Title: Research of Information Extraction Algorithm based on Hidden Markov Model
Abstract: Based on the research of Web Information Extraction Algorithm of Hidden Markov Model, this paper focus on the application of HMM in text information extraction, and improved methods of information extraction with constructing granularity refined DOM tree combined with regular expression to extract detailed information points. At the same time, we smooth the probability of unknown observations. Test result show that, the improved HMM has better extraction performance.
Publication Year: 2010
Publication Date: 2010-12-01
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
Cited By Count: 6
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot