Title: Protection Techniques from Information Extraction
Abstract:Information extraction technologies meet the market need for automatic tools for extracting semi-structured information from Web pages. However, pages may change over time due to different reasons, ra...Information extraction technologies meet the market need for automatic tools for extracting semi-structured information from Web pages. However, pages may change over time due to different reasons, ranging from restyling pages to on-purpose modifications brought about into pages in order to puzzle Web wrappers. In this paper we deal with this latter scenario, by studying the issue of on-purpose wrapper spoiling and its relationship to wrapping. We present an architecture and a tool implementing a wrapper spoiling system, and discuss some practical spoiling techniques which are also experimentally testedRead More
Publication Year: 2006
Publication Date: 2006-12-01
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
Cited By Count: 1
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot