Semi-Supervised Web Wrapper Repair via Recursive Tree Matching
CoRR, Volume abs/1505.01303, 2015.
Continuous data extraction pipelines using wrappers have become common and integral parts of businesses dealing with stock, flight, or product information. Extracting data from websites that use HTML templates is difficult because available wrapper methods are not designed to deal with websites that change over time (the inclusion or re...More
PPT (Upload PPT)