Spreading excellence and disseminating the cutting edge results of our research and development efforts is crucial to our institute. Check for our educational offers for Bachelor, Master and PhD studies at the University of Innsbruck!

Web Data Extraction and Reconditioning

Student name: 
Alex Stolz

The current web provides a vast amount of information. Unfortunately those are currently only available in a semi-structured manner (html) and thus the automatic processing possibilities for computers are limited. Web Services provide a standardized interface to information sources and allow easy integration of any source anywhere in the web into any computer program. For this thesis you will first survey the existing set of tools for information extraction (screen scraper) and then implement a set of scrapers and provide the extracted information as web service. Required Knowledge: Basic understanding of HTML, Java and Servlet technology.