Abstract
Internet Query Systems (IQS) are information systems used to query the World Wide Web by finding data sources relevant to a given query and retrieving data from the identified data sources. They differ from traditional database management systems in that data to be processed need to be found by a search engine, fetched from remote data sources and processed taking into account issues such as the unpredictability of access and transfer rates, infinite streams of data, and the ability to produce partial results. Despite the powerful query functionality provided by internet query systems when compared to traditional search engines, their uptake has been slow partly due to the difficulty of assessing and filtering low quality data resulting from internet queries. In this paper we investigate how an internet query system can be extended to support data quality aware query processing. In particular, we illustrate the metadata support, XML-based data quality measurement method, algebraic query processing operators, and query plan structures of a query processing framework aimed at helping users to identify, assess, and filter out data regarded as of low completeness data quality for the intended use.
Original language | English |
---|---|
Title of host publication | ICSOFT 2008 - Proceedings of the 3rd International Conference on Software and Data Technologies|ICSOFT - Int. Conf. Softw. Data Technol., Proc. |
Place of Publication | Portugal |
Publisher | INSTICC Press |
Pages | 234-239 |
Number of pages | 5 |
Volume | ISDM |
ISBN (Print) | 9789898111524, 9789898111531 |
Publication status | Published - 2008 |
Event | 3rd International Conference on Software and Data Technologies, ICSOFT 2008 - Porto Duration: 1 Jul 2008 → … |
Conference
Conference | 3rd International Conference on Software and Data Technologies, ICSOFT 2008 |
---|---|
City | Porto |
Period | 1/07/08 → … |
Keywords
- Completeness
- Data quality
- Internet query systems
- Query processing