A completeness-aware data quality processing approach for web queries

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Internet Query Systems (IQS) are information systems used to query the World Wide Web by finding data sources relevant to a given query and retrieving data from the identified data sources. They differ from traditional database management systems in that data to be processed need to be found by a search engine, fetched from remote data sources and processed taking into account issues such as the unpredictability of access and transfer rates, infinite streams of data, and the ability to produce partial results. Despite the powerful query functionality provided by internet query systems when compared to traditional search engines, their uptake has been slow partly due to the difficulty of assessing and filtering low quality data resulting from internet queries. In this paper we investigate how an internet query system can be extended to support data quality aware query processing. In particular, we illustrate the metadata support, XML-based data quality measurement method, algebraic query processing operators, and query plan structures of a query processing framework aimed at helping users to identify, assess, and filter out data regarded as of low completeness data quality for the intended use.
Original languageEnglish
Title of host publicationICSOFT 2008 - Proceedings of the 3rd International Conference on Software and Data Technologies|ICSOFT - Int. Conf. Softw. Data Technol., Proc.
Place of PublicationPortugal
PublisherINSTICC Press
Pages234-239
Number of pages5
VolumeISDM
ISBN (Print)9789898111524, 9789898111531
Publication statusPublished - 2008
Event3rd International Conference on Software and Data Technologies, ICSOFT 2008 - Porto
Duration: 1 Jul 2008 → …

Conference

Conference3rd International Conference on Software and Data Technologies, ICSOFT 2008
CityPorto
Period1/07/08 → …

Keywords

  • Completeness
  • Data quality
  • Internet query systems
  • Query processing

Fingerprint

Dive into the research topics of 'A completeness-aware data quality processing approach for web queries'. Together they form a unique fingerprint.

Cite this