Feedback-based annotation, selection and refinement of schema mappings for dataspaces

Khalid Belhajjame, Norman W. Paton, Suzanne M. Embury, Alvaro A A Fernandes, Cornelia Hedeler

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    The specification of schema mappings has proved to be time and resource consuming, and has been recognized as a critical bottleneck to the large scale deployment of data integration systems. In an attempt to address this issue, dataspaces have been proposed as a data management abstraction that aims to reduce the up-front cost required to setup a data integration system by gradually specifying schema mappings through interaction with end users in a pay-as-you-go fashion. As a step in this direction, we explore an approach for incrementally annotating schema mappings using feedback obtained from end users. In doing so, we do not expect users to examine mapping specifications; rather, they comment on results to queries evaluated using the mappings. Using annotations computed on the basis of user feedback, we present a method for selecting from the set of candidate mappings, those to be used for query evaluation considering user requirements in terms of precision and recall. In doing so, we cast mapping selection as an optimization problem. Mapping annotations may reveal that the quality of schema mappings is poor. We also show how feedback can be used to support the derivation of better quality mappings from existing mappings through refinement. An evolutionary algorithm is used to efficiently and effectively explore the large space of mappings that can be obtained through refinement. The results of evaluation exercises show the effectiveness of our solution for annotating, selecting and refining schema mappings. Copyright 2010 ACM.
    Original languageEnglish
    Title of host publicationAdvances in Database Technology - EDBT 2010 - 13th International Conference on Extending Database Technology, Proceedings|Adv. Database Technol. - EDBT - Int. Conf. Extending Database Technol., Proc.
    PublisherAssociation for Computing Machinery
    Pages573-584
    Number of pages11
    ISBN (Print)9781605589459
    DOIs
    Publication statusPublished - 2010
    Event13th International Conference on Extending Database Technology: Advances in Database Technology - EDBT 2010 - Lausanne
    Duration: 1 Jul 2010 → …

    Conference

    Conference13th International Conference on Extending Database Technology: Advances in Database Technology - EDBT 2010
    CityLausanne
    Period1/07/10 → …

    Keywords

    • Data integration
    • Dataspaces
    • Mapping annotation
    • Mapping refinement
    • Mapping selection
    • Pay-as-you-go
    • User feedback

    Fingerprint

    Dive into the research topics of 'Feedback-based annotation, selection and refinement of schema mappings for dataspaces'. Together they form a unique fingerprint.

    Cite this