Abstract
Data integration systems typically make use of mappings to capture the relationships between the data resources to be integrated and the integrated representations presented to users. Manual development and maintenance of such mappings is time consuming and thus costly. Pay-as-you-go approaches to data integration support automatic construction of initial mappings, which are generally of rather poor quality, for refinement in the light of user feedback. However, automatic approaches that produce these mappings typically lead to the generation of multiple, overlapping candidate mappings. To present the most relevant set of results to user queries, the mappings have to be ranked. We proposed a ranking technique that uses information from query logs to discriminate among candidate mappings. The technique is evaluated in terms of how quickly stable rankings can be produced, and to investigate how the rankings track query patterns that are skewed towards specific sources. © 2012 Springer-Verlag.
Original language | English |
---|---|
Title of host publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci. |
Editors | Olivier Bodenreider, Bastien Rance |
Publisher | Springer Nature |
Pages | 37-52 |
Number of pages | 15 |
Volume | 7348 |
ISBN (Print) | 9783642310393 |
DOIs | |
Publication status | Published - 2012 |
Event | 8th International Conference on Data Integration in the Life Sciences, DILS 2012 - College Park, MD Duration: 1 Jul 2012 → … |
Conference
Conference | 8th International Conference on Data Integration in the Life Sciences, DILS 2012 |
---|---|
City | College Park, MD |
Period | 1/07/12 → … |
Keywords
- Dataspaces
- Implicit Feedback
- Pay-as-you-go Data Integration
- Ranking
- Schema Mapping