Abstract
Format transformation is one of the most labor intensive tasks of a data wrangling process. Recent advances in programming by example proposed synthesis algorithms that showed promising results on spreadsheet data. However, when employed on repositories consisting of multiple sources and large number of examples, such algorithms manifest scalability issues. This paper introduces a new transformation synthesis technique based on edit operations that enables efficient learning of transformation programs. Empirical results show comparable effectiveness and dramatic improvements in efficiency over the state-of-the art.
Original language | English |
---|---|
Title of host publication | Advances in Database Technology - EDBT 2019 |
Subtitle of host publication | 22nd International Conference on Extending Database Technology, Proceedings |
Editors | Zoi Kaoudi, Helena Galhardas, Irini Fundulaki, Berthold Reinwald, Melanie Herschel, Carsten Binnig |
Publisher | OpenProceedings |
Pages | 714-717 |
Number of pages | 4 |
Volume | 2019-March |
ISBN (Electronic) | 9783893180813 |
DOIs | |
Publication status | Published - 26 Mar 2019 |
Event | 22nd International Conference on Extending Database Technology - Lisbon, Portugal Duration: 26 Mar 2019 → 29 Mar 2019 |
Conference
Conference | 22nd International Conference on Extending Database Technology |
---|---|
Abbreviated title | EDBT 2019 |
Country/Territory | Portugal |
City | Lisbon |
Period | 26/03/19 → 29/03/19 |