A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control

Cherian Mathew, Anton Güntsch, Matthias Obst, Saverio Vicario, Robert Haines, Alan Williams, Yde de Jong, Carole Goble

    Research output: Contribution to journalArticlepeer-review

    Abstract

    The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.

    Original languageEnglish
    Pages (from-to)e4221
    JournalBiodiversity data journal
    Issue number2
    DOIs
    Publication statusPublished - 2014

    Keywords

    • Journal Article

    Fingerprint

    Dive into the research topics of 'A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control'. Together they form a unique fingerprint.

    Cite this