Subjective data models in bioinformatics and how wet lab and computational biologists conceptualise data

Research output: Contribution to journalArticlepeer-review

Abstract

Biological science produces "big data" in varied formats, which necessitates using computational tools to process, integrate, and analyse data. Researchers using computational biology tools range from those using computers for communication, to those writing analysis code. We examine differences in how researchers conceptualise the same data, which we call "subjective data models". We interviewed 22 people with biological experience and varied levels of computational experience, and found that many had fluid subjective data models that changed depending on circumstance. Surprisingly, results did not cluster around participants' computational experience levels. People did not consistently map entities from abstract data models to the real-world entities in files, and certain data identifier formats were easier to infer meaning from than others. Real-world implications: 1) software engineers should design interfaces for task performance, emulating popular user interfaces, rather than targeting professional backgrounds; 2) when insufficient context is provided, people may guess what data means, whether or not they are correct, emphasising the importance of contextual metadata to remove the need for erroneous guesswork.

Original languageEnglish
Article number756
Number of pages11
JournalScientific Data
Volume10
Issue number1
DOIs
Publication statusPublished - 2 Nov 2023

Fingerprint

Dive into the research topics of 'Subjective data models in bioinformatics and how wet lab and computational biologists conceptualise data'. Together they form a unique fingerprint.

Cite this