The Consequences of Data Silos on Data Quality in Biomedical Research

elucidata

posted on 1 day ago — updated on 1 second ago

44
views

In biomedical research, data silos are isolated repositories of information accessible only to specific departments, teams, or organizations.

These silos emerge as a natural consequence of the diverse sources and proprietary systems that generate data in the field. Clinical data, genomic sequences, proteomic datasets, and imaging files are often stored separately, each following its own standards, formats, and storage mechanisms. While this setup might suit individual teams, the lack of interconnectedness leads to fragmented data ecosystems that are difficult to integrate and analyze holistically.

The impact of data silos on data quality is profound. When datasets remain isolated, inconsistencies and redundancies become unavoidable. For example, one research team may collect patient data using different units of measurement or variable names than another, making harmonization a labor-intensive and error-prone process. Additionally, crucial metadata contextual information that provides meaning to raw data is often incomplete or missing altogether, further reducing the usability of siloed datasets.
Source Url

Data Silos on Data Quality

elucidata

Website: https://www.elucidata.io/ Elucidata leverages its platform, Polly to augment the quality of data in pre-clinical drug discovery. It curates multi-omics and assay data to make them ML-ready or analysis-ready. Address: 114 Sansome Street, Suite 250 San Francisco, CA 94104 Phone No: 9716140329 Contact Email: info@elucidata.io