- Download 4
- File Size 442.26 KB
- File Count 1
- Create Date December 15, 2021
- Last Updated December 15, 2021
D1.1 Datasets and repositories benchmarking - Full report
This document provides an assessment of currently available data repositories across the major omics, namely genomics, transcriptomics, proteomics, metabolomics and phenomics, with a view towards their potential as data sources for the GLOMICAVE project. The assessment indicates that the longer established omics, namely genomics and transcriptomics have comprehensive, well-established repositories, in comparison to the newer omics where repositories contain relatively few datasets. However, the primary goal of the various repositories is, understandably, the preservation of raw datasets, and removing of this data preservation burden from scientific publishers. Access to quantitative data, which would simplify automated data integration within GLOMICAVE, is understandably given a lower priority. Importantly, programmatic data access via APIs is supported by the majority of repositories. Finally, while data licensing may be an issue with some specialized repositories, the most important repositories do not appear to place any restrictions on data reuse.