Ontology-Driven Metadata Enrichment for Genomic Datasets
2018-12-05T13:01:39Z (GMT) by
Data-driven genomic research requires accessing several repositories of genomic datasets, produced by international consortia, which provide open access to extremely valuable and well curated biological content. The associated metadata, describing experimental and biological conditions, are highly heterogeneous; consequently, dataset collection and
integration is difficult -- it requires data conversions and
term matching which needs to be done by humans, with biological expertise.
In this paper, we present a method and tools for ontology-driven metadata enrichment. We select few relevant features which are provided by most repositories, and then we
comparatively evaluate several search services providing ontological access, eventually associating each feature with the specific ontologies which are most suited to describe them. We also provide an expert validation of the approach. The method and tools are deployed in a large repository of open data, which will be soon available to the research community.