Metagenome analyses explore the functional potential and biodiversity of prokaryotes, eukaryotes, and viruses starting from sequencing data and recovering metagenome-assembled genomes (MAGs). This process involves several complex bioinformatics approaches, such as sequence assembly, genome binning and quality estimation, taxonomic assignment, functional annotation, and data integration with other analyses (metadata or other omics technologies). Researchers working on metagenomic studies require comparable genome sequences and datasets. Many of the metagenomes deposited in public repositories have insufficient or incomplete metadata. This issue also extends to information on the bioinformatic tools used to generate these metagenomes.
To enable meta-analyses on metagenomes, MetaProv will assess and optimize the scalability and reproducibility of data generation tools and workflows and enhance user-friendliness. Creating a suitable tool to track provenance (e.g. used thresholds, tools, database versions) will enhance reproducibility and guide the users to define the necessary computer resources for their data analysis. MetaProv will contribute to developing a modular implementation of the current standards and analytical services provided by NFDI4Microbiota, facilitating the introduction or update of workflows. Ultimately, MetaProv will showcase and enable users to easily search for extra metagenomes that could help answer their research question or test their hypothesis.
Graphical abstract “Use Case MetaProv” by Ulisses Nunes da Rocha and Jonas Coelho Kasmanas with visual adaptation by Charlie Pauvert is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
You are a …
Below, you will find the output provided by the Use Case. If you are interested in the development stages of the project, these are indicated by the following tenses and suffixes.
The output for the research community is already established (-ed for past tense), is currently being (-ing for present progressive) in progress, or will be soon set-up (present tense) for future endeavors.
metagenomics
workflows
metadata standard
provenance
metagenome-assembled genomes