Digital repository of Slovenian research organisations

Show document
A+ | A- | Help | SLO | ENG

Title:MetaBakery : a singularity implementation of bioBakery tools as a skeleton application for efficient HPC deconvolution of microbiome metagenomic sequencing data to machine learning ready information
Authors:ID Murovec, Boštjan (Author)
ID Deutsch, Leon (Author)
ID Osredkar, Damjan (Author)
ID Stres, Blaž (Author)
Files:.pdf PDF - Presentation file, download (1,12 MB)
MD5: E950A4B0E71E62CEEA8D16BF8C7AFBB3
 
URL URL - Source URL, visit https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2024.1426465/full
 
Language:English
Typology:1.01 - Original Scientific Article
Organization:Logo UKC LJ - Ljubljana University Medical Centre
Abstract:In this study, we present MetaBakery (http://metabakery.fe.uni-lj.si), an integrated application designed as a framework for synergistically executing the bioBakery workflow and associated utilities. MetaBakery streamlines the processing of any number of paired or unpaired fastq files, or a mixture of both, with optional compression (gzip, zip, bzip2, xz, or mixed) within a single run. MetaBakery uses programs such as KneadData (https://github.com/bioBakery/kneaddata), MetaPhlAn, HUMAnN and StrainPhlAn as well as integrated utilities and extends the original functionality of bioBakery. In particular, it includes MelonnPan for the prediction of metabolites and Mothur for calculation of microbial alpha diversity. Written in Python 3 and C++ the whole pipeline was encapsulated as Singularity container for efficient execution on various computing infrastructures, including large High-Performance Computing clusters. MetaBakery facilitates crash recovery, efficient re-execution upon parameter changes, and processing of large data sets through subset handling and is offered in three editions with bioBakery ingredients versions 4, 3 and 2 as versatile, transparent and well documented within the MetaBakery Users’ Manual (http://metabakery.fe.uni-lj.si/metabakery_manual.pdf). It provides automatic handling of command line parameters, file formats and comprehensive hierarchical storage of output to simplify navigation and debugging. MetaBakery filters out potential human contamination and excludes samples with low read counts. It calculates estimates of alpha diversity and represents a comprehensive and augmented re-implementation of the bioBakery workflow. The robustness and flexibility of the system enables efficient exploration of changing parameters and input datasets, increasing its utility for microbiome analysis. Furthermore, we have shown that the MetaBakery tool can be used in modern biostatistical and machine learning approaches including large-scale microbiome studies.
Publication status:Published
Publication version:Version of Record
Year of publishing:2024
Number of pages:11 str.
Numbering:Vol. 15, [article no.] 1426465
PID:20.500.12556/DiRROS-29849 New window
UDC:579
ISSN on article:1664-302X
DOI:10.3389/fmicb.2024.1426465 New window
COBISS.SI-ID:204189699 New window
Note:Nasl. z nasl. zaslona; Opis vira z dne 12. 8. 2024;
Publication date in DiRROS:08.06.2026
Views:57
Downloads:36
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Frontiers in microbiology
Shortened title:Front. microbiol.
Publisher:Frontiers Research Foundation
ISSN:1664-302X
COBISS.SI-ID:4146296 New window

Document is financed by a project

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P2-0095-2020
Name:Znanstveno računanje

Funder:ARRS - Slovenian Research Agency
Funding programme:Slovenian Research and Innovation Agency
Project number:SRA R#51867
Name:MR+

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P2-0180-2014
Name:Vodarstvo in geotehnika

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J7-50230-2024
Name:Izgradnja učinkovitih orodij za odkrivanje neprenosljivih bolezni

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Keywords:mikrobiologija, mikrobna metagenomika, bioinformatika, strojno učenje, črevesni mikrobiom, medicina, nenalezljive bolezni


Back