DNNGIOR

03 Jan 2024 in Software on Metabolic, Model

Highlights

We trained a deep neural network on >11k bacterial species to recover missing reactions
Reaction frequency and query similarity to the training data impacted performance
DNNGIOR models can simulate real data similar to CarveMe with fewer false positives

A data driven approach to gapfill Genome Scale Metabolic Reconstructions in an unbiased way.

Open Source

03 Jan 2024

Establishing the ELIXIR Microbiome Community

01 Jan 2024 in Publications on microbiome

ELIXIR Marine Metagenomics Community, initiated amongst researchers focusing on marine microbiomes, has concentrated on promoting standards around microbiome-derived sequence analysis, as well as understanding the gaps in methods and reference databases, and solutions to computational overheads of performing such analyses.

A visit at ETH

30 Dec 2023 in Blog on Thoughts

Last month, I had the chance to visit ETH and the Institute of Food, Nutrition and Health.

I was lucky to see the awsome bioreactors they have set over the years there in the Laboratory of Food Biotechnology but most importantly, a group of people that struggle to keep doing what they love.

I was able to see how the data are produced but also spread the world about microbetag, our co-occurrence network annotator.

For sure, two weeks were not enough, so I hope we will meet again even somewhere not as beautiful as the snowy Zurich. A great thanks to Dr. Annelies Geirnaert and Professor for this opportunity and to all the lab members for making my stay so pleasant. Of course, a special thanks to Dr. Andi Erega for all the explanations and patience with me knowing but the basics in the lab but mostly for the burek!

See you soon I hope! :)

metaGOflow

30 Oct 2023 in Software on Shotgun, Metadata

metaGOflow is a Common Workflow Language (CWL) based pipeline for the analysis of shotgun metagenomic data at the sample level. It was initially built to address the needs of the EMO BON community but it can be used for any type of shotgun sequencing data.

metaGOflow is based on the tools and subworkflow implemented in the framework of MGnify.

A pile of pipelines: An overview of the bioinformatics software for metabarcoding data analyses

18 Aug 2023 in Publications on Amplicon

A multitude of metabarcoding data analysis tools and pipelines have also been developed. Often, several developed workflows are designed to process the same amplicon sequencing data, making it somewhat puzzling to choose one among the plethora of existing pipelines. However, each pipeline has its own specific philosophy, strengths and limitations, which should be considered depending on the aims of any specific study, as well as the bioinformatics expertise of the user. In this review, we outline the input data requirements, supported operating systems and particular attributes of thirtytwo amplicon processing pipelines with the goal of helping users to select a pipeline for their metabarcoding projects.

metaGOflow: a workflow for the analysis of marine Genomic Observatories shotgun metagenomics data

18 Aug 2023 in Publications on Shotgun

Based on the established MGnify resource, we developed metaGOflow. metaGOflow supports the fast inference of taxonomic profiles from GO-derived data based on ribosomal RNA genes and their functional annotation using the raw reads. Thanks to the Research Object Crate packaging, relevant metadata about the sample under study, and the details of the bioinformatics analysis it has been subjected to, are inherited to the data product while its modular implementation allows running the workflow partially. The analysis of 2 EMO BON samples and 1 Tara Oceans sample was performed as a use case.

metaGOflow is an efficient and robust workflow that scales to the needs of projects producing big metagenomic data such as EMO BON. It highlights how containerization technologies along with modern workflow languages and metadata package approaches can support the needs of researchers when dealing with ever-increasing volumes of biological data. Despite being initially oriented to address the needs of EMO BON, metaGOflow is a flexible and easy-to-use workflow that can be broadly used for one-sample-at-a-time analysis of shotgun metagenomics data.

Geometric algorithms for sampling the flux space of metabolic networks

18 Aug 2023 in Publications on Metabolic, Networks

Constraint-based approaches have been widely used for the analysis of such models and led to intriguing geometry-oriented challenges. In this setting, sampling uniformly points from polytopes derived from metabolic models (flux sampling) provides a representation of the solution space of the model under various conditions. However, the polytopes that result from such models are of high dimension (in the order of thousands) and usually considerably skinny. Therefore, to sample uniformly at random from such polytopes shouts for a novel algorithmic and computational framework specially tailored for the properties of metabolic models. We present a Multiphase Monte Carlo Sampling (MMCS) algorithm that unifies rounding and sampling in one pass, yielding both upon termination. It exploits an optimized variant of the Billiard Walk that enjoys faster arithmetic complexity per step than the original. Sampling on the most complicated human metabolic network accessible today, Recon3D, corresponding to a polytope of dimension 5 335, took less than 30 hours.

eDNA metabarcoding, pipeline development & high performance computing

30 May 2023 in Teaching on Amplicon

Class in the MSc course “Applied Bioinformatics & Data Analysis”.

Genome-scale metabolic model reconstruction

30 May 2023 in Teaching on Model

With Dr. Daniel Garza, we had a class in the MSc course “Modélisation et contrôle des systèmes dynamiques en bioingénierie”.

Many thanks to Prof. Didier Gonze for inviting us!

GColab