.. starting from a co-occurrence network

In case you already have a co-occurrence network based on your data, you can ask microbetag to directly annotate it. In this tutorial, we show how to run microbetag using a network of your own.

INPUT FILES USED IN THIS TUTORIAL

The network we will use (edgelist.tsv) comes from the study of Hessler et al. (2023). We would like to thank the authors for sharing their data. The abundance table (vitAbund.tsv) however does not represent the one of the actual study. It is a pseudo abundance table that only includes the bin names, exactly as they are called in the network file, to highlight that it is not the abundance data that are of interest now. Hessler et al. investigated factors that control community organization in mine tailings-derived laboratory microbial consortia and were able to link a Variovorax species as an important source of thiamine.

If you already have a network, then you need to provide both the network and the abundance table and make sure that the sequence identifiers in those two files are the same; meaning that the node ids of the network are present in the abundance table in the column representing the sequence identifier.

For example, a toy model of a network file would be:

node_Anode_Bweight
bin_1bin_20.84

then, the corresponding abundance table would have, among other records, to have the following two lines:

sequenceIdentifiersample_1sample_2sample_3taxonomy
bin_12344243g__Devosia;s__Devosia sp001899045
bin_23245443g__Pseudonocardia;s__Pseudonocardia sp001899645

Taxonomy here is only partial. Make sure you always have a 7-level taxonomy, e.g. d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Rhizobiales;f__Devosiaceae;g__Devosia_A;s__Devosia_A sp001899075

So, this time we will load the vitAbund.tsv file. Here how this looks like:

check_abundance

Then, you need to import your network first on Cytoscape through the main File tab:

import_network

Cytoscape will display your network and on the bottom of your screen you will have the core tables of a Cytoscape network. As your network may have several edges, you need to make clear which edge attribute you would like microbetag to use; this is essential in cases network clustering will be performed. To do so, you need to move on the Edge table by clicking on the arrow next to the Node table that is displayed by default, and then rename the column of your choice to microbetag::weight.

rename

Now you are ready to import your network to MGG though its main menu on the Apps tab:

import_network

Finally, you can again check your network as loaded on MGG through the Check Data Files tab:

check_network

If the node names of the network are not included in the sequence identifiers of the abundance table, you will not be able to import your network to MGG.

Once both your abundance file and your network are imported, you can proceed as in the Starting from an abundance table case by clicking on Get Annotated Network on the main menu of the MGG app on the Apps tab and setting the parameters required.

settings

Make sure that you set the Choose Input Type as network this time, otherwise microbetag will ignore your network and try to build on of their own.

This will take significantly less time and here is the returned network:

annotated_net

Now, you can go through the Investigating the annotations tutorial to check how good microbetag did with Variovorax - related annotations.