A new software platform to analyze large scale ‘omic’ data according to their metabolic machinery: the case of the biogeochemical sulfur cycle

$2,000

Prizes

2,303

Views

1. Background

The increasing expansion in the number of metagenomic and genomic sequences has dramatically improving our understanding of life’s microbial diversity to an unprecedented level of detail. Yet, our ability to infer metabolic capabilities in a large omic datasets remain biologically and computationally challenging. Here we propose a new Multigenomic Entropy Based Score (MEBS), which enclose the information derived from complex metabolic pathways into a single Score. To test MEBS we focused on the biogeochemical Sulfur cycle due to the lack of studies aiming to integrate all the microbiological and geochemical transformations and their corresponding metabolic pathways in global scale.  

2. Method

MEBS algorithm is a software platform written in Bash, Perl and Python and have been tested under Linux environments. The first step of MEBS consists of the systematic manual acquisition and curation of the molecular and ecological information required to describe the metabolic machinery of interest, for example, the sulfur metabolism. This information is represented by two input files: a list of microorganisms and a multi FASTA file of proteins. MEBS then evaluate the presence/absence patterns of the input proteins in a Genomic dataset (Gen), containing 2,107 non-redundant complete sequenced genomes. Then, the expected vs observed pattern in the input organisms is obtained for each of the input proteins using the mathematical framework of relative entropy (H’). The last step consists in the summation of all the input protein entropies present in the omics data to be evaluated (either genomes or metagenomes) in order to obtain the final Entropy Score. MEBS was thoroughly tested to capture the importance of biogeochemical Sulfur (S) cycles in 935 metagenomes 2107 genomes. The performance, reproducibility and robustness of MEBS was evaluated using several approaches including a random sampling test, linear regression models and ROC curves.


3. Results

We present MEBS, a new open source software platform aimed to quantitatively evaluate, compare and infer the metabolic machinery of interest, in large ‘omic’ datasets, including complex metabolic pathways such as entire biogeochemical cycles. MEBS algorithm is free, open source and available through: through https://github.com/eead-csic-compbio/metagenome_Pfam_score. The curation effort reported here represents the first comprehensive inventory of the genes, enzymes, pathways, compounds and organisms involved in the sulfur cycle. The input protein domains enriched among sulfur-based microorganisms were obtained with the relative entropy (H’) mathematical framework. The clustering of the 112 H’ values of the input sulfur proteins obtained in a large collection of non-redundant genomes, highlight the possibility of use 12 sulfur informative domains as sulfur cycle marker genes in metagenomic data. Finally the summation of 112’ H’ values in a given genome or metagenome dataset build up the MEBS final Score (Sulfur Score: SS). The SS values in the genomic and metagenomic data collections strongly highlight the broad applicability of our proposed algorithm to accurately detect the sulfur cycle metabolic machinery in large OMIC scale in a fast and a simple fashion manner

4. Conclusions

Our Sulfur cycle benchmark using MEBS software platform, indicate that the use of a single informative Score the metabolic machinery of interest holds the potential to dramatically change the current view of inferring metabolic capabilities in the present omic-era. We have demonstrated that MEBS is very accurate to detect and classify genomes and metagenomes known to be closely involved in the Sulfur Cycle, suggesting several applications like, the prediction of metabolic capabilities in uncultivated/unexplored taxa and the generation of a measurable score devoted to evaluating any given metabolic pathway or cycle in large meta- genomic scale. 

5. Future ideas/collaborators needed to further research?

In this study, we focused on evaluate the Sulfur cycle, but we are currently preparing the manuscript for the carbon, nitrogen, oxygen, phosphorous and iron cycles. Furthermore, we are also working in improve MEBS algorithm by using only a list of microorganisms of interest to avoid the manual exhaustive curation of the proteins involved in the metabolic pathway of interest. We are looking forward to collaborating and help other researchers interested in integrate this software platform in large scale analysis (i.e climate change, bioremediation studies, etc)   


Comments

25
Angelo Aquino
over 1 year ago

Oh wow you won the prize?? This is such a cool topic..

quick help
about 1 year ago

In Essay writing help with the Center for Systems Genomics at the University of Melbourne, The Australian Regenerative Medicine Institute at Monash University and Melbourne Bioinformatics, Thinkable is eager to dispatch the inaugural 'Companion Prize' for bioinformatics.

Jack William
about 1 year ago

In this quick case study, I learn lots of things that really important to me.Especially when you talk about dog ear cleaner under $50 and the way you explain each and everything was really good.

Valerie De Anda
11 months ago

We have updated the main script of MEBS to compute with a single script the importance of the main biogeochemical cycles (C,N,O,Fe and S) in metagenomic and genomic data. Please have a look at :
Main Software page: https://eead-csic-compbio.github.io/metagenome_Pfam_score/
Readme: https://eead-csic-compbio.github.io/metagenome_Pfam_score/READMEv1.html
Paper: https://academic.oup.com/gigascience/article/6/11/1/4561660

Justin Brunker
8 months ago

Challenges of the data are done for the inclusion of ten norms for the humans. The phase of the data and research papers writing help is approved for the use of the reforms for the humans. The perspective of the data collector is new for the youngsters.

aliyah brown
7 months ago

Thanks for sharing. I learn lots of things that really important to me spanish dictionary

kelly Leona
6 months ago

What's more, you're correct - my experimental research included a branch of connected humanism (interpersonal organization investigation of correspondence/data stream on assignment service around particular, geologically important open issues ).

Muneer Ahmed
5 months ago

You have a great sense of writing I must say. Your post has those facts which are not accessible from anywhere else. It’s my humble request to u please keep writing such remarkable articles How to access multiple Gmail accounts in one login?

Muneer Ahmed
5 months ago

start with fresh vegetables such as spinach, kale, broccoli or others as your base. In a study published restaurants near me open now

sheeraz khatri
4 months ago

It proved to be Very helpful to me and I am sure to all the commentators here! Relationship Rewrite Method

sheeraz khatri
4 months ago

I must say, I thought this was a pretty interesting read when it comes to this topic. Liked the material. . . . . Relationship Rewrite Method

sheeraz khatri
4 months ago

I see some amazingly important and kept up to length of your strength searching for in your on the site Tetanus

sheeraz khatri
4 months ago

It proved to be Very helpful to me and I am sure to all the commentators here! Anorexia

sheeraz khatri
4 months ago

You re in point of fact a just right webmaster. The website loading speed is amazing. It kind of feels that you're doing any distinctive trick. Moreover, The contents are masterpiece. you have done a fantastic activity on this subject! Mouth Sores

sheeraz khatri
4 months ago

Hi buddies, it is great written piece entirely defined, continue the good work constantly. Knee Injury

sheeraz khatri
4 months ago

This website and I conceive this internet site is really informative ! Keep on putting up! cd duplication services

sheeraz khatri
4 months ago

Vancouver SEO Agency offers complete business listings that let you have dofollow links. This is great for your business and drives up your sites rankings. SEO Vancouver

sheeraz khatri
4 months ago

Hey, great blog, but I don’t understand how to add your site in my rss reader. Can you Help me please? BA 3rd Year Time Table 2019

sheeraz khatri
4 months ago

We offer a wide range of Ready To Assemble(RTA) kitchen cabinets and bathroom vanities. We have plywood boxes and real hardwood doors, as well as top quality MDF doors with great finishes. Modern Shaker style doors, high gloss flat panel doors, or classic styling hardwood doors. We have a large selection of kitchens to choose from. For the DYI or have installation services from the pros. We also offer kitchen countertops in marble, quartz, or arborite finishes. We offer complete kitchen renovations from start to finish, demo to done with the highest in quality every step of the way. kitchen cabinet

Alberta Martin
3 months ago

Good job. I want to thank you for this informative read; I really appreciate sharing this great post. Chaga Pilz kaufen - chagapilz-tee.de

Merck Seo
3 months ago

Truly, this article is really one of the very best in the history of articles. I am a antique ’Article’ collector and I sometimes read some new articles if I find them interesting. And I found this one pretty fascinating and it should go into my collection. Very good work! gutter replacement

Khatri SEO
3 months ago

An obligation of appreciation is all together to share the data, continue doing magnificent... I really savored the experience of exploring your site. incredible resource... 토토

Khatri SEO
3 months ago

The worst part of it was that the software only worked intermittently and the data was not accurate. You obviously canot confront anyone about what you have discovered if the information is not right. Africa news

Khatri SEO
3 months ago

I at last discovered grand post here.I will get back here. I just added your blog to my bookmark districts. thanks.Quality presents is the major on welcome the guests to visit the site page, that is the thing that this site page is giving. William Bronchick 

Image1503436605?1503436605

PhD student at Ecology Institute UNAM Using 'omic' approaches to understand the individual reactions that make life possible on Earth, focusing on how the genes involved in biogeochemcial cycle...

Round: Open Peer Voting
Category: Student Prize

Votes

Ranking

Researcher

13

1

Recent Voters