Bayesian prediction of microbial oxygen requirement

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Bayesian prediction of microbial oxygen requirement. / Jensen, Dan B.; Ussery, David W.

I: F1000Research, Bind 2, 184, 2013.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Jensen, DB & Ussery, DW 2013, 'Bayesian prediction of microbial oxygen requirement', F1000Research, bind 2, 184. https://doi.org/10.12688/f1000research.2-184.v1

APA

Jensen, D. B., & Ussery, D. W. (2013). Bayesian prediction of microbial oxygen requirement. F1000Research, 2, [184]. https://doi.org/10.12688/f1000research.2-184.v1

Vancouver

Jensen DB, Ussery DW. Bayesian prediction of microbial oxygen requirement. F1000Research. 2013;2. 184. https://doi.org/10.12688/f1000research.2-184.v1

Author

Jensen, Dan B. ; Ussery, David W. / Bayesian prediction of microbial oxygen requirement. I: F1000Research. 2013 ; Bind 2.

Bibtex

@article{d7a8335ddbb945b88618af80c736df35,
title = "Bayesian prediction of microbial oxygen requirement",
abstract = "Background: Prediction of the optimal habitat conditions for a given bacterium, based on genome sequence alone would be of value for scientific as well as industrial purposes. One example of such a habitat adaptation is the requirement for oxygen. In spite of good genome data availability, there have been only a few prediction attempts of bacterial oxygen requirements, using genome sequences. Here, we describe a method for distinguishing aerobic, anaerobic and facultative anaerobic bacteria, based on genome sequence-derived input, using naive Bayesian inference. In contrast, other studies found in literature only demonstrate the ability to distinguish two classes at a time. Results: The results shown in the present study are as good as or better than comparable methods previously described in the scientific literature, with an arguably simpler method, when results are directly compared. This method further compares the performance of a single-step naive Bayesian prediction of the three included classifications, compared to a simple Bayesian network with two steps. A two-step network, distinguishing first respiring from non-respiring organisms, followed by the distinction of aerobe and facultative anaerobe organisms within the respiring group, is found to perform best. Conclusions: A simple naive Bayesian network based on the presence or absence of specific protein domains within a genome is an effective and easy way to predict bacterial habitat preferences, such as oxygen requirement.",
author = "Jensen, {Dan B.} and Ussery, {David W.}",
note = "Publisher Copyright: {\textcopyright} 2013 Jensen DB and Ussery DW.",
year = "2013",
doi = "10.12688/f1000research.2-184.v1",
language = "English",
volume = "2",
journal = "F1000Research",
issn = "2046-1402",
publisher = "F1000Research",

}

RIS

TY - JOUR

T1 - Bayesian prediction of microbial oxygen requirement

AU - Jensen, Dan B.

AU - Ussery, David W.

N1 - Publisher Copyright: © 2013 Jensen DB and Ussery DW.

PY - 2013

Y1 - 2013

N2 - Background: Prediction of the optimal habitat conditions for a given bacterium, based on genome sequence alone would be of value for scientific as well as industrial purposes. One example of such a habitat adaptation is the requirement for oxygen. In spite of good genome data availability, there have been only a few prediction attempts of bacterial oxygen requirements, using genome sequences. Here, we describe a method for distinguishing aerobic, anaerobic and facultative anaerobic bacteria, based on genome sequence-derived input, using naive Bayesian inference. In contrast, other studies found in literature only demonstrate the ability to distinguish two classes at a time. Results: The results shown in the present study are as good as or better than comparable methods previously described in the scientific literature, with an arguably simpler method, when results are directly compared. This method further compares the performance of a single-step naive Bayesian prediction of the three included classifications, compared to a simple Bayesian network with two steps. A two-step network, distinguishing first respiring from non-respiring organisms, followed by the distinction of aerobe and facultative anaerobe organisms within the respiring group, is found to perform best. Conclusions: A simple naive Bayesian network based on the presence or absence of specific protein domains within a genome is an effective and easy way to predict bacterial habitat preferences, such as oxygen requirement.

AB - Background: Prediction of the optimal habitat conditions for a given bacterium, based on genome sequence alone would be of value for scientific as well as industrial purposes. One example of such a habitat adaptation is the requirement for oxygen. In spite of good genome data availability, there have been only a few prediction attempts of bacterial oxygen requirements, using genome sequences. Here, we describe a method for distinguishing aerobic, anaerobic and facultative anaerobic bacteria, based on genome sequence-derived input, using naive Bayesian inference. In contrast, other studies found in literature only demonstrate the ability to distinguish two classes at a time. Results: The results shown in the present study are as good as or better than comparable methods previously described in the scientific literature, with an arguably simpler method, when results are directly compared. This method further compares the performance of a single-step naive Bayesian prediction of the three included classifications, compared to a simple Bayesian network with two steps. A two-step network, distinguishing first respiring from non-respiring organisms, followed by the distinction of aerobe and facultative anaerobe organisms within the respiring group, is found to perform best. Conclusions: A simple naive Bayesian network based on the presence or absence of specific protein domains within a genome is an effective and easy way to predict bacterial habitat preferences, such as oxygen requirement.

UR - http://www.scopus.com/inward/record.url?scp=84971012158&partnerID=8YFLogxK

U2 - 10.12688/f1000research.2-184.v1

DO - 10.12688/f1000research.2-184.v1

M3 - Journal article

AN - SCOPUS:84971012158

VL - 2

JO - F1000Research

JF - F1000Research

SN - 2046-1402

M1 - 184

ER -

ID: 292229390