Computational Biology and Bioinformatics

Biology has to a large extent become an information science. With the quantity of biological data being generated, for example by high-throughput sequencing techniques, data can only meaningfully be processed by computer.

The processing of the data and the subsequent search for patterns (sequence and structure) in DNA (genome), RNA and proteins help us to identify functional genomic components. However to do this, programs needs to be efficient so run times can be minimized.

The group has a general focus on animal genomics including non-coding RNAs (ncRNAs), structure and interactions, CRISPR and analysis of high-throughput sequencing data. ncRNAs are rapidly becoming a central focus of genomic biology and given only ~1% of the (~3 billion base) mammalian genome encodes proteins, the potential for the genome to host many ncRNAs is large.

In our group we develop new computational methods (computational biology) as well as setting up pipelines for genome annotation (bioinformatics). We relate these findings to diseases and other phenotypes. We are addressing animal models for human disease, and we are studying bacteria used in industrial contexts as cell factories, with the aim to understand production yield.

The group hosts Center for non-coding RNA in Technology and Health (see details at http://rth.dk) which takes a whole new approach to disease studies by searching for ncRNA and structured RNAs as disease components and biomarkers through development of in silico search tools for ncRNA analysis complemented by experimental analysis and further functional studies. The disease focus is on inflammatory diseases and diabetes employing human and animal material.

Recent selected publications:

  • CRISPR/Cas9 gRNA activity depends on free energy changes and on the target PAM context. Corsi GI, Qu K, Alkan F, Pan X, Luo Y*, Gorodkin J* Nature Communications 2022, 13(1):3006.
  • The impact of PrsA over-expression on the Bacillus subtilis transcriptome during fed-batch fermentation of alpha-amylase production. Geissler AS, Poulsen LD, Doncheva NT, Anthon C, Seemann SE, González-Tortuero E, Breüner A, Jensen LJ, Hjort C, Vinther J, Gorodkin J* Frontiers in Microbiology 2022, 13.
  • CRISPRroots: on- and off-target assessment of RNA-seq data in CRISPR-Cas9 edited cells. Corsi GI, Gadekar VP, Gorodkin J*, Seemann SE*Nucleic Acids Res. 2022, 50(4):e20.
  • Massively targeted evaluation of therapeutic CRISPR off-targets in cells. Pan X, Qu K, Yuan H, Xiang X, Anthon C, Pashkova L, Liang X, Han P, Corsi GI, Xu F, Liu P, Zhong J, Zhou Y, Ma T, Jiang H, Liu J, Wang J, Jessen N, Bolund L, Yang H, Xu X, Church GM*, Gorodkin J*, Lin L*, Luo Y* Nature Communications 2022 Jul 13;13(1):4049.
  • The Bacillaceae-1 RNA motif comprises two distinct classes. Gonzalez-Tortuero E, Anthon C, Havgaard JH, Geissler AS, Breuner A, Hjort C, Gorodkin J*, Seemann SE* Gene. 2022 Jul 26;841:146756.
  • Does rapid sequence divergence preclude RNA structure conservation in vertebrates? Seemann SE*, Mirza AH, Bang-Berthelsen CH, Garde C, Christensen-Dalsgaard M, Workman CT, Pociot F, Tommerup N, Gorodkin J, Ruzzo WL Nucleic Acids Res. 2022 Mar 21;50(5):2452-2463.
  • A non-enzymatic, isothermal strand displacement and amplification assay for rapid detection of SARS-CoV-2 RNA. Mohammadniaei M, Zhang M, Ashley J, Christensen UB, Friis-Hansen LJ, Gregersen R, Lisby JG, Benfield TL, Nielsen FE, Henning Rasmussen J, Pedersen EB, Olinger ACR, Kolding LT, Naseri M, Zheng T, Wang W, Gorodkin J, Sun Y. Nature Communications 2021, 12(1):5089.
  • Enhancing CRISPR-Cas9 gRNA efficiency prediction by data integration and deep learning. Xiang X, Corsi GI, Anthon C, Qu K, Pan X, Liang X, Han P, Dong Z, Liu L, Zhong J, Ma T, Wang J, Zhang X, Jiang H, Xu F, Liu X, Xu X, Wang J, Yang H, Bolund L, Church GM, Lin L, Gorodkin J*, Luo Y* Nature Communications 2021, 12(1):3238.
  • Human pathways in animal models: possibilities and limitations. Doncheva NT*, Palasca O, Yarani R, Litman T, Anthon C, Groenen MAM, Stadler PF, Pociot F, Jensen LJ*, Gorodkin J*. Nucleic Acids Res. 2021.
  • BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. Geissler AS, Anthon C, Alkan F, Gonzalez-Tortuero E, Poulsen LD, Kallehauge TB, Breuner A, Seemann SE, Vinther J, Gorodkin J. Microb Genom. 2021 Feb 4.