World Library  
Flag as Inappropriate
Email this Article

Computational genomics

Article Id: WHEBN0002571276
Reproduction Date:

Title: Computational genomics  
Author: World Heritage Encyclopedia
Language: English
Subject: Genomics, Bioinformatics, Cytoscape, The Proteolysis Map, TopFIND
Collection: Bioinformatics, Genomics, Omics
Publisher: World Heritage Encyclopedia

Computational genomics

Computational genomics (often referred to as Computational Genetics) refers to the use of computational and statistical analysis to decipher biology from genome sequences and related data,[1] including both DNA and RNA sequence as well as other "post-genomic" data (i.e., experimental data obtained with technologies that require the genome sequence, such as genomic DNA microarrays). These, in combination with computational and statistical approaches to understanding the function of the genes and statistical association analysis, this field is also often referred to as Computational and Statistical Genetics/genomics. As such, computational genomics may be regarded as a subset of bioinformatics and computational biology, but with a focus on using whole genomes (rather than individual genes) to understand the principles of how the DNA of a species controls its biology at the molecular level and beyond. With the current abundance of massive biological datasets, computational studies have become one of the most important means to biological discovery.[2]


  • History 1
  • Contributions of computational genomics research to biology 2
  • Latest Development (from 2012) 3
    • First Computer Model of an Organism 3.1
  • See also 4
  • References 5
  • External links 6


The roots of computational genomics are shared with those of bioinformatics. During the 1960s, Margaret Dayhoff and others at the National Biomedical Research Foundation assembled databases of homologous protein sequences for evolutionary study.[3] Their research developed a phylogenetic tree that determined the evolutionary changes that were required for a particular protein to change into another protein based on the underlying amino acid sequences. This led them to create a scoring matrix that assessed the likelihood of one protein being related to another.

Beginning in the 1980s, databases of genome sequences began to be recorded, but this presented new challenges in the form of searching and comparing the databases of gene information. Unlike text-searching algorithms that are used on websites such as Google or WorldHeritage, searching for sections of genetic similarity requires one to find strings that are not simply identical, but similar. This led to the development of the Needleman-Wunsch algorithm, which is a dynamic programming algorithm for comparing sets of amino acid sequences with each other by using scoring matrices derived from the earlier research by Dayhoff. Later, the BLAST algorithm was developed for performing fast, optimized searches of gene sequence databases. BLAST and its derivatives are probably the most widely used algorithms for this purpose.[4]

The emergence of the phrase "computational genomics" coincides with the availability of complete sequenced genomes in the mid-to-late 1990s. The first meeting of the Annual Conference on Computational Genomics was organized by scientists from The Institute for Genomic Research (TIGR) in 1998, providing a forum for this speciality and effectively distinguishing this area of science from the more general fields of Genomics or Computational Biology.[5][6] The first use of this term in scientific literature, according to MEDLINE abstracts, was just one year earlier in Nucleic Acids Research.[7] The final Computational Genomics conference was held in 2006, featuring a keynote talk by Nobel Laureate Barry Marshall, co-discoverer of the link between Helicobacter pylori and stomach ulcers. As of 2014, the leading conferences in the field include Intelligent Systems for Molecular Biology (ISMB) and RECOMB.

The development of computer-assisted mathematics (using products such as Mathematica or Matlab) has helped engineers, mathematicians and computer scientists to start operating in this domain, and a public collection of case studies and demonstrations is growing, ranging from whole genome comparisons to gene expression analysis.[8] This has increased the introduction of different ideas, including concepts from systems and control, information theory, strings analysis and data mining. It is anticipated that computational approaches will become and remain a standard topic for research and teaching, while students fluent in both topics start being formed in the multiple courses created in the past few years.

Contributions of computational genomics research to biology

Contributions of computational genomics research to biology include:[2][9]

  • discovering subtle patterns in genomic sequences [9]
  • proposing cellular signalling networks
  • proposing mechanisms of genome evolution
  • predict precise locations of all human genes using comparative genomics techniques with several mammalian and vertebrate species
  • predict conserved genomic regions that are related to early embryonic development
  • discover potential links between repeated sequence motifs and tissue-specific gene expression
  • measure regions of genomes that have undergone unusually rapid evolution

Latest Development (from 2012)

First Computer Model of an Organism

Researchers at [12]

The ‘silicon cell’ will act as computerized laboratories that could perform experiments which are difficult to do on an actual organism, or could carry out procedures much faster. The applications will include faster screening of new compounds, understanding of basic cellular principles and behavior.[10][12]

See also


  1. ^ Koonin EV (March 2001). "Computational genomics". Curr. Biol. 11 (5): R155–8.  
  2. ^ a b Computational Genomics and Proteomics at MIT
  3. ^ Mount, David (2000). Bioinformatics, Sequence and Genome Analysis. Cold Spring Harbor Laboratory Press. pp. 2–3.  
  4. ^ Brown, T.A. (1999). Genomes. Wiley.  
  5. ^ backPid]=67&cHash=fd69079f5e The 7th Annual Conference on Computational Genomics (2004)
  6. ^ The 9th Annual Conference on Computational Genomics (2006)
  7. ^ Wagner A (September 1997). "A computational genomics approach to the identification of gene networks". Nucleic Acids Res. 25 (18): 3594–604.  
  8. ^ Cristianini, N.; Hahn, M. (2006). Introduction to Computational Genomics. Cambridge University Press.  
  9. ^ a b Gagniuc, P; Ionescu-Tirgoviste, C (Sep 28, 2012). "Eukaryotic genomes may exhibit up to 10 generic classes of gene promoters.". BMC Genomics 13: 512.  
  10. ^ a b McClure, Max (19 July 2012). "Stanford researchers produce first complete computer model of an organism". Stanford University News. Retrieved 3 August 2012. 
  11. ^ Karr JR, Sanghvi JC, Macklin DN; et al. (July 2012). "A whole-cell computational model predicts phenotype from genotype". Cell 150 (2): 389–401.  
  12. ^ a b John Markoff (20 July 2012). "In First, Software Emulates Lifespan of Entire Organism".  

External links

  • Harvard Extension School Biophysics 101, Genomics and Computational Biology,
  • University of Bristol course in Computational Genomics,
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from Project Gutenberg are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.