Comparative genomics - A perspective (2024)

  • Journal List
  • Bioinformation
  • v.1(9); 2007
  • PMC1891719

As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsem*nt of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more: PMC Disclaimer | PMC Copyright Notice

Comparative genomics - A perspective (1)

Link to Publisher's site

Bioinformation. 2007; 1(9): 376–378.

Published online 2007 Mar 27. doi:10.6026/97320630001376

PMCID: PMC1891719

PMID: 17597925

Selvarajan Sivashankari1 and Piramanayagam Shanmughavel2,*

Abstract

The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology.

Keywords: comparative genomics, genome correspondence, gene identification, genome evolution

Background

As on Jan 25, 2007, 472 genomes are completely sequenced and yet another 498 are in progress. The rapid progress in genome sequencing demands more comparativeanalysis to gain new insights into evolutionary, biochemical, genetic, metabolic, and physiological pathways. Comparative genomics is the direct comparisonof complete genetic material of one organism against that of another to gain a better understanding of how species evolved and to determine the function of genes and noncodingregions in genomes. It includes a comparison of gene number, gene content, and gene location, the length and number of coding regions (called exons) within genes,the amount of non coding DNA in each genome, and conserved regions maintained in both prokaryotic and eukaryotic groups of organisms. Comparative genomics notonly can trace out the evolutionary relationship between organisms but also differences and similarities within and between species. The difference between humans and otherorganisms can be obtained by comparative investigations. For the purpose of documenting the distinctive features of humans, the most informative research involves comparinghumans to our closest relatives, the chimpanzees and apes.

Methodology

Genome correspondence

Genome correspondence [1], the method of determining the correct correspondence of chromosomal segments andfunctional elements across the species compared is the first step in comparative genomics. This involves determining orthologous (genes diverged after a speciation event)segments of DNA that descend from the same region in the common ancestor of the species compared, and paralogous (genes diverged after a duplication event) regions that aroseby duplication events prior to the divergence of the species compared. The mapping of regions across two genomes can be one-to-one in absence of duplication events; one-tomanyif a region has undergone duplication or loss in one of the species, or many-to-many if duplication/loss has occurred in both lineages. Fitch et al., [2] developed amethod called BBH (Best Bidirectional Hits), which identifies gene pairs that are best matches of each other as orthologous. Tatusov et al., [3] further enhanced thismethod, which matches groups of genes to groups of genes.

Understanding the ancestry of the functional elements compared is central to our understanding and applications of genome comparison. Most comparative methods havefocused on one-to-one orthologous regions, but it is equally important to recognize which segments have undergone duplication events, and which segments were lost since thedivergence of the species. Comparing segments that arose before the divergence of the species may result in the wrong interpretations of sequence conservation anddivergence. Further, in the presence of gene duplication, some of the evolutionary constraints that a region is under are relieved, and uniform models of evolution no longercapture the underlying selection for these sites. Thus, our methods for determining gene correspondence should account for duplication and loss events, and ensure that the segments we compare are orthologous.

Applications

Gene identification

Once genome correspondence is established, comparative genomics can aid gene identification. Comparative genomics can recognize real genes based on their patternsof nucleotide conservation across evolutionary time. With the availability of genome-wide alignments across the genomes compared, the different ways by which sequenceschange in known genes and in intergenic regions can be analyzed. The alignments of known genes will reveal the conservation of the reading frame of protein translation.

The genome of a species encodes genes and other functional elements, interspersed with non-functional nucleotides in a single uninterrupted string of DNA.Recognizing protein-coding genes typically relies on finding stretches of nucleotides free of stop codons (called Open Reading Frames, or ORFs) that are too long to havelikely occurred by chance. Since stop codons occur at a frequency of roughly 1 in 20 in random sequence, ORFs of at least 60 amino acids will occur frequently by chance (5%under a simple Poisson model), and even ORFs of 150 amino acids will appear by chance in a large genome (0.05%). This poses a huge challenge for higher eukaryotesin which genes are typically broken into many, small exons (on average 125 nucleotides long for internal exons) in mammals. The basic problem is distinguishing real genes –those ORFs encoding a translated protein product – from spurious ORFs – the remaining ORFs whose presence is simply due to chance. In mammalian genomes, estimates ofhypothetical genes have ranged from 28,000 to more than 120,000 genes. The internal coding exons were easily identified using Comparative analysis of human genomewith mouse genome. [4]

Regulatory motif discovery

Regulatory motifs are short DNA sequences about 6 to 15bp long that are used to control the expression of genes, dictating the conditions under which a gene will be turnedon or off. Each motif is typically recognized by a specific DNA-binding protein called a transcription factor (TF). A transcription factor binds precise sites in the promoterregion of target genes in a sequence-specific way, but this contact can tolerate some degree of sequence variation. Thus, different binding sites may contain slight variationsof the same underlying motif, and the definition of a regulatory motif should capture these variations while remaining as specific as possible. Comparative genomicsprovides a powerful way to distinguish regulatory motifs from non-functional patterns based on their conservation. One such example is the identification of TF DNA-bindingmotif [5] using comparative genomics and denovo motif. The regulatory motifs of the Human Promoters wereidentified by comparison with other mammals. [6] Yet another important finding is the gene and regulatoryelement by comparison of yeast species. [7]

Other applications

Comparative genomics has wide applications in the field of molecular medicine and molecular evolution. The most significant application of comparative genomics inmolecular medicine is the identification of drug targets of many infectious diseases. For example, comparative analyses of fungal genomes have led to the identification ofmany putative targets for novel antifungal. [8] This discovery can aid in target based drug design to cure fungaldiseases in human. Comparative analysis of genomes of individuals with genetic disease against healthy individuals may reveal clues of eliminating that disease.

Comparative genomics helps in selecting model organisms. A model system [9] is a simple, idealized system that canbe accessible and easily manipulated. For example, a comparison of the fruit fly genome with the human genome discovered that about 60 percent of genes are conservedbetween fly and human. Researchers have found that two-thirds of human genes known to be involved in cancer have counterparts in the fruit fly. Even more surprisingly, whenscientists inserted a human gene associated with early-onset Parkinson's disease into fruit flies, they displayed symptoms similar to those seen in humans with thedisorder, raising the possibility that the tiny insects could serve as a new model for testing therapies aimed at Parkinson's. Thus, comparative genomics may providegene functional annotation. Gene finding is an important application of comparative genomics. Comparative genomics identify Synteny (genes present in the same orderin the genomes) and hence reveal gene clusters.

Comparative genomics also helps in the clustering of regulatory sites [10], which can help in the recognition ofunknown regulatory regions in other genomes. The metabolic pathway regulation can also be recognized by means of comparative genomics of a species. Dmitry andcolleagues [11] have identified the regulons of methionine metabolism in gram-positive bacteria using comparativegenomics analysis. Similarly Kai Tan [12] and colleagues have identified regulatory networks of H. influenzae bycomparing its genome with that of E. coli. The adaptive properties of organisms [13] like evolution of sex, genesilencing can also be correlated to genome sequence by comparative genomics.

Conclusion

The most unexpected finding in comparing [14] the mouse and human genomes lies in the similarities between “junk”DNA, mostly retro-transposons, (transposons copied from mRNA by reverse transcriptase) in the two species. A survey of the location of retrotransposon DNA in bothspecies shows that it has independently ended up in comparable regions of the genome. Thus “junk” DNA may have more of a function than was previously assumed. Highperformance computing tools help in comparing huge genomes. Because of its wide applications and feasibility, automation of comparing genomics is possible. [15] SuchComparisons can aid in predicting the function of numerous hypothetical proteins.

Acknowledgments

The authors wish to acknowledge DBT-Bioinformatics Infrastructure Facility, Bharathiar University, Coimbatore 641 046 for providing facilities for their work.

Footnotes

Citation:Sivashankari & Shanmughavel, Bioinformation 1(9): 376-378 (2007)

References

1. Kellis M, et al. J Comput Biol. 2004;11:319. [PubMed] [Google Scholar]

2. Fitch WM, et al. Syst Zool. 1970;19:99. [PubMed] [Google Scholar]

3. Fitch WM, et al. Philos Trans R Soc Lond B Biol Sci. 1995;349:93. [PubMed] [Google Scholar]

4. Batzoglou S, et al. Genome Res. 2000;10:950. [PMC free article] [PubMed] [Google Scholar]

5. Mao L, Zheng WJ. BMC Bioinformatics. 2006;7:s21. [PMC free article] [PubMed] [Google Scholar]

6. Xie X, et al. Nature. 2005;434:338. [PMC free article] [PubMed] [Google Scholar]

7. Kellis M, et al. Nature. 2003;423:241. [PubMed] [Google Scholar]

8. Odds FC, et al. Rev Iberoam Micol. 2005;22:229. [PubMed] [Google Scholar]

9. Preuss TM, et al. Journal of Biomedical Discovery and Collaboration. 2006;1:17. [PMC free article] [PubMed] [Google Scholar]

10. Nimwegen EV, et al. PNAS. 2002;99:7323. [Google Scholar]

11. Dmitry A, et al. Nucleic Acids Research. 2004;32:3340. [PMC free article] [PubMed] [Google Scholar]

12. Tan K, et al. Genome Res. 2001;11:566. [PMC free article] [PubMed] [Google Scholar]

13. Fabre E, et al. Mol Biol Evol. 2005;22:856. [PubMed] [Google Scholar]

14. Preuss TM, et al. Journal of Biomedical Discovery and Collaboration. 2006;1:17. [PMC free article] [PubMed] [Google Scholar]

15. Alm EJ, et al. Genome Res. 2005;15:1015. [PMC free article] [PubMed] [Google Scholar]

Articles from Bioinformation are provided here courtesy of Biomedical Informatics Publishing Group

Comparative genomics - A perspective (2024)
Top Articles
What Are the 3 Basic Functions of a Finance Manager?
Best Personal Loans for Fair Credit In Canada for September 2024
Tiny Tina Deadshot Build
Unitedhealthcare Hwp
Get train & bus departures - Android
Otterbrook Goldens
The Best English Movie Theaters In Germany [Ultimate Guide]
7543460065
Geometry Escape Challenge A Answer Key
Ucf Event Calendar
13 The Musical Common Sense Media
Jack Daniels Pop Tarts
Reddit Wisconsin Badgers Leaked
Busty Bruce Lee
Sony E 18-200mm F3.5-6.3 OSS LE Review
Youravon Comcom
Rachel Griffin Bikini
Craiglist Kpr
Wgu Academy Phone Number
Decosmo Industrial Auctions
Fsga Golf
Filthy Rich Boys (Rich Boys Of Burberry Prep #1) - C.M. Stunich [PDF] | Online Book Share
The Many Faces of the Craigslist Killer
How to Make Ghee - How We Flourish
2021 MTV Video Music Awards: See the Complete List of Nominees - E! Online
Workshops - Canadian Dam Association (CDA-ACB)
Chelsea Hardie Leaked
Bfsfcu Truecar
Keshi with Mac Ayres and Starfall (Rescheduled from 11/1/2024) (POSTPONED) Tickets Thu, Nov 1, 2029 8:00 pm at Pechanga Arena - San Diego in San Diego, CA
How To Improve Your Pilates C-Curve
Busch Gardens Wait Times
Wcostream Attack On Titan
Aladtec Login Denver Health
Greencastle Railcam
Best Workers Compensation Lawyer Hill & Moin
Ewwwww Gif
Case Funeral Home Obituaries
Myql Loan Login
How To Paint Dinos In Ark
Linda Sublette Actress
2700 Yen To Usd
The Realreal Temporary Closure
Homeloanserv Account Login
Bekah Birdsall Measurements
Login
Gabrielle Abbate Obituary
20 Mr. Miyagi Inspirational Quotes For Wisdom
Washington Craigslist Housing
Model Center Jasmin
One Facing Life Maybe Crossword
Escape From Tarkov Supply Plans Therapist Quest Guide
Latest Posts
Article information

Author: Rev. Porsche Oberbrunner

Last Updated:

Views: 5811

Rating: 4.2 / 5 (53 voted)

Reviews: 92% of readers found this page helpful

Author information

Name: Rev. Porsche Oberbrunner

Birthday: 1994-06-25

Address: Suite 153 582 Lubowitz Walks, Port Alfredoborough, IN 72879-2838

Phone: +128413562823324

Job: IT Strategist

Hobby: Video gaming, Basketball, Web surfing, Book restoration, Jogging, Shooting, Fishing

Introduction: My name is Rev. Porsche Oberbrunner, I am a zany, graceful, talented, witty, determined, shiny, enchanting person who loves writing and wants to share my knowledge and understanding with you.