Finding Genes in DNA with a Hidden Markov Model

dc.contributor.authorHenderson, John
dc.contributor.authorSalzberg, Steven
dc.contributor.authorFasman, Kenneth H
dc.date.accessioned2008-06-18T13:26:43Z
dc.date.available2008-06-18T13:26:43Z
dc.date.issued1997
dc.description.abstractThis study describes a new Hidden Markov Model (HMM) system for segmenting uncharacterized genomic DNA sequences into exons, introns, and intergenic regions. Separate HMM modules were designed and trained for specific regions of DNA: exons, introns, intergenic regions, and splice sites. The models were then tied together to form a biologically feasible topology. The integrated HMM was trained further on a set of eukaryotic DNA sequences, and tested by using it to segment a separate set of sequences. The resulting HMM system, which is called VEIL (Viterbi Exon-Intron Locator), obtains an overall accuracy on test data of 92% of total bases correctly labelled, with a correlation coefficient of 0.73. Using the more stringent test of exact exon prediction, VEIL correctly located both ends of 53% of the coding exons, and 49% of the exons it predicts are exactly correct. These results compare favorably to the best previous results for gene structure prediction, and demonstrate the benefits of using HMMs for this problem.en
dc.format.extent249205 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.citationFinding Genes in Human DNA with a Hidden Markov Model. J. Henderson, S.L. Salzberg, and K. Fasman. This describes the VEIL system for finding genes. Journal of Computational Biology 4:2 (1997), 127-141.en
dc.identifier.urihttp://hdl.handle.net/1903/8004
dc.language.isoen_USen
dc.publisherJournal of Computational Biologyen
dc.relation.isAvailableAtCollege of Computer, Mathematical & Physical Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectHidden Markov Model (HMMM)en
dc.subjectexonsen
dc.subjectintronsen
dc.subjectintergenic regionsen
dc.subjectDNAen
dc.subjectVEIL (Viterbi Exon/Intron Locator)en
dc.titleFinding Genes in DNA with a Hidden Markov Modelen
dc.typeArticleen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
FindingGenes.pdf
Size:
243.36 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.8 KB
Format:
Item-specific license agreed upon to submission
Description: