Las Vegas algorithms for gene recognition: suboptimal and error-tolerant spliced alignment.

abstract

Recently, Gelfand, Mironov and Pevzner (1996) proposed a spliced alignment approach to gene recognition that provides 99% accurate recognition of human genes if a related mammalian protein is available. However, even 99% accurate gene predictions are insufficient for automated sequence annotation in large-scale sequencing projects and therefore have to be complemented by experimental gene verification. One hundred percent accurate gene predictions would lead to a substantial reduction of experimental work on gene identification. Our goal is to develop an algorithm that either predicts an exon assembly with accuracy sufficient for sequence annotation or warns a biologist that the accuracy of a prediction is insufficient and further experimental work is required. We study suboptimal and error-tolerant spliced alignment problems as the first steps towards such an algorithm, and report an algorithm which provides 100% accurate recognition of human genes in 37% of cases (if a related mammalian protein is available). In 52% of genes, the algorithm predicts at least one exon with 100% accuracy.

authors

Sze, Sing-Hoi

published proceedings

J Comput Biol

author list (cited authors)

Sze, S. H., & Pevzner, P. A.

citation count

17

complete list of authors

Sze, SH||Pevzner, PA

publication date

January 1997

publisher

Mary Ann Liebert Publisher

published in

Journal of Computational Biology Journal

keywords

Algorithms
Amino Acid Sequence
Animals
Binding Sites
Genes
Humans
Nucleic Acid Conformation
RNA Splicing
Sequence Alignment

PubMed Central ID

9278061

Digital Object Identifier (DOI)

10.1089/cmb.1997.4.297

start page

297

end page

309

volume

4

issue

3

URL

http://dx.doi.org/10.1089/cmb.1997.4.297

Las Vegas algorithms for gene recognition: suboptimal and error-tolerant spliced alignment. Conference Paper

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL