A Practical Exact Algorithm for the Individual Haplotyping Problem MEC/GI Conference Paper uri icon

abstract

  • Haplotypes play an important role in genetic association studies of complex diseases. Recently, computational techniques helping to determine human haplotypes were studied extensively. Given the genotype and the aligned single nucleotide polymorphism (SNP) fragments of an individual, Minimum Error Correction with Genotype Information (MEC/GI) is an important computational model to infer a pair of haplotypes compatible with the genotype by correcting minimum number of SNPs in the given SNP fragments. The MEC/GI problem has been proven NP-hard, for which there is no practical exact algorithm. Despite the rapid advances in molecular biological techniques, modern high-throughput sequencers cannot sequence directly a DNA fragment that contains more than 1200 nucleotide bases. With low SNP density, current available data reveal that the number k of SNP sites that a DNA fragment covers is usually smaller than 10. Based on the above fact, we develop a new dynamic programming algorithm with running time O(mk2 k +mlogm+mk), where m is the number of fragments. Since k is small in real biological applications, the algorithm is practical and efficient. 2009 Springer Science+Business Media, LLC.

published proceedings

  • ALGORITHMICA

author list (cited authors)

  • Wang, J., Xie, M., & Chen, J.

citation count

  • 11

complete list of authors

  • Wang, Jianxin||Xie, Minzhu||Chen, Jianer

publication date

  • March 2010