Improving SNP discovery by base alignment quality

Heng Li Heng Li
2011 Bioinformatics 332 citations

Abstract

Abstract Summary: I propose a new application of profile Hidden Markov Models in the area of SNP discovery from resequencing data, to greatly reduce false SNP calls caused by misalignments around insertions and deletions (indels). The central concept is per-Base Alignment Quality, which accurately measures the probability of a read base being wrongly aligned. The effectiveness of BAQ has been positively confirmed on large datasets by the 1000 Genomes Project analysis subgroup. Availability: http://samtools.sourceforge.net Contact: hengli@broadinstitute.org

Keywords

IndelSNPBase (topology)Computer scienceHidden Markov modelData miningComputational biologyMarkov chainINDEL MutationQuality (philosophy)GeneticsArtificial intelligenceBiologyMachine learningSingle-nucleotide polymorphismMathematicsGene

Affiliated Institutions

Related Publications

Publication Info

Year
2011
Type
article
Volume
27
Issue
8
Pages
1157-1158
Citations
332
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

332
OpenAlex

Cite This

Heng Li (2011). Improving SNP discovery by base alignment quality. Bioinformatics , 27 (8) , 1157-1158. https://doi.org/10.1093/bioinformatics/btr076

Identifiers

DOI
10.1093/bioinformatics/btr076