Abstract
Abstract Summary: I propose a new application of profile Hidden Markov Models in the area of SNP discovery from resequencing data, to greatly reduce false SNP calls caused by misalignments around insertions and deletions (indels). The central concept is per-Base Alignment Quality, which accurately measures the probability of a read base being wrongly aligned. The effectiveness of BAQ has been positively confirmed on large datasets by the 1000 Genomes Project analysis subgroup. Availability: http://samtools.sourceforge.net Contact: hengli@broadinstitute.org
Keywords
Affiliated Institutions
Related Publications
An initial map of insertion and deletion (INDEL) variation in the human genome
Although many studies have been conducted to identify single nucleotide polymorphisms (SNPs) in humans, few studies have been conducted to identify alternative forms of natural ...
CLEVER: clique-enumerating variant finder
Abstract Motivation: Next-generation sequencing techniques have facilitated a large-scale analysis of human genetic variation. Despite the advances in sequencing speed, the comp...
Hidden Markov Models in Computational Biology
Hidden Markov Models (HMMs) are applied to the problems of statistical modeling, database searching and multiple sequence alignment of protein families and protein domains. Thes...
Publication Info
- Year
- 2011
- Type
- article
- Volume
- 27
- Issue
- 8
- Pages
- 1157-1158
- Citations
- 332
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/bioinformatics/btr076