Variant detection

NGS is frequently used to identify mutations in DNA samples from individual patients or experimental organisms. Sequencing can be done at the whole-genome scale; RNA-seq, which targets expressed genes; exome capture, which targets specific exon regions captured by hybridization to probes of known sequence; or amplicons for genes or regions of interest. In all cases, sequence variants are detected by alignment of NGS reads to a reference sequence and then identification of differences between the reads and the reference. Variant detection algorithms must distinguish between random sequencing errors, differences caused by incorrect alignment, and true variants in the genome of the target organism. Various combinations of base quality scores, alignment quality scores, depth of coverage, variant allele frequency, and the presence of nearby sequence variants and indels are all used to differentiate true variants from false positives. Recent algorithms have also made use of machine learning methods based on training sets of genotype data or large sets of samples from different patients/organisms that are sequenced in parallel with the same sample preparation methods on the same NGS machines.

Tags
Data and Resources
To access the resources you must log in
References
Field Value
PublicationTitle Glossary of Sequencing Terms
PublicationType Website
PublicationYear Accessed: 16.05.2019
Publisher DNA Link Sequencing Lab
Website https://www.dnalinkseqlab.com/glossary/
ZoteroURL https://www.zotero.org/groups/2344323/orion/items/itemKey/YAPAR7FS
Glossary Term Classification
Field Value
Category Data processing and analysis; Sampling and Laboratory testing
ModifiedDefinition false
Sector Shared Definition
Additional Info
Field Value
Provided by: EJP ORION project
system:type GlossaryTerm
Management Info
Field Value
Author taras_guenther
Last Updated 29 April 2020, 02:25 (CEST)
Created 3 September 2019, 13:20 (CEST)