By Sadaf Tariq (M.Phil Scholar)
Ambreen Zahra (M.Sc Scholar)
Next-Generation Sequencing, an innovation in science
Abstract
Next-generation
sequencing (NGS) is also known as high throughput sequencing. The Human Genome Project was completed by using
Sanger sequencing in over 10 years and its cost was nearly $3 billion (Sanger et
al., 1977). NGS is an innovative sequencing method during which millions of
parallel reactions take place for high throughput sequencing. Next-generation
sequencing makes it easier to sequence the whole genome in a single experiment.
With its ultra-high-speed, scalability, and throughput, NGS has completely
revolutionized biological sciences. This parallel sequencing technique is used
to study genomes at the next level. NGS
has filled the gap to address all the complex genomics questions and has become
an everyday research technology. With innovative data analysis options,
researchers can easily sequence the whole genome within a day. NGS is also playing
its role to uncover the genetic susceptibility of mildly ill COVID-19 patients.
Introduction
DNA sequencing is a process to determine the nucleotides
sequence of the targeted area in DNA. Sanger sequencing was the first
commercialized method of DNA sequencing introduced in 1977 and remained dominated
for many decades. It requires a single-stranded DNA template, DNA primers, DNA
polymerases, dNTPs, and modified ddNTPs. These modified ddNTPs lack 2' and 3' hydroxyl group, required for the formation of a phosphodiester bond between two nucleotides. Hence, it ceases the extension of
DNA. These are radioactively or fluorescently labeled for detection. NGS is at an
advantage to conventional techniques for its high throughput and massively parallel
sequencing at a reduced cost. Different technologies are involved in NGS: Illumina sequencing works by simultaneous
detection by the addition of nucleotides and emits unique fluorescent signals upon the addition of labeled nucleotides. Roche
sequencing detects the release of phosphate group as fluorescence is emitted and
new nucleotide is added to the existing chain. Ion
semiconductor sequencing detects the release of protons upon the incorporation
of individual nucleotides by DNA polymerases. It does not emit any light signal
(Yohe&Thyagarajan, 2017).
Applications of NGS
·
To sequence
whole genome
NGS platforms
are used to perform the whole genome sequencing. Millions of small fragments of the genome are sequenced in parallel. There is a wide variety of platforms using
different sequencing technologies. Bioinformatics tools are used to analyze the
sequenced data and join the individual reads to get a final draft (Ansorge,
2009).
·
To sequence
only specific regions of the genome
Exons, introns, repeated sequences, tandem repeats are sequenced,
identified and detected by NGS. Open reading frames and non-open reading frames
can also be sequenced and identified (Adams &Eng, 2017).
·
To sequence
cancer subclones and other rare variants
Single-cell
techniques can be used to study the cell microenvironment, interpret all the
gene expression patterns provide insights on drug resistance patterns and
metastasis (Aravanis et al., 2017).
·
To discover
susceptibility of novel pathogens
Each of the three
billion bases are sequenced multiple times for high depth insight into DNA
variation and provides accurate data on unexpected variations in the genome. A full
spectrum of genomic variation can be obtained by this technology (Ansorge,
2009).
·
To sequence
and map whole-exome
NGS
is also used to sequence constrained areas of genome for better interpretation
of gene position and locality. NGS is able to sequence and map all 22,000
coding genes (whole-exome sequencing) or only a specific numbers of individual
exomes.
·
To sequence
transcriptome for uncovering rare diseases
NGS can help
us to study the transcriptome of high-risk populations. It can uncover rare
diseases, genetic susceptibility, and their risk factors. Thus, preventive
measures can be taken earlier to prevent diseases (Adams &Eng, 2017).
·
To assess risk
factors of COVID-19
NGS is being
used by UK researchers to compare sequenced genomes of COVID-19 patients. It is
uncovering the genetic susceptibility of patients by revealing all the involved
genetic factors (Lu et al., 2020).
Methodology
Following common steps are followed during all NGS
techniques:
1.
Sample
Preparation
DNA is used as a starting material. Only pure DNA can
lead to precise and accurate results. Spin column DNA extraction is the most
preferable extraction technique of all. About 100ng DNA is used for the reaction.
The extracted DNA is further amplified by PCR assay. The amplicons are run on
2% agarose gel with DNA ladder. Spin column PCR purification is done after
agarose gel electrophoresis for amplicons purification as the amplicons may be
contaminated with unbound primers, dimers, unused templates, and unused
buffers. These contaminants can abort sequencing. The sample preparation for NGS
requires a genome library which can be obtained either by ligation or
amplification to an adapter sequence. These adapter sequences are crucial to
provide a universal priming site for sequencer primers. These adapters also work
as sequencing chips during hybridization (Ronaghi, 2001; Shendure&Ji,
2008; McCombie et al., 2019).
2.
Library
preparation
The genomic library is prepared by fragmentizing the whole
genome by restriction nucleases. The whole-genome DNA fragments are joined with
known DNA fragments. The ligation is followed by the addition of adapters at the
terminals and the process is adapter ligation. Only the ligated fragments are
added in the genomic library. The unbound sequences are washed away by washing buffers
(Mardis, 2008; Slatko et al., 2018).
3.
Cluster
generation
The fragmented DNA is loaded on the immobilized oligos
on the surface. The clusters of DNA are generated by bridge amplification. The
DNA fragments are bent over one adapter sequence by one end and bind to another
sequence by another end. The primers bind to the DNA and amplification is
occurred vertically. The bridge amplification results in the generation of two new
DNA strands.
4.
Sequencing
The polymerases add the nucleotides into the
bridge amplification; the sequencing is the next step. The reaction tubes
containing amplicons are placed into a sequencer machine. A number of cycles of
denaturation, annealing, and extension take place simultaneously in the sequencer.
The labeled nucleotides produce signals during the addition reactions which are
recorded by a computer (Shendure & Ji, 2008). Each library fragment undergoes
amplification on a solid matter surface. These surfaces can be either bead or
silicon derived sheets with DNA linkers used to hybridize library adapters.
After repeated cycles of nucleotide incorporation, amplification results in
formation of clusters of DNA, each of them acts as an individual reaction.
These clusters are sequenced and read optically in the form of light or
fluorescent signals. All these signals are sent to the computer for data
analysis.
5. Data
output
Each cycle of nucleotide incorporation provides data about the complete sequencing draft. This data is collected and further analyzed for more meaningful results. The bioinformatics based inbuilt software process the file saved after DNA sequencing and compare it with available data for analysis. After data analysis, the software can interpret the variations and mutations in any gene (Slatko et al., 2018).
Table: Basic features of NGS platforms
NGS Platforms |
Read length
per run (bp) |
Time |
Error
rate(%) |
Chemistry |
Illumina sequencing |
2x300 |
27 h |
0.8 |
Reversible terminators |
Roche sequencing |
700 |
24 h |
1 |
Pyrosequencing |
Ion semiconductor sequencing |
200 |
2-5 h |
1 |
Proton detection sequencing |
SOLiD sequencing |
50 |
14 days |
0.01 |
Sequencing by ligation |
Nanopore sequencing |
>5000 |
48 h |
1 |
Real-time sequencing |
Future Prospects
NGS has
revolutionized biological sciences, medical sciences, and health sciences. It is
being used to diagnose rare diseases, multiple genetic disorders, and syndromes.
Optimizations and advancements are reducing the error rate and improving the
cost and time issues. Semiconductor sequencing is revolutionizing with CMOS technology. It is eager to provide high accuracy data by
one channel SBS. The two-channel SBS is even faster sequencing than original
SBS version with high throughput and even higher accuracy. The NextSeq 2000 the system is working for new emerging applications to analyze all the data in less
than 2 hours (Brink et al., 2019).
Author's Details
Sadaf Tariq 1, Ambreen Zahra 2
1M.Phil Scholar (Biochemistry), 2 M.Sc Scholar (Biochemistry)
1Department of Biochemistry, GC University, Faisalabad, Pakistan.
Reviewed & Edited by
Aysha Yasmin1*, M.Ahsan ul Haq 2*
1 Ph.D. Scholar,2 M.Phil Scholar
*Department of Biochemistry, GC University, Faisalabad, Pakistan.
References
1.
Sanger, F., Nicklen,
S., & Coulson, A. R. (1977). DNA sequencing with chain-terminating
inhibitors. Proceedings of the national academy of sciences, 74(12),
5463-5467.
2.
Ronaghi, M. (2001). Pyrosequencing
sheds light on DNA sequencing. Genome research, 11(1),
3-11.
3.
Shendure, J., &Ji,
H. (2008). Next-generation DNA sequencing. Nature biotechnology, 26(10),
1135-1145.
4.
Mardis, E. R. (2008). Next-generation DNA sequencing
methods. Annu. Rev. Genomics Hum. Genet., 9, 387-402.
5.
Ansorge, W. J. (2009).
Next-generation DNA sequencing techniques. New biotechnology, 25(4),
195-203.
6.
Aravanis, A. M., Lee,
M., &Klausner, R. D. (2017). Next generation sequencing of circulating
tumor DNA for early cancer detection. Cell,
168(4), 571-574.
7.
Yohe, S.,
&Thyagarajan, B. (2017). Review of clinical next generation sequencing. Archives of pathology & laboratory
medicine, 141(11), 1544-1557.
8.
Adams, D. R., &Eng, C. M. (2018). Next-generation
sequencing to diagnose suspected genetic disorders. New England Journal
of Medicine, 379(14), 1353-1362.
9.
vanDijk, E. L., Jaszczyszyn, Y., Naquin, D., &Thermes, C.
(2018). The third revolution in sequencing technology. Trends in
Genetics, 34(9), 666-681.
10.
Slatko, B. E., Gardner, A. F., &Ausubel, F. M. (2018).
Overview of next‐generation sequencing technologies. Current protocols
in molecular biology, 122(1), e59.
11.
McCombie, W. R., McPherson, J.D. ,&Mardis, E.R. (2019).
Next-generation sequencing technologies. Cold
Spring Harbor Perspectives in Medicine,9(11), a036798.
12.
Brink, k., Anitha, S.C., Beatty, M.k., Anderson, J.A., Lyon,
M., Weaver, j., & Dietrich, N. (2019). Comparison of Southern-by-Sequencing
(SbSTM) technology and southern blot analysis for molecular characterization of
genetically modified crops. Journal of
Regualtory Science, 7, 1-14.
13.
Lu, R., Zhao, X., Li, J., Niu,P., Yang, B., Wu, H., & Bi,
Y. (2020). Genomic characterization and epidemiology of 2019 novel coronavirus:
implications of virus origins and receptor binding. The Lancet, 395(10224),
565-574.
Comments
Post a Comment