Approximately 80% of rare diseases have genetic defects and hereditary characteristics that are often difficult to diagnose clinically. Short-read sequencing (SRS) methods are commonly used in clinical laboratory medicine due to their cost-effectiveness and low per-base error rate. However, SRS has limitations in capturing the full range of genomic variation. Areas of low complexity, such as repeats, and regions of high polymorphism, like the human leukocyte antigen region, present challenges to SRS and reference-based genome assembly. Additionally, structural variations are often beyond the resolution of SRS.
Long-read sequencing (LRS) offers complementary strengths to SRS by producing longer reads of several thousand base pairs with uniform coverage. Although individual long reads have lower accuracy, errors are random and correctable with sufficient coverage, resulting in high consensus accuracy. LRS surpasses the limitations of SRS and enables the detection of large structural variations and complex variants. It accurately maps to the genome and accesses regions beyond the reach of short reads.
Applicable to the most challenging rare disease cases, long-read sequencing can be employed.
- Identification of disease-causing mutations: Long-read sequencing has high accuracy in detecting structural variants (SVs) such as insertions, duplications, inversions, and deletions. This enables the identification of disease-causing mutations in complex disorders, including rare diseases like Williams syndrome, Cri du Chat syndrome, and Angelman syndrome, where structural variations play a significant role in disease manifestation.[4]
- Determining haplotypes: Long-read sequencing allows for the detection and characterization of complicated genetic diseases by accurately determining haplotypes, which are collections of inherited genetic variations. [4].
- Detection of epigenetic changes: Third-generation sequencing provides a comprehensive analysis of DNA methylation patterns, histone modifications, and chromatin accessibility. This is particularly crucial in rare diseases like Rett syndrome, Prader-Willi syndrome, and Beckwith-Wiedemann syndrome, where understanding epigenetic alterations is important for disease understanding.[5]
- Understanding gene expression and disease symptoms: Long-read sequencing enables the direct sequencing of full-length RNA transcripts, providing critical insights into alternative splicing, isoform identification, and gene expression levels. This information is crucial for understanding the impact of genetic differences on gene expression and disease symptoms.[6]
- Identification of structural variations (SV): Long-read sequencing is highly valuable in identifying SVs, which are often associated with rare genetic disorders. This capability is especially important in diseases like Williams syndrome, Cri du Chat syndrome, and Angelman syndrome, where SVs significantly impact disease development and manifestation [7].
By providing high-quality sequencing data and in-depth insights into the genetic pathways associated with rare diseases, the long-read length of third-generation sequencing enables researchers and clinicians to gain a comprehensive understanding of the complex genetic landscape. This, in turn, facilitates more precise diagnoses, personalized treatment strategies, and ultimately, better patient outcomes. Learn more, please visit our website: Novogene.