Access the epigenome to explore DNA modifications without any special library preparation needed
Introduction to epigenetics with long-read sequencing
The unique chemistry of long-read HiFi sequencing technology enables researchers to directly reveal the epigenetic landscape of samples.
HiFi sequencing provides the most accurate genome-wide calls for SNVs, indels, and SVs. With 5-base HiFi sequencing, the same sequencing library identifies genome-wide methylation patterns associated with gene activity and other biological functions. Further, long HiFi reads enable phasing of the genetic and epigenetic variants into parental haplotypes.
On-instrument 5-base HiFi sequencing detects 5mC methylation in standard sequencing runs without any changes to library prep or sequencing workflows required.
ADVANTAGES OF EPIGENETIC ANALYSIS WITH 5-BASE HIFI SEQUENCING
With the power of long-read sequencing, you can achieve:
Epigenetics in every run — no bisulfite treatment required
Unlike methods that require chemical conversion of DNA, HiFi sequencing detects modifications in native DNA through impacts on the kinetics of base incorporation.
High accuracy of sequence and methylation
Methylation detection with HiFi sequencing is highly concordant to bisulfite sequencing.
Access the full genome
Access difficult regions of the genome like repeats and centromeres that are beyond the reach of short-read sequencing.
Identify allele-specific methylation, whether due to parental imprinting, genetic variation, or repeat expansions.
“SMRT sequencing is opening up new diagnostic avenues, such as the ability to determine tandem repeat lengths, interruptions, and even epigenetics in a single test at base pair resolution.”1– Ardui, et al., 20181
HOW 5-BASE HIFI SEQUENCING CALLS METHYLATION STATUS
HiFi sequencing provides two channels of information: fluorescence and kinetics. Utilizing both enables highly accurate reads (fluorescence) plus methylation status (kinetics) from a single library.
HiFi sequencing observes a polymerase incorporating fluorescently labeled nucleotides complementary to a native DNA strand. The label identifies the base (A, C, G, T). Epigenetic modifications like 5mC impact polymerase kinetics — how fast bases are incorporated. No special library prep is required.
- A convolutional neural network model processes polymerase kinetics to determine the methylation status of each CpG site in a HiFi read.
- The model runs directly on the Sequel IIe system and is also available in SMRT Link.
- Methylation status is output using the BAM standard MM and ML tags.
Epigenetic analysis in action
The pb-CpG-tools collection provides tools for secondary analysis of CpG methylation data from PacBio HiFi reads. Starting from a pileup of HiFi reads with methylation tags, the tools calculate the percent of reads methylated at every CpG site in the genome.
Direct detection of DNA methylation
See how scientists use PacBio sequencing to detect methylation with basepair resolution.
Genome-wide detection of cytosine methylation
Read how researchers use the kinetics in HiFi reads to determine methylation status at CpG sites.
DNA 5mC detection and methylation phasing
Read how circular consensus sequencing enables genome-wide detection of cytosine methylation by single molecule real-time sequencing.
EPIGENETIC SEQUENCING - HOW PACBIO COMPARES
What 5-Base Sequencing Reveals
Regional methylation patterns
Methylation levels vary across the genome in many species. In vertebrates like human most CpG sites are methylated. Active gene transcription start sites are often hypomethylated.
In this example genomic region, 5-base HiFi sequencing of the human HG002 sample shows overall hypermethylation (red) with hypomethylation (blue) specifically at transcription start sites.
Phased genetics and epigenetics
HiFi sequencing enables simultaneous phasing of reads into maternal and paternal haplotypes and detection of methylation. This reveals allele-specific methylation patterns, which can be due to genetic variation (where epigenetic status is affected by a difference in sequence) or parental imprinting (where epigenetic status is affected by whether a chromosome was inherited from the mother or father).
In this example, the HG002/3/4 trio from Genome in a Bottle, HiFi reads show the expected maternal imprinting at the gene PEG3. HiFi sequencing allows phasing of the haplotypes per sample, the trio identifies which allele is transmitted from which parent, and 5-base sequencing shows allele-specific methylation.
HiFi sequencing phases and identifies hypermethylation of expanded FMR1 repeats in NA07537.
Methylation and disease
Atypical methylation patterns contribute to rare diseases like Prader-Willi syndrome and are important factors in pathogenic repeat expansion, such as the CGG expansion at the FMR1 locus that cause Fragile X syndrome. With high accuracy, long reads, and methylation detection, HiFi sequencing is ideal for characterizing these repeat expansions.
APPLICATIONS FOR EPIGENETIC ANALYSIS
Epigenetic sequencing workflow at a glance
- Simultaneously detect accurate base sequence and accurate epigenetic modifications
- Detect microbial base modifications and motifs with the microbial genome analysis application in SMRT Link
- Call 5mC at CpG sites directly from the sequencing instrument or in SMRT Link
- Visualize 5mC annotation directly in IGV