Repeat expansions Archives - Page 2 of 3

June 1, 2021

Comprehensive variant detection in a human genome with highly accurate long reads

Introduction: Long-read sequencing has revealed more than 20,000 structural variants spanning over 12 Mb in a healthy human genome. Short-read sequencing fails to detect most structural variants but has remained the more effective approach for small variants, due to 10-15% error rates in long reads, and copy-number variants (CNVs), due to lack of effective long-read variant callers. The development of PacBio highly accurate long reads (HiFi reads) with read lengths of 10-25 kb and quality >99% presents the opportunity to capture all classes of variation with one approach.Methods: We sequence the Genome in a Bottle benchmark sample HG002 and an individual with a presumed Mendelian disease with HiFi reads. We call SNVs and indels with DeepVariant and extend the structural variant caller pbsv to call CNVs using read depth and clipping signatures. Results: For 18-fold coverage with 13 kb HiFi reads, variant calling in HG002 achieves an F1 score of 99.7% for SNVs, 96.6% for indels, and 96.4% for structural variants. Additionally, we detect more than 300 CNVs spanning around 10 Mb. For the Mendelian disease case, HiFi reads reveal thousands of variants that were overlooked by short-read sequencing, including a candidate causative structural variant. Conclusions: These results illustrate the ability of HiFi reads to comprehensively detect variants, including those associated with human disease.

June 1, 2021

Amplification-free targeted enrichment powered by CRISPR-Cas9 and long-read Single Molecule Real-Time (SMRT) Sequencing can efficiently and accurately sequence challenging repeat expansion disorders

Genomic regions with extreme base composition bias and repetitive sequences have long proven challenging for targeted enrichment methods, as they rely upon some form of amplification. Similarly, most DNA sequencing technologies struggle to faithfully sequence regions of low complexity. This has been especially trying for repeat expansion disorders such as Fragile-X disease, Huntington disease and various Ataxias, where the repetitive elements range from several hundreds of bases to tens of kilobases. We have developed a robust, amplification-free targeted enrichment technique, called No-Amp Targeted Sequencing, that employs the CRISPR-Cas9 system. In conjunction with SMRT Sequencing, which delivers long reads spanning the entire repeat expansion, high consensus accuracy, and uniform coverage, these previously inaccessible regions are now accessible. This method is completely amplification-free, therefore removing any PCR errors and biases from the experiment. Furthermore, this technique also preserves native DNA molecules, allowing for direct detection and characterization of epigenetic signatures. The No-Amp method is a two-day protocol that is compatible with multiplexing of multiple targets and multiple samples in a single reaction, using as little as 1 µg of genomic DNA input per sample. We have successfully targeted a number of repeat expansion disorder loci including HTT, FMR1, C9orf7,2 as well as built an Ataxia panel which consists of 15 different disease-causing repeat expansion regions. Using the No-Amp method we have isolated hundreds of individual on-target molecules, allowing for reliable repeat size estimation, mosaicism detection and identification of interruption sequences with alleles as long as >2700 repeat unites ( >13 kb). In addition to multiplexing several targets, we have also multiplexed at least 20 samples in one experiment making the No-Amp Targeted Sequencing method a cost-effective option. Combining the CRISPR-Cas9 enrichment method with Single Molecule, Real-Time Sequencing provided us with base-level resolution of previously inaccessible regions of the genome, like disease-causing repeat expansions. No-Amp Targeted Sequencing captures, in one experiment, many aspects of repeat expansion disorders which are important for better understanding the underlying disease mechanisms.

February 5, 2021

ASHG PacBio Workshop: SMRT Sequencing as a translational research tool to investigate germline, somatic and infectious diseases

Melissa Laird Smith discussed how the Icahn School of Medicine at Mount Sinai uses long-read sequencing for translational research. She gave several examples of targeted sequencing projects run on the…

February 5, 2021

AGBT Virtual Poster: Analysis method for amplification-free SMRT sequencing and assessment on repeat expansions in Huntington’s disease

Adam Ameur from the National Genomics Infrastructure at SciLifeLab presented this poster at AGBT 2017. In it, he details a validation study for the use of CRISPR/Cas9 to capture genomic…

February 5, 2021

Webinar: Addressing “NGS Dead Zones” with third generation PacBio sequencing

SMRT Sequencing is a DNA sequencing technology characterized by long read lengths and high consensus accuracy, regardless of the sequence complexity or GC content of the DNA sample. These characteristics…

February 5, 2021

ASHG PacBio Workshop: Expansion sequence variations underlie distinct disease phenotypes in SCA10

In this ASHG 2017 presentation, Karen McFarland of the University of Florida presented research on spinocerebellar ataxia type 10 (SCA10), a progressive neurodegenerative disease caused by repeat expansions. She outlined…

February 5, 2021

ASHG PacBio Workshop: PacBio applications updates & future roadmap

In this ASHG 2017 presentation, Jonas Korlach, the CSO of PacBio shared updates on three applications featuring SMRT Sequencing on the Sequel System, highlighting structural variant detection, targeted sequencing and…

February 5, 2021

Webinar: Assessing the targeted regions using a PacBio Amplicon analysis applications

This webinar, presented by Nisha Pillai, provides an overview of amplicon sequencing to target specific regions of a genome using PacBio Single Molecule, Real-Time (SMRT) Sequencing. This session provides an…

February 5, 2021

Webinar: Amplicon sequencing with confidence – High-fidelity, long-read PacBio sequencing solutions

In this webinar, Lori Aro and Cheryl Heiner of PacBio describe how high-throughput amplicon sequencing using Single Molecule, Real-Time (SMRT) Sequencing and the Sequel System allows for the easy and…

February 5, 2021

Webinar: Beyond Gene Editing: How CRISPR/Cas9 enables sequencing of difficult regions of the genome

In this webinar, Jenny Ekholm and Paul Kotturi provide an overview of the PacBio No-Amp targeted sequencing application and its uses for targeting hard-to-amplify genes. This approach couples CRISPR-Cas9 with…

February 5, 2021

ASHG PacBio Workshop: Sequence with confidence – A new era of highly accurate long-read sequencing

In this presentation, Emily Hatas of PacBio offers a look a how SMRT Sequencing has changed over the years as well as the most common applications in human genome analysis:…

February 5, 2021

Webinar: SMRT Sequencing applications for human genomics and medicine

In this webinar, Adam Ameur of SciLifeLab at Uppsala University shares how he uses Single Molecule, Real-Time (SMRT) Sequencing applications for medical diagnostics and human genetics research, including sequencing of…

April 21, 2020

Long-read sequencing for rare human genetic diseases.

During the past decade, the search for pathogenic mutations in rare human genetic diseases has involved huge efforts to sequence coding regions, or the entire genome, using massively parallel short-read sequencers. However, the approximate current diagnostic rate is <50% using these approaches, and there remain many rare genetic diseases with unknown cause. There may be many reasons for this, but one plausible explanation is that the responsible mutations are in regions of the genome that are difficult to sequence using conventional technologies (e.g., tandem-repeat expansion or complex chromosomal structural aberrations). Despite the drawbacks of high cost and a shortage of standard analytical methods, several studies have analyzed pathogenic changes in the genome using long-read sequencers. The results of these studies provide hope that further application of long-read sequencers to identify the causative mutations in unsolved genetic diseases may expand our understanding of the human genome and diseases. Such approaches may also be applied to molecular diagnosis and therapeutic strategies for patients with genetic diseases in the future.

April 21, 2020

Transcriptional initiation of a small RNA, not R-loop stability, dictates the frequency of pilin antigenic variation in Neisseria gonorrhoeae.

Neisseria gonorrhoeae, the sole causative agent of gonorrhea, constitutively undergoes diversification of the Type IV pilus. Gene conversion occurs between one of the several donor silent copies located in distinct loci and the recipient pilE gene, encoding the major pilin subunit of the pilus. A guanine quadruplex (G4) DNA structure and a cis-acting sRNA (G4-sRNA) are located upstream of the pilE gene and both are required for pilin antigenic variation (Av). We show that the reduced sRNA transcription lowers pilin Av frequencies. Extended transcriptional elongation is not required for Av, since limiting the transcript to 32 nt allows for normal Av frequencies. Using chromatin immunoprecipitation (ChIP) assays, we show that cellular G4s are less abundant when sRNA transcription is lower. In addition, using ChIP, we demonstrate that the G4-sRNA forms a stable RNA:DNA hybrid (R-loop) with its template strand. However, modulating R-loop levels by controlling RNase HI expression does not alter G4 abundance quantified through ChIP. Since pilin Av frequencies were not altered when modulating R-loop levels by controlling RNase HI expression, we conclude that transcription of the sRNA is necessary, but stable R-loops are not required to promote pilin Av. © 2019 John Wiley & Sons Ltd.

April 21, 2020

High satellite repeat turnover in great apes studied with short- and long-read technologies.

Satellite repeats are a structural component of centromeres and telomeres, and in some instances their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50?bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: (1) the (AATGG)n repeat (critical for heat shock response) and its derivatives; and (2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males vs. females; using Y chromosome assemblies or FIuorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59?kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Auto Tag: Repeat expansions

Comprehensive variant detection in a human genome with highly accurate long reads

Amplification-free targeted enrichment powered by CRISPR-Cas9 and long-read Single Molecule Real-Time (SMRT) Sequencing can efficiently and accurately sequence challenging repeat expansion disorders

ASHG PacBio Workshop: SMRT Sequencing as a translational research tool to investigate germline, somatic and infectious diseases

AGBT Virtual Poster: Analysis method for amplification-free SMRT sequencing and assessment on repeat expansions in Huntington’s disease

Webinar: Addressing “NGS Dead Zones” with third generation PacBio sequencing

ASHG PacBio Workshop: Expansion sequence variations underlie distinct disease phenotypes in SCA10

ASHG PacBio Workshop: PacBio applications updates & future roadmap

Webinar: Assessing the targeted regions using a PacBio Amplicon analysis applications

Webinar: Amplicon sequencing with confidence – High-fidelity, long-read PacBio sequencing solutions

Webinar: Beyond Gene Editing: How CRISPR/Cas9 enables sequencing of difficult regions of the genome

ASHG PacBio Workshop: Sequence with confidence – A new era of highly accurate long-read sequencing

Webinar: SMRT Sequencing applications for human genomics and medicine

Long-read sequencing for rare human genetic diseases.

Transcriptional initiation of a small RNA, not R-loop stability, dictates the frequency of pilin antigenic variation in Neisseria gonorrhoeae.

High satellite repeat turnover in great apes studied with short- and long-read technologies.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert