Menu
June 1, 2021

Making the most of long reads: towards efficient assemblers for reference quality, de novo reconstructions

2015 SMRT Informatics Developers Conference Presentation Slides: Gene Myers, Ph.D., Founding Director, Systems Biology Center, Max Planck Institute delivered the keynote presentation. He talked about building efficient assemblers, the importance of random error distribution in sequencing data, and resolving tricky repeats with very long reads. He also encouraged developers to release assembly modules openly, and noted that data should be straightforward to parse since sharing data interfaces is easier than sharing software interfaces.


June 1, 2021

Application specific barcoding strategies for SMRT Sequencing

Over the last few years, several advances were implemented in the PacBio RS II System to maximize throughput and efficiency while reducing the cost per sample. The number of useable bases per SMRT Cell now exceeds 1 Gb with the latest P6-C4 chemistry and 6-hour movies. For applications such as microbial sequencing, targeted sequencing, Iso-Seq (full-length isoform sequencing) and Nimblegen’s target enrichment method, current SMRT Cell yields could be an excess relative to project requirements. To this end, barcoding is a viable option for multiplexing samples. For microbial sequencing, multiplexing can be accomplished by tagging sheared genomic DNA during library construction with modified SMRTbell adapters. We studied the performance of 2- to 8-plex microbial sequencing. For full-length amplicon sequencing such as HLA typing, amplicons as large as 5 kb may be barcoded during amplification using barcoded locus-specific primers. Alternatively, amplicons may be barcoded during SMRTbell library construction using barcoded SMRTbell adapters. The preferred barcoding strategy depends on the user’s existing workflow and flexibility to changing and/or updating existing workflows. Using barcoded adapters, five Class I and II genes (3.3 – 5.8 kb) x 96 patients can be multiplexed and typed. For Iso-Seq full-length cDNA sequencing, barcodes are incorporated during 1st-strand synthesis and are enabled by tailing the oligo-dT primer with any PacBio published 16-bp barcode sequences. RNA samples from 6 maize tissues were multiplexed to generate barcoded cDNA libraries. The NimbleGen SeqCap Target Enrichment method, combined with PacBio’s long-read sequencing, provides comprehensive view of multi-kilobase contiguous regions, both exonic and intronic regions. To make this cost effective, we recommend barcoding samples for pooling prior to target enrichment and capture. Here, we present specific examples of strategies and best practices for multiplexing samples for different applications for SMRT Sequencing. Additionally, we describe recommendations for analyzing barcoded samples.


June 1, 2021

Multiplexing strategies for microbial whole genome SMRT Sequencing

As the throughput of the PacBio Systems continues to increase, so has the desire to fully utilize SMRT Cell sequencing capacity to multiplex microbes for whole genome sequencing. Multiplexing is readily achieved by incorporating a unique barcode for each microbe into the SMRTbell adapters and using a streamlined library preparation process. Incorporating barcodes without PCR amplification prevents the loss of epigenetic information and the generation of chimeric sequences, while eliminating the need to generate separate SMRTbell libraries. We multiplexed the genomes of up to 8 unique strains of H. pylori. Each genome was sheared and processed through adapter ligation in a single, addition-only reaction. The barcoded samples were pooled in equimolar quantities and a single SMRTbell library was prepared. We demonstrate successful de novo microbial assembly from all multiplexes tested (2- through 8-plex) using data generated from a single SMRTbell library, run on a single SMRT Cell with the PacBio RS II, and analyzed with standard SMRT Analysis assembly methods. This strategy was successful using both small (1.6 Mb, H. pylori) and medium (5 Mb, E. coli) genomes. This protocol facilitates the sequencing of multiple microbial genomes in a single run, greatly increasing throughput and reducing costs per genome.


June 1, 2021

Multiplexing strategies for microbial whole genome sequencing using the Sequel System

For microbial sequencing on the PacBio Sequel System, the current yield per SMRT Cell is in excess relative to project requirements. Multiplexing offers a viable solution; greatly increasing throughput, efficiency, and reducing costs per genome. This approach is achieved by incorporating a unique barcode for each microbial sample into the SMRTbell adapters and using a streamlined library preparation process. To demonstrate performance,12 unique barcodes assigned to B. subtilis and sequenced on a single SMRT Cell. To further demonstrate the applicability of this method, we multiplexed the genomes of 16 strains of H. pylori. Each DNA was sheared to 10 kb, end-repaired and ligated with a barcoded adapter in a single-tube reaction. The barcoded samples were pooled in equimolar quantities and a single SMRTbell library was prepared. Successful de novo microbial assemblies were achieved from all multiplexes tested (12-, and 16-plex) using data generated from a single SMRTbell library, run on a single SMRT Cell 1M with the PacBio Sequel System, and analyzed with standard SMRT Analysis assembly methods. Here, we describe a protocol that facilitated the multiplexing up to 12-plex of microbial genomes in one SMRT Cell 1M on the Sequel System that produced near-complete microbial de novo assemblies of <10 contigs for genomes <5 Mb in size.


June 1, 2021

SMRT-Cappable-seq reveals the complex operome of bacteria

SMRT-Cappable-seq combines the isolation of full-length prokaryotic primary transcripts with long read sequencing technology. It is the first experimental methodology to sequence entire prokaryotic transcripts. It identifies the transcription start site and termination site, thereby directly defines the operon structures genome-wide in prokaryotes. Applied to E.coli, SMRT-Cappable-seq identifies a total of ~2300 operons, among which ~900 are novel. Importantly, our result reveals a pervasive read-through of previous experimentally validated transcription termination sites. Termination read-through represents a powerful strategy to control gene expression. Taken together this data provides a first glance at the complexity of the ‘operome’ in bacteria and presents an invaluable resource for understanding gene regulation and function in bacteria.


June 1, 2021

Single chromosomal genome assemblies on the Sequel System with Circulomics high molecular weight DNA extraction for microbes

Background: The Nanobind technology from Circulomics provides an elegant HMW DNA extraction solution for genome sequencing of Gram-positive and -negative microbes. Nanobind is a nanostructured magnetic disk that can be used for rapid extraction of high molecular weight (HMW) DNA from diverse sample types including cultured cells, blood, plant nuclei, and bacteria. Processing can be completed in <1 hour for most sample types and can be performed manually or automated with common instruments. Methods:We have validated several critical steps for generating high-quality microbial genome assemblies in a streamlined microbial multiplexing workflow. This new workflow enables high-volume, cost-effective sequencing of up to 16 microbes totaling 30 Mb in genome size on a single SMRT Cell 1M using a target shear size of 10 kb. We also evaluated this method on a pool of four “class 3” microbes that contain >7 kb repeats. Fragment size was increased to ~14 kb, with some fragments >30 kb. Results: Here we present a demonstration of these capabilities using isolates relevant to high-throughput sequencing applications, including common foodborne pathogens (Shigella, Listeria, Salmonella), and species often seen in hospital settings (Klebsiella, Staphylococcus). For nearly all microbes, including difficult-to-assemble class III microbes, we achieved complete de novo microbial assemblies of =5 chromosomal contigs with minimum quality scores of 40 (99.99% accuracy) using data from multiplexed SMRTbell libraries. Each library was sequenced on a single SMRT Cell 1M with the PacBio Sequel System and analyzed with streamlined SMRT Analysis assembly methods. Conclusions: We achieved high-quality, closed microbial genomes using a combination of Circulomics Nanobind extraction and PacBio SMRT Sequencing, along with a newly streamlined workflow that includes automated demultiplexing and push-button assembly.


June 1, 2021

Comparative metagenome-assembled genome analysis of “Candidatus Lachnocurva vaginae”, formerly known as Bacterial Vaginosis Associated bacterium – 1 (BVAB1)

Bacterial Vaginosis Associated bacterium 1 (BVAB1) is an as-yet uncultured bacterial species found in the human vagina that belongs to the family Lachnospiraceae within the order Clostridiales. As its name suggests, this bacterium is often associated with bacterial vaginosis (BV), a common vaginal disorder that has been shown to increase a woman’s risk for HIV, Chlamydia trachomatis, and Neisseria gonorrhoeae infections as well as preterm birth. Further, BVAB1 is associated with the persistence of BV following metronidazole treatment, increased vaginal inflammation, and adverse obstetrics outcomes. There is no available complete genome sequence of BVAB1, which has made it di?cult to mechanistically understand its role in disease. We present here a circularized metagenome-assembled genome (cMAG) of B VAB1 as well as a comparative analysis including an additional six metagenome-assembled genomes (MAGs) of this species. These sequences were derived from cervicovaginal samples of seven separate women. The cMAG is 1.649 Mb in size and encodes 1,578 genes. We propose to rename BVAB1 to “Candidatus Lachnocurva vaginae” based on phylogenetic analyses, and provide genomic evidence that this candidate species may metabolize D-lactate, produce trimethylamine (one of the chemicals responsible for BV-associated odor), and be motile. The cMAG and the six MAGs are valuable resources that will further contribute to our understanding of the heterogeneous etiology of bacterial vaginosis.


April 21, 2020

Chryseobacterium mulctrae sp. nov., isolated from raw cow’s milk.

A Gram-stain-negative bacterial strain, designated CA10T, was isolated from bovine raw milk sampled in Anseong, Republic of Korea. Cells were yellow-pigmented, aerobic, non-motile bacilli and grew optimally at 30?°C and pH 7.0 on tryptic soy agar without supplementation of NaCl. Phylogenetic analysis based on the 16S rRNA gene sequences revealed that strain CA10T belonged to the genus Chryseobacterium, family Flavobacteriaceae, and was most closely related to Chryseobacterium indoltheticum ATCC 27950T (98.75?% similarity). The average nucleotide identity and digital DNA-DNA hybridization values of strain CA10T were 94.4 and 56.9?%, respectively, relative to Chryseobacterium scophthalmum DSM 16779T, being lower than the cut-off values of 95-96?and 70?%, respectively. The predominant respiratory quinone was menaquinone-6; major polar lipid, phosphatidylethanolamine; major fatty acids, iso-C15?:?0, summed feature 9 (iso-C17?:?1?9c and/or C16?:?0 10-methyl), summed feature 3 (iso-C15?:?0 2-OH and/or C16?:?1?7c) and iso-C17?:?0 3-OH. The results of physiological, chemotaxonomic and biochemical analyses suggested that strain CA10T is a novel species of genus Chryseobacterium, for which the name Chryseobacterium mulctrae sp. nov. is proposed. The type strain is CA10T (=KACC 21234T=JCM 33443T).


April 21, 2020

Allopseudarcicella aquatilis gen. nov., sp. nov., isolated from freshwater.

A Gram-stain-negative, rod-shaped and red-pigmented strain, HME7025T, was isolated from freshwater sampled in the Republic of Korea. Phylogenetic analysis based on its 16S rRNA gene sequence revealed that strain HME7025T formed a lineage within the family Cytophagaceae of the phylum Bacteroidetes. Strain HME7025T was closely related to the genera Pseudarcicella, Arcicella and Flectobacillus. The 16S rRNA gene sequence similarity values of strain HME7025T were under 94.5?% to its closest phylogenetic neighbours. The major fatty acids of strain HME7025T were iso-C15?:?0 (41.9?%), summed feature 3 (comprising C16?:?1?7c and/or C16?:?1?6c; 12.2?%) and anteiso-C15?:?0 (10.8?%). The major respiratory quinone was menaquinone-7. The major polar lipids were phosphatidylethanolamine, two unidentified aminophospholipids and one unidentified polar lipid. The DNA G+C content of strain HME7025T was 37.9?mol%. On the basis of the evidence presented in this study, strain HME7025T represents a novel species of a novel genus within the family Cytophagaceae, for which the name Allopseudarcicella aquatilis gen. nov., sp. nov. is proposed. The type strain is HME7025T (=KCTC 23617T=CECT 7957T).


April 21, 2020

Complete genome sequence of Antarcticibacterium flavum JB01H24T from an Antarctic marine sediment

Antarcticibacterium flavum JB01H24T was isolated from a marine sediment of the Ross Sea, Antarctica. Whole-genome sequencing of the strain Antarcticibacterium flavum JB01H24T was achieved using PacBio RS II platform. The resulting complete genome comprised of one closed, complete chromosome of 4,319,074 base pairs with a 40.87% G?+?C content, where genomic analyses demonstrated that it is constituted mostly by putative ORFs with unknown functions, representing a novel genetic feature. It is the first complete genome sequence of the Antarcticibacterium strain.


April 21, 2020

The complete genome sequence and comparative genome analysis of the multi-drug resistant food-borne pathogen Bacillus cereus.

Bacillus cereus is an opportunistic human pathogen causing food-borne gastrointestinal infections and non-gastrointestinal infections worldwide. The strain B. cereus FORC_013 was isolated from fried eel. Its genome was completely sequenced by PacBio technology, analyzed and compared with other complete genome sequences of Bacillus to elucidate the distinct pathogenic features of the strain isolated in South Korea. Genomic analysis revealed pathogenesis and host immune evasion-associated genes encoding tissue-destructive exoenzymes, and pore-forming toxins. In particular, tissue-destructive (hemolysin BL, nonhaemolytic enterotoxins) and cytolytic proteins (cytolysin) were observed in the genome, which damage the plasma membrane of the epithelial cells of the small intestine causing diarrhea in humans. Capsule biosynthesis gene found in both chromosome and plasmid, which might be responsible for protecting the pathogen from the host cell immune defense system after host cell invasion. Additionally, multidrug resistance operon and efflux pumps were identified in the genome, which play a prominent role in multi-antibiotic resistance. Comparative phylogenetic tree analysis of the strain FORC_013 and other B. cereus strains revealed that the closest strains are ATCC 14579 and B4264. This genome data can be used to identify virulence factors that can be applied for the development of novel biomarkers for the rapid detection of this pathogen in foods.Copyright © 2018. Published by Elsevier Inc.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.