Menu
July 7, 2019

Single-locus enrichment without amplification for sequencing and direct detection of epigenetic modifications.

A gene-level targeted enrichment method for direct detection of epigenetic modifications is described. The approach is demonstrated on the CGG-repeat region of the FMR1 gene, for which large repeat expansions, hitherto refractory to sequencing, are known to cause fragile X syndrome. In addition to achieving a single-locus enrichment of nearly 700,000-fold, the elimination of all amplification steps removes PCR-induced bias in the repeat count and preserves the native epigenetic modifications of the DNA. In conjunction with the single-molecule real-time sequencing approach, this enrichment method enables direct readout of the methylation status and the CGG repeat number of the FMR1 allele(s) for a clonally derived cell line. The current method avoids potential biases introduced through chemical modification and/or amplification methods for indirect detection of CpG methylation events.


July 7, 2019

Complete chloroplast genome sequences of Eucommia ulmoides: genome structure and evolution.

Eucommia ulmoides is an important traditional medicinal plant that is used for the production of locative Eucommia rubber. In this study, the complete chloroplast (cp) genome sequence of E. ulmoides was obtained by total DNA sequencing; this is the first cp genome sequence of the order Garryales. The cp genome of E. ulmoides was 163,341 bp long and included a pair of inverted repeat (IR) regions (31,300 bp), one large single copy (LSC) region (86,592 bp), and one small single copy (SSC) region (14,149 bp). The genome structure and GC content were similar to those of typical angiosperm cp genomes and contained 115 unique genes, including 80 protein-coding genes, 31 transfer RNA (tRNAs), and four ribosomal RNA (rRNAs). Compared with the entire cp genome sequence, three unique genome rearrangements were observed in the LSC region. Moreover, compared with the Sesamum and Nicotiana cp genomes, E. ulmoides contained no indels in the IR regions, and variable regions were identified in noncoding regions. The E. ulmoides cp genome showed extreme expansion at the IR/SSC boundary owing to the integration of an additional complete gene, ycf1. Twenty-nine simple sequence repeats (SSRs) were identified in the E. ulmoides cp genome. In addition, 36 protein-coding genes were used for phylogenetic inference, supporting a sister relationship between E. ulmoides and Aucuba, which belongs to Euasterids I. In summary, we described the complete cp genome sequence of E. ulmoides; this information will be useful for phylogenetic and evolutionary studies.


July 7, 2019

Microsatellite length scoring by Single Molecule Real Time Sequencing – Effects of sequence structure and PCR regime.

Microsatellites are DNA sequences consisting of repeated, short (1-6 bp) sequence motifs that are highly mutable by enzymatic slippage during replication. Due to their high intrinsic variability, microsatellites have important applications in population genetics, forensics, genome mapping, as well as cancer diagnostics and prognosis. The current analytical standard for microsatellites is based on length scoring by high precision electrophoresis, but due to increasing efficiency next-generation sequencing techniques may provide a viable alternative. Here, we evaluated single molecule real time (SMRT) sequencing, implemented in the PacBio series of sequencing apparatuses, as a means of microsatellite length scoring. To this end we carried out multiplexed SMRT sequencing of plasmid-carried artificial microsatellites of varying structure under different pre-sequencing PCR regimes. For each repeat structure, reads corresponding to the target length dominated. We found that pre-sequencing amplification had large effects on scoring accuracy and error distribution relative to controls, but that the effects of the number of amplification cycles were generally weak. In line with expectations enzymatic slippage decreased proportionally with microsatellite repeat unit length and increased with repetition number. Finally, we determined directional mutation trends, showing that PCR and SMRT sequencing introduced consistent but opposing error patterns in contraction and expansion of the microsatellites on the repeat motif and single nucleotide level.


July 7, 2019

Bacterial genetics: SMRT-seq reveals an epigenetic switch.

Streptococcus pneumoniae uses genetic diversification as a strategy to achieve phenotypic plasticity. For example, DNA inversion of the hsdS genes of type I restriction-modification (R-M) systems determines whether S. pneumoniae forms opaque or transparent colonies, which have different colonization and virulence characteristics. Zhang and colleagues now use single-molecule, real-time sequencing (SMRT-seq) to show the allelic variation of hsdS that results from site-specific recombination forms part of an epigenetic switch.


July 7, 2019

Comparative methylome analysis of the occasional ruminant respiratory pathogen Bibersteinia trehalosi.

We examined and compared both the methylomes and the modification-related gene content of four sequenced strains of Bibersteinia trehalosi isolated from the nasopharyngeal tracts of Nebraska cattle with symptoms of bovine respiratory disease complex. The methylation patterns and the encoded DNA methyltransferase (MTase) gene sets were different between each strain, with the only common pattern being that of Dam (GATC). Among the observed patterns were three novel motifs attributable to Type I restriction-modification systems. In some cases the differences in methylation patterns corresponded to the gain or loss of MTase genes, or to recombination at target recognition domains that resulted in changes of enzyme specificity. However, in other cases the differences could be attributed to differential expression of the same MTase gene across strains. The most obvious regulatory mechanism responsible for these differences was slipped strand mispairing within short sequence repeat regions. The combined action of these evolutionary forces allows for alteration of different parts of the methylome at different time scales. We hypothesize that pleiotropic transcriptional modulation resulting from the observed methylomic changes may be involved with the switch between the commensal and pathogenic states of this common member of ruminant microflora.


July 7, 2019

Novel m4C modification in type I restriction-modification systems.

We identify a new subgroup of Type I Restriction-Modification enzymes that modify cytosine in one DNA strand and adenine in the opposite strand for host protection. Recognition specificity has been determined for ten systems using SMRT sequencing and each recognizes a novel DNA sequence motif. Previously characterized Type I systems use two identical copies of a single methyltransferase (MTase) subunit, with one bound at each half site of the specificity (S) subunit to form the MTase. The new m4C-producing Type I systems we describe have two separate yet highly similar MTase subunits that form a heterodimeric M1M2S MTase. The MTase subunits from these systems group into two families, one of which has NPPF in the highly conserved catalytic motif IV and modifies adenine to m6A, and one having an NPPY catalytic motif IV and modifying cytosine to m4C. The high degree of similarity among their cytosine-recognizing components (MTase and S) suggest they have recently evolved, most likely from the far more common m6A Type I systems. Type I enzymes that modify cytosine exclusively were formed by replacing the adenine target recognition domain (TRD) with a cytosine-recognizing TRD. These are the first examples of m4C modification in Type I RM systems.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Genome and plasmid analysis of blaIMP-4 -carrying Citrobacter freundii B38.

Sequencing of the blaIMP-4 -carrying C. freundii B38 using PacBio SMRT technique revealed that the genome contained a chromosome of 5,134,500 bp, and three plasmids, pOZ172 (127,005 bp), pOZ181 (277,592 bp), and pOZ182 (18,467 bp). Plasmid pOZ172 was identified as IncFIIY, like pP10164-NDM and pNDM-EcGN174. It carries a class 1 integron with four cassettes: blaIMP-4-qacG2-aacA4-aphA15, and a complete hybrid tni module (tniR-tniQ-tniB-tniA). The recombination of tniR from Tn402 (identical) with tniQBA (99%) from Tn5053 occurred within the res site of Tn402/5053. The Tn402/5053-like integron, named Tn6017, was inserted into Tn1722 at the res II site. The replication, partitioning and transfer systems of pOZ181 were similar to IncHI2 (e.g. R478) and contained a sul1-type class 1 integron with the cassette array: orf-dfrA1-orf-gcu37-aadA5 linked to an upstream Tn1696 tnpA-tnpR and to a downstream 3′ CS and ISCR1 A Tn2 transposon with a blaTEM-1b ß-lactamase was identified on pOZ182. Other interesting resistance determinants on the B38 chromosome included MDR efflux pumps, AmpC ß-lactamase, and resistances to Cu, Ag, As, and Zn. This is the first report of a complete tni module linked to a blaIMP- 4 carrying class 1 integron, and together with other recently reported non-sul1 integrons, represents the emergence of a distinct evolutionary lineage of class 1 integrons lacking a 3′ -CS (qacE?1-sul1). The unique cassette array, complete tni module of Tn6017, and incompatibility group of pOZ172 suggests a different blaIMP-4 evolutionary pathway in C. freundii B38 compared to other blaIMP-4 foundin Gram-negative bacteria in the Western Pacific Region. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Plasmids from Shiga toxin-producing Escherichia coli strains with rare enterohemolysin gene (ehxA) subtypes reveal pathogenicity potential and display a novel evolutionary path.

Most Shiga toxin-producing Escherichia coli (STEC) strains associated with severe disease, such as hemolytic-uremic syndrome (HUS), carry large enterohemolysin-encoding (ehxA) plasmids, e.g., pO157 and pO103, that contribute to STEC clinical manifestations. Six ehxA subtypes (A through F) exist that phylogenetically cluster into eae-positive (B, C, F), a mix of eae-positive (E) and eae-negative (A), and a third, more distantly related, cluster of eae-negative (D) STEC strains. While subtype B, C, and F plasmids share a number of virulence traits that are distinct from those of subtype A, sequence data have not been available for subtype D and E plasmids. Here, we determined and compared the genetic composition of four subtype D and two subtype E plasmids to establish their evolutionary relatedness among ehxA subtypes and define their potential role in pathogenicity. We found that subtype D strains carry one exceptionally large plasmid (>200 kbp) that carries a variety of virulence genes that are associated with enterotoxigenic and enterohemorrhagic E. coli, which, quite possibly, enables these strains to cause disease despite being food isolates. Our data offer further support for the hypothesis that this subtype D plasmid represents a novel virulence plasmid, sharing very few genetic features with other plasmids; we conclude that these plasmids have evolved from a different evolutionary lineage than the plasmids carrying the other ehxA subtypes. In contrast, the 50-kbp plasmids of subtype E (pO145), although isolated from HUS outbreak strains, carried only few virulence-associated determinants, suggesting that the clinical presentation of subtype E strains is largely a result of chromosomally encoded virulence factors.Bacterial plasmids are known to be key agents of change in microbial populations, promoting the dissemination of various traits, such as drug resistance and virulence. This study determined the genetic makeup of virulence plasmids from rare enterohemolysin subtype D and E Shiga toxin-producing E. coli strains. We demonstrated that ehxA subtype D plasmids represent a novel E. coli virulence plasmid, and although subtype D plasmids were derived from nonclinical isolates, they encoded a variety of virulence determinants that are associated with pathogenic E. coli In contrast, subtype E plasmids, isolated from strains recovered from severely ill patients, carry only a few virulence determinants. The results of this study reemphasize the plasticity and vast diversity among E. coli plasmids. This work demonstrates that, although E. coli strains of certain serogroups may not be frequently associated with disease, they should not be underestimated in protecting human health and food safety. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Genomic analyses of multidrug resistant Pseudomonas aeruginosa PA1 resequenced by single-molecule real-time sequencing.

As a third-generation sequencing (TGS) method, single-molecule real-time (SMRT) technology provides long read length, and it is well suited for resequencing projects and de novo assembly. In the present study, Pseudomonas aeruginosa PA1 was characterized and resequenced using SMRT technology. PA1 was also subjected to genomic, comparative and pan-genomic analyses. The multidrug resistant strain PA1 possesses a 6,498,072 bp genome and a sequence type of ST-782. The genome of PA1 was also visualized, and the results revealed the details of general genome annotations, virulence factors, regulatory proteins (RPs), secretion system proteins, type II toxin-antitoxin (T-A) pairs and genomic islands. Whole genome comparison analysis suggested that PA1 exhibits similarity to other P. aeruginosa strains but differs in terms of horizontal gene transfer (HGT) regions, such as prophages and genomic islands. Phylogenetic analyses based on 16S rRNA sequences demonstrated that PA1 is closely related to PAO1, and P. aeruginosa strains can be divided into two main groups. The pan-genome of P. aeruginosa consists of a core genome of approximately 4,000 genes and an accessory genome of at least 6,600 genes. The present study presented a detailed, visualized and comparative analysis of the PA1 genome, to enhance our understanding of this notorious pathogen. © 2016 The Author(s).


July 7, 2019

Genomic insights into Campylobacter jejuni virulence and population genetics

Campylobacter jejuni has long been recognized as a main food-borne pathogen in many parts of the world. Natural reservoirs include a wide variety of domestic and wild birds and mammals, whose intestines offer a suitable biological niche for the survival and dissemination of the organism. Understanding the genetic basis of the biology and pathogenicity of C. jejuni is vital to prevent and control Campylobacter-associated infections. The recent progress in sequencing techniques has allowed for a rapid increase in our knowledge of the molecular biology and the genetic structures of Campylobacter. Single-molecule realtime (SMRT) sequencing, which goes beyond four-base sequencing, revealed the role of DNA methylation in modulating the biology and virulence of C. jejuni at the level of epigenetics. In this review, we will provide an up-to-date review on recent advances in understanding C. jejuni genomics, including structural features of genomes, genetic traits of virulence, population genetics, and epigenetics.


July 7, 2019

Genomic analysis of phylotype I strain EP1 reveals substantial divergence from other strains in the Ralstonia solanacearum species complex.

Ralstonia solanacearum species complex is a devastating group of phytopathogens with an unusually wide host range and broad geographical distribution. R. solanacearum isolates may differ considerably in various properties including host range and pathogenicity, but the underlying genetic bases remain vague. Here, we conducted the genome sequencing of strain EP1 isolated from Guangdong Province of China, which belongs to phylotype I and is highly virulent to a range of solanaceous crops. Its complete genome contains a 3.95-Mb chromosome and a 2.05-Mb mega-plasmid, which is considerably bigger than reported genomes of other R. solanacearum strains. Both the chromosome and the mega-plasmid have essential house-keeping genes and many virulence genes. Comparative analysis of strain EP1 with other 3 phylotype I and 3 phylotype II, III, IV strains unveiled substantial genome rearrangements, insertions and deletions. Genome sequences are relatively conserved among the 4 phylotype I strains, but more divergent among strains of different phylotypes. Moreover, the strains exhibited considerable variations in their key virulence genes, including those encoding secretion systems and type III effectors. Our results provide valuable information for further elucidation of the genetic basis of diversified virulences and host range of R. solanacearum species.


July 7, 2019

Complete genome sequence of Clostridium estertheticum DSM 8809, a microbe identified in spoiled vacuum packed beef.

Blown pack spoilage (BPS) is a major issue for the beef industry. Etiological agents of BPS involve members of a group of Clostridium species, including Clostridium estertheticum which has the ability to produce gas, mostly carbon dioxide, under anaerobic psychotrophic growth conditions. This spore-forming bacterium grows slowly under laboratory conditions, and it can take up to 3 months to produce a workable culture. These characteristics have limited the study of this commercially challenging bacterium. Consequently information on this bacterium is limited and no effective controls are currently available to confidently detect and manage this production risk. In this study the complete genome of C. estertheticum DSM 8809 was determined by SMRT(®) sequencing. The genome consists of a circular chromosome of 4.7 Mbp along with a single plasmid carrying a potential tellurite resistance gene tehB and a Tn3-like resolvase-encoding gene tnpR. The genome sequence was searched for central metabolic pathways that would support its biochemical profile and several enzymes contributing to this phenotype were identified. Several putative antibiotic/biocide/metal resistance-encoding genes and virulence factors were also identified in the genome, a feature that requires further research. The availability of the genome sequence will provide a basic blueprint from which to develop valuable biomarkers that could support and improve the detection and control of this bacterium along the beef production chain.


July 7, 2019

Cell cycle constraints and environmental control of local DNA hypomethylation in a-proteobacteria.

Heritable DNA methylation imprints are ubiquitous and underlie genetic variability from bacteria to humans. In microbial genomes, DNA methylation has been implicated in gene transcription, DNA replication and repair, nucleoid segregation, transposition and virulence of pathogenic strains. Despite the importance of local (hypo)methylation at specific loci, how and when these patterns are established during the cell cycle remains poorly characterized. Taking advantage of the small genomes and the synchronizability of a-proteobacteria, we discovered that conserved determinants of the cell cycle transcriptional circuitry establish specific hypomethylation patterns in the cell cycle model system Caulobacter crescentus. We used genome-wide methyl-N6-adenine (m6A-) analyses by restriction-enzyme-cleavage sequencing (REC-Seq) and single-molecule real-time (SMRT) sequencing to show that MucR, a transcriptional regulator that represses virulence and cell cycle genes in S-phase but no longer in G1-phase, occludes 5′-GANTC-3′ sequence motifs that are methylated by the DNA adenine methyltransferase CcrM. Constitutive expression of CcrM or heterologous methylases in at least two different a-proteobacteria homogenizes m6A patterns even when MucR is present and affects promoter activity. Environmental stress (phosphate limitation) can override and reconfigure local hypomethylation patterns imposed by the cell cycle circuitry that dictate when and where local hypomethylation is instated.


July 7, 2019

Smooth q-Gram, and its applications to detection of overlaps among long, error-prone sequencing reads

We propose smoothq-gram, the frst variant of q-gram that captures q-gram pair within a small edit distance. We apply smooth q-gram to the problem of detecting overlapping pairs of error-prone reads produced by single molecule real time sequencing (SMRT), which is the frst and most critical step of the de novo fragment assembly of SMRT reads. We have implemented and tested our algorithm on a set of real world benchmarks. Our empirical results demonstrated the signifcant superiority of our algorithm over the existing q-gram based algorithms in accuracy.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.