Menu
July 7, 2019

Next-generation polyploid phylogenetics: rapid resolution of hybrid polyploid complexes using PacBio single-molecule sequencing.

Difficulties in generating nuclear data for polyploids have impeded phylogenetic study of these groups. We describe a high-throughput protocol and an associated bioinformatics pipeline (Pipeline for Untangling Reticulate Complexes (Purc)) that is able to generate these data quickly and conveniently, and demonstrate its efficacy on accessions from the fern family Cystopteridaceae. We conclude with a demonstration of the downstream utility of these data by inferring a multi-labeled species tree for a subset of our accessions. We amplified four c. 1-kb-long nuclear loci and sequenced them in a parallel-tagged amplicon sequencing approach using the PacBio platform. Purc infers the final sequences from the raw reads via an iterative approach that corrects PCR and sequencing errors and removes PCR-mediated recombinant sequences (chimeras). We generated data for all gene copies (homeologs, paralogs, and segregating alleles) present in each of three sets of 50 mostly polyploid accessions, for four loci, in three PacBio runs (one run per set). From the raw sequencing reads, Purc was able to accurately infer the underlying sequences. This approach makes it easy and economical to study the phylogenetics of polyploids, and, in conjunction with recent analytical advances, facilitates investigation of broad patterns of polyploid evolution.© 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.


July 7, 2019

A gapless genome sequence of the fungus Botrytis cinerea.

Following earlier incomplete and fragmented versions of a genome sequence for the grey mould Botrytis cinerea, we here report a gapless, near-finished genome sequence for B. cinerea strain B05.10. The assembly comprises 18 chromosomes and was confirmed by an optical map and a genetic map based on ~75 000 SNP markers. All chromosomes contain fully assembled centromeric regions, and 10 chromosomes have telomeres on both ends. The genetic map consisted of 4153 cM and comparison of genetic distances with the physical distances identified 40 recombination hotspots. The linkage map also identified two mutations, located in the previously described genes Bos1 and BcsdhB, that confer resistance to the fungicides boscalid and iprodione. The genome was predicted to encode 11 701 proteins. RNAseq data from >20 different samples were used to validate and improve gene models. Manual curation of chromosome 1 revealed interesting features, such as the occurrence of a dicistronic transcript and fully overlapping genes in opposite orientations, as well as many spliced antisense transcripts. Manual curation also revealed that UTRs of genes can be complex and long, with many UTRs exceeding lengths of 1 kb and possessing multiple introns. Community annotation is in progress. This article is protected by copyright. All rights reserved. © 2016 BSPP AND JOHN WILEY & SONS LTD.


July 7, 2019

Draft genome assembly and annotation of Glycyrrhiza uralensis, a medicinal legume.

Chinese liquorice/licorice (Glycyrrhiza uralensis) is a leguminous plant species whose roots and rhizomes have been widely used as a herbal medicine and natural sweetener. Whole-genome sequencing is essential for gene discovery studies and molecular breeding in liquorice. Here, we report a draft assembly of the approximately 379-Mb whole-genome sequence of strain 308-19 of G. uralensis; this assembly contains 34 445 predicted protein-coding genes. Comparative analyses suggested well-conserved genomic components and collinearity of gene loci (synteny) between the genome of liquorice and those of other legumes such as Medicago and chickpea. We observed that three genes involved in isoflavonoid biosynthesis, namely, 2-hydroxyisoflavanone synthase (CYP93C), 2,7,4′-trihydroxyisoflavanone 4′-O-methyltransferase/isoflavone 4′-O-methyltransferase (HI4OMT) and isoflavone-7-O-methyltransferase (7-IOMT) formed a cluster on the scaffold of the liquorice genome and showed conserved microsynteny with Medicago and chickpea. Based on the liquorice genome annotation, we predicted genes in the P450 and UDP-dependent glycosyltransferase (UGT) superfamilies, some of which are involved in triterpenoid saponin biosynthesis, and characterised their gene expression with the reference genome sequence. The genome sequencing and its annotations provide an essential resource for liquorice improvement through molecular breeding and the discovery of useful genes for engineering bioactive components through synthetic biology approaches.© 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.


July 7, 2019

Methods for genome-wide methylome profiling of Campylobacter jejuni.

Methylation has a profound role in the regulation of numerous biological processes in bacteria including virulence. The study of methylation in bacteria has greatly advanced thanks to next-generation sequencing technologies. These technologies have expedited the process of uncovering unique features of many bacterial methylomes such as characterizing previously uncharacterized methyltransferases, cataloging genome-wide DNA methylations in bacteria, identifying the frequency of methylation at particular genomic loci, and revealing regulatory roles of methylation in the biology of various bacterial species. For instance, methylation has been cited as a potential source for the pathogenicity differences observed in C. jejuni strains with syntenic genomes as seen in recent publications. Here, we describe the methodology for the use of Pacific Biosciences’ single molecule real-time (SMRT) sequencing for detecting methylation patterns in C. jejuni and bioinformatics tools to profile its methylome.


July 7, 2019

Resequencing and annotation of the Nostoc punctiforme ATTC 29133 genome: facilitating biofuel and high-value chemical production.

Cyanobacteria have the potential to produce bulk and fine chemicals and members belonging to Nostoc sp. have received particular attention due to their relatively fast growth rate and the relative ease with which they can be harvested. Nostoc punctiforme is an aerobic, motile, Gram-negative, filamentous cyanobacterium that has been studied intensively to enhance our understanding of microbial carbon and nitrogen fixation. The genome of the type strain N. punctiforme ATCC 29133 was sequenced in 2001 and the scientific community has used these genome data extensively since then. Advances in bioinformatics tools for sequence annotation and the importance of this organism prompted us to resequence and reanalyze its genome and to make both, the initial and improved annotation, available to the scientific community. The new draft genome has a total size of 9.1 Mbp and consists of 65 contiguous pieces of DNA with a GC content of 41.38% and 7664 protein-coding genes. Furthermore, the resequenced genome is slightly (5152 bp) larger and contains 987 more genes with functional prediction when compared to the previously published version. We deposited the annotation of both genomes in the Department of Energy’s IMG database to facilitate easy genome exploration by the scientific community without the need of in-depth bioinformatics skills. We expect that an facilitated access and ability to search the N. punctiforme ATCC 29133 for genes of interest will significantly facilitate metabolic engineering and genome prospecting efforts and ultimately the synthesis of biofuels and natural products from this keystone organism and closely related cyanobacteria.


July 7, 2019

Two stable variants of Burkholderia pseudomallei strain MSHR5848 express broadly divergent in vitro phenotypes associated with their virulence differences.

Burkholderia pseudomallei (Bp), the agent of melioidosis, causes disease ranging from acute and rapidly fatal to protracted and chronic. Bp is highly infectious by aerosol, can cause severe disease with nonspecific symptoms, and is naturally resistant to multiple antibiotics. However, no vaccine exists. Unlike many Bp strains, which exhibit random variability in traits such as colony morphology, Bp strain MSHR5848 exhibited two distinct and relatively stable colony morphologies on sheep blood agar plates: a smooth, glossy, pale yellow colony and a flat, rough, white colony. Passage of the two variants, designated “Smooth” and “Rough”, under standard laboratory conditions produced cultures composed of > 99.9% of the single corresponding type; however, both could switch to the other type at different frequencies when incubated in certain nutritionally stringent or stressful growth conditions. These MSHR5848 derivatives were extensively characterized to identify variant-associated differences. Microscopic and colony morphology differences on six differential media were observed and only the Rough variant metabolized sugars in selective agar. Antimicrobial susceptibilities and lipopolysaccharide (LPS) features were characterized and phenotype microarray profiles revealed distinct metabolic and susceptibility disparities between the variants. Results using the phenotype microarray system narrowed the 1,920 substrates to a subset which differentiated the two variants. Smooth grew more rapidly in vitro than Rough, yet the latter exhibited a nearly 10-fold lower lethal dose for mice than Smooth. Finally, the Smooth variant was phagocytosed and replicated to a greater extent and was more cytotoxic than Rough in macrophages. In contrast, multiple locus sequence type (MLST) analysis, ribotyping, and whole genome sequence analysis demonstrated the variants’ genetic conservation; only a single consistent genetic difference between the two was identified for further study. These distinct differences shown by two variants of a Bp strain will be leveraged to better understand the mechanism of Bp phenotypic variability and to possibly identify in vitro markers of infection.


July 7, 2019

First complete genome sequence of Marinilactibacillus piezotolerans strain 15R, a marine lactobacillus isolated from coal-bearing sediment 2.0 kilometers below the seafloor, determined by PacBio single-molecule real-time technology.

Marinilactibacillus piezotolerans strain 15R is a facultatively anaerobic heterotrophic lactobacillus isolated from deep marine subsurface sediment nearly 2 km below the seafloor in the northwestern Pacific. We report here the first whole-genome sequence of strain 15R. The identified genome sequence has 2,767,908 bp, 35.4% G+C content, and predicted 2,552 candidate protein-coding sequences, with no identified plasmids. Copyright © 2017 Wei et al.


July 7, 2019

Fallacy of the unique genome: sequence diversity within single Helicobacter pylori strains.

Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB a-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.IMPORTANCE Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish “the genome” of a bacterial strain. Variability is usually reduced (“only sequence from a single colony”), ignored (“just publish the consensus”), or placed in the “too-hard” basket (“analysis of raw read data is more robust”). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen Helicobacter pylori model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as “the genome” of a bacterial strain may be misleading. Copyright © 2017 Draper et al.


July 7, 2019

Complete genome sequences of three Cupriavidus strains isolated from various Malaysian environments.

Cupriavidus sp. USMAA1020, USMAA2-4, and USMAHM13 are capable of producing polyhydroxyalkanoate (PHA). This biopolymer is an alternative solution to synthetic plastics, whereby polyhydroxyalkanoate synthase is the key enzyme involved in PHA biosynthesis. Here, we report the complete genomes of three Cupriavidus sp. strains: USMAA1020, USMAA2-4, and USMAHM13. Copyright © 2017 Shafie et al.


July 7, 2019

Complete genome sequence of Bradyrhizobium japonicum J5, isolated from a soybean nodule in Hokkaido, Japan.

Soybean bradyrhizobia form root nodules on soybean plants and symbiotically fix N2 Strain J5 is phylogenetically far from well-known representatives within the Bradyrhizobium japonicum linage. The complete genome showed the largest single chromosomal (10.1 Mb) and symbiosis island (998 kb) among complete genomes of soybean bradyrhizobia. Copyright © 2017 Kanehara and Minamisawa.


July 7, 2019

Genome features of moderately halophilic polyhydroxyalkanoate-producing Yangia sp. CCB-MM3.

Yangia sp. CCB-MM3 was one of several halophilic bacteria isolated from soil sediment in the estuarine Matang Mangrove, Malaysia. So far, no member from the genus Yangia, a member of the Rhodobacteraceae family, has been reported sequenced. In the current study, we present the first complete genome sequence of Yangia sp. strain CCB-MM3. The genome includes two chromosomes and five plasmids with a total length of 5,522,061 bp and an average GC content of 65%. Since a different strain of Yangia sp. (ND199) was reported to produce a polyhydroxyalkanoate copolymer, the ability for this production was tested in vitro and confirmed for strain CCB-MM3. Analysis of its genome sequence confirmed presence of a pathway for production of propionyl-CoA and gene cluster for PHA production in the sequenced strain. The genome sequence described will be a useful resource for understanding the physiology and metabolic potential of Yangia as well as for comparative genomic analysis with other Rhodobacteraceae.


July 7, 2019

Identification of a Pseudomonas aeruginosa PAO1 DNA methyltransferase, its targets, and physiological roles.

DNA methylation is widespread among prokaryotes, and most DNA methylation reactions are catalyzed by adenine DNA methyltransferases, which are part of restriction-modification (R-M) systems. R-M systems are known for their role in the defense against foreign DNA; however, DNA methyltransferases also play functional roles in gene regulation. In this study, we used single-molecule real-time (SMRT) sequencing to uncover the genome-wide DNA methylation pattern in the opportunistic pathogen Pseudomonas aeruginosa PAO1. We identified a conserved sequence motif targeted by an adenine methyltransferase of a type I R-M system and quantified the presence of N(6)-methyladenine using liquid chromatography-tandem mass spectrometry (LC-MS/MS). Changes in the PAO1 methylation status were dependent on growth conditions and affected P. aeruginosa pathogenicity in a Galleria mellonella infection model. Furthermore, we found that methylated motifs in promoter regions led to shifts in sense and antisense gene expression, emphasizing the role of enzymatic DNA methylation as an epigenetic control of phenotypic traits in P. aeruginosa Since the DNA methylation enzymes are not encoded in the core genome, our findings illustrate how the acquisition of accessory genes can shape the global P. aeruginosa transcriptome and thus may facilitate adaptation to new and challenging habitats.IMPORTANCE With the introduction of advanced technologies, epigenetic regulation by DNA methyltransferases in bacteria has become a subject of intense studies. Here we identified an adenosine DNA methyltransferase in the opportunistic pathogen Pseudomonas aeruginosa PAO1, which is responsible for DNA methylation of a conserved sequence motif. The methylation level of all target sequences throughout the PAO1 genome was approximated to be in the range of 65 to 85% and was dependent on growth conditions. Inactivation of the methyltransferase revealed an attenuated-virulence phenotype in the Galleria mellonella infection model. Furthermore, differential expression of more than 90 genes was detected, including the small regulatory RNA prrF1, which contributes to a global iron-sparing response via the repression of a set of gene targets. Our finding of a methylation-dependent repression of the antisense transcript of the prrF1 small regulatory RNA significantly expands our understanding of the regulatory mechanisms underlying active DNA methylation in bacteria. Copyright © 2017 Doberenz et al.


July 7, 2019

ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing.

Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments – thermoalignments – across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (˜85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.