Menu
July 7, 2019

The evolutionary life cycle of the polysaccharide biosynthetic gene cluster based on the Sphingomonadaceae.

Although clustering of genes from the same metabolic pathway is a widespread phenomenon, the evolution of the polysaccharide biosynthetic gene cluster remains poorly understood. To determine the evolution of this pathway, we identified a scattered production pathway of the polysaccharide sanxan by Sphingomonas sanxanigenens NX02, and compared the distribution of genes between sphingan-producing and other Sphingomonadaceae strains. This allowed us to determine how the scattered sanxan pathway developed, and how the polysaccharide gene cluster evolved. Our findings suggested that the evolution of microbial polysaccharide biosynthesis gene clusters is a lengthy cyclic process comprising cluster 1???scatter???cluster 2. The sanxan biosynthetic pathway proved the existence of a dispersive process. We also report the complete genome sequence of NX02, in which we identified many unstable genetic elements and powerful secretion systems. Furthermore, nine enzymes for the formation of activated precursors, four glycosyltransferases, four acyltransferases, and four polymerization and export proteins were identified. These genes were scattered in the NX02 genome, and the positive regulator SpnA of sphingans synthesis could not regulate sanxan production. Finally, we concluded that the evolution of the sanxan pathway was independent. NX02 evolved naturally as a polysaccharide producing strain over a long-time evolution involving gene acquisitions and adaptive mutations.


July 7, 2019

Acquisition of virulence factors in livestock-associated MRSA: Lysogenic conversion of CC398 strains by virulence gene-containing phages.

Staphylococcus aureus MRSA strains belonging to the clonal complex 398 (CC398) are highly prevalent in livestock and companion animals but may also cause serious infections in humans. CC398 strains in livestock usually do not possess well-known virulence factors that can be frequently found in other MRSA sequence types (ST). Since many staphylococcal virulence genes are residing on the genomes of temperate phages, the question arises why livestock-associated (LA-) CC398 strains are only rarely infected by those phages. We isolated and characterized four temperate phages (P240, P282, P630 and P1105) containing genes of the immune evasion cluster (IEC) and/or for the Panton-Valentine leucocidin (PVL). Sequence analysis of the phage genomes showed that they are closely related to known phages and that the DNA region encoding lysis proteins, virulence factors and the integrase exhibits numerous DNA repeats which may facilitate genomic rearrangements. All phages lysed and lysogenized LA-CC398 strains. Integration of IEC phage P282 was detected at ten sites of the hosts’ chromosome. The prophages were stably inherited in LA-CC398 and enterotoxin A, staphylokinase and PVL toxin were produced. The data demonstrate that lysogenic conversion of LA-CC398 strains by virulence-associated phages may occur and that new pathotypes may emerge by this mechanism.


July 7, 2019

The complete chloroplast genome sequence of tung tree (Vernicia fordii): Organization and phylogenetic relationships with other angiosperms.

Tung tree (Vernicia fordii) is an economically important tree widely cultivated for industrial oil production in China. To better understand the molecular basis of tung tree chloroplasts, we sequenced and characterized its genome using PacBio RS II sequencing platforms. The chloroplast genome was sequenced with 161,528?bp in length, composed with one pair of inverted repeats (IRs) of 26,819?bp, which were separated by one small single copy (SSC; 18,758?bp) and one large single copy (LSC; 89,132?bp). The genome contains 114 genes, coding for 81 protein, four ribosomal RNAs and 29 transfer RNAs. An expansion with integration of an additional rps19 gene in the IR regions was identified. Compared to the chloroplast genome of Jatropha curcas, a species from the same family, the tung tree chloroplast genome is distinct with 85 single nucleotide polymorphisms (SNPs) and 82 indels. Phylogenetic analysis suggests that V. fordii is a sister species with J. curcas within the Eurosids I. The nucleotide sequence provides vital molecular information for understanding the biology of this important oil tree.


July 7, 2019

Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch.

Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.


July 7, 2019

High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development.


July 7, 2019

Lost in plasmids: next generation sequencing and the complex genome of the tick-borne pathogen Borrelia burgdorferi.

Borrelia (B.) burgdorferi sensu lato, including the tick-transmitted agents of human Lyme borreliosis, have particularly complex genomes, consisting of a linear main chromosome and numerous linear and circular plasmids. The number and structure of plasmids is variable even in strains within a single genospecies. Genes on these plasmids are known to play essential roles in virulence and pathogenicity as well as host and vector associations. For this reason, it is essential to explore methods for rapid and reliable characterisation of molecular level changes on plasmids. In this study we used three strains: a low passage isolate of B. burgdorferi sensu stricto strain B31(-NRZ) and two closely related strains (PAli and PAbe) that were isolated from human patients. Sequences of these strains were compared to the previously sequenced reference strain B31 (available in GenBank) to obtain proof-of-principle information on the suitability of next generation sequencing (NGS) library construction and sequencing methods on the assembly of bacterial plasmids. We tested the effectiveness of different short read assemblers on Illumina sequences, and of long read generation methods on sequence data from Pacific Bioscience single-molecule real-time (SMRT) and nanopore (Oxford Nanopore Technologies) sequencing technology.Inclusion of mate pair library reads improved the assembly in some plasmids as did prior enrichment of plasmids. While cp32 plasmids remained refractory to assembly using only short reads they were effectively assembled by long read sequencing methods. The long read SMRT and nanopore sequences came, however, at the cost of indels (insertions or deletions) appearing in an unpredictable manner. Using long and short read technologies together allowed us to show that the three B. burgdorferi s.s. strains investigated here, whilst having similar plasmid structures to each other (apart from fusion of cp32 plasmids), differed significantly from the reference strain B31-GB, especially in the case of cp32 plasmids.Short read methods are sufficient to assemble the main chromosome and many of the plasmids in B. burgdorferi. However, a combination of short and long read sequencing methods is essential for proper assembly of all plasmids including cp32 and thus, for gaining an understanding of host- or vector adaptations. An important conclusion from our work is that the evolution of Borrelia plasmids appears to be dynamic. This has important implications for the development of useful research strategies to monitor the risk of Lyme disease occurrence and how to medically manage it.


July 7, 2019

Complete genome of a panresistant Pseudomonas aeruginosa strain, isolated from a patient with respiratory failure in a Canadian community hospital.

We report here the complete genome sequence of a panresistant Pseudomonas aeruginosa strain, isolated from a patient with respiratory failure in Canada. No carbapenemase genes were identified. Carbapenem resistance is attributable to a frameshift in the oprD gene; the basis for colistin resistance remains undetermined. Copyright © 2017 Xiong et al.


July 7, 2019

Complete genome sequence of Staphylococcus epidermidis 1457.

Staphylococcus epidermidis 1457 is a frequently utilized strain that is amenable to genetic manipulation and has been widely used for biofilm-related research. We report here the whole-genome sequence of this strain, which encodes 2,277 protein-coding genes and 81 RNAs within its 2.4-Mb genome and plasmid. Copyright © 2017 Galac et al.


July 7, 2019

Complete genome sequence of a community-associated methicillin-resistant Staphylococcusaureus hypervirulent strain, USA300-C2406, isolated from a patient with a lethal case of necrotizing pneumonia.

USA300 is a predominant community-associated methicillin-resistant Staphylococcus aureus strain causing significant morbidity and mortality. We present here the full annotated genome of a USA300 hypervirulent clinical strain, USA300-C2406, isolated from a patient with a lethal case of necrotizing pneumonia, to gain a better understanding of USA300 hypervirulence. Copyright © 2017 McClure and Zhang.


July 7, 2019

High metabolic versatility of different toxigenic and non-toxigenic Clostridioides difficile isolates.

Clostridioides difficile (formerly Clostridium difficile) is a major nosocomial pathogen with an increasing number of community-acquired infections causing symptoms from mild diarrhea to life-threatening colitis. The pathogenicity of C. difficile is considered to be mainly associated with the production of genome-encoded toxins A and B. In addition, some strains also encode and express the binary toxin CDT. However; a large number of non-toxigenic C. difficile strains have been isolated from the human gut and the environment. In this study, we characterized the growth behavior, motility and fermentation product formation of 17 different C. difficile isolates comprising five different major genomic clades and five different toxin inventories in relation to the C. difficile model strains 630?erm and R20291. Within 33 determined fermentation products, we identified two yet undescribed products (5-methylhexanoate and 4-(methylthio)-butanoate) of C. difficile. Our data revealed major differences in the fermentation products obtained after growth in a medium containing casamino acids and glucose as carbon and energy source. While the metabolism of branched chain amino acids remained comparable in all isolates, the aromatic amino acid uptake and metabolism and the central carbon metabolism-associated fermentation pathways varied strongly between the isolates. The patterns obtained followed neither the classification of the clades nor the ribotyping patterns nor the toxin distribution. As the toxin formation is strongly connected to the metabolism, our data allow an improved differentiation of C. difficile strains. The observed metabolic flexibility provides the optimal basis for the adaption in the course of infection and to changing conditions in different environments including the human gut. Copyright © 2017 Elsevier GmbH. All rights reserved.


July 7, 2019

Toolkit for automated and rapid discovery of structural variants.

Structural variations (SV) are broadly defined as genomic alterations that affect > 50 bp of DNA, which are shown to have significant effect on evolution and disease. The advent of high throughput sequencing (HTS) technologies and the ability to perform whole genome sequencing (WGS), makes it feasible to study these variants in depth. However, discovery of all forms of SV using WGS has proven to be challenging as the short reads produced by the predominant HTS platforms (<200bp for current technologies) and the fact that most genomes include large amounts of repeats make it very difficult to unambiguously map and accurately characterize such variants. Furthermore, existing tools for SV discovery are primarily developed for only a few of the SV types, which may have conflicting sequence signatures (i.e. read pairs, read depth, split reads) with other, untargeted SV classes. Here we are introduce a new framework, Tardis, which combines multiple read signatures into a single package to characterize most SV types simultaneously, while preventing such conflicts. Tardis also has a modular structure that makes it easy to extend for the discovery of additional forms of SV. Copyright © 2017. Published by Elsevier Inc.


July 7, 2019

Analysis of complete genome sequence and major surface antigens of Neorickettsia helminthoeca, causative agent of salmon poisoning disease.

Neorickettsia helminthoeca, a type species of the genus Neorickettsia, is an endosymbiont of digenetic trematodes of veterinary importance. Upon ingestion of salmonid fish parasitized with infected trematodes, canids develop salmon poisoning disease (SPD), an acute febrile illness that is particularly severe and often fatal in dogs without adequate treatment. We determined and analysed the complete genome sequence of N. helminthoeca: a single small circular chromosome of 884 232 bp encoding 774 potential proteins. N. helminthoeca is unable to synthesize lipopolysaccharides and most amino acids, but is capable of synthesizing vitamins, cofactors, nucleotides and bacterioferritin. N. helminthoeca is, however, distinct from majority of the family Anaplasmataceae to which it belongs, as it encodes nearly all enzymes required for peptidoglycan biosynthesis, suggesting its structural hardiness and inflammatory potential. Using sera from dogs that were experimentally infected by feeding with parasitized fish or naturally infected in southern California, Western blot analysis revealed that among five predicted N. helminthoeca outer membrane proteins, P51 and strain-variable surface antigen were uniformly recognized. Our finding will help understanding pathogenesis, prevalence of N. helminthoeca infection among trematodes, canids and potentially other animals in nature to develop effective SPD diagnostic and preventive measures. Recent progresses in large-scale genome sequencing have been uncovering broad distribution of Neorickettsia spp., the comparative genomics will facilitate understanding of biology and the natural history of these elusive environmental bacteria.© 2017 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.


July 7, 2019

Complete genome sequences of 12 isolates of Listeria monocytogenes belonging to serotypes 1/2a, 1/2b, and 4b obtained from food products and food-processing environments in Canada.

Listeria monocytogenes is the etiological agent for an often fatal foodborne illness known as listeriosis. Here, we present the complete genome sequences of 12 L. monocytogenes isolates representing the three most common serotypes of this pathogen (1/2a, 1/2b, and 4b), collected in Canada from different food products and environmental sources.© Crown copyright 2017.


July 7, 2019

Whole genome and core genome multilocus sequence typing and single nucleotide polymorphism analyses of Listeria monocytogenes associated with an outbreak linked to cheese, United States, 2013.

Epidemiological findings of a listeriosis outbreak in 2013 implicated Hispanic-style cheese produced by Company A, and pulsed-field gel electrophoresis (PFGE) and whole genome sequencing (WGS) were performed on clinical isolates and representative isolates collected from Company A cheese and environmental samples during the investigation. The results strengthened the evidence for cheese as the vehicle. Surveillance sampling and WGS three months later revealed that the equipment purchased by Company B from Company A yielded an environmental isolate highly similar to all outbreak isolates. The whole genome and core genome multilocus sequence typing and single nucleotide polymorphism (SNP) analyses were compared to demonstrate the maximum discriminatory power obtained by using multiple analyses, which were needed to differentiate outbreak-associated isolates from a PFGE-indistinguishable isolate collected in a non-implicated food source in 2012. This unrelated isolate differed from the outbreak isolates by only 7 to 14 SNPs, and as a result, minimum spanning tree by the whole genome analyses and certain variant calling approach and phylogenetic algorithm for core genome-based analyses could not provide the differentiation between unrelated isolates. Our data also suggest that SNP/allele counts should always be combined with WGS clustering generated by phylogenetically meaningful algorithms on sufficient number of isolates, and SNP/allele threshold alone is not sufficient evidence to delineate an outbreak. The putative prophages were conserved across all the outbreak isolates. All outbreak isolates belonged to clonal complex 5 and serotype 1/2b, had an identical inlA sequence, which did not have premature stop codons.IMPORTANCE In this outbreak, multiple analytical approaches were used for maximum discriminatory power. A PFGE-matched, epidemiologically unrelated isolate had high genetic similarity to the outbreak-associated isolates, with as few as only 7 SNP differences. Therefore, the SNP/allele threshold should not be used as the only evidence to define the scope of an outbreak. It is critical that the SNP/allele counts be complemented by WGS clustering generated by phylogenetically meaningful algorithms to distinguish outbreak-associated isolates from epidemiologically unrelated isolates. Careful selection of a variant calling approach and phylogenetic algorithm is critical for core genome-based analyses. The whole genome-based analyses were able to construct the highly resolved phylogeny needed to support the findings of the outbreak investigation. Ultimately, epidemiologic evidence and multiple WGS analyses should be combined to increase the confidence in outbreak investigations. Copyright © 2017 Chen et al.


July 7, 2019

Genome sequencing: Illuminating the sunflower genome.

A high-quality sunflower genome provides insight into Asterid genome evolution. Moreover, integrative analyses based on quantitative genetics, expression and diversity data uncover the gene networks and candidate genes for oil metabolism and flowering time, two important agronomic traits for sunflowers.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.