Menu
July 7, 2019

De novo genome assembly of the economically important weed horseweed using integrated data from multiple sequencing platforms.

Horseweed (Conyza canadensis), a member of the Compositae (Asteraceae) family, was the first broadleaf weed to evolve resistance to glyphosate. Horseweed, one of the most problematic weeds in the world, is a true diploid (2n = 2x = 18), with the smallest genome of any known agricultural weed (335 Mb). Thus, it is an appropriate candidate to help us understand the genetic and genomic bases of weediness. We undertook a draft de novo genome assembly of horseweed by combining data from multiple sequencing platforms (454 GS-FLX, Illumina HiSeq 2000, and PacBio RS) using various libraries with different insertion sizes (approximately 350 bp, 600 bp, 3 kb, and 10 kb) of a Tennessee-accessed, glyphosate-resistant horseweed biotype. From 116.3 Gb (approximately 350× coverage) of data, the genome was assembled into 13,966 scaffolds with 50% of the assembly = 33,561 bp. The assembly covered 92.3% of the genome, including the complete chloroplast genome (approximately 153 kb) and a nearly complete mitochondrial genome (approximately 450 kb in 120 scaffolds). The nuclear genome is composed of 44,592 protein-coding genes. Genome resequencing of seven additional horseweed biotypes was performed. These sequence data were assembled and used to analyze genome variation. Simple sequence repeat and single-nucleotide polymorphisms were surveyed. Genomic patterns were detected that associated with glyphosate-resistant or -susceptible biotypes. The draft genome will be useful to better understand weediness and the evolution of herbicide resistance and to devise new management strategies. The genome will also be useful as another reference genome in the Compositae. To our knowledge, this article represents the first published draft genome of an agricultural weed.© 2014 American Society of Plant Biologists. All Rights Reserved.


July 7, 2019

Simultaneous sequencing of oxidized methylcytosines produced by TET/JBP dioxygenases in Coprinopsis cinerea.

TET/JBP enzymes oxidize 5-methylpyrimidines in DNA. In mammals, the oxidized methylcytosines (oxi-mCs) function as epigenetic marks and likely intermediates in DNA demethylation. Here we present a method based on diglucosylation of 5-hydroxymethylcytosine (5hmC) to simultaneously map 5hmC, 5-formylcytosine, and 5-carboxylcytosine at near-base-pair resolution. We have used the method to map the distribution of oxi-mC across the genome of Coprinopsis cinerea, a basidiomycete that encodes 47 TET/JBP paralogs in a previously unidentified class of DNA transposons. Like 5-methylcytosine residues from which they are derived, oxi-mC modifications are enriched at centromeres, TET/JBP transposons, and multicopy paralogous genes that are not expressed, but rarely mark genes whose expression changes between two developmental stages. Our study provides evidence for the emergence of an epigenetic regulatory system through recruitment of selfish elements in a eukaryotic lineage, and describes a method to map all three different species of oxi-mCs simultaneously.


July 7, 2019

Genome annotation provides insight into carbon monoxide and hydrogen metabolism in Rubrivivax gelatinosus.

We report here the sequencing and analysis of the genome of the purple non-sulfur photosynthetic bacterium Rubrivivax gelatinosus CBS. This microbe is a model for studies of its carboxydotrophic life style under anaerobic condition, based on its ability to utilize carbon monoxide (CO) as the sole carbon substrate and water as the electron acceptor, yielding CO2 and H2 as the end products. The CO-oxidation reaction is known to be catalyzed by two enzyme complexes, the CO dehydrogenase and hydrogenase. As expected, analysis of the genome of Rx. gelatinosus CBS reveals the presence of genes encoding both enzyme complexes. The CO-oxidation reaction is CO-inducible, which is consistent with the presence of two putative CO-sensing transcription factors in its genome. Genome analysis also reveals the presence of two additional hydrogenases, an uptake hydrogenase that liberates the electrons in H2 in support of cell growth, and a regulatory hydrogenase that senses H2 and relays the signal to a two-component system that ultimately controls synthesis of the uptake hydrogenase. The genome also contains two sets of hydrogenase maturation genes which are known to assemble the catalytic metallocluster of the hydrogenase NiFe active site. Collectively, the genome sequence and analysis information reveals the blueprint of an intricate network of signal transduction pathways and its underlying regulation that enables Rx. gelatinosus CBS to thrive on CO or H2 in support of cell growth.


July 7, 2019

Co-option of Sox3 as the male-determining factor on the Y chromosome in the fish Oryzias dancena.

Sex chromosomes harbour a primary sex-determining signal that triggers sexual development of the organism. However, diverse sex chromosome systems have been evolved in vertebrates. Here we use positional cloning to identify the sex-determining locus of a medaka-related fish, Oryzias dancena, and find that the locus on the Y chromosome contains a cis-regulatory element that upregulates neighbouring Sox3 expression in developing gonad. Sex-reversed phenotypes in Sox3(Y) transgenic fish, and Sox3(Y) loss-of-function mutants all point to its critical role in sex determination. Furthermore, we demonstrate that Sox3 initiates testicular differentiation by upregulating expression of downstream Gsdf, which is highly conserved in fish sex differentiation pathways. Our results not only provide strong evidence for the independent recruitment of Sox3 to male determination in distantly related vertebrates, but also provide direct evidence that a novel sex determination pathway has evolved through co-option of a transcriptional regulator potentially interacted with a conserved downstream component.


July 7, 2019

Complete genome determination and analysis of Acholeplasma oculi strain 19L, highlighting the loss of basic genetic features in the Acholeplasmataceae.

BACKGROUND: Acholeplasma oculi belongs to the Acholeplasmataceae family, comprising the genera Acholeplasma and ‘Candidatus Phytoplasma’. Acholeplasmas are ubiquitous saprophytic bacteria. Several isolates are derived from plants or animals, whereas phytoplasmas are characterised as intracellular parasitic pathogens of plant phloem and depend on insect vectors for their spread. The complete genome sequences for eight strains of this family have been resolved so far, all of which were determined depending on clone-based sequencing. RESULTS:The A. oculi strain 19L chromosome was sequenced using two independent approaches. The first approach comprised sequencing by synthesis (Illumina) in combination with Sanger sequencing, while single molecule real time sequencing (PacBio) was used in the second. The genome was determined to be 1,587,120bp in size. Sequencing by synthesis resulted in six large genome fragments, while the single molecule real time sequencing approach yielded one circular chromosome sequence. High-quality sequences were obtained by both strategies differing in six positions, which are interpreted as reliable variations present in the culture population. Our genome analysis revealed 1,471 protein-coding genes and highlighted the absence of the F1FO-type Na+ ATPase system and GroEL/ES chaperone. Comparison of the four available Acholeplasma sequences revealed a core-genome encoding 703 proteins and a pan-genome of 2,867 proteins. CONCLUSIONS:The application of two state-of-the-art sequencing technologies highlights the potential of single molecule real time sequencing for complete genome determination. Comparative genome analyses revealed that the process of losing particular basic genetic features during genome reduction occurs in both genera, as indicated for several phytoplasma strains and at least A. oculi. The loss of the F1FO-type Na+ ATPase system may separate Acholeplasmataceae from other Mollicutes, while the loss of those genes encoding the chaperone GroEL/ES is not a rare exception in this bacterial class.


July 7, 2019

Global phylogenomic analysis of nonencapsulated Streptococcus pneumoniae reveals a deep-branching classic lineage that is distinct from multiple sporadic lineages.

The surrounding capsule of Streptococcus pneumoniae has been identified as a major virulence factor and is targeted by pneumococcal conjugate vaccines (PCV). However, nonencapsulated S. pneumoniae (non-Ec-Sp) have also been isolated globally, mainly in carriage studies. It is unknown if non-Ec-Sp evolve sporadically, if they have high antibiotic nonsusceptiblity rates and a unique, specific gene content. Here, whole-genome sequencing of 131 non-Ec-Sp isolates sourced from 17 different locations around the world was performed. Results revealed a deep-branching classic lineage that is distinct from multiple sporadic lineages. The sporadic lineages clustered with a previously sequenced, global collection of encapsulated S. pneumoniae (Ec-Sp) isolates while the classic lineage is comprised mainly of the frequently identified multilocus sequences types (STs) ST344 (n = 39) and ST448 (n = 40). All ST344 and nine ST448 isolates had high nonsusceptiblity rates to ß-lactams and other antimicrobials. Analysis of the accessory genome reveals that the classic non-Ec-Sp contained an increased number of mobile elements, than Ec-Sp and sporadic non-Ec-Sp. Performing adherence assays to human epithelial cells for selected classic and sporadic non-Ec-Sp revealed that the presence of a integrative conjugative element (ICE) results in increased adherence to human epithelial cells (P = 0.005). In contrast, sporadic non-Ec-Sp lacking the ICE had greater growth in vitro possibly resulting in improved fitness. In conclusion, non-Ec-Sp isolates from the classic lineage have evolved separately. They have spread globally, are well adapted to nasopharyngeal carriage and are able to coexist with Ec-Sp. Due to continued use of PCV, non-Ec-Sp may become more prevalent. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Dissemination of cephalosporin resistance genes between Escherichia coli strains from farm animals and humans by specific plasmid lineages.

Third-generation cephalosporins are a class of ß-lactam antibiotics that are often used for the treatment of human infections caused by Gram-negative bacteria, especially Escherichia coli. Worryingly, the incidence of human infections caused by third-generation cephalosporin-resistant E. coli is increasing worldwide. Recent studies have suggested that these E. coli strains, and their antibiotic resistance genes, can spread from food-producing animals, via the food-chain, to humans. However, these studies used traditional typing methods, which may not have provided sufficient resolution to reliably assess the relatedness of these strains. We therefore used whole-genome sequencing (WGS) to study the relatedness of cephalosporin-resistant E. coli from humans, chicken meat, poultry and pigs. One strain collection included pairs of human and poultry-associated strains that had previously been considered to be identical based on Multi-Locus Sequence Typing, plasmid typing and antibiotic resistance gene sequencing. The second collection included isolates from farmers and their pigs. WGS analysis revealed considerable heterogeneity between human and poultry-associated isolates. The most closely related pairs of strains from both sources carried 1263 Single-Nucleotide Polymorphisms (SNPs) per Mbp core genome. In contrast, epidemiologically linked strains from humans and pigs differed by only 1.8 SNPs per Mbp core genome. WGS-based plasmid reconstructions revealed three distinct plasmid lineages (IncI1- and IncK-type) that carried cephalosporin resistance genes of the Extended-Spectrum Beta-Lactamase (ESBL)- and AmpC-types. The plasmid backbones within each lineage were virtually identical and were shared by genetically unrelated human and animal isolates. Plasmid reconstructions from short-read sequencing data were validated by long-read DNA sequencing for two strains. Our findings failed to demonstrate evidence for recent clonal transmission of cephalosporin-resistant E. coli strains from poultry to humans, as has been suggested based on traditional, low-resolution typing methods. Instead, our data suggest that cephalosporin resistance genes are mainly disseminated in animals and humans via distinct plasmids.


July 7, 2019

The first 50 plant genomes

Fifty-five plant genomes have been published to date representing 49 different species (Table 1 includes PubMed IDs for complete reference). What have we learned from the first wave of plant genomes? It has been said that plant genome papers (and genome papers in general) are dry and lack “biology” and that the days of high impact plant genome papers are drawing to a close unless they explore significant biology. However, with each new genome, earlier observations are refined and plant genome papers continue to reveal novel aspects of genome biology. For example, the tomato and banana genome papers refined current thinking on the whole genome duplications (WGD) that shaped dicot and monocot genome evolution (D’Hont et al., 2012; Tomato Genome Consortium, 2012). These observations were enabled not only by high quality genome assemblies but also by a greater number of genomes available for com- parisons. In addition, the initial round of plant genomes enabled the first generation of functional genomics that helped to define the roles of hundreds of genes, provided unprecedented access to sequence-based markers for breeding, and provided glimpses into plant evolutionary history. More genomes, representing the diverse array of species in Viridiplantae are still required to gain a full understanding of plant genome structure, evolution, and complexity.


July 7, 2019

Genome sequence of Phaeobacter daeponensis type strain (DSM 23529(T)), a facultatively anaerobic bacterium isolated from marine sediment, and emendation of Phaeobacter daeponensis.

TF-218(T) is the type strain of the species Phaeobacter daeponensis Yoon et al. 2007, a facultatively anaerobic Phaeobacter species isolated from tidal flats. Here we describe the draft genome sequence and annotation of this bacterium together with previously unreported aspects of its phenotype. We analyzed the genome for genes involved in secondary metabolite production and its anaerobic lifestyle, which have also been described for its closest relative Phaeobacter caeruleus. The 4,642,596 bp long genome of strain TF-218(T) contains 4,310 protein-coding genes and 78 RNA genes including four rRNA operons and consists of five replicons: one chromosome and four extrachromosomal elements with sizes of 276 kb, 174 kb, 117 kb and 90 kb. Genome analysis showed that TF-218(T) possesses all of the genes for indigoidine biosynthesis, and on specific media the strain showed a blue pigmentation. We also found genes for dissimilatory nitrate reduction, gene-transfer agents, NRPS/ PKS genes and signaling systems homologous to the LuxR/I system.


July 7, 2019

Mutation in the C-di-AMP cyclase dacA affects fitness and resistance of methicillin resistant Staphylococcus aureus.

Faster growing and more virulent strains of methicillin resistant Staphylococcus aureus (MRSA) are increasingly displacing highly resistant MRSA. Elevated fitness in these MRSA is often accompanied by decreased and heterogeneous levels of methicillin resistance; however, the mechanisms for this phenomenon are not yet fully understood. Whole genome sequencing was used to investigate the genetic basis of this apparent correlation, in an isogenic MRSA strain pair that differed in methicillin resistance levels and fitness, with respect to growth rate. Sequencing revealed only one single nucleotide polymorphism (SNP) in the diadenylate cyclase gene dacA in the faster growing but less resistant strain. Diadenylate cyclases were recently discovered to synthesize the new second messenger cyclic diadenosine monophosphate (c-di-AMP). Introduction of this mutation into the highly resistant but slower growing strain reduced resistance and increased its growth rate, suggesting a direct connection between the dacA mutation and the phenotypic differences of these strains. Quantification of cellular c-di-AMP revealed that the dacA mutation decreased c-di-AMP levels resulting in reduced autolysis, increased salt tolerance and a reduction in the basal expression of the cell wall stress stimulon. These results indicate that c-di-AMP affects cell envelope-related signalling in S. aureus. The influence of c-di-AMP on growth rate and methicillin resistance in MRSA indicate that altering c-di-AMP levels could be a mechanism by which MRSA strains can increase their fitness levels by reducing their methicillin resistance levels.


July 7, 2019

A hybrid approach for the automated finishing of bacterial genomes.

Advances in DNA sequencing technology have improved our ability to characterize most genomic diversity. However, accurate resolution of large structural events is challenging because of the short read lengths of second-generation technologies. Third-generation sequencing technologies, which can yield longer multikilobase reads, have the potential to address limitations associated with genome assembly. Here we combine sequencing data from second- and third-generation DNA sequencing technologies to assemble the two-chromosome genome of a recent Haitian cholera outbreak strain into two nearly finished contigs at >99.9% accuracy. Complex regions with clinically relevant structure were completely resolved. In separate control assemblies on experimental and simulated data for the canonical N16961 cholera reference strain, we obtained 14 scaffolds of greater than 1 kb for the experimental data and 8 scaffolds of greater than 1 kb for the simulated data, which allowed us to correct several errors in contigs assembled from the short-read data alone. This work provides a blueprint for the next generation of rapid microbial identification and full-genome assembly.


July 7, 2019

Analysis of a unique Clostridium botulinum strain from the Southern hemisphere producing a novel type E botulinum neurotoxin subtype.

Clostridium botulinum strains that produce botulinum neurotoxin type E (BoNT/E) are most commonly isolated from botulism cases, marine environments, and animals in regions of high latitude in the Northern hemisphere. A strain of C. botulinum type E (CDC66177) was isolated from soil in Chubut, Argentina. Previous studies showed that the amino acid sequences of BoNT/E produced by various strains differ by < 6% and that the type E neurotoxin gene cluster inserts into the rarA operon.Genetic and mass spectral analysis demonstrated that the BoNT/E produced by CDC66177 is a novel toxin subtype (E9). Toxin gene sequencing indicated that BoNT/E9 differed by nearly 11% at the amino acid level compared to BoNT/E1. Mass spectrometric analysis of BoNT/E9 revealed that its endopeptidase substrate cleavage site was identical to other BoNT/E subtypes. Further analysis of this strain demonstrated that its 16S rRNA sequence clustered with other Group II C. botulinum (producing BoNT types B, E, and F) strains. Genomic DNA isolated from strain CDC66177 hybridized with fewer probes using a Group II C. botulinum subtyping microarray compared to other type E strains examined. Whole genome shotgun sequencing of strain CDC66177 revealed that while the toxin gene cluster inserted into the rarA operon similar to other type E strains, its overall genome content shared greater similarity with a Group II C. botulinum type B strain (17B).These results expand our understanding of the global distribution of C. botulinum type E strains and suggest that the type E toxin gene cluster may be able to insert into C. botulinum strains with a more diverse genetic background than previously recognized.


July 7, 2019

Genomic analysis of 495 vancomycin-resistant Enterococcus faecium reveals broad dissemination of a vanA plasmid in more than 19 clones from Copenhagen, Denmark.

From 2012 to 2014, there has been a huge increase in vancomycin-resistant (vanA) Enterococcus faecium (VREfm) in Copenhagen, Denmark, with 602 patients infected or colonized with VREfm in 2014 compared with just 22 in 2012. The objective of this study was to describe the genetic epidemiology of VREfm to assess the contribution of clonal spread and horizontal transfer of the vanA transposon (Tn1546) and plasmid in the dissemination of VREfm in hospitals.VREfm from Copenhagen, Denmark (2012-14) were whole-genome sequenced. The clonal structure was determined and the structure of Tn1546-like transposons was characterized. One VREfm isolate belonging to the largest clonal group was sequenced using long-read technology to close a 37 kb vanA plasmid.Phylogeny revealed a polyclonal structure where 495 VREfm isolates were divided into 13 main groups and 7 small groups. The majority of the isolates were located in three groups (n?=?44, 100 and 218) and clonal spread of VREfm between wards and hospitals was identified. Five Tn1546-like transposon types were identified. A dominant truncated transposon (type 4, 92%) was spread across all but one VREfm group. The closed vanA plasmid was highly covered by reads from isolates containing the type 4 transposon.This study suggests that it was the dissemination of the type 4 Tn1546-like transposon and plasmid via horizontal transfer to multiple populations of E. faecium, followed by clonal spread of new VREfm clones, that contributed to the increase in and diversity of VREfm in Danish hospitals.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.