Menu
September 22, 2019

The complete replicons of 16 Ensifer meliloti strains offer insights into intra- and inter-replicon gene transfer, transposon-associated loci, and repeat elements.

Ensifer meliloti (formerly Rhizobium meliloti and Sinorhizobium meliloti) is a model bacterium for understanding legume-rhizobial symbioses. The tripartite genome of E. meliloti consists of a chromosome, pSymA and pSymB, and in some instances strain-specific accessory plasmids. The majority of previous sequencing studies have relied on the use of assemblies generated from short read sequencing, which leads to gaps and assembly errors. Here we used PacBio-based, long-read assemblies and were able to assemble, de novo, complete circular replicons. In this study, we sequenced, de novo-assembled and analysed 10 E. meliloti strains. Sequence comparisons were also done with data from six previously published genomes. We identified genome differences between the replicons, including mol% G+C and gene content, nucleotide repeats, and transposon-associated loci. Additionally, genomic rearrangements both within and between replicons were identified, providing insight into evolutionary processes at the structural level. There were few cases of inter-replicon gene transfer of core genes between the main replicons. Accessory plasmids were more similar to pSymA than to either pSymB or the chromosome, with respect to gene content, transposon content and G+C content. In our population, the accessory plasmids appeared to share an open genome with pSymA, which contains many nodulation- and nitrogen fixation-related genes. This may explain previous observations that horizontal gene transfer has a greater effect on the content of pSymA than pSymB, or the chromosome, and why some rhizobia show unstable nodulation phenotypes on legume hosts.


September 22, 2019

Whole genome sequence of an edible and potential medicinal fungus, Cordyceps guangdongensis.

Cordyceps guangdongensis is an edible fungus which was approved as a novel food by the Chinese Ministry of Public Health in 2013. It also has a broad prospect of application in pharmaceutical industries, with many medicinal activities. In this study, the whole genome of C. guangdongensis GD15, a single spore isolate from a wild strain, was sequenced and assembled with Illumina and PacBio sequencing technology. The generated genome is 29.05 Mb in size, comprising nine scaffolds with an average GC content of 57.01%. It is predicted to contain a total of 9150 protein-coding genes. Sequence identification and comparative analysis indicated that the assembled scaffolds contained two complete chromosomes and four single-end chromosomes, showing a high level assembly. Gene annotation revealed a diversity of transposons that could contribute to the genome size and evolution. Besides, approximately 15.57% and 12.01% genes involved in metabolic processes were annotated by KEGG and COG respectively. Genes belonging to CAZymes accounted for 3.15% of the total genes. In addition, 435 transcription factors, involved in various biological processes, were identified. Among the identified transcription factors, the fungal transcription regulatory proteins (18.39%) and fungal-specific transcription factors (19.77%) represented the two largest classes of transcription factors. This genomic resource provided a new insight into better understanding the relevance of phenotypic characters and genetic mechanisms in C. guangdongensis. Copyright © 2018 Zhang et al.


September 22, 2019

Comparative genomics of cocci-shaped Sporosarcina strains with diverse spatial isolation.

Cocci-shaped Sporosarcina strains are currently one of the few known cocci-shaped spore-forming bacteria, yet we know very little about the genomics. The goal of this study is to utilize comparative genomics to investigate the diversity of cocci-shaped Sporosarcina strains that differ in their geographical isolation and show different nutritional requirements.For this study, we sequenced 28 genomes of cocci-shaped Sporosarcina strains isolated from 13 different locations around the world. We generated the first six complete genomes and methylomes utilizing PacBio sequencing, and an additional 22 draft genomes using Illumina sequencing. Genomic analysis revealed that cocci-shaped Sporosarcina strains contained an average genome of 3.3 Mb comprised of 3222 CDS, 54 tRNAs and 6 rRNAs, while only two strains contained plasmids. The cocci-shaped Sporosarcina genome on average contained 2.3 prophages and 15.6 IS elements, while methylome analysis supported the diversity of these strains as only one of 31 methylation motifs were shared under identical growth conditions. Analysis with a 90% identity cut-off revealed 221 core genes or ~?7% of the genome, while a 30% identity cut-off generated a pan-genome of 8610 genes. The phylogenetic relationship of the cocci-shaped Sporosarcina strains based on either core genes, accessory genes or spore-related genes consistently resulted in the 29 strains being divided into eight clades.This study begins to unravel the phylogenetic relationship of cocci-shaped Sporosarcina strains, and the comparative genomics of these strains supports identification of several new species.


September 22, 2019

The Egyptian rousette genome reveals unexpected features of bat antiviral immunity.

Bats harbor many viruses asymptomatically, including several notorious for causing extreme virulence in humans. To identify differences between antiviral mechanisms in humans and bats, we sequenced, assembled, and analyzed the genome of Rousettus aegyptiacus, a natural reservoir of Marburg virus and the only known reservoir for any filovirus. We found an expanded and diversified KLRC/KLRD family of natural killer cell receptors, MHC class I genes, and type I interferons, which dramatically differ from their functional counterparts in other mammals. Such concerted evolution of key components of bat immunity is strongly suggestive of novel modes of antiviral defense. An evaluation of the theoretical function of these genes suggests that an inhibitory immune state may exist in bats. Based on our findings, we hypothesize that tolerance of viral infection, rather than enhanced potency of antiviral defenses, may be a key mechanism by which bats asymptomatically host viruses that are pathogenic in humans. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Insights into platypus population structure and history from whole-genome sequencing.

The platypus is an egg-laying mammal which, alongside the echidna, occupies a unique place in the mammalian phylogenetic tree. Despite widespread interest in its unusual biology, little is known about its population structure or recent evolutionary history. To provide new insights into the dispersal and demographic history of this iconic species, we sequenced the genomes of 57 platypuses from across the whole species range in eastern mainland Australia and Tasmania. Using a highly improved reference genome, we called over 6.7?M SNPs, providing an informative genetic data set for population analyses. Our results show very strong population structure in the platypus, with our sampling locations corresponding to discrete groupings between which there is no evidence for recent gene flow. Genome-wide data allowed us to establish that 28 of the 57 sampled individuals had at least a third-degree relative among other samples from the same river, often taken at different times. Taking advantage of a sampled family quartet, we estimated the de novo mutation rate in the platypus at 7.0?×?10-9/bp/generation (95% CI 4.1?×?10-9-1.2?×?10-8/bp/generation). We estimated effective population sizes of ancestral populations and haplotype sharing between current groupings, and found evidence for bottlenecks and long-term population decline in multiple regions, and early divergence between populations in different regions. This study demonstrates the power of whole-genome sequencing for studying natural populations of an evolutionarily important species.


September 22, 2019

Double insertion of transposable elements provides a substrate for the evolution of satellite DNA.

Eukaryotic genomes are replete with repeated sequences in the form of transposable elements (TEs) dispersed across the genome or as satellite arrays, large stretches of tandemly repeated sequences. Many satellites clearly originated as TEs, but it is unclear how mobile genetic parasites can transform into megabase-sized tandem arrays. Comprehensive population genomic sampling is needed to determine the frequency and generative mechanisms of tandem TEs, at all stages from their initial formation to their subsequent expansion and maintenance as satellites. The best available population resources, short-read DNA sequences, are often considered to be of limited utility for analyzing repetitive DNA due to the challenge of mapping individual repeats to unique genomic locations. Here we develop a new pipeline called ConTExt that demonstrates that paired-end Illumina data can be successfully leveraged to identify a wide range of structural variation within repetitive sequence, including tandem elements. By analyzing 85 genomes from five populations of Drosophila melanogaster, we discover that TEs commonly form tandem dimers. Our results further suggest that insertion site preference is the major mechanism by which dimers arise and that, consequently, dimers form rapidly during periods of active transposition. This abundance of TE dimers has the potential to provide source material for future expansion into satellite arrays, and we discover one such copy number expansion of the DNA transposon hobo to approximately 16 tandem copies in a single line. The very process that defines TEs-transposition-thus regularly generates sequences from which new satellites can arise.© 2018 McGurk and Barbash; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Spread of plasmid-encoded NDM-1 and GES-5 carbapenemases among extensively drug-resistant and pandrug-resistant clinical Enterobacteriaceae in Durban, South Africa.

Whole-genome sequence analyses revealed the presence of blaNDM-1 (n = 31), blaGES-5 (n = 8), blaOXA-232 (n = 1), or blaNDM-5 (n = 1) in extensively drug-resistant and pandrug-resistant Enterobacteriaceae organisms isolated from in-patients in 10 private hospitals (2012 to 2013) in Durban, South Africa. Two novel NDM-1-encoding plasmids from Klebsiella pneumoniae were circularized by PacBio sequencing. In p19-10_01 [IncFIB(K); 223.434 bp], blaNDM-1 was part of a Tn1548-like structure (16.276 bp) delineated by IS26 The multireplicon plasmid p18-43_01 [IncR_1/IncFIB(pB171)/IncFII(Yp); 212.326 bp] shared an 80-kb region with p19-10_01, not including the blaNDM-1-containing region. The two plasmids were used as references for tracing NDM-1-encoding plasmids in the other genome assemblies. The p19-10_01 sequence was detected in K. pneumoniae (n = 7) only, whereas p18-43_01 was tracked to K. pneumoniae (n = 4), Klebsiella michiganensis (n = 1), Serratia marcescens (n = 11), Enterobacter spp. (n = 7), and Citrobacter freundii (n = 1), revealing horizontal spread of this blaNDM-1-bearing plasmid structure. Global phylogeny showed clustering of the K. pneumoniae (18/20) isolates together with closely related carbapenemase-negative ST101 isolates from other geographical origins. The South African isolates were divided into three phylogenetic subbranches, where each group had distinct resistance and replicon profiles, carrying either p19-10_01, p18-10_01, or pCHE-A1 (8,201 bp). The latter plasmid carried blaGES-5 and aacA4 within an integron mobilization unit. Our findings imply independent plasmid acquisition followed by local dissemination. Additionally, we detected blaOXA-232 carried by pPKPN4 in K. pneumoniae (ST14) and blaNDM-5 contained by a pNDM-MGR194-like genetic structure in Escherichia coli (ST167), adding even more complexity to the multilayer molecular mechanisms behind nosocomial spread of carbapenem-resistant Enterobacteriaceae in Durban, South Africa. Copyright © 2018 American Society for Microbiology.


September 22, 2019

PhdA catalyzes the first step of phenazine-1-carboxylic acid degradation in Mycobacterium fortuitum.

Phenazines are a class of bacterially produced redox-active metabolites that are found in natural, industrial, and clinical environments. In Pseudomonas spp., phenazine-1-carboxylic acid (PCA)-the precursor of all phenazine metabolites-facilitates nutrient acquisition, biofilm formation, and competition with other organisms. While the removal of phenazines negatively impacts these activities, little is known about the genes or enzymes responsible for phenazine degradation by other organisms. Here, we report that the first step of PCA degradation by Mycobacterium fortuitum is catalyzed by a phenazine-degrading decarboxylase (PhdA). PhdA is related to members of the UbiD protein family that rely on a prenylated flavin mononucleotide cofactor for activity. The gene for PhdB, the enzyme responsible for cofactor synthesis, is present in a putative operon with the gene encoding PhdA in a region of the M. fortuitum genome that is essential for PCA degradation. PhdA and PhdB are present in all known PCA-degrading organisms from the ActinobacteriaM. fortuitum can also catabolize other Pseudomonas-derived phenazines such as phenazine-1-carboxamide, 1-hydroxyphenazine, and pyocyanin. On the basis of our previous work and the current characterization of PhdA, we propose that degradation converges on a common intermediate: dihydroxyphenazine. An understanding of the genes responsible for degradation will enable targeted studies of phenazine degraders in diverse environments.IMPORTANCE Bacteria from phylogenetically diverse groups secrete redox-active metabolites that provide a fitness advantage for their producers. For example, phenazines from Pseudomonas spp. benefit the producers by facilitating anoxic survival and biofilm formation and additionally inhibit competitors by serving as antimicrobials. Phenazine-producing pseudomonads act as biocontrol agents by leveraging these antibiotic properties to inhibit plant pests. Despite this importance, the fate of phenazines in the environment is poorly understood. Here, we characterize an enzyme from Mycobacterium fortuitum that catalyzes the first step of phenazine-1-carboxylic acid degradation. Knowledge of the genetic basis of phenazine degradation will facilitate the identification of environments where this activity influences the microbial community structure. Copyright © 2018 American Society for Microbiology.


September 22, 2019

The consistent differential expression of genetic pathways following exposure of an industrial Pseudomonas aeruginosa strain to preservatives and a laundry detergent formulation.

Pseudomonas aeruginosa is a common contaminant associated with product recalls in the home and personal care industry. Preservation systems are used to prevent spoilage and protect consumers, but greater knowledge is needed of preservative resistance mechanisms used by P. aeruginosa contaminants. We aimed to identify genetic pathways associated with preservative exposure by using an industrial P. aeruginosa strain and implementing RNA-Seq to understand gene expression changes in response to industry relevant conditions. The consistent differential expression of five genetic pathways during exposure to multiple industrial growth conditions associated with benzisothiazolone (BIT) and phenoxyethanol (POE) preservatives, and a laundry detergent (LD) formulation, was observed. A MexPQ-OpmE Resistance Nodulation Division efflux pump system was commonly upregulated in response to POE, a combination of BIT and POE, and LD together with BIT. In response to all industry conditions, a putative sialic acid transporter and isoprenoid biosynthesis gnyRDBHAL operon demonstrated consistent upregulation. Two operons phnBA and pqsEDCBA involved in Pseudomonas quinolone signaling production and quorum-sensing were also consistently downregulated during exposure to all the industry conditions. The ability to identify consistently differentially expressed genetic pathways in P. aeruginosa can inform the development of future targeted preservation systems that maintain product safety and minimise resistance development.


September 22, 2019

Characterization of phenotypic variation and genome aberrations observed among Phytophthora ramorum isolates from diverse hosts.

Accumulating evidence suggests that genome plasticity allows filamentous plant pathogens to adapt to changing environments. Recently, the generalist plant pathogen Phytophthora ramorum has been documented to undergo irreversible phenotypic alterations accompanied by chromosomal aberrations when infecting trunks of mature oak trees (genus Quercus). In contrast, genomes and phenotypes of the pathogen derived from the foliage of California bay (Umbellularia californica) are usually stable. We define this phenomenon as host-induced phenotypic diversification (HIPD). P. ramorum also causes a severe foliar blight in some ornamental plants such as Rhododendron spp. and Viburnum spp., and isolates from these hosts occasionally show phenotypes resembling those from oak trunks that carry chromosomal aberrations. The aim of this study was to investigate variations in phenotypes and genomes of P. ramorum isolates from non-oak hosts and substrates to determine whether HIPD changes may be equivalent to those among isolates from oaks.We analyzed genomes of diverse non-oak isolates including those taken from foliage of Rhododendron and other ornamental plants, as well as from natural host species, soil, and water. Isolates recovered from artificially inoculated oak logs were also examined. We identified diverse chromosomal aberrations including copy neutral loss of heterozygosity (cnLOH) and aneuploidy in isolates from non-oak hosts. Most identified aberrations in non-oak hosts were also common among oak isolates; however, trisomy, a frequent type of chromosomal aberration in oak isolates was not observed in isolates from Rhododendron.This work cross-examined phenotypic variation and chromosomal aberrations in P. ramorum isolates from oak and non-oak hosts and substrates. The results suggest that HIPD comparable to that occurring in oak hosts occurs in non-oak environments such as in Rhododendron leaves. Rhododendron leaves are more easily available than mature oak stems and thus can potentially serve as a model host for the investigation of HIPD, the newly described plant-pathogen interaction.


September 22, 2019

High-oleate yeast oil without polyunsaturated fatty acids.

Oleate-enriched triacylglycerides are well-suited for lubricant applications that require high oxidative stability. Fatty acid carbon chain length and degree of desaturation are key determinants of triacylglyceride properties and the ability to manipulate fatty acid composition in living organisms is critical to developing a source of bio-based oil tailored to meet specific application requirements.We sought to engineer the oleaginous yeast Yarrowia lipolytica for production of high-oleate triacylglyceride oil. We studied the effect of deletions and overexpressions in the fatty acid and triacylglyceride synthesis pathways to identify modifications that increase oleate levels. Oleic acid accumulation in triacylglycerides was promoted by exchanging the native ?9 fatty acid desaturase and glycerol-3-phosphate acyltransferase with heterologous enzymes, as well as deletion of the ?12 fatty acid desaturase and expression of a fatty acid elongase. By combining these engineering steps, we eliminated polyunsaturated fatty acids and created a Y. lipolytica strain that accumulates triglycerides with >?90% oleate content.High-oleate content and lack of polyunsaturates distinguish this triacylglyceride oil from plant and algal derived oils. Its composition renders the oil suitable for applications that require high oxidative stability and further demonstrates the potential of Y. lipolytica as a producer of tailored lipid profiles.


September 22, 2019

Genomic changes associated with the evolutionary transitions of Nostoc to a plant symbiont.

Cyanobacteria belonging to the genus Nostoc comprise free-living strains and also facultative plant symbionts. Symbiotic strains can enter into symbiosis with taxonomically diverse range of host plants. Little is known about genomic changes associated with evolutionary transition of Nostoc from free-living to plant symbiont. Here, we compared the genomes derived from 11 symbiotic Nostoc strains isolated from different host plants and infer phylogenetic relationships between strains. Phylogenetic reconstructions of 89 Nostocales showed that symbiotic Nostoc strains with a broad host range, entering epiphytic and intracellular or extracellular endophytic interactions, form a monophyletic clade indicating a common evolutionary history. A polyphyletic origin was found for Nostoc strains which enter only extracellular symbioses, and inference of transfer events implied that this trait was likely acquired several times in the evolution of the Nostocales. Symbiotic Nostoc strains showed enriched functions in transport and metabolism of organic sulfur, chemotaxis and motility, as well as the uptake of phosphate, branched-chain amino acids, and ammonium. The genomes of the intracellular clade differ from that of other Nostoc strains, with a gain/enrichment of genes encoding proteins to generate l-methionine from sulfite and pathways for the degradation of the plant metabolites vanillin and vanillate, and of the macromolecule xylan present in plant cell walls. These compounds could function as C-sources for members of the intracellular clade. Molecular clock analysis indicated that the intracellular clade emerged ca. 600 Ma, suggesting that intracellular Nostoc symbioses predate the origin of land plants and the emergence of their extant hosts.


September 22, 2019

NextSV: a meta-caller for structural variants from low-coverage long-read sequencing data.

Structural variants (SVs) in human genomes are implicated in a variety of human diseases. Long-read sequencing delivers much longer read lengths than short-read sequencing and may greatly improve SV detection. However, due to the relatively high cost of long-read sequencing, it is unclear what coverage is needed and how to optimally use the aligners and SV callers.In this study, we developed NextSV, a meta-caller to perform SV calling from low coverage long-read sequencing data. NextSV integrates three aligners and three SV callers and generates two integrated call sets (sensitive/stringent) for different analysis purposes. We evaluated SV calling performance of NextSV under different PacBio coverages on two personal genomes, NA12878 and HX1. Our results showed that, compared with running any single SV caller, NextSV stringent call set had higher precision and balanced accuracy (F1 score) while NextSV sensitive call set had a higher recall. At 10X coverage, the recall of NextSV sensitive call set was 93.5 to 94.1% for deletions and 87.9 to 93.2% for insertions, indicating that ~10X coverage might be an optimal coverage to use in practice, considering the balance between the sequencing costs and the recall rates. We further evaluated the Mendelian errors on an Ashkenazi Jewish trio dataset.Our results provide useful guidelines for SV detection from low coverage whole-genome PacBio data and we expect that NextSV will facilitate the analysis of SVs on long-read sequencing data.


September 22, 2019

Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.


September 22, 2019

Novel haloarchaeon Natrinema thermophila having the highest growth temperature among haloarchaea with a large genome size.

Environmental temperature is one of the most important factors for the growth and survival of microorganisms. Here we describe a novel extremely halophilic archaeon (haloarchaea) designated as strain CBA1119T isolated from solar salt. Strain CBA1119T had the highest maximum and optimal growth temperatures (66?°C and 55?°C, respectively) and one of the largest genome sizes among haloarchaea (5.1?Mb). It also had the largest number of strain-specific pan-genome orthologous groups and unique pathways among members of the genus Natrinema in the class Halobacteria. A dendrogram based on the presence/absence of genes and a phylogenetic tree constructed based on OrthoANI values highlighted the particularities of strain CBA1119T as compared to other Natrinema species and other haloarchaea members. The large genome of strain CBA1119T may provide information on genes that confer tolerance to extreme environmental conditions, which may lead to the discovery of other thermophilic strains with potential applications in industrial biotechnology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.