Menu
September 22, 2019

Biosynthetic Baeyer-Villiger chemistry enables access to two anthracene scaffolds from a single gene cluster in Deep-Sea-derived Streptomyces olivaceus SCSIO T05.

Four known compounds, rishirilide B (1), rishirilide C (2), lupinacidin A (3), and galvaquinone B (4), representing two anthracene scaffolds typical of aromatic polyketides, were isolated from a culture of the deep-sea-derived Streptomyces olivaceus SCSIO T05. From the S. olivaceus producer was cloned and sequenced the rsd biosynthetic gene cluster (BGC) that drives rishirilide biosynthesis. The structural gene rsdK2 inactivation and heterologous expression of the rsd BGC confirmed the single rsd BGC encodes construction of 1-4 and, thus, accounts for two anthracene scaffolds. Precursor incubation experiments with 13C-labeled acetate revealed that a Baeyer-Villiger-type rearrangement plays a central role in construction of 1-4. Two luciferase monooxygenase components, along with a reductase component, are presumably involved in the Baeyer-Villiger-type rearrangement reaction enabling access to the two anthracene scaffold variants. Engineering of the rsd BGC unveiled three SARP family transcriptional regulators, enhancing anthracene production. Inactivation of rsdR4, a MarR family transcriptional regulator, failed to impact production of 1-4, although production of 3 was slightly improved; most importantly rsdR4 inactivation led to the new adduct 6 in high titer. Notably, inactivation of rsdH, a putative amidohydrolase, substantially improved the overall titers of 1-4 by more than 4-fold.


September 22, 2019

MultiMotifMaker: a multi-thread tool for identifying DNA methylation motifs from Pacbio reads.

The methylation of DNA is important mechanism to control biological processes. Recently, the Pacbio SMRT technology provides a new way to identify base methylation in the genome. MotifMaker is a tool developed by Pacbio for discovering DNA methylation motifs from methylated DNA sequences. However, MotifMaker is single-threaded and computational expensive for identifying methylation motifs from large genomes. Here, we present an efficient motif finding algorithm (MultiMotifMaker) by implementing multi threads of the MotifMaker. The MultiMotifMaker, speeds up the motif search about 8-9 times on a 32 core computer comparing to MotifMaker. MultiMotifMaker makes it possible to identify methylation motifs from Pacbio reads for large genomes.


September 22, 2019

Sequencing of Panax notoginseng genome reveals genes involved in disease resistance and ginsenoside biosynthesis

Background: Panax notoginseng is a traditional Chinese herb with high medicinal and economic value. There has been considerable research on the pharmacological activities of ginsenosides contained in Panax spp.; however, very little is known about the ginsenoside biosynthetic pathway. Results: We reported the first de novo genome of 2.36 Gb of sequences from P. notoginseng with 35,451 protein-encoding genes. Compared to other plants, we found notable gene family contraction of disease-resistance genes in P. notoginseng, but notable expansion for several ATP-binding cassette (ABC) transporter subfamilies, such as the Gpdr subfamily, indicating that ABCs might be an additional mechanism for the plant to cope with biotic stress. Combining eight transcriptomes of roots and aerial parts, we identified several key genes, their transcription factor binding sites and all their family members involved in the synthesis pathway of ginsenosides in P. notoginseng, including dammarenediol synthase, CYP716 and UGT71. Conclusions: The complete genome analysis of P. notoginseng, the first in genus Panax, will serve as an important reference sequence for improving breeding and cultivation of this important nutraceutical and medicinal but vulnerable plant species.


September 22, 2019

The integrative conjugative element clc (ICEclc) of Pseudomonas aeruginosa JB2.

Integrative conjugative elements (ICE) are a diverse group of chromosomally integrated, self-transmissible mobile genetic elements (MGE) that are active in shaping the functions of bacteria and bacterial communities. Each type of ICE carries a characteristic set of core genes encoding functions essential for maintenance and self-transmission, and cargo genes that endow on hosts phenotypes beneficial for niche adaptation. An important area to which ICE can contribute beneficial functions is the biodegradation of xenobiotic compounds. In the biodegradation realm, the best-characterized ICE is ICEclc, which carries cargo genes encoding for ortho-cleavage of chlorocatechols (clc genes) and aminophenol metabolism (amn genes). The element was originally identified in the 3-chlorobenzoate-degrader Pseudomonas knackmussii B13, and the closest relative is a nearly identical element in Burkholderia xenovorans LB400 (designated ICEclc-B13 and ICEclc-LB400, respectively). In the present report, genome sequencing of the o-chlorobenzoate degrader Pseudomonas aeruginosa JB2 was used to identify a new member of the ICEclc family, ICEclc-JB2. The cargo of ICEclc-JB2 differs from that of ICEclc-B13 and ICEclc-LB400 in consisting of a unique combination of genes that encode for the utilization of o-halobenzoates and o-hydroxybenzoate as growth substrates (ohb genes and hyb genes, respectively) and which are duplicated in a tandem repeat. Also, ICEclc-JB2 lacks an operon of regulatory genes (tciR-marR-mfsR) that is present in the other two ICEclc, and which controls excision from the host. Thus, the mechanisms regulating intracellular behavior of ICEclc-JB2 may differ from that of its close relatives. The entire tandem repeat in ICEclc-JB2 can excise independently from the element in a process apparently involving transposases/insertion sequence associated with the repeats. Excision of the repeats removes important niche adaptation genes from ICEclc-JB2, rendering it less beneficial to the host. However, the reduced version of ICEclc-JB2 could now acquire new genes that might be beneficial to a future host and, consequently, to the survival of ICEclc-JB2. Collectively, the present identification and characterization of ICEclc-JB2 provides insights into roles of MGE in bacterial niche adaptation and the evolution of catabolic pathways for biodegradation of xenobiotic compounds.


September 22, 2019

Comparative genomics and genotype-phenotype associations in Bifidobacterium breve.

Bifidobacteria are common members of the gastro-intestinal microbiota of a broad range of animal hosts. Their successful adaptation to this particular niche is linked to their saccharolytic metabolism, which is supported by a wide range of glycosyl hydrolases. In the current study a large-scale gene-trait matching (GTM) effort was performed to explore glycan degradation capabilities in B. breve. By correlating the presence/absence of genes and associated genomic clusters with growth/no-growth patterns across a dataset of 20 Bifidobacterium breve strains and nearly 80 different potential growth substrates, we not only validated the approach for a number of previously characterized carbohydrate utilization clusters, but we were also able to discover novel genetic clusters linked to the metabolism of salicin and sucrose. Using GTM, genetic associations were also established for antibiotic resistance and exopolysaccharide production, thereby identifying (novel) bifidobacterial antibiotic resistance markers and showing that the GTM approach is applicable to a variety of phenotypes. Overall, the GTM findings clearly expand our knowledge on members of the B. breve species, in particular how their variable genetic features can be linked to specific phenotypes.


September 22, 2019

Integrating long-range connectivity information into de Bruijn graphs.

The de Bruijn graph is a simple and efficient data structure that is used in many areas of sequence analysis including genome assembly, read error correction and variant calling. The data structure has a single parameter k, is straightforward to implement and is tractable for large genomes with high sequencing depth. It also enables representation of multiple samples simultaneously to facilitate comparison. However, unlike the string graph, a de Bruijn graph does not retain long range information that is inherent in the read data. For this reason, applications that rely on de Bruijn graphs can produce sub-optimal results given their input data.We present a novel assembly graph data structure: the Linked de Bruijn Graph (LdBG). Constructed by adding annotations on top of a de Bruijn graph, it stores long range connectivity information through the graph. We show that with error-free data it is possible to losslessly store and recover sequence from a Linked de Bruijn graph. With assembly simulations we demonstrate that the LdBG data structure outperforms both our de Bruijn graph and the String Graph Assembler (SGA). Finally we apply the LdBG to Klebsiella pneumoniae short read data to make large (12 kbp) variant calls, which we validate using PacBio sequencing data, and to characterize the genomic context of drug-resistance genes.Linked de Bruijn Graphs and associated algorithms are implemented as part of McCortex, which is available under the MIT license at https://github.com/mcveanlab/mccortex.Supplementary data are available at Bioinformatics online.


September 22, 2019

Identification of the DNA methyltransferases establishing the methylome of the cyanobacterium Synechocystis sp. PCC 6803.

DNA methylation in bacteria is important for defense against foreign DNA, but is also involved in DNA repair, replication, chromosome partitioning, and regulatory processes. Thus, characterization of the underlying DNA methyltransferases in genetically tractable bacteria is of paramount importance. Here, we characterized the methylome and orphan methyltransferases in the model cyanobacterium Synechocystis sp. PCC 6803. Single molecule real-time (SMRT) sequencing revealed four DNA methylation recognition sequences in addition to the previously known motif m5CGATCG, which is recognized by M.Ssp6803I. For three of the new recognition sequences, we identified the responsible methyltransferases. M.Ssp6803II, encoded by the sll0729 gene, modifies GGm4CC, M.Ssp6803III, encoded by slr1803, represents the cyanobacterial dam-like methyltransferase modifying Gm6ATC, and M.Ssp6803V, encoded by slr6095 on plasmid pSYSX, transfers methyl groups to the bipartite motif GGm6AN7TTGG/CCAm6AN7TCC. The remaining methylation recognition sequence GAm6AGGC is probably recognized by methyltransferase M.Ssp6803IV encoded by slr6050. M.Ssp6803III and M.Ssp6803IV were essential for the viability of Synechocystis, while the strains lacking M.Ssp6803I and M.Ssp6803V showed growth similar to the wild type. In contrast, growth was strongly diminished of the ?sll0729 mutant lacking M.Ssp6803II. These data provide the basis for systematic studies on the molecular mechanisms impacted by these methyltransferases.


September 22, 2019

Sequencing of pT5282-CTXM, p13190-KPC and p30860-NR, and comparative genomics analysis of IncX8 plasmids.

This study proposes a replicon-based scheme for typing IncX plasmids into nine separately clustering subgroups, including IncX1a, IncX1ß and IncX2-8. The complete nucleotide sequences of three IncX8 plasmids, namely pT5282-CTXM and p30860-NR from Enterobacter cloacae and p13190-KPC from Klebsiella pneumoniae, were determined and were compared with two other previously sequenced IncX8 plasmids (pCAV1043-58 and pCAV1741-16). These five plasmids possessed conserved IncX8 backbones with limited genetic variation with respect to gene content and organisation, and each of them carried one or three accessory modules that harboured resistance markers and metabolic gene clusters as well as transposons, insertion sequence (IS)-based transposition units and miniature inverted repeat transposable elements (MITEs), indicating that the relatively small IncX8 backbones were able to integrate various foreign genetic contents. The resistance genes blaCTX-M-3 and blaTEM-1 (ß-lactam resistance), blaKPC-2 (carbapenem resistance) and ?blaTEM-1, and tet(A) (tetracycline resistance) and mph(E) (macrolide resistance) were found in pT5282-CTXM, p13190-KPC and pCAV1741-16, respectively, whilst p30860-NR and pCAV1043-58 carried no resistance genes. The data presented here provide an insight into the diversification and evolution history of IncX8 plasmids. Copyright © 2018 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.


September 22, 2019

The plant growth-promoting rhizobacterium Variovorax boronicumulans CGMCC 4969 regulates the level of indole-3-acetic acid synthesized from indole-3-acetonitrile.

Variovorax is a metabolically diverse genus of plant growth-promoting rhizobacteria (PGPR) that engages in mutually beneficial interactions between plants and microbes. Unlike most PGPR, Variovorax cannot synthesize the phytohormone indole-3-acetic acid (IAA) via tryptophan. However, we found that V. boronicumulans strain CGMCC 4969 could produce IAA using indole-3-acetonitrile (IAN) as the precursor. Thus, in the present study, the IAA synthesis mechanism of V. boronicumulans CGMCC 4969 was investigated. V. boronicumulans CGMCC 4969 metabolized IAN to IAA through both a nitrilase-dependent pathway and a nitrile hydratase (NHase) and amidase-dependent pathway. Cobalt enhanced the metabolic flux via the NHase/amidase, by which IAN was rapidly converted to indole-3-acetamide (IAM) and in turn to IAA. IAN stimulated the metabolic flux via the nitrilase, by which IAN was rapidly converted to IAA. Subsequently, the IAA was degraded. V. boronicumulans CGMCC 4969 could use IAN as the sole carbon and nitrogen source for growth. Genome sequencing confirmed the IAA synthesis pathways. Gene cloning and overexpression in Escherichia coli indicated that NitA has the nitrilase activity, and IamA has the amidase activity to respectively transform IAN and IAM to IAA. Interestingly, NitA showed a close genetic relationship with the nitrilase of the phytopathogen Pseudomonas syringae Quantitative PCR analysis indicated that the NHase/amidase system is constitutively expressed, whereas the nitrilase is inducible. The present study helps our understanding of the versatile functions of Variovorax nitrile-converting enzymes that mediate IAA synthesis and the interactions between plants and these bacteria.IMPORTANCE We demonstrated that Variovorax boronicumulans CGMCC 4969 has two enzymatic systems-nitrilase and nitrile hydratase/amidase-that convert indole-3-acetonitrile (IAN) to the important plant hormone indole-3-acetic acid (IAA). The two IAA synthesis systems have very different regulatory mechanisms, affecting the IAA synthesis rate and duration. The nitrilase was induced by IAN, which was rapidly converted to IAA; subsequently IAA was rapidly consumed for cell growth. The NHase and amidase system was constitutively expressed and slowly but continuously synthesized IAA. In addition to synthesizing IAA from IAN, CGMCC 4969 has a rapid IAA degradation system, which would be helpful for a host plant to eliminate redundant IAA. This study indicates that the plant growth-promoting rhizobacterium V. boronicumulans CGMCC 4969 has the potential to be used by host plants to regulate the IAA level. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Analysis of the Gli-D2 locus identifies a genetic target for simultaneously improving the breadmaking and health-related traits of common wheat.

Gliadins are a major component of wheat seed proteins. However, the complex homoeologous Gli-2 loci (Gli-A2, -B2 and -D2) that encode the a-gliadins in commercial wheat are still poorly understood. Here we analyzed the Gli-D2 locus of Xiaoyan 81 (Xy81), a winter wheat cultivar. A total of 421.091 kb of the Gli-D2 sequence was assembled from sequencing multiple bacterial artificial clones, and 10 a-gliadin genes were annotated. Comparative genomic analysis showed that Xy81 carried only eight of the a-gliadin genes of the D genome donor Aegilops tauschii, with two of them each experiencing a tandem duplication. A mutant line lacking Gli-D2 (DLGliD2) consistently exhibited better breadmaking quality and dough functionalities than its progenitor Xy81, but without penalties in other agronomic traits. It also had an elevated lysine content in the grains. Transcriptome analysis verified the lack of Gli-D2 a-gliadin gene expression in DLGliD2. Furthermore, the transcript and protein levels of protein disulfide isomerase were both upregulated in DLGliD2 grains. Consistent with this finding, DLGliD2 had increased disulfide content in the flour. Our work sheds light on the structure and function of Gli-D2 in commercial wheat, and suggests that the removal of Gli-D2 and the gliadins specified by it is likely to be useful for simultaneously enhancing the end-use and health-related traits of common wheat. Because gliadins and homologous proteins are widely present in grass species, the strategy and information reported here may be broadly useful for improving the quality traits of diverse cereal crops.© 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


September 22, 2019

Human copy number variants are enriched in regions of low mappability.

Copy number variants (CNVs) are known to affect a large portion of the human genome and have been implicated in many diseases. Although whole-genome sequencing (WGS) can help identify CNVs, most analytical methods suffer from limited sensitivity and specificity, especially in regions of low mappability. To address this, we use PopSV, a CNV caller that relies on multiple samples to control for technical variation. We demonstrate that our calls are stable across different types of repeat-rich regions and validate the accuracy of our predictions using orthogonal approaches. Applying PopSV to 640 human genomes, we find that low-mappability regions are approximately 5 times more likely to harbor germline CNVs, in stark contrast to the nearly uniform distribution observed for somatic CNVs in 95 cancer genomes. In addition to known enrichments in segmental duplication and near centromeres and telomeres, we also report that CNVs are enriched in specific types of satellite and in some of the most recent families of transposable elements. Finally, using this comprehensive approach, we identify 3455 regions with recurrent CNVs that were missing from existing catalogs. In particular, we identify 347 genes with a novel exonic CNV in low-mappability regions, including 29 genes previously associated with disease.


September 22, 2019

Citrobacter freundii fitness during bloodstream infection.

Sepsis resulting from microbial colonization of the bloodstream is a serious health concern associated with high mortality rates. The objective of this study was to define the physiologic requirements of Citrobacter freundii in the bloodstream as a model for bacteremia caused by opportunistic Gram-negative pathogens. A genetic screen in a murine host identified 177 genes that contributed significantly to fitness, the majority of which were broadly classified as having metabolic or cellular maintenance functions. Among the pathways examined, the Tat protein secretion system conferred the single largest fitness contribution during competition infections and a putative Tat-secreted protein, SufI, was also identified as a fitness factor. Additional work was focused on identifying relevant metabolic pathways for bacteria in the bloodstream environment. Mutations that eliminated the use of glucose or mannitol as carbon sources in vitro resulted in loss of fitness in the murine model and similar results were obtained upon disruption of the cysteine biosynthetic pathway. Finally, the conservation of identified fitness factors was compared within a cohort of Citrobacter bloodstream isolates and between Citrobacter and Serratia marcescens, the results of which suggest the presence of conserved strategies for bacterial survival and replication in the bloodstream environment.


September 22, 2019

Biology and genome of a newly discovered sibling species of Caenorhabditis elegans.

A ‘sibling’ species of the model organism Caenorhabditis elegans has long been sought for use in comparative analyses that would enable deep evolutionary interpretations of biological phenomena. Here, we describe the first sibling species of C. elegans, C. inopinata n. sp., isolated from fig syconia in Okinawa, Japan. We investigate the morphology, developmental processes and behaviour of C. inopinata, which differ significantly from those of C. elegans. The 123-Mb C. inopinata genome was sequenced and assembled into six nuclear chromosomes, allowing delineation of Caenorhabditis genome evolution and revealing unique characteristics, such as highly expanded transposable elements that might have contributed to the genome evolution of C. inopinata. In addition, C. inopinata exhibits massive gene losses in chemoreceptor gene families, which could be correlated with its limited habitat area. We have developed genetic and molecular techniques for C. inopinata; thus C. inopinata provides an exciting new platform for comparative evolutionary studies.


September 22, 2019

The complete methylome of an entomopathogenic bacterium reveals the existence of loci with unmethylated adenines.

DNA methylation can serve to control diverse phenomena in eukaryotes and prokaryotes, including gene regulation leading to cell differentiation. In bacteria, DNA methylomes (i.e., methylation state of each base of the whole genome) have been described for several species, but methylome profile variation during the lifecycle has rarely been studied, and only in a few model organisms. Moreover, major phenotypic changes have been reported in several bacterial strains with a deregulated methyltransferase, but the corresponding methylome has rarely been described. Here we report the first methylome description of an entomopathogenic bacterium, Photorhabdus luminescens. Eight motifs displaying a high rate of methylation (>94%) were identified. The methylome was strikingly stable over course of growth, but also in a subpopulation responsible for a critical step in the bacterium’s lifecycle: successful survival and proliferation in insects. The rare unmethylated GATC motifs were preferentially located in putative promoter regions, and most of them were methylated after Dam methyltransferase overexpression, suggesting that DNA methylation is involved in gene regulation. Our findings bring key insight into bacterial methylomes and encourage further research to decipher the role of loci protected from DNA methylation in gene regulation.


September 22, 2019

Complete genome of streamlined marine actinobacterium Pontimonas salivibrio strain CL-TW6T adapted to coastal planktonic lifestyle.

Pontimonas salivibrio strain CL-TW6T (=KCCM 90105?=?JCM18206) was characterized as the type strain of a new genus within the Actinobacterial family Microbacteriaceae. It was isolated from a coastal marine environment in which members of Microbactericeae have not been previously characterized.The genome of P. salivibrio CL-TW6T was a single chromosome of 1,760,810 bp. Genomes of this small size are typically found in bacteria growing slowly in oligotrophic zones and said to be streamlined. Phylogenetic analysis showed it to represent a lineage originating in the Microbacteriaceae radiation occurring before the snowball Earth glaciations, and to have a closer relationship with some streamlined bacteria known through metagenomic data. Several genomic characteristics typical of streamlined bacteria are found: %G?+?C is lower than non-streamlined members of the phylum; there are a minimal number of rRNA and tRNA genes, fewer paralogs in most gene families, and only two sigma factors; there is a noticeable absence of some nonessential metabolic pathways, including polyketide synthesis and catabolism of some amino acids. There was no indication of any phage genes or plasmids, however, a system of active insertion elements was present. P. salivibrio appears to be unusual in having polyrhamnose-based cell wall oligosaccharides instead of mycolic acid or teichoic acid-based oligosaccharides. Oddly, it conducts sulfate assimilation apparently for sulfating cell wall components, but not for synthesizing amino acids. One gene family it has more of, rather than fewer of, are toxin/antitoxin systems, which are thought to down-regulate growth during nutrient deprivation or other stressful conditions.Because of the relatively small number of paralogs and its relationship to the heavily characterized Mycobacterium tuberculosis, we were able to heavily annotate the genome of P. salivibrio CL-TW6T. Its streamlined status and relationship to streamlined metagenomic constructs makes it an important reference genome for study of the streamlining concept. The final evolutionary trajectory of CL-TW6 T was to adapt to growth in a non-oligotrophic coastal zone. To understand that adaptive process, we give a thorough accounting of gene content, contrasting with both oligotrophic streamlined bacteria and large genome bacteria, and distinguishing between genes derived by vertical and horizontal descent.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.