Menu
July 7, 2019

Genome sequencing and comparative genomics reveal the potential pathogenic mechanism of Cercospora sojina Hara on soybean.

Frogeye leaf spot, caused by Cercospora sojina Hara, is a common disease of soybean in most soybean-growing countries of the world. In this study, we report a high-quality genome sequence of C. sojina by Single Molecule Real-Time sequencing method. The 40.8-Mb genome encodes 11,655 predicated genes, and 8,474 genes are revealed by RNA sequencing. Cercospora sojina genome contains large numbers of gene clusters that are involved in synthesis of secondary metabolites, including mycotoxins and pigments. However, much less carbohydrate-binding module protein encoding genes are identified in C. sojina genome, when compared with other phytopathogenic fungi. Bioinformatics analysis reveals that C. sojina harbours about 752 secreted proteins, and 233 of them are effectors. During early infection, the genes for metabolite biosynthesis and effectors are significantly enriched, suggesting that they may play essential roles in pathogenicity. We further identify 13 effectors that can inhibit BAX-induced cell death. Taken together, our results provide insights into the infection mechanisms of C. sojina on soybean.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Parallel evolution of group B Streptococcus hypervirulent clonal complex 17 unveils new pathoadaptive mutations.

Group B Streptococcus (GBS) is a commensal of the gastrointestinal and genitourinary tracts, while a prevailing cause of neonatal disease worldwide. Of the various clonal complexes (CCs), CC17 is overrepresented in GBS-infected newborns for reasons that are still largely unknown. Here, we report a comprehensive genomic analysis of 626 CC17 isolates collected worldwide, identifying the genetic traits behind their successful adaptation to humans and the underlying differences between carriage and clinical strains. Comparative analysis with 923 GBS genomes belonging to CC1, CC19, and CC23 revealed that the evolution of CC17 is distinct from that of other human-adapted lineages and recurrently targets functions related to nucleotide and amino acid metabolism, cell adhesion, regulation, and immune evasion. We show that the most distinctive features of disease-specific CC17 isolates were frequent mutations in the virulence-associated CovS and Stk1 kinases, underscoring the crucial role of the entire CovRS regulatory pathway in modulating the pathogenicity of GBS. Importantly, parallel and convergent evolution of major components of the bacterial cell envelope, such as the capsule biosynthesis operon, the pilus, and Rib, reflects adaptation to host immune pressures and should be taken into account in the ongoing development of a GBS vaccine. The presence of recurrent targets of evolution not previously implicated in virulence also opens the way for uncovering new functions involved in host colonization and GBS pathogenesis. IMPORTANCE The incidence of group B Streptococcus (GBS) neonatal disease continues to be a significant cause of concern worldwide. Strains belonging to clonal complex 17 (CC17) are the most frequently responsible for GBS infections in neonates, especially among late-onset disease cases. Therefore, we undertook the largest genomic study of GBS CC17 strains to date to decipher the genetic bases of their remarkable colonization and infection ability. We show that crucial functions involved in different steps of the colonization or infection process of GBS are distinctly mutated during the adaptation of CC17 to the human host. In particular, our results implicate the CovRS two-component regulator of virulence in the differentiation between carriage- and disease-associated isolates. Not only does this work raise important implications for the ongoing development of a vaccine against GBS but might also drive the discovery of key functions for GBS adaptation and pathogenesis that have been overlooked until now. Author Video: An author video summary of this article is available.


July 7, 2019

Recent expansion and adaptive evolution of the carcinoembryonic antigen family in bats of the Yangochiroptera subgroup.

Expansions of gene families are predictive for ongoing genetic adaptation to environmental cues. We describe such an expansion of the carcinoembryonic antigen (CEA) gene family in certain bat families. Members of the CEA family in humans and mice are exploited as cellular receptors by a number of pathogens, possibly due to their function in immunity and reproduction. The CEA family is composed of CEA-related cell adhesion molecules (CEACAMs) and secreted pregnancy-specific glycoproteins (PSGs). PSGs are almost exclusively expressed by trophoblast cells at the maternal-fetal interface. The reason why PSGs exist only in a minority of mammals is still unknown.Analysis of the CEA gene family in bats revealed that in certain bat families, belonging to the subgroup Yangochiroptera but not the Yinpterochiroptera subgroup an expansion of the CEA gene family took place, resulting in approximately one hundred CEA family genes in some species of the Vespertilionidae. The majority of these genes encode secreted PSG-like proteins (further referred to as PSG). Remarkably, we found strong evidence that the ligand-binding domain (IgV-like domain) of PSG is under diversifying positive selection indicating that bat PSGs may interact with structurally highly variable ligands. Such ligands might represent bacterial or viral pathogen adhesins. We have identified two distinct clusters of PSGs in three Myotis species. The two PSG cluster differ in the amino acids under positive selection. One cluster was only expanded in members of the Vespertilionidae while the other was found to be expanded in addition in members of the Miniopteridae and Mormoopidae. Thus one round of PSG expansion may have occurred in an ancestry of all three families and a second only in Vespertilionidae. Although maternal ligands of PSGs may exist selective challenges by two distinct pathogens seem to be likely responsible for the expansion of PSGs in Vespertilionidae.The rapid expansion of PSGs in certain bat species together with selection for diversification suggest that bat PSGs could be part of a pathogen defense system by serving as decoy receptors and/or regulators of feto-maternal interactions.


July 7, 2019

Convergent evolution of Y chromosome gene content in flies.

Sex-chromosomes have formed repeatedly across Diptera from ordinary autosomes, and X-chromosomes mostly conserve their ancestral genes. Y-chromosomes are characterized by abundant gene-loss and an accumulation of repetitive DNA, yet the nature of the gene repertoire of fly Y-chromosomes is largely unknown. Here we trace gene-content evolution of Y-chromosomes across 22 Diptera species, using a subtraction pipeline that infers Y genes from male and female genome, and transcriptome data. Few genes remain on old Y-chromosomes, but the number of inferred Y-genes varies substantially between species. Young Y-chromosomes still show clear evidence of their autosomal origins, but most genes on old Y-chromosomes are not simply remnants of genes originally present on the proto-sex-chromosome that escaped degeneration, but instead were recruited secondarily from autosomes. Despite almost no overlap in Y-linked gene content in different species with independently formed sex-chromosomes, we find that Y-linked genes have evolved convergent gene functions associated with testis expression. Thus, male-specific selection appears as a dominant force shaping gene-content evolution of Y-chromosomes across fly species.While X-chromosome gene content tends to be conserved, Y-chromosome evolution is dynamic and difficult to reconstruct. Here, Mahajan and Bachtrog use a subtraction pipeline to identify Y-linked genes in 22 Diptera species, revealing patterns of Y-chromosome gene-content evolution.


July 7, 2019

LOGAN: A framework for LOssless Graph-based ANalysis of high throughput sequence data

Recent massive growth in the production of sequencing data necessitates matching improvements in bioinformatics tools to effectively utilize it. Existing tools suffer from limitations in both scalability and applicability which are inherent to their underlying algorithms and data structures. We identify the key requirements for the ideal data structure for sequence analyses: it should be informationally lossless, locally updatable, and memory efficient; requirements which are not met by data structures underlying the major assembly strategies Overlap Layout Consensus and De Bruijn Graphs. We therefore propose a new data structure, the LOGAN graph, which is based on a memory efficient Sparse De Bruijn Graph with routing information. Innovations in storing routing information and careful implementation allow sequence datasets for Escherichia coli (4.6Mbp, 117x coverage), Arabidopsis thaliana (135Mbp, 17.5x coverage) and Solanum pennellii (1.2Gbp, 47x coverage) to be loaded into memory on a desktop computer in seconds, minutes, and hours respectively. Memory consumption is competitive with state of the art alternatives, while losslessly representing the reads in an indexed and updatable form. Both Second and Third Generation Sequencing reads are supported. Thus, the LOGAN graph is positioned to be the backbone for major breakthroughs in sequence analysis such as integrated hybrid assembly, assembly of exceptionally large and repetitive genomes, as well as assembly and representation of pan-genomes.


July 7, 2019

Multiple hybrid de novo genome assembly of finger millet, an orphan allotetraploid crop.

Finger millet (Eleusine coracana (L.) Gaertn) is an important crop for food security because of its tolerance to drought, which is expected to be exacerbated by global climate changes. Nevertheless, it is often classified as an orphan/underutilized crop because of the paucity of scientific attention. Among several small millets, finger millet is considered as an excellent source of essential nutrient elements, such as iron and zinc; hence, it has potential as an alternate coarse cereal. However, high-quality genome sequence data of finger millet are currently not available. One of the major problems encountered in the genome assembly of this species was its polyploidy, which hampers genome assembly compared with a diploid genome. To overcome this problem, we sequenced its genome using diverse technologies with sufficient coverage and assembled it via a novel multiple hybrid assembly workflow that combines next-generation with single-molecule sequencing, followed by whole-genome optical mapping using the Bionano Irys® system. The total number of scaffolds was 1,897 with an N50 length?>2.6?Mb and detection of 96% of the universal single-copy orthologs. The majority of the homeologs were assembled separately. This indicates that the proposed workflow is applicable to the assembly of other allotetraploid genomes.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

The biofilm inhibitor carolacton enters Gram-negative cells: studies using a TolC-deficient strain of Escherichia coli.

The myxobacterial secondary metabolite carolacton inhibits growth of Streptococcus pneumoniae and kills biofilm cells of the caries- and endocarditis-associated pathogen Streptococcus mutans at nanomolar concentrations. Here, we studied the response to carolacton of an Escherichia coli strain that lacked the outer membrane protein TolC. Whole-genome sequencing of the laboratory E. coli strain TolC revealed the integration of an insertion element, IS5, at the tolC locus and a close phylogenetic relationship to the ancient E. coli K-12. We demonstrated via transcriptome sequencing (RNA-seq) and determination of MIC values that carolacton penetrates the phospholipid bilayer of the Gram-negative cell envelope and inhibits growth of E. coli TolC at similar concentrations as for streptococci. This inhibition is completely lost for a C-9 (R) epimer of carolacton, a derivative with an inverted stereocenter at carbon atom 9 [(S) ? (R)] as the sole difference from the native molecule, which is also inactive in S. pneumoniae and S. mutans, suggesting a specific interaction of native carolacton with a conserved cellular target present in bacterial phyla as distantly related as Firmicutes and Proteobacteria. The efflux pump inhibitor (EPI) phenylalanine arginine ß-naphthylamide (PAßN), which specifically inhibits AcrAB-TolC, renders E. coli susceptible to carolacton. Our data indicate that carolacton has potential for use in antimicrobial chemotherapy against Gram-negative bacteria, as a single drug or in combination with EPIs. Strain E. coli TolC has been deposited at the DSMZ; together with the associated RNA-seq data and MIC values, it can be used as a reference during future screenings for novel bioactive compounds. IMPORTANCE The emergence of pathogens resistant against most or all of the antibiotics currently used in human therapy is a global threat, and therefore the search for antimicrobials with novel targets and modes of action is of utmost importance. The myxobacterial secondary metabolite carolacton had previously been shown to inhibit biofilm formation and growth of streptococci. Here, we investigated if carolacton could act against Gram-negative bacteria, which are difficult targets because of their double-layered cytoplasmic envelope. We found that the model organism Escherichia coli is susceptible to carolacton, similar to the Gram-positive Streptococcus pneumoniae, if its multidrug efflux system AcrAB-TolC is either inactivated genetically, by disruption of the tolC gene, or physiologically by coadministering an efflux pump inhibitor. A carolacton epimer that has a different steric configuration at carbon atom 9 is completely inactive, suggesting that carolacton may interact with the same molecular target in both Gram-positive and Gram-negative bacteria.


July 7, 2019

Echinobase: an expanding resource for echinoderm genomic information

Echinobase, a web accessible information system of diverse genomics and biological data for the echinoderm clade, grew out of SpBase, the first echinoderm genome project for sea urchin, Strongylocentrotus purpuratus. Sea urchins and their relatives are utilitarian research models in fields ranging from marine biology to developmental biology and gene regulatory systems. Echinobase is a user-friendly web interface that links an array of biological data that would otherwise have been tedious and frustrating for researchers to extract and organize. The system hosts a powerful gene search engine, genomics browser and other bioinformatics tools to investigate genomics and high throughput data. The Echinobase information system now serves genomic information for eight echinoderm species: S. purpuratus, Strongylocentrotus fransciscanus, Allocentrotus fragilis, Lytechinus variegatus, Patiria miniata, Parastichopus parvimensis and Ophiothrix spiculata, Eucidaris tribuloides. Herein lies a description of the web information system, genomics data types and content hosted by Echinobase.org. The goal of Echinobase is to connect genomic information to various experimental data and accelerate the research in field of molecular biology, developmental process, gene regulatory networks and more recently engineering biological systems0.


July 7, 2019

Systems biotechnology for protein production in Pichia pastoris.

The methylotrophic yeast Pichia pastoris (syn. Komagataella spp.) is one of the most important production systems for heterologous proteins. After the first genome sequences were published in 2009, tremendous effort was made to establish systems-level analytical methods. Methylotrophic lifestyle was one of the most thoroughly investigated topics, studied at the levels of transcriptome, proteome and metabolic flux. Also the responses of P. pastoris to environmental stress conditions experienced during high cell density production processes were studied. Metabolomics and flux analysis revealed the plasticity of the cellular metabolism in its adaption to the production of foreign proteins and served as blueprints for subsequent cell engineering and/or process design. The transcriptional response elicited by overexpression of heterologous proteins seems to depend on the nature and complexity of the recombinant product. Based on these data, novel targets for strain engineering could be deduced from transcriptomics and proteomics data mining and effectively enhanced protein secretion. Transcriptional regulation data also served as a valuable resource to identify novel promoters with the desired regulatory characteristics. This review aims to provide a comprehensive overview of systems biology applications in P. pastoris ranging from increased understanding of cell physiology to improving recombinant protein production in this cell factory.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Building a locally diploid genome and transcriptome of the diatom Fragilariopsis cylindrus.

The genome of the cold-adapted diatom Fragilariopsis cylindrus is characterized by highly diverged haplotypes that intersperse its homozygous genome. Here, we describe how a combination of PacBio DNA and Illumina RNA sequencing can be used to resolve this complex genomic landscape locally into the highly diverged haplotypes, and how to map various environmentally controlled transcripts onto individual haplotypes. We assembled PacBio sequence data with the FALCON assembler and created a haplotype resolved annotation of the assembly using annotations of a Sanger sequenced F. cylindrus genome. RNA-seq datasets from six different growth conditions were used to resolve allele-specifc gene expression in F. cylindrus. This approach enables to study differential expression of alleles in a complex genomic landscape and provides a useful tool to study how diverged haplotypes in diploid organisms are used for adaptation and evolution to highly variable environments.


July 7, 2019

Echinochloa crus-galli genome analysis provides insight into its adaptation and invasiveness as a weed.

Barnyardgrass (Echinochloa crus-galli) is a pernicious weed in agricultural fields worldwide. The molecular mechanisms underlying its success in the absence of human intervention are presently unknown. Here we report a draft genome sequence of the hexaploid species E. crus-galli, i.e., a 1.27?Gb assembly representing 90.7% of the predicted genome size. An extremely large repertoire of genes encoding cytochrome P450 monooxygenases and glutathione S-transferases associated with detoxification are found. Two gene clusters involved in the biosynthesis of an allelochemical 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA) and a phytoalexin momilactone A are found in the E. crus-galli genome, respectively. The allelochemical DIMBOA gene cluster is activated in response to co-cultivation with rice, while the phytoalexin momilactone A gene cluster specifically to infection by pathogenic Pyricularia oryzae. Our results provide a new understanding of the molecular mechanisms underlying the extreme adaptation of the weed.


July 7, 2019

Determination of the genome and primary transcriptome of syngas fermenting Eubacterium limosum ATCC 8486.

Autotrophic conversion of CO2 to value-added biochemicals has received considerable attention as a sustainable route to replace fossil fuels. Particularly, anaerobic acetogenic bacteria are naturally capable of reducing CO2 or CO to various metabolites. To fully utilize their biosynthetic potential, an understanding of acetogenesis-related genes and their regulatory elements is required. Here, we completed the genome sequence of the syngas fermenting Eubacterium limosum ATCC 8486 and determined its transcription start sites (TSS). We constructed a 4.4?Mb long circular genome with a GC content of 47.2% and 4,090 protein encoding genes. To understand the transcriptional and translational regulation, the primary transcriptome was augmented, identifying 1,458 TSSs containing a high pyrimidine (T/C) and purine nucleotide (A/G) content at the -1 and +1 position, respectively, along with 1,253 5′-untranslated regions, and principal promoter elements such as -10 (TATAAT) and -35 (TTGACA), and Shine-Dalgarno motifs (GGAGR). Further analysis revealed 93 non-coding RNAs, including one for potential transcriptional regulation of the hydrogenase complex via interaction with molybdenum or tungsten cofactors, which in turn controls formate dehydrogenase activity of the initial step of Wood-Ljungdahl pathway. Our results provide comprehensive genomic information for strain engineering to enhance the syngas fermenting capacity of acetogenic bacteria.


July 7, 2019

Commensal Propionibacterium strain UF1 mitigates intestinal inflammation via Th17 cell regulation.

Consumption of human breast milk (HBM) attenuates the incidence of necrotizing enterocolitis (NEC), which remains a leading and intractable cause of mortality in preterm infants. Here, we report that this diminution correlates with alterations in the gut microbiota, particularly enrichment of Propionibacterium species. Transfaunation of microbiota from HBM-fed preterm infants or a newly identified and cultured Propionibacterium strain, P. UF1, to germfree mice conferred protection against pathogen infection and correlated with profound increases in intestinal Th17 cells. The induction of Th17 cells was dependent on bacterial dihydrolipoamide acetyltransferase (DlaT), a major protein expressed on the P. UF1 surface layer (S-layer). Binding of P. UF1 to its cognate receptor, SIGNR1, on dendritic cells resulted in the regulation of intestinal phagocytes. Importantly, transfer of P. UF1 profoundly mitigated induced NEC-like injury in neonatal mice. Together, these results mechanistically elucidate the protective effects of HBM and P. UF1-induced immunoregulation, which safeguard against proinflammatory diseases, including NEC.


July 7, 2019

Complete circular genome sequence and temperature independent adaptation to anaerobiosis of Listeria weihenstephanensis DSM 24698.

The aim of this study was to analyze the adaptation of the environmental Listeria weihenstephanensis DSM 24698 to anaerobiosis. The complete circular genome sequence of this species is reported and the adaptation of L. weihenstephanensis DSM 24698 to oxygen availability was investigated by global transcriptional analyses via RNAseq at 18 and 34°C. A list of operons was created based on the transcriptional data. Forty-two genes were upregulated anaerobically and 62 genes were downregulated anaerobically. The oxygen dependent gene expression of selected genes was further validated via qPCR. Many of the differentially regulated genes encode metabolic enzymes indicating broad metabolic adaptations with respect to oxygen availability. Genes showing the strongest oxygen-dependent adaption encoded nitrate (narGHJI) and nitrite (nirBD) reductases. Together with the observation that nitrate supported anaerobic growth, these data indicate that L. weihenstephanensis DSM 24698 performs anaerobic nitrate respiration. The wide overlap between the oxygen-dependent transcriptional regulation at 18 and 34°C suggest that temperature does not play a key role in the oxygen-dependent transcriptional regulation of L. weihenstephanensis DSM 24698.


July 7, 2019

The Tartary buckwheat genome provides insights into rutin biosynthesis and abiotic stress tolerance.

Tartary buckwheat (Fagopyrum tataricum) is an important pseudocereal crop that is strongly adapted to growth in adverse environments. Its gluten-free grain contains complete proteins with a well-balanced composition of essential amino acids and is a rich source of beneficial phytochemicals that provide significant health benefits. Here, we report a high-quality, chromosome-scale Tartary buckwheat genome sequence of 489.3 Mb that is assembled by combining whole-genome shotgun sequencing of both Illumina short reads and single-molecule real-time long reads, sequence tags of a large DNA insert fosmid library, Hi-C sequencing data, and BioNano genome maps. We annotated 33 366 high-confidence protein-coding genes based on expression evidence. Comparisons of the intra-genome with the sugar beet genome revealed an independent whole-genome duplication that occurred in the buckwheat lineage after they diverged from the common ancestor, which was not shared with rosids or asterids. The reference genome facilitated the identification of many new genes predicted to be involved in rutin biosynthesis and regulation, aluminum stress resistance, and in drought and cold stress responses. Our data suggest that Tartary buckwheat’s ability to tolerate high levels of abiotic stress is attributed to the expansion of several gene families involved in signal transduction, gene regulation, and membrane transport. The availability of these genomic resources will facilitate the discovery of agronomically and nutritionally important genes and genetic improvement of Tartary buckwheat. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.