Menu
July 7, 2019

The cacao Criollo genome v2.0: an improved version of the genome for genetic and functional genomic studies.

Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes.We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes.The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence.Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).


July 7, 2019

Complete genome sequences of the plant pathogens Ralstonia solanacearum type strain K60 and R. solanacearum race 3 biovar 2 strain UW551.

Ralstonia solanacearum is a globally distributed plant pathogen that causes bacterial wilt diseases of many crop hosts, threatening both sustenance farming and industrial agriculture. Here, we present closed genome sequences for the R. solanacearum type strain, K60, and the cool-tolerant potato brown rot strain R. solanacearum UW551, a highly regulated U.S. select agent pathogen. Copyright © 2017 Hayes et al.


July 7, 2019

Genomic and functional analysis of Romboutsia ilealis CRIBT reveals adaptation to the small intestine.

The microbiota in the small intestine relies on their capacity to rapidly import and ferment available carbohydrates to survive in a complex and highly competitive ecosystem. Understanding how these communities function requires elucidating the role of its key players, the interactions among them and with their environment/host.The genome of the gut bacterium Romboutsia ilealis CRIBT was sequenced with multiple technologies (Illumina paired-end, mate-pair and PacBio). The transcriptome was sequenced (Illumina HiSeq) after growth on three different carbohydrate sources, and short chain fatty acids were measured via HPLC.We present the complete genome of Romboutsia ilealis CRIBT, a natural inhabitant and key player of the small intestine of rats. R. ilealis CRIBT possesses a circular chromosome of 2,581,778 bp and a plasmid of 6,145 bp, carrying 2,351 and eight predicted protein coding sequences, respectively. Analysis of the genome revealed limited capacity to synthesize amino acids and vitamins, whereas multiple and partially redundant pathways for the utilization of different relatively simple carbohydrates are present. Transcriptome analysis allowed identification of the key components in the degradation of glucose, L-fucose and fructo-oligosaccharides.This revealed that R. ilealis CRIBT is adapted to a nutrient-rich environment where carbohydrates, amino acids and vitamins are abundantly available.


July 7, 2019

Genome sequencing and comparative genomics reveal the potential pathogenic mechanism of Cercospora sojina Hara on soybean.

Frogeye leaf spot, caused by Cercospora sojina Hara, is a common disease of soybean in most soybean-growing countries of the world. In this study, we report a high-quality genome sequence of C. sojina by Single Molecule Real-Time sequencing method. The 40.8-Mb genome encodes 11,655 predicated genes, and 8,474 genes are revealed by RNA sequencing. Cercospora sojina genome contains large numbers of gene clusters that are involved in synthesis of secondary metabolites, including mycotoxins and pigments. However, much less carbohydrate-binding module protein encoding genes are identified in C. sojina genome, when compared with other phytopathogenic fungi. Bioinformatics analysis reveals that C. sojina harbours about 752 secreted proteins, and 233 of them are effectors. During early infection, the genes for metabolite biosynthesis and effectors are significantly enriched, suggesting that they may play essential roles in pathogenicity. We further identify 13 effectors that can inhibit BAX-induced cell death. Taken together, our results provide insights into the infection mechanisms of C. sojina on soybean.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Genome sequence of the white-rot fungus Irpex lacteus F17, a type strain of lignin degrader fungus.

Irpex lacteus, a cosmopolitan white-rot fungus, degrades lignin and lignin-derived aromatic compounds. In this study, we report the high-quality draft genome sequence of I. lacteus F17, isolated from a decaying hardwood tree in the vicinity of Hefei, China. The genome is 44,362,654 bp, with a GC content of 49.64% and a total of 10,391 predicted protein-coding genes. In addition, a total of 18 snRNA, 842 tRNA, 15 rRNA operons and 11,710 repetitive sequences were also identified. The genomic data provides insights into the mechanisms of the efficient lignin decomposition of this strain.


July 7, 2019

Recent expansion and adaptive evolution of the carcinoembryonic antigen family in bats of the Yangochiroptera subgroup.

Expansions of gene families are predictive for ongoing genetic adaptation to environmental cues. We describe such an expansion of the carcinoembryonic antigen (CEA) gene family in certain bat families. Members of the CEA family in humans and mice are exploited as cellular receptors by a number of pathogens, possibly due to their function in immunity and reproduction. The CEA family is composed of CEA-related cell adhesion molecules (CEACAMs) and secreted pregnancy-specific glycoproteins (PSGs). PSGs are almost exclusively expressed by trophoblast cells at the maternal-fetal interface. The reason why PSGs exist only in a minority of mammals is still unknown.Analysis of the CEA gene family in bats revealed that in certain bat families, belonging to the subgroup Yangochiroptera but not the Yinpterochiroptera subgroup an expansion of the CEA gene family took place, resulting in approximately one hundred CEA family genes in some species of the Vespertilionidae. The majority of these genes encode secreted PSG-like proteins (further referred to as PSG). Remarkably, we found strong evidence that the ligand-binding domain (IgV-like domain) of PSG is under diversifying positive selection indicating that bat PSGs may interact with structurally highly variable ligands. Such ligands might represent bacterial or viral pathogen adhesins. We have identified two distinct clusters of PSGs in three Myotis species. The two PSG cluster differ in the amino acids under positive selection. One cluster was only expanded in members of the Vespertilionidae while the other was found to be expanded in addition in members of the Miniopteridae and Mormoopidae. Thus one round of PSG expansion may have occurred in an ancestry of all three families and a second only in Vespertilionidae. Although maternal ligands of PSGs may exist selective challenges by two distinct pathogens seem to be likely responsible for the expansion of PSGs in Vespertilionidae.The rapid expansion of PSGs in certain bat species together with selection for diversification suggest that bat PSGs could be part of a pathogen defense system by serving as decoy receptors and/or regulators of feto-maternal interactions.


July 7, 2019

Convergent evolution of Y chromosome gene content in flies.

Sex-chromosomes have formed repeatedly across Diptera from ordinary autosomes, and X-chromosomes mostly conserve their ancestral genes. Y-chromosomes are characterized by abundant gene-loss and an accumulation of repetitive DNA, yet the nature of the gene repertoire of fly Y-chromosomes is largely unknown. Here we trace gene-content evolution of Y-chromosomes across 22 Diptera species, using a subtraction pipeline that infers Y genes from male and female genome, and transcriptome data. Few genes remain on old Y-chromosomes, but the number of inferred Y-genes varies substantially between species. Young Y-chromosomes still show clear evidence of their autosomal origins, but most genes on old Y-chromosomes are not simply remnants of genes originally present on the proto-sex-chromosome that escaped degeneration, but instead were recruited secondarily from autosomes. Despite almost no overlap in Y-linked gene content in different species with independently formed sex-chromosomes, we find that Y-linked genes have evolved convergent gene functions associated with testis expression. Thus, male-specific selection appears as a dominant force shaping gene-content evolution of Y-chromosomes across fly species.While X-chromosome gene content tends to be conserved, Y-chromosome evolution is dynamic and difficult to reconstruct. Here, Mahajan and Bachtrog use a subtraction pipeline to identify Y-linked genes in 22 Diptera species, revealing patterns of Y-chromosome gene-content evolution.


July 7, 2019

Multiple hybrid de novo genome assembly of finger millet, an orphan allotetraploid crop.

Finger millet (Eleusine coracana (L.) Gaertn) is an important crop for food security because of its tolerance to drought, which is expected to be exacerbated by global climate changes. Nevertheless, it is often classified as an orphan/underutilized crop because of the paucity of scientific attention. Among several small millets, finger millet is considered as an excellent source of essential nutrient elements, such as iron and zinc; hence, it has potential as an alternate coarse cereal. However, high-quality genome sequence data of finger millet are currently not available. One of the major problems encountered in the genome assembly of this species was its polyploidy, which hampers genome assembly compared with a diploid genome. To overcome this problem, we sequenced its genome using diverse technologies with sufficient coverage and assembled it via a novel multiple hybrid assembly workflow that combines next-generation with single-molecule sequencing, followed by whole-genome optical mapping using the Bionano Irys® system. The total number of scaffolds was 1,897 with an N50 length?>2.6?Mb and detection of 96% of the universal single-copy orthologs. The majority of the homeologs were assembled separately. This indicates that the proposed workflow is applicable to the assembly of other allotetraploid genomes.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

The biofilm inhibitor carolacton enters Gram-negative cells: studies using a TolC-deficient strain of Escherichia coli.

The myxobacterial secondary metabolite carolacton inhibits growth of Streptococcus pneumoniae and kills biofilm cells of the caries- and endocarditis-associated pathogen Streptococcus mutans at nanomolar concentrations. Here, we studied the response to carolacton of an Escherichia coli strain that lacked the outer membrane protein TolC. Whole-genome sequencing of the laboratory E. coli strain TolC revealed the integration of an insertion element, IS5, at the tolC locus and a close phylogenetic relationship to the ancient E. coli K-12. We demonstrated via transcriptome sequencing (RNA-seq) and determination of MIC values that carolacton penetrates the phospholipid bilayer of the Gram-negative cell envelope and inhibits growth of E. coli TolC at similar concentrations as for streptococci. This inhibition is completely lost for a C-9 (R) epimer of carolacton, a derivative with an inverted stereocenter at carbon atom 9 [(S) ? (R)] as the sole difference from the native molecule, which is also inactive in S. pneumoniae and S. mutans, suggesting a specific interaction of native carolacton with a conserved cellular target present in bacterial phyla as distantly related as Firmicutes and Proteobacteria. The efflux pump inhibitor (EPI) phenylalanine arginine ß-naphthylamide (PAßN), which specifically inhibits AcrAB-TolC, renders E. coli susceptible to carolacton. Our data indicate that carolacton has potential for use in antimicrobial chemotherapy against Gram-negative bacteria, as a single drug or in combination with EPIs. Strain E. coli TolC has been deposited at the DSMZ; together with the associated RNA-seq data and MIC values, it can be used as a reference during future screenings for novel bioactive compounds. IMPORTANCE The emergence of pathogens resistant against most or all of the antibiotics currently used in human therapy is a global threat, and therefore the search for antimicrobials with novel targets and modes of action is of utmost importance. The myxobacterial secondary metabolite carolacton had previously been shown to inhibit biofilm formation and growth of streptococci. Here, we investigated if carolacton could act against Gram-negative bacteria, which are difficult targets because of their double-layered cytoplasmic envelope. We found that the model organism Escherichia coli is susceptible to carolacton, similar to the Gram-positive Streptococcus pneumoniae, if its multidrug efflux system AcrAB-TolC is either inactivated genetically, by disruption of the tolC gene, or physiologically by coadministering an efflux pump inhibitor. A carolacton epimer that has a different steric configuration at carbon atom 9 is completely inactive, suggesting that carolacton may interact with the same molecular target in both Gram-positive and Gram-negative bacteria.


July 7, 2019

Echinobase: an expanding resource for echinoderm genomic information

Echinobase, a web accessible information system of diverse genomics and biological data for the echinoderm clade, grew out of SpBase, the first echinoderm genome project for sea urchin, Strongylocentrotus purpuratus. Sea urchins and their relatives are utilitarian research models in fields ranging from marine biology to developmental biology and gene regulatory systems. Echinobase is a user-friendly web interface that links an array of biological data that would otherwise have been tedious and frustrating for researchers to extract and organize. The system hosts a powerful gene search engine, genomics browser and other bioinformatics tools to investigate genomics and high throughput data. The Echinobase information system now serves genomic information for eight echinoderm species: S. purpuratus, Strongylocentrotus fransciscanus, Allocentrotus fragilis, Lytechinus variegatus, Patiria miniata, Parastichopus parvimensis and Ophiothrix spiculata, Eucidaris tribuloides. Herein lies a description of the web information system, genomics data types and content hosted by Echinobase.org. The goal of Echinobase is to connect genomic information to various experimental data and accelerate the research in field of molecular biology, developmental process, gene regulatory networks and more recently engineering biological systems0.


July 7, 2019

Systems biotechnology for protein production in Pichia pastoris.

The methylotrophic yeast Pichia pastoris (syn. Komagataella spp.) is one of the most important production systems for heterologous proteins. After the first genome sequences were published in 2009, tremendous effort was made to establish systems-level analytical methods. Methylotrophic lifestyle was one of the most thoroughly investigated topics, studied at the levels of transcriptome, proteome and metabolic flux. Also the responses of P. pastoris to environmental stress conditions experienced during high cell density production processes were studied. Metabolomics and flux analysis revealed the plasticity of the cellular metabolism in its adaption to the production of foreign proteins and served as blueprints for subsequent cell engineering and/or process design. The transcriptional response elicited by overexpression of heterologous proteins seems to depend on the nature and complexity of the recombinant product. Based on these data, novel targets for strain engineering could be deduced from transcriptomics and proteomics data mining and effectively enhanced protein secretion. Transcriptional regulation data also served as a valuable resource to identify novel promoters with the desired regulatory characteristics. This review aims to provide a comprehensive overview of systems biology applications in P. pastoris ranging from increased understanding of cell physiology to improving recombinant protein production in this cell factory.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

A high-quality genome assembly of quinoa provides insights into the molecular basis of salt bladder-based salinity tolerance and the exceptional nutritional value.

Chenopodium quinoa is a halophytic pseudocereal crop that is being cultivated in an ever-growing number of countries. Because quinoa is highly resistant to multiple abiotic stresses and its seed has a better nutritional value than any other major cereals, it is regarded as a future crop to ensure global food security. We generated a high-quality genome draft using an inbred line of the quinoa cultivar Real. The quinoa genome experienced one recent genome duplication about 4.3 million years ago, likely reflecting the genome fusion of two Chenopodium parents, in addition to the ? paleohexaploidization reported for most eudicots. The genome is highly repetitive (64.5% repeat content) and contains 54 438 protein-coding genes and 192 microRNA genes, with more than 99.3% having orthologous genes from glycophylic species. Stress tolerance in quinoa is associated with the expansion of genes involved in ion and nutrient transport, ABA homeostasis and signaling, and enhanced basal-level ABA responses. Epidermal salt bladder cells exhibit similar characteristics as trichomes, with a significantly higher expression of genes related to energy import and ABA biosynthesis compared with the leaf lamina. The quinoa genome sequence provides insights into its exceptional nutritional value and the evolution of halophytes, enabling the identification of genes involved in salinity tolerance, and providing the basis for molecular breeding in quinoa.


July 7, 2019

Building a locally diploid genome and transcriptome of the diatom Fragilariopsis cylindrus.

The genome of the cold-adapted diatom Fragilariopsis cylindrus is characterized by highly diverged haplotypes that intersperse its homozygous genome. Here, we describe how a combination of PacBio DNA and Illumina RNA sequencing can be used to resolve this complex genomic landscape locally into the highly diverged haplotypes, and how to map various environmentally controlled transcripts onto individual haplotypes. We assembled PacBio sequence data with the FALCON assembler and created a haplotype resolved annotation of the assembly using annotations of a Sanger sequenced F. cylindrus genome. RNA-seq datasets from six different growth conditions were used to resolve allele-specifc gene expression in F. cylindrus. This approach enables to study differential expression of alleles in a complex genomic landscape and provides a useful tool to study how diverged haplotypes in diploid organisms are used for adaptation and evolution to highly variable environments.


July 7, 2019

Revealing the saline adaptation strategies of the halophilic bacterium Halomonas beimenensis through high-throughput omics and transposon mutagenesis approaches.

Studies on the halotolerance of bacteria are attractive to the fermentation industry. However, a lack of sufficient genomic information has precluded an investigation of the halotolerance of Halomonas beimenensis. Here, we describe the molecular mechanisms of saline adaptation in H. beimenensis based on high-throughput omics and Tn5 transposon mutagenesis. The H. beimenensis genome is 4.05 Mbp and contains 3,807 genes, which were sequenced using short and long reads obtained via deep sequencing. Sixteen Tn5 mutants with a loss of halotolerance were identified. Orthologs of the mutated genes, such as nqrA, trkA, atpC, nadA, and gdhB, have significant biological functions in sodium efflux, potassium uptake, hydrogen ion transport for energy conversion, and compatible solute synthesis, which are known to control halotolerance. Other genes, such as spoT, prkA, mtnN, rsbV, lon, smpB, rfbC, rfbP, tatB, acrR1, and lacA, function in cellular signaling, quorum sensing, transcription/translation, and cell motility also shown critical functions for promoting a halotolerance. In addition, KCl application increased halotolerance and potassium-dependent cell motility in a high-salinity environment. Our results demonstrated that a combination of omics and mutagenesis could be used to facilitate the mechanistic exploitation of saline adaptation in H. beimenensis, which can be applied for biotechnological purposes.


July 7, 2019

Echinochloa crus-galli genome analysis provides insight into its adaptation and invasiveness as a weed.

Barnyardgrass (Echinochloa crus-galli) is a pernicious weed in agricultural fields worldwide. The molecular mechanisms underlying its success in the absence of human intervention are presently unknown. Here we report a draft genome sequence of the hexaploid species E. crus-galli, i.e., a 1.27?Gb assembly representing 90.7% of the predicted genome size. An extremely large repertoire of genes encoding cytochrome P450 monooxygenases and glutathione S-transferases associated with detoxification are found. Two gene clusters involved in the biosynthesis of an allelochemical 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA) and a phytoalexin momilactone A are found in the E. crus-galli genome, respectively. The allelochemical DIMBOA gene cluster is activated in response to co-cultivation with rice, while the phytoalexin momilactone A gene cluster specifically to infection by pathogenic Pyricularia oryzae. Our results provide a new understanding of the molecular mechanisms underlying the extreme adaptation of the weed.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.