Menu
April 21, 2020  |  

Comparative genomic analysis of Lactobacillus mucosae LM1 identifies potential niche-specific genes and pathways for gastrointestinal adaptation.

Lactobacillus mucosae is currently of interest as putative probiotics due to their metabolic capabilities and ability to colonize host mucosal niches. L. mucosae LM1 has been studied in its functions in cell adhesion and pathogen inhibition, etc. It demonstrated unique abilities to use energy from carbohydrate and non-carbohydrate sources. Due to these functions, we report the first complete genome sequence of an L. mucosae strain, L. mucosae LM1. Analysis of the pan-genome in comparison with closely-related Lactobacillus species identified a complete glycogen metabolism pathway, as well as folate biosynthesis, complementing previous proteomic data on the LM1 strain. It also revealed common and unique niche-adaptation genes among the various L. mucosae strains. The aim of this study was to derive genomic information that would reveal the probable mechanisms underlying the probiotic effect of L. mucosae LM1, and provide a better understanding of the nature of L. mucosae sp. Copyright © 2017 Elsevier Inc. All rights reserved.


April 21, 2020  |  

Genetic basis for the establishment of endosymbiosis in Paramecium.

The single-celled ciliate Paramecium bursaria is an indispensable model for investigating endosymbiosis between protists and green-algal symbionts. To elucidate the mechanism of this type of endosymbiosis, we combined PacBio and Illumina sequencing to assemble a high-quality and near-complete macronuclear genome of P. bursaria. The genomic characteristics and phylogenetic analyses indicate that P. bursaria is the basal clade of the Paramecium genus. Through comparative genomic analyses with its close relatives, we found that P. bursaria encodes more genes related to nitrogen metabolism and mineral absorption, but encodes fewer genes involved in oxygen binding and N-glycan biosynthesis. A comparison of the transcriptomic profiles between P. bursaria with and without endosymbiotic Chlorella showed differential expression of a wide range of metabolic genes. We selected 32 most differentially expressed genes to perform RNA interference experiment in P. bursaria, and found that P. bursaria can regulate the abundance of their symbionts through glutamine supply. This study provides novel insights into Paramecium evolution and will extend our knowledge of the molecular mechanism for the induction of endosymbiosis between P. bursaria and green algae.


April 21, 2020  |  

Diploid Genome Assembly of the Wine Grape Carménère.

In this genome report, we describe the sequencing and annotation of the genome of the wine grape Carménère (clone 02, VCR-702). Long considered extinct, this old French wine grape variety is now cultivated mostly in Chile where it was imported in the 1850s just before the European phylloxera epidemic. Genomic DNA was sequenced using Single Molecule Real Time technology and assembled with FALCON-Unzip, a diploid-aware assembly pipeline. To optimize the contiguity and completeness of the assembly, we tested about a thousand combinations of assembly parameters, sequencing coverage, error correction and repeat masking methods. The final scaffolds provide a complete and phased representation of the diploid genome of this wine grape. Comparison of the two haplotypes revealed numerous heterozygous variants, including loss-of-function ones, some of which in genes associated with polyphenol biosynthesis. Comparisons with other publicly available grape genomes and transcriptomes showed the impact of structural variation on gene content differences between Carménère and other wine grape cultivars. Among the putative cultivar-specific genes, we identified genes potentially involved in aroma production and stress responses. The genome assembly of Carménère expands the representation of the genomic variability in grapes and will enable studies that aim to understand its distinctive organoleptic and agronomical features and assess its still elusive extant genetic variability. A genome browser for Carménère, its annotation, and an associated blast tool are available at http://cantulab.github.io/data.Copyright © 2019 Minio et al.


April 21, 2020  |  

Mutation of a bHLH transcription factor allowed almond domestication.

Wild almond species accumulate the bitter and toxic cyanogenic diglucoside amygdalin. Almond domestication was enabled by the selection of genotypes harboring sweet kernels. We report the completion of the almond reference genome. Map-based cloning using an F1 population segregating for kernel taste led to the identification of a 46-kilobase gene cluster encoding five basic helix-loop-helix transcription factors, bHLH1 to bHLH5. Functional characterization demonstrated that bHLH2 controls transcription of the P450 monooxygenase-encoding genes PdCYP79D16 and PdCYP71AN24, which are involved in the amygdalin biosynthetic pathway. A nonsynonymous point mutation (Leu to Phe) in the dimerization domain of bHLH2 prevents transcription of the two cytochrome P450 genes, resulting in the sweet kernel trait. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020  |  

Single-molecule real-time sequencing reveals diverse allelic variations in carotenoid biosynthetic genes in pepper (Capsicum spp.).

The diverse colours of mature pepper (Capsicum spp.) fruit result from the accumulation of different carotenoids. The carotenoid biosynthetic pathway has been well elucidated in Solanaceous plants, and analysis of candidate genes involved in this process has revealed variations in carotenoid biosynthetic genes in Capsicum spp. However, the allelic variations revealed by previous studies could not fully explain the variation in fruit colour in Capsicum spp. due to technical difficulties in detecting allelic variation in multiple candidate genes in numerous samples. In this study, we uncovered allelic variations in six carotenoid biosynthetic genes, including phytoene synthase (PSY1, PSY2), lycopene ß-cyclase, ß-carotene hydroxylase, zeaxanthin epoxidase and capsanthin-capsorubin synthase (CCS) genes, in 94 pepper accessions by single-molecule real-time (SMRT) sequencing. To investigate the relationship between allelic variations in the candidate genes and differences in fruit colour, we performed ultra-performance liquid chromatography analysis using 43 accessions representing each allelic variation. Different combinations of dysfunctional mutations in PSY1 and CCS could explain variation in the compositions and levels of carotenoids in the accessions examined in this study. Our results demonstrate that SMRT sequencing technology can be used to rapidly identify allelic variation in target genes in various germplasms. The newly identified allelic variants will be useful for pepper breeding and for further analysis of carotenoid biosynthesis pathways. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020  |  

A global survey of full-length transcriptome of Ginkgo biloba reveals transcript variants involved in flavonoid biosynthesis

Ginkgo biloba, which contains flavonoids as bioactive components, is widely used in traditional Chinese medicine. Increasing the flavonoid production of medicinal plants through genetic engineering generally focuses on the key genes involved in flavonoid biosynthesis. However, the molecular mechanisms underlying such biosynthesis are not yet well understood. To understand these mechanisms, a combination of second-generation sequencing (SGS) and single-molecule real-time (SMRT) sequencing was applied to G. biloba. Eight tissues were sampled for SMRT sequencing to generate a high-quality, full-length transcriptome database. From 23.36 Gb clean reads, 12,954 alternative polyadenylation events, 12,290 alternative splicing events, 929 fusion transcripts, 2,286 novel transcripts, and 1,270 lncRNAs were predicted by removing redundant reads. Further studies reveal that 7 AS, 5 lncRNA, and 6 fusion gene events were identified in flavonoid biosynthesis. A total of 12 gene modules were revealed to be involved in flavonoid metabolism structural genes and transcription factors by constructing co-expression networks. Weighted gene coexpression network analysis (WGCNA) analysis reveals that some hub genes operate during the biosynthesis by identifying transcription factors (TFs) and structure genes. Seven key hub genes were also identified by analyzing the correlation between gene expression level and flavonoids content. The results highlight the importance of SMRT sequencing of the full-length transcriptome in improving genome annotation and elucidating the gene regulation of flavonoid biosynthesis in G. biloba by providing a comprehensive set of reference transcripts.


April 21, 2020  |  

Urinary tract colonization is enhanced by a plasmid that regulates uropathogenic Acinetobacter baumannii chromosomal genes.

Multidrug resistant (MDR) Acinetobacter baumannii poses a growing threat to global health. Research on Acinetobacter pathogenesis has primarily focused on pneumonia and bloodstream infections, even though one in five A. baumannii strains are isolated from urinary sites. In this study, we highlight the role of A. baumannii as a uropathogen. We develop the first A. baumannii catheter-associated urinary tract infection (CAUTI) murine model using UPAB1, a recent MDR urinary isolate. UPAB1 carries the plasmid pAB5, a member of the family of large conjugative plasmids that represses the type VI secretion system (T6SS) in multiple Acinetobacter strains. pAB5 confers niche specificity, as its carriage improves UPAB1 survival in a CAUTI model and decreases virulence in a pneumonia model. Comparative proteomic and transcriptomic analyses show that pAB5 regulates the expression of multiple chromosomally-encoded virulence factors besides T6SS. Our results demonstrate that plasmids can impact bacterial infections by controlling the expression of chromosomal genes.


April 21, 2020  |  

Comprehensive identification of the full-length transcripts and alternative splicing related to the secondary metabolism pathways in the tea plant (Camellia sinensis).

Flavonoids, theanine and caffeine are the main secondary metabolites of the tea plant (Camellia sinensis), which account for the tea’s unique flavor quality and health benefits. The biosynthesis pathways of these metabolites have been extensively studied at the transcriptional level, but the regulatory mechanisms are still unclear. In this study, to explore the transcriptome diversity and complexity of tea plant, PacBio Iso-Seq and RNA-seq analysis were combined to obtain full-length transcripts and to profile the changes in gene expression during the leaf development. A total of 1,388,066 reads of insert (ROI) were generated with an average length of 1,762?bp, and more than 54% (755,716) of the ROIs were full-length non-chimeric (FLNC) reads. The Benchmarking Universal Single-Copy Orthologue (BUSCO) completeness was 92.7%. A total of 93,883 non-redundant transcripts were obtained, and 87,395 (93.1%) were new alternatively spliced isoforms. Meanwhile, 7,650 differential expression transcripts (DETs) were identified. A total of 28,980 alternative splicing (AS) events were predicted, including 1,297 differential AS (DAS) events. The transcript isoforms of the key genes involved in the flavonoid, theanine and caffeine biosynthesis pathways were characterized. Additionally, 5,777 fusion transcripts and 9,052 long non-coding RNAs (lncRNAs) were also predicted. Our results revealed that AS potentially plays a crucial role in the regulation of the secondary metabolism of the tea plant. These findings enhanced our understanding of the complexity of the secondary metabolic regulation of tea plants and provided a basis for the subsequent exploration of the regulatory mechanisms of flavonoid, theanine and caffeine biosynthesis in tea plants.


April 21, 2020  |  

Efomycins K and L From a Termite-Associated Streptomyces sp. M56 and Their Putative Biosynthetic Origin.

Two new elaiophylin derivatives, efomycins K (1) and L (2), and five known elaiophylin derivatives (3-7) were isolated from the termite-associated Streptomyces sp. M56. The structures were determined by 1D and 2D NMR and HR-ESIMS analyses and comparative CD spectroscopy. The putative gene cluster responsible for the production of the elaiophylin and efomycin derivatives was identified based on significant homology to related clusters. Phylogenetic analysis of gene cluster domains was used to provide a biosynthetic rational for these new derivatives and to demonstrate how a single biosynthetic pathway can produce diverse structures.


April 21, 2020  |  

Complete genome sequence analysis of the thermoacidophilic verrucomicrobial methanotroph “Candidatus Methylacidiphilum kamchatkense” strain Kam1 and comparison with its closest relatives.

The candidate genus “Methylacidiphilum” comprises thermoacidophilic aerobic methane oxidizers belonging to the Verrucomicrobia phylum. These are the first described non-proteobacterial aerobic methane oxidizers. The genes pmoCAB, encoding the particulate methane monooxygenase do not originate from horizontal gene transfer from proteobacteria. Instead, the “Ca. Methylacidiphilum” and the sister genus “Ca. Methylacidimicrobium” represent a novel and hitherto understudied evolutionary lineage of aerobic methane oxidizers. Obtaining and comparing the full genome sequences is an important step towards understanding the evolution and physiology of this novel group of organisms.Here we present the closed genome of “Ca. Methylacidiphilum kamchatkense” strain Kam1 and a comparison with the genomes of its two closest relatives “Ca. Methylacidiphilum fumariolicum” strain SolV and “Ca. Methylacidiphilum infernorum” strain V4. The genome consists of a single 2,2 Mbp chromosome with 2119 predicted protein coding sequences. Genome analysis showed that the majority of the genes connected with metabolic traits described for one member of “Ca. Methylacidiphilum” is conserved between all three genomes. All three strains encode class I CRISPR-cas systems. The average nucleotide identity between “Ca. M. kamchatkense” strain Kam1 and strains SolV and V4 is =95% showing that they should be regarded as separate species. Whole genome comparison revealed a high degree of synteny between the genomes of strains Kam1 and SolV. In contrast, comparison of the genomes of strains Kam1 and V4 revealed a number of rearrangements. There are large differences in the numbers of transposable elements found in the genomes of the three strains with 12, 37 and 80 transposable elements in the genomes of strains Kam1, V4 and SolV respectively. Genomic rearrangements and the activity of transposable elements explain much of the genomic differences between strains. For example, a type 1h uptake hydrogenase is conserved between strains Kam1 and SolV but seems to have been lost from strain V4 due to genomic rearrangements.Comparing three closed genomes of “Ca. Methylacidiphilum” spp. has given new insights into the evolution of these organisms and revealed large differences in numbers of transposable elements between strains, the activity of these explains much of the genomic differences between strains.


April 21, 2020  |  

Hybrid sequencing of the Gynostemma pentaphyllum transcriptome provides new insights into gypenoside biosynthesis.

Gypenosides are a group of triterpene saponins from Gynostemma pentaphyllum that are the same as or very similar to ginsenosides from the Panax species. Several enzymes involved in ginsenoside biosynthesis have been characterized, which provide important clues for elucidating the gypenoside biosynthetic pathway. We suppose that gypenosides and ginsenosides may have a similar biosynthetic mechanism and that the corresponding enzymes in the two pathways may have considerable similarity in their sequences. To further understand gypenoside biosynthesis, we sequenced the G. pentaphyllum transcriptome with a hybrid sequencing-based strategy and then determined the candidate genes involved in this pathway using phylogenetic tree construction and gene expression analysis.Following the PacBio standard analysis pipeline, 66,046 polished consensus sequences were obtained, while Illumina data were assembled into 140,601 unigenes with Trinity software. Then, these output sequences from the two analytical routes were merged. After removing redundant data with CD-HIT software, a total of 140,157 final unigenes were obtained. After functional annotation, five 2,3-oxidosqualene cyclase genes, 145 cytochrome P450 genes and 254 UDP-glycosyltransferase genes were selected for the screening of genes involved in gypenoside biosynthesis. Using phylogenetic analysis, several genes were divided into the same subfamilies or closely related evolutionary branches with characterized enzymes involved in ginsenoside biosynthesis. Using real-time PCR technology, their expression patterns were investigated in different tissues and at different times after methyl jasmonate induction. Since the genes in the same biosynthetic pathway are generally coexpressed, we speculated that GpOSC1, GpCYP89, and GpUGT35 were the leading candidates for gypenoside biosynthesis. In addition, six GpWRKYs and one GpbHLH might play a possible role in regulating gypenoside biosynthesis.We developed a hybrid sequencing strategy to obtain longer length transcriptomes with increased accuracy, which will greatly contribute to downstream gene screening and characterization, thus improving our ability to elucidate secondary metabolite biosynthetic pathways. With this strategy, we found several candidate genes that may be involved in gypenoside biosynthesis, which laid an important foundation for the elucidation of this biosynthetic pathway, thus greatly contributing to further research in metabolic regulation, synthetic biology and molecular breeding in this species.


April 21, 2020  |  

Genomics-driven discovery of a biosynthetic gene cluster required for the synthesis of BII-Rafflesfungin from the fungus Phoma sp. F3723.

Phomafungin is a recently reported broad spectrum antifungal compound but its biosynthetic pathway is unknown. We combed publicly available Phoma genomes but failed to find any putative biosynthetic gene cluster that could account for its biosynthesis.Therefore, we sequenced the genome of one of our Phoma strains (F3723) previously identified as having antifungal activity in a high-throughput screen. We found a biosynthetic gene cluster that was predicted to synthesize a cyclic lipodepsipeptide that differs in the amino acid composition compared to Phomafungin. Antifungal activity guided isolation yielded a new compound, BII-Rafflesfungin, the structure of which was determined.We describe the NRPS-t1PKS cluster ‘BIIRfg’ compatible with the synthesis of the cyclic lipodepsipeptide BII-Rafflesfungin [HMHDA-L-Ala-L-Glu-L-Asn-L-Ser-L-Ser-D-Ser-D-allo-Thr-Gly]. We report new Stachelhaus codes for Ala, Glu, Asn, Ser, Thr, and Gly. We propose a mechanism for BII-Rafflesfungin biosynthesis, which involves the formation of the lipid part by BIIRfg_PKS followed by activation and transfer of the lipid chain by a predicted AMP-ligase on to the first PCP domain of the BIIRfg_NRPS gene.


April 21, 2020  |  

Reconstruction of the full-length transcriptome atlas using PacBio Iso-Seq provides insight into the alternative splicing in Gossypium australe.

Gossypium australe F. Mueller (2n?=?2x?=?26, G2 genome) possesses valuable characteristics. For example, the delayed gland morphogenesis trait causes cottonseed protein and oil to be edible while retaining resistance to biotic stress. However, the lack of gene sequences and their alternative splicing (AS) in G. australe remain unclear, hindering to explore species-specific biological morphogenesis.Here, we report the first sequencing of the full-length transcriptome of the Australian wild cotton species, G. australe, using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) from the pooled cDNA of ten tissues to identify transcript loci and splice isoforms. We reconstructed the G. australe full-length transcriptome and identified 25,246 genes, 86 pre-miRNAs and 1468 lncRNAs. Most genes (12,832, 50.83%) exhibited two or more isoforms, suggesting a high degree of transcriptome complexity in G. australe. A total of 31,448 AS events in five major types were found among the 9944 gene loci. Among these five major types, intron retention was the most frequent, accounting for 68.85% of AS events. 29,718 polyadenylation sites were detected from 14,536 genes, 7900 of which have alternative polyadenylation sites (APA). In addition, based on our AS events annotations, RNA-Seq short reads from germinating seeds showed that differential expression of these events occurred during seed germination. Ten AS events that were randomly selected were further confirmed by RT-PCR amplification in leaf and germinating seeds.The reconstructed gene sequences and their AS in G. australe would provide information for exploring beneficial characteristics in G. australe.


April 21, 2020  |  

Iso-Seq analysis of the Taxus cuspidata transcriptome reveals the complexity of Taxol biosynthesis.

Taxus cuspidata is well known worldwide for its ability to produce Taxol, one of the top-selling natural anticancer drugs. However, current Taxol production cannot match the increasing needs of the market, and novel strategies should be considered to increase the supply of Taxol. Since the biosynthetic mechanism of Taxol remains largely unknown, elucidating this pathway in detail will be very helpful in exploring alternative methods for Taxol production.Here, we sequenced Taxus cuspidata transcriptomes with next-generation sequencing (NGS) and third-generation sequencing (TGS) platforms. After correction with Illumina reads and removal of redundant reads, more than 180,000 nonredundant transcripts were generated from the raw Iso-Seq data. Using Cogent software and an alignment-based method, we identified a total of 139 cytochrome P450s (CYP450s), 31 BAHD acyltransferases (ACTs) and 1940 transcription factors (TFs). Based on phylogenetic and coexpression analysis, we identified 9 CYP450s and 7 BAHD ACTs as potential lead candidates for Taxol biosynthesis and 6 TFs that are possibly involved in the regulation of this process. Using coexpression analysis of genes known to be involved in Taxol biosynthesis, we elucidated the stem biosynthetic pathway. In addition, we analyzed the expression patterns of 12 characterized genes in the Taxol pathway and speculated that the isoprene precursors for Taxol biosynthesis were mainly synthesized via the MEP pathway. In addition, we found and confirmed that the alternative splicing patterns of some genes varied in different tissues, which may be an important tissue-specific method of posttranscriptional regulation.A strategy was developed to generate corrected full-length or nearly full-length transcripts without assembly to ensure sequence accuracy, thus greatly improving the reliability of coexpression and phylogenetic analysis and greatly facilitating gene cloning and characterization. This strategy was successfully utilized to elucidate the Taxol biosynthetic pathway, which will greatly contribute to the goals of improving the Taxol content in Taxus spp. using molecular breeding or plant management strategies and synthesizing Taxol in microorganisms using synthetic biological technology.


April 21, 2020  |  

Genome of lethal Lepiota venenata and insights into the evolution of toxin-biosynthetic genes.

Genomes of lethal Amanita and Galerina mushrooms have gradually become available in the past ten years; in contrast the other known amanitin-producing genus, Lepiota, is still vacant in this aspect. A fatal mushroom poisoning case in China has led to acquisition of fresh L. venenata fruiting bodies, based on which a draft genome was obtained through PacBio and Illumina sequencing platforms. Toxin-biosynthetic MSDIN family and Porlyl oligopeptidase B (POPB) genes were mined from the genome and used for phylogenetic and statistical studies to gain insights into the evolution of the biosynthetic pathway.The analysis of the genome data illustrated that only one MSDIN, named LvAMA1, exits in the genome, along with a POPB gene. No POPA homolog was identified by direct homology searching, however, one additional POP gene, named LvPOPC, was cloned and the gene structure determined. Similar to ApAMA1 in A. phalloides and GmAMA1 in G. marginata, LvAMA1 directly encodes a-amanitin. The two toxin genes were mapped to the draft genome, and the structures analyzed. Furthermore, phylogenetic and statistical analyses were conducted to study the evolution history of the POPB genes. Compared to our previous report, the phylogenetic trees unambiguously showed that a monophyletic POPB lineage clearly conflicted with the species phylogeny. In contrast, phylogeny of POPA genes resembled the species phylogeny. Topology and divergence tests showed that the POPB lineage was robust and these genes exhibited significantly shorter genetic distances than those of the house-keeping rbp2, a characteristic feature of genes with horizontal gene transfer (HGT) background. Consistently, same scenario applied to the only MSDIN, LvAMA1, in the genome.To the best of our knowledge, this is the first reported genome of Lepiota. The analyses of the toxin genes indicate that the cyclic peptides are synthesized through a ribosomal mechanism. The toxin genes, LvAMA1 and LvPOPB, are not in the vicinity of each other. Phylogenetic and evolutionary studies suggest that HGT is the underlining cause for the occurrence of POPB and MSDIN in Amanita, Galerina and Lepiota, which are allocated in three distantly-related families.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.