Menu
April 21, 2020

Complete genome sequence of Streptomyces spongiicola HNM0071T, a marine sponge-associated actinomycete producing staurosporine and echinomycin

Streptomyes spongiicola HNM0071T is a novel marine sponge-associated actinomycete with potential to produce antitumor agents including staurosporine and echinomycin. Here, we present the complete genome sequence of S. spongiicola HNM0071, which consists of a linear chromosome of 7,180,417?bp, 5669 protein coding genes, 18 rRNA genes, and 66 tRNA genes. Twenty-seven putative secondary metabolite biosynthetic gene clusters were found in the genome. Among them, the staurosporine and echinomycin gene clusters have been described completely. The complete genome information presented here will enable us to investigate the biosynthetic mechanism of two well-known antitumor antibiotics and to discover novel secondary metabolites with potential antitumor activities.


April 21, 2020

Primary transcriptome and translatome analysis determines transcriptional and translational regulatory elements encoded in the Streptomyces clavuligerus genome.

Determining transcriptional and translational regulatory elements in GC-rich Streptomyces genomes is essential to elucidating the complex regulatory networks that govern secondary metabolite biosynthetic gene cluster (BGC) expression. However, information about such regulatory elements has been limited for Streptomyces genomes. To address this limitation, a high-quality genome sequence of ß-lactam antibiotic-producing Streptomyces clavuligerus ATCC 27 064 is completed, which contains 7163 newly annotated genes. This provides a fundamental reference genome sequence to integrate multiple genome-scale data types, including dRNA-Seq, RNA-Seq and ribosome profiling. Data integration results in the precise determination of 2659 transcription start sites which reveal transcriptional and translational regulatory elements, including -10 and -35 promoter components specific to sigma (s) factors, and 5′-untranslated region as a determinant for translation efficiency regulation. Particularly, sequence analysis of a wide diversity of the -35 components enables us to predict potential s-factor regulons, along with various spacer lengths between the -10 and -35 elements. At last, the primary transcriptome landscape of the ß-lactam biosynthetic pathway is analyzed, suggesting temporal changes in metabolism for the synthesis of secondary metabolites driven by transcriptional regulation. This comprehensive genetic information provides a versatile genetic resource for rational engineering of secondary metabolite BGCs in Streptomyces. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

Do the toll-like receptors and complement systems play equally important roles in freshwater adapted Dolly Varden char (Salvelinus malma)?

Unlike the normal anadromous lifestyle, Chinese native Dolly Varden char (Salvelinus malma) is locked in land and lives in fresh water lifetime. To explore the effect of freshwater adaption on its immune system, we constructed a pooled cDNA library of hepatopancreas and spleen of Chinese freshwater Dolly Varden char (S. malma). A total of 27,829 unigenes were generated from 31,233 high-quality transcripts and 17,670 complete open reading frames (ORF) were identified. Totally 25,809 unigenes were successfully annotated and it classified more native than adaptive immunity-associated genes, and more genes involved in toll-like receptor signal pathway than those in complement and coagulation cascades (51 vs 3), implying the relative more important role of toll-like receptors than the complement system under bacterial injection for the freshwater Dolly Varden char. These huge different numbers of TLR and complement system identified in freshwater Dolly Varden char probably caused by distinct evolution pressure patterns between fish TLR and complement system, representative by TLR3 and TLR5 as well as C4 and C6, respectively, which were under purifying and positively selecting pressure, respectively. Further seawater adaptation experiment and the comparison study with our library will no doubt be helpful to elucidate the effect of freshwater adaption of Chinese native Dolly Varden char on its immune system.Copyright © 2018 Elsevier Ltd. All rights reserved.


April 21, 2020

The Genome of Cucurbita argyrosperma (Silver-Seed Gourd) Reveals Faster Rates of Protein-Coding Gene and Long Noncoding RNA Turnover and Neofunctionalization within Cucurbita.

Whole-genome duplications are an important source of evolutionary novelties that change the mode and tempo at which genetic elements evolve within a genome. The Cucurbita genus experienced a whole-genome duplication around 30 million years ago, although the evolutionary dynamics of the coding and noncoding genes in this genus have not yet been scrutinized. Here, we analyzed the genomes of four Cucurbita species, including a newly assembled genome of Cucurbita argyrosperma, and compared the gene contents of these species with those of five other members of the Cucurbitaceae family to assess the evolutionary dynamics of protein-coding and long intergenic noncoding RNA (lincRNA) genes after the genome duplication. We report that Cucurbita genomes have a higher protein-coding gene birth-death rate compared with the genomes of the other members of the Cucurbitaceae family. C. argyrosperma gene families associated with pollination and transmembrane transport had significantly faster evolutionary rates. lincRNA families showed high levels of gene turnover throughout the phylogeny, and 67.7% of the lincRNA families in Cucurbita showed evidence of birth from the neofunctionalization of previously existing protein-coding genes. Collectively, our results suggest that the whole-genome duplication in Cucurbita resulted in faster rates of gene family evolution through the neofunctionalization of duplicated genes. Copyright © 2019 The Author. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Complete genome sequence of Pseudomonas frederiksbergensis ERDD5:01 revealed genetic bases for survivability at high altitude ecosystem and bioprospection potential.

Pseudomonas frederiksbergensis ERDD5:01 is a psychrotrophic bacteria isolated from the glacial stream flowing from East Rathong glacier in Sikkim Himalaya. The strain showed survivability at high altitude stress conditions like freezing, frequent freeze-thaw cycles, and UV-C radiations. The complete genome of 5,746,824?bp circular chromosome and a plasmid of 371,027?bp was sequenced to understand the genetic basis of its survival strategy. Multiple copies of cold-associated genes encoding cold active chaperons, general stress response, osmotic stress, oxidative stress, membrane/cell wall alteration, carbon storage/starvation and, DNA repair mechanisms supported its survivability at extreme cold and radiations corroborating with the bacterial physiological findings. The molecular cold adaptation analysis in comparison with the genome of 15 mesophilic Pseudomonas species revealed functional insight into the strategies of cold adaptation. The genomic data also revealed the presence of industrially important enzymes.Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Potential of TLR-gene diversity in Czech indigenous cattle for resistance breeding as revealed by hybrid sequencing

A production herd of Czech Simmental cattle (Czech Red Pied, CRP), the conserved subpopulation of this breed, and the ancient local breed Czech Red cattle (CR) were screened for diversity in the antibacterial toll-like receptors (TLRs), which are members of the innate immune system. Polymerase chain reaction (PCR) amplicons of TLR1, TLR2, TLR4, TLR5, and TLR6 from pooled DNA samples were sequenced with PacBio technology, with 3–5×?coverage per gene per animal. To increase the reliability of variant detection, the gDNA pools were sequenced in parallel with the Illumina X-ten platform at low coverage (60× per gene). The diversity in conserved CRP and CR was similar to the diversity in conserved and modern CRP, representing 76.4?% and 70.9?% of its variants, respectively. Sixty-eight (54.4?%) polymorphisms in the five TLR genes were shared by the two breeds, whereas 38 (30.4?%) were specific to the production herd of CRP; 4 (3.2?%) were specific to the broad CRP population; 7 (5.6?%) were present in both conserved populations; 5 (4.0?%) were present solely for the conserved CRP; and 3 (2.4?%) were restricted to CR. Consequently, gene pool erosion related to intensive breeding did not occur in Czech Simmental cattle. Similarly, no considerable consequences were found from known bottlenecks in the history of Czech Red cattle. On the other hand, the distinctness of the conserved populations and their potential for resistance breeding were only moderate. This relationship might be transferable to other non-abundant historical cattle breeds that are conserved as genetic resources. The estimates of polymorphism impact using Variant Effect Predictor and SIFT software tools allowed for the identification of candidate single-nucleotide polymorphisms (SNPs) for association studies related to infection resistance and targeted breeding. Knowledge of TLR-gene diversity present in Czech Simmental populations may aid in the potential transfer of variant characteristics from other breeds.


April 21, 2020

Patterns of non-ARD variation in more than 300 full-length HLA-DPB1 alleles.

Our understanding of sequence variation in the HLA-DPB1 gene is largely restricted to the hypervariable antigen recognition domain (ARD) encoded by exon 2. Here, we employed a redundant sequencing strategy combining long-read and short-read data to accurately phase and characterise in full length the majority of common and well-documented (CWD) DPB1 alleles as well as alleles with an observed frequency of at least 0.0006% in our predominantly European sample set. We generated 664 DPB1 sequences, comprising 279 distinct allelic variants. This allows us to present the, to date, most comprehensive analysis of the nature and extent of DPB1 sequence variation. The full-length sequence analysis revealed the existence of two highly diverged allele clades. These clades correlate with the rs9277534 A???G variant, a known expression marker located in the 3′-UTR. The two clades are fully differentiated by 174 fixed polymorphisms throughout a 3.6?kb stretch at the 3′-end of DPB1. The region upstream of this differentiation zone is characterised by increasingly shared variation between the clades. The low-expression A clade comprises 59% of the distinct allelic sequences including the three by far most frequent DPB1 alleles, DPB1*04:01, DPB1*02:01 and DPB1*04:02. Alleles in the A clade show reduced nucleotide diversity with an excess of rare variants when compared to the high-expression G clade. This pattern is consistent with a scenario of recent proliferation of A-clade alleles. The full-length characterisation of all but the most rare DPB1 alleles will benefit the application of NGS for DPB1 genotyping and provides a helpful framework for a deeper understanding of high- and low-expression alleles and their implications in the context of unrelated haematopoietic stem-cell transplantation.Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Diversity of phytobeneficial traits revealed by whole-genome analysis of worldwide-isolated phenazine-producing Pseudomonas spp.

Plant-beneficial Pseudomonas spp. competitively colonize the rhizosphere and display plant-growth promotion and/or disease-suppression activities. Some strains within the P. fluorescens species complex produce phenazine derivatives, such as phenazine-1-carboxylic acid. These antimicrobial compounds are broadly inhibitory to numerous soil-dwelling plant pathogens and play a role in the ecological competence of phenazine-producing Pseudomonas spp. We assembled a collection encompassing 63 strains representative of the worldwide diversity of plant-beneficial phenazine-producing Pseudomonas spp. In this study, we report the sequencing of 58 complete genomes using PacBio RS II sequencing technology. Distributed among four subgroups within the P. fluorescens species complex, the diversity of our collection is reflected by the large pangenome which accounts for 25 413 protein-coding genes. We identified genes and clusters encoding for numerous phytobeneficial traits, including antibiotics, siderophores and cyclic lipopeptides biosynthesis, some of which were previously unknown in these microorganisms. Finally, we gained insight into the evolutionary history of the phenazine biosynthetic operon. Given its diverse genomic context, it is likely that this operon was relocated several times during Pseudomonas evolution. Our findings acknowledge the tremendous diversity of plant-beneficial phenazine-producing Pseudomonas spp., paving the way for comparative analyses to identify new genetic determinants involved in biocontrol, plant-growth promotion and rhizosphere competence. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Nephromyces encodes a urate metabolism pathway and predicted peroxisomes, demonstrating that these are not ancient losses of apicomplexans.

The phylum Apicomplexa is a quintessentially parasitic lineage, whose members infect a broad range of animals. One exception to this may be the apicomplexan genus Nephromyces, which has been described as having a mutualistic relationship with its host. Here we analyze transcriptome data from Nephromyces and its parasitic sister taxon, Cardiosporidium, revealing an ancestral purine degradation pathway thought to have been lost early in apicomplexan evolution. The predicted localization of many of the purine degradation enzymes to peroxisomes, and the in silico identification of a full set of peroxisome proteins, indicates that loss of both features in other apicomplexans occurred multiple times. The degradation of purines is thought to play a key role in the unusual relationship between Nephromyces and its host. Transcriptome data confirm previous biochemical results of a functional pathway for the utilization of uric acid as a primary nitrogen source for this unusual apicomplexan.


April 21, 2020

In-depth analysis of the genome of Trypanosoma evansi, an etiologic agent of surra.

Trypanosoma evansi is the causative agent of the animal trypanosomiasis surra, a disease with serious economic burden worldwide. The availability of the genome of its closely related parasite Trypanosoma brucei allows us to compare their genetic and evolutionarily shared and distinct biological features. The complete genomic sequence of the T. evansi YNB strain was obtained using a combination of genomic and transcriptomic sequencing, de novo assembly, and bioinformatic analysis. The genome size of the T. evansi YNB strain was 35.2 Mb, showing 96.59% similarity in sequence and 88.97% in scaffold alignment with T. brucei. A total of 8,617 protein-coding genes, accounting for 31% of the genome, were predicted. Approximately 1,641 alternative splicing events of 820 genes were identified, with a majority mediated by intron retention, which represented a major difference in post-transcriptional regulation between T. evansi and T. brucei. Disparities in gene copy number of the variant surface glycoprotein, expression site-associated genes, microRNAs, and RNA-binding protein were clearly observed between the two parasites. The results revealed the genomic determinants of T. evansi, which encoded specific biological characteristics that distinguished them from other related trypanosome species.


April 21, 2020

The red bayberry genome and genetic basis of sex determination.

Morella rubra, red bayberry, is an economically important fruit tree in south China. Here, we assembled the first high-quality genome for both a female and a male individual of red bayberry. The genome size was 313-Mb, and 90% sequences were assembled into eight pseudo chromosome molecules, with 32 493 predicted genes. By whole-genome comparison between the female and male and association analysis with sequences of bulked and individual DNA samples from female and male, a 59-Kb region determining female was identified and located on distal end of pseudochromosome 8, which contains abundant transposable element and seven putative genes, four of them are related to sex floral development. This 59-Kb female-specific region was likely to be derived from duplication and rearrangement of paralogous genes and retained non-recombinant in the female-specific region. Sex-specific molecular markers developed from candidate genes co-segregated with sex in a genetically diverse female and male germplasm. We propose sex determination follow the ZW model of female heterogamety. The genome sequence of red bayberry provides a valuable resource for plant sex chromosome evolution and also provides important insights for molecular biology, genetics and modern breeding in Myricaceae family. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Potential use of the Pteris vittata arsenic hyperaccumulation-regulation network for phytoremediation.

Arsenic accumulation in soil is a global problem typically addressed using phytoremediation methods. Pteris vittata, a model arsenic hyperaccumulator, has great potential as a genetically engineered plant for phytoremediation. However, the lack of omic information on this species has severely limited the identification and application of its arsenic hyperaccumulation and regulation components. In this study, we used an optimized single-molecular real-time (SMRT) strategy to create a de novo full-length transcriptomic-tonoplast proteomic database for this unsequenced fern and to determine the genetic components underlying its arsenic hyperaccumulation-regulation mechanisms. We established a comprehensive network consisting of six major transporter families, two novel resistance pathways, and a regulatory system by examining alternative splicing (AS) and long non-coding RNA (lncRNA) in different tissues following As(III) and As(V) treatment. The database and network established in this study will deepen our understanding of the unique hyperaccumulation and regulation mechanisms of P. vittata, ultimately providing a valuable resource for futher research on phytoremediation of arsenic-contaminated soil. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020

Secretion of an Argonaute protein by a parasitic nematode and the evolution of its siRNA guides.

Extracellular RNA has been proposed to mediate communication between cells and organisms however relatively little is understood regarding how specific sequences are selected for export. Here, we describe a specific Argonaute protein (exWAGO) that is secreted in extracellular vesicles (EVs) released by the gastrointestinal nematode Heligmosomoides bakeri, at multiple copies per EV. Phylogenetic and gene expression analyses demonstrate exWAGO orthologues are highly conserved and abundantly expressed in related parasites but highly diverged in free-living genus Caenorhabditis. We show that the most abundant small RNAs released from the nematode parasite are not microRNAs as previously thought, but rather secondary small interfering RNAs (siRNAs) that are produced by RNA-dependent RNA Polymerases. The siRNAs that are released in EVs have distinct evolutionary properties compared to those resident in free-living or parasitic nematodes. Immunoprecipitation of exWAGO demonstrates that it specifically associates with siRNAs from transposons and newly evolved repetitive elements that are packaged in EVs and released into the host environment. Together this work demonstrates molecular and evolutionary selectivity in the small RNA sequences that are released in EVs into the host environment and identifies a novel Argonaute protein as the mediator of this. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

Characterization of the genome of a Nocardia strain isolated from soils in the Qinghai-Tibetan Plateau that specifically degrades crude oil and of this biodegradation.

A strain of Nocardia isolated from crude oil-contaminated soils in the Qinghai-Tibetan Plateau degrades nearly all components of crude oil. This strain was identified as Nocardia soli Y48, and its growth conditions were determined. Complete genome sequencing showed that N. soli Y48 has a 7.3?Mb genome and many genes responsible for hydrocarbon degradation, biosurfactant synthesis, emulsification and other hydrocarbon degradation-related metabolisms. Analysis of the clusters of orthologous groups (COGs) and genomic islands (GIs) revealed that Y48 has undergone significant gene transfer events to adapt to changing environmental conditions (crude oil contamination). The structural features of the genome might provide a competitive edge for the survival of N. soli Y48 in oil-polluted environments and reflect the adaptation of coexisting bacteria to distinct nutritional niches.Copyright © 2018. Published by Elsevier Inc.


April 21, 2020

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

To better understand the immune system of shrimp, this study combined PacBio isoform sequencing (Iso-Seq) and Illumina paired-end short reads sequencing methods to discover full-length immune-related molecules of the Pacific white shrimp, Litopenaeus vannamei. A total of 72,648 nonredundant full-length transcripts (unigenes) were generated with an average length of 2545 bp from five main tissues, including the hepatopancreas, cardiac stomach, heart, muscle, and pyloric stomach. These unigenes exhibited a high annotation rate (62,164, 85.57%) when compared against NR, NT, Swiss-Prot, Pfam, GO, KEGG and COG databases. A total of 7544 putative long noncoding RNAs (lncRNAs) were detected and 1164 nonredundant full-length transcripts (449 UniTransModels) participated in the alternative splicing (AS) events. Importantly, a total of 5279 nonredundant full-length unigenes were successfully identified, which were involved in the innate immune system, including 9 immune-related processes, 19 immune-related pathways and 10 other immune-related systems. We also found wide transcript variants, which increased the number and function complexity of immune molecules; for example, toll-like receptors (TLRs) and interferon regulatory factors (IRFs). The 480 differentially expressed genes (DEGs) were significantly higher or tissue-specific expression patterns in the hepatopancreas compared with that in other four tested tissues (FDR <0.05). Furthermore, the expression levels of six selected immune-related DEGs and putative IRFs were validated using real-time PCR technology, substantiating the reliability of the PacBio Iso-seq results. In conclusion, our results provide new genetic resources of long-read full-length transcripts data and information for identifying immune-related genes, which are an invaluable transcriptomic resource as genomic reference, especially for further exploration of the innate immune and defense mechanisms of shrimp. Copyright © 2019 Elsevier Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.