Menu
September 22, 2019

De novo transcriptome assembly of the Chinese pearl barley, adlay, by full-length isoform and short-read RNA sequencing.

Adlay (Coix lacryma-jobi) is a tropical grass that has long been used in traditional Chinese medicine and is known for its nutritional benefits. Recent studies have shown that vitamin E compounds in adlay protect against chronic diseases such as cancer and heart disease. However, the molecular basis of adlay’s health benefits remains unknown. Here, we generated adlay gene sets by de novo transcriptome assembly using long-read isoform sequencing (Iso-Seq) and short-read RNA-Sequencing (RNA-Seq). The gene sets obtained from Iso-seq and RNA-seq contained 31,177 genes and 57,901 genes, respectively. We confirmed the validity of the assembled gene sets by experimentally analyzing the levels of prolamin and vitamin E biosynthesis-associated proteins in adlay plant tissues and seeds. We compared the screened adlay genes with known gene families from closely related plant species, such as rice, sorghum and maize. We also identified tissue-specific genes from the adlay leaf, root, and young and mature seed, and experimentally validated the differential expression of 12 randomly-selected genes. Our study of the adlay transcriptome will provide a valuable resource for genetic studies that can enhance adlay breeding programs in the future.


September 22, 2019

Transcriptome sequencing and comparative analysis of differentially-expressed isoforms in the roots of Halogeton glomeratus under salt stress.

Although Halogeton glomeratus (H. glomeratus) has been confirmed to have a unique mechanism to regulate Na+efflux from the cytoplasm and compartmentalize Na+into leaf vacuoles, little is known about the salt tolerance mechanisms of roots under salinity stress. In the present study, transcripts were sequenced using the BGISEQ-500 sequencing platform (BGI, Wuhan, China). After quality control, approximately 24.08 million clean reads were obtained and the average mapping ratio to the reference gene was 70.00%. When comparing salt-treated samples with the control, a total of 550, 590, 1411 and 2063 DEIs were identified at 2, 6, 24 and 72h, respectively. Numerous differentially-expressed isoforms that play important roles in response and adaptation to salt condition are related to metabolic processes, cellular processes, single-organism processes, localization, biological regulation, responses to stimulus, binding, catalytic activity and transporter activity. Fifty-eight salt-induced isoforms were common to different stages of salt stress; most of these DEIs were related to signal transduction and transporters, which maybe the core isoforms regulating Na+uptake and transport in the roots of H. glomeratus. The expression patterns of 18 DEIs that were detected by quantitative real-time polymerase chain reaction were consistent with their respective changes in transcript abundance as identified by RNA-Seq technology. The present study thoroughly explored potential isoforms involved in salt tolerance on H. glomeratus roots at five time points. Our results may serve as an important resource for the H. glomeratus research community, improving our understanding of salt tolerance in halophyte survival under high salinity stress. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

Comparative genomic analysis of Sulfurospirillum cavolei MES reconstructed from the metagenome of an electrosynthetic microbiome.

Sulfurospirillum spp. play an important role in sulfur and nitrogen cycling, and contain metabolic versatility that enables reduction of a wide range of electron acceptors, including thiosulfate, tetrathionate, polysulfide, nitrate, and nitrite. Here we describe the assembly of a Sulfurospirillum genome obtained from the metagenome of an electrosynthetic microbiome. The ubiquity and persistence of this organism in microbial electrosynthesis systems suggest it plays an important role in reactor stability and performance. Understanding why this organism is present and elucidating its genetic repertoire provide a genomic and ecological foundation for future studies where Sulfurospirillum are found, especially in electrode-associated communities. Metabolic comparisons and in-depth analysis of unique genes revealed potential ecological niche-specific capabilities within the Sulfurospirillum genus. The functional similarities common to all genomes, i.e., core genome, and unique gene clusters found only in a single genome were identified. Based upon 16S rRNA gene phylogenetic analysis and average nucleotide identity, the Sulfurospirillum draft genome was found to be most closely related to Sulfurospirillum cavolei. Characterization of the draft genome described herein provides pathway-specific details of the metabolic significance of the newly described Sulfurospirillum cavolei MES and, importantly, yields insight to the ecology of the genus as a whole. Comparison of eleven sequenced Sulfurospirillum genomes revealed a total of 6246 gene clusters in the pan-genome. Of the total gene clusters, 18.5% were shared among all eleven genomes and 50% were unique to a single genome. While most Sulfurospirillum spp. reduce nitrate to ammonium, five of the eleven Sulfurospirillum strains encode for a nitrous oxide reductase (nos) cluster with an atypical nitrous-oxide reductase, suggesting a utility for this genus in reduction of the nitrous oxide, and as a potential sink for this potent greenhouse gas.


September 22, 2019

Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 a-, 11 ?-, one d- and five ?-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of a-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The d-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic a-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.


September 22, 2019

Characteristics of ARG-carrying plasmidome in the cultivable microbial community from wastewater treatment system under high oxytetracycline concentration.

Studies on antibiotic production wastewater have shown that even a single antibiotic can select for multidrug resistant bacteria in aquatic environments. It is speculated that plasmids are an important mechanism of multidrug resistance (MDR) under high concentrations of antibiotics. Herein, two metagenomic libraries were constructed with plasmid DNA extracted from cultivable microbial communities in a biological wastewater treatment reactor supplemented with 0 (CONTROL) or 25 mg/L of oxytetracycline (OTC-25). The OTC-25 plasmidome reads were assigned to 72 antibiotic resistance genes (ARGs) conferring resistance to 13 types of antibiotics. Dominant ARGs, encoding resistance to tetracycline, aminoglycoside, sulfonamide, and multidrug resistance genes, were enriched in the plasmidome under 25 mg/L of oxytetracycline. Furthermore, 17 contiguous multiple-ARG carrying contigs (carrying =?2 ARGs) were discovered in the OTC-25 plasmidome, whereas only nine were found in the CONTROL. Mapping of the OTC-25 plasmidome reads to completely sequenced plasmids revealed that the conjugative IncU resistance plasmid pFBAOT6 of Aeromonas caviae, carrying multidrug resistance transporter (pecM), tetracycline resistance genes (tetA, tetR), and transposase genes, might be a potential prevalent resistant plasmid in the OTC-25 plasmidome. Additionally, two novel resistant plasmids (containing contig C301682 carrying multidrug resistant operon mexCD-oprJ and contig C301632 carrying the tet36 and transposases genes) might also be potential prevalent resistant plasmids in the OTC-25 plasmidome. This study will be helpful to better understand the role of plasmids in the development of MDR in water environments under high antibiotic concentrations.


September 22, 2019

Global identification of the full-length transcripts and alternative splicing related to phenolic acid biosynthetic genes in Salvia miltiorrhiza.

Salvianolic acids are among the main bioactive components in Salvia miltiorrhiza, and their biosynthesis has attracted widespread interest. However, previous studies on the biosynthesis of phenolic acids using next-generation sequencing platforms are limited with regard to the assembly of full-length transcripts. Based on hybrid-seq (next-generation and single molecular real-time sequencing) of the S. miltiorrhiza root transcriptome, we experimentally identified 15 full-length transcripts and four alternative splicing events of enzyme-coding genes involved in the biosynthesis of rosmarinic acid. Moreover, we herein demonstrate that lithospermic acid B accumulates in the phloem and xylem of roots, in agreement with the expression patterns of the identified key genes related to rosmarinic acid biosynthesis. According to co-expression patterns, we predicted that six candidate cytochrome P450s and five candidate laccases participate in the salvianolic acid pathway. Our results provide a valuable resource for further investigation into the synthetic biology of phenolic acids in S. miltiorrhiza.


September 22, 2019

De novo assembly of a Chinese soybean genome.

Soybean was domesticated in China and has become one of the most important oilseed crops. Due to bottlenecks in their introduction and dissemination, soybeans from different geographic areas exhibit extensive genetic diversity. Asia is the largest soybean market; therefore, a high-quality soybean reference genome from this area is critical for soybean research and breeding. Here, we report the de novo assembly and sequence analysis of a Chinese soybean genome for “Zhonghuang 13” by a combination of SMRT, Hi-C and optical mapping data. The assembled genome size is 1.025 Gb with a contig N50 of 3.46 Mb and a scaffold N50 of 51.87 Mb. Comparisons between this genome and the previously reported reference genome (cv. Williams 82) uncovered more than 250,000 structure variations. A total of 52,051 protein coding genes and 36,429 transposable elements were annotated for this genome, and a gene co-expression network including 39,967 genes was also established. This high quality Chinese soybean genome and its sequence analysis will provide valuable information for soybean improvement in the future.


September 22, 2019

Precise fecal microbiome of the herbivorous Tibetan antelope inhabiting high-altitude alpine plateau

The metataxonomic approach combining 16S rRNA gene amplicon sequencing using the PacBio technology with the application of the operational phylogenetic unit (OPU) approach, has been used to analyze the fecal microbial composition of the high-altitude and herbivorous Tibetan antelopes. The fecal samples of the antelope were collected in Hoh Xil National Nature Reserve, at an altitude over 4500 m, the largest depopulated zone in Qinghai-Tibetan Plateau, China, where non-native animals or humans may experience life-threatening acute mountain sickness. In total, 104 antelope fecal samples were enrolled in this study, and were clustered into 61,258 operational taxonomic units (OTUs) at an identity of 98.7% and affiliated with 757 OPUs, including 144 known species, 256 potentially new species, 103 potentially higher taxa within known lineages. In addition, 254 comprised sequences not affiliating with any known family, and the closest relatives were unclassified lineages of existing orders or classes. A total of 42 out of 757 OPUs conformed to the core fecal microbiome, of which four major lineages, namely, un-cultured Ruminococcaceae, Lachnospiraceae, Akkermansia and Christensenellaceae were associated with human health or longevity. The current study reveals that the fecal core microbiome of antelope is mainly composited of uncultured bacteria. The most abundant core taxa, namely, uncultured Ruminococcaceae, uncultured Akkermansia, uncultured Bacteroides, uncultured Christensenellaceae, uncultured Mollicutes, and uncultured Lachnospiraceae, may represent new bacterial candidates at high taxa levels, and several may have beneficial roles in health promotion or anti-intestinal dysbiosis. These organisms should be further isolated and evaluated for potential effect on human health and longevity.


September 22, 2019

Recent insights into the tick microbiome gained through next-generation sequencing.

The tick microbiome comprises communities of microorganisms, including viruses, bacteria and eukaryotes, and is being elucidated through modern molecular techniques. The advent of next-generation sequencing (NGS) technologies has enabled the genes and genomes within these microbial communities to be explored in a rapid and cost-effective manner. The advantages of using NGS to investigate microbiomes surpass the traditional non-molecular methods that are limited in their sensitivity, and conventional molecular approaches that are limited in their scalability. In recent years the number of studies using NGS to investigate the microbial diversity and composition of ticks has expanded. Here, we provide a review of NGS strategies for tick microbiome studies and discuss the recent findings from tick NGS investigations, including the bacterial diversity and composition, influential factors, and implications of the tick microbiome.


September 22, 2019

Full-length RNA sequencing reveals unique transcriptome composition in bermudagrass.

Bermudagrass [Cynodon dactylon (L.) Pers.] is an important perennial warm-season turfgrass species with great economic value. However, the reference genome and transcriptome information are still deficient in bermudagrass, which severely impedes functional and molecular breeding studies. In this study, through analyzing a mixture sample of leaves, stolons, shoots, roots and flowers with single-molecule long-read sequencing technology from Pacific Biosciences (PacBio), we reported the first full-length transcriptome dataset of bermudagrass (C. dactylon cultivar Yangjiang) comprising 78,192 unigenes. Among the unigenes, 66,409 were functionally annotated, whereas 27,946 were found to have two or more isoforms. The annotated full-length unigenes provided many new insights into gene sequence characteristics and systematic phylogeny of bermudagrass. By comparison with transcriptome dataset in nine grass species, KEGG pathway analyses further revealed that C4 photosynthesis-related genes, notably the phosphoenolpyruvate carboxylase and pyruvate, phosphate dikinase genes, are specifically enriched in bermudagrass. These results not only explained the possible reason why bermudagrass flourishes in warm areas but also provided a solid basis for future studies in this important turfgrass species. Copyright © 2018 Elsevier Masson SAS. All rights reserved.


September 22, 2019

Multiple regulatory networks are activated during cold stress in Medicago sativa L.

Cultivated alfalfa (Medicago sativa L.) is one of the most important perennial legume forages in the world, and it has considerable potential as a valuable forage crop for livestock. However, the molecular mechanisms underlying alfalfa responses to cold stress are largely unknown. In this study, the transcriptome changes in alfalfa under cold stress at 4 °C for 2, 6, 24, and 48 h (three replicates for each time point) were analyzed using the high-throughput sequencing platform, BGISEQ-500, resulting in the identification of 50,809 annotated unigenes and 5283 differentially expressed genes (DEGs). Metabolic pathway enrichment analysis demonstrated that the DEGs were involved in carbohydrate metabolism, photosynthesis, plant hormone signal transduction, and the biosynthesis of amino acids. Moreover, the physiological changes of glutathione and proline content, catalase, and peroxidase activity were in accordance with dynamic transcript profiles of the relevant genes. Additionally, some transcription factors might play important roles in the alfalfa response to cold stress, as determined by the expression pattern of the related genes during 48 h of cold stress treatment. These findings provide valuable information for identifying and characterizing important components in the cold signaling network in alfalfa and enhancing the understanding of the molecular mechanisms underlying alfalfa responses to cold stress.


September 22, 2019

ISOdb: A comprehensive database of full-length isoforms generated by Iso-Seq.

The accurate landscape of transcript isoforms plays an important role in the understanding of gene function and gene regulation. However, building complete transcripts is very challenging for short reads generated using next-generation sequencing. Fortunately, isoform sequencing (Iso-Seq) using single-molecule sequencing technologies, such as PacBio SMRT, provides long reads spanning entire transcript isoforms which do not require assembly. Therefore, we have developed ISOdb, a comprehensive resource database for hosting and carrying out an in-depth analysis of Iso-Seq datasets and visualising the full-length transcript isoforms. The current version of ISOdb has collected 93 publicly available Iso-Seq samples from eight species and presents the samples in two levels: (1) sample level, including metainformation, long read distribution, isoform numbers, and alternative splicing (AS) events of each sample; (2) gene level, including the total isoforms, novel isoform number, novel AS number, and isoform visualisation of each gene. In addition, ISOdb provides a user interface in the website for uploading sample information to facilitate the collection and analysis of researchers’ datasets. Currently, ISOdb is the first repository that offers comprehensive resources and convenient public access for hosting, analysing, and visualising Iso-Seq data, which is freely available.


September 22, 2019

The genomic and functional landscapes of developmental plasticity in the American cockroach.

Many cockroach species have adapted to urban environments, and some have been serious pests of public health in the tropics and subtropics. Here, we present the 3.38-Gb genome and a consensus gene set of the American cockroach, Periplaneta americana. We report insights from both genomic and functional investigations into the underlying basis of its adaptation to urban environments and developmental plasticity. In comparison with other insects, expansions of gene families in P. americana exist for most core gene families likely associated with environmental adaptation, such as chemoreception and detoxification. Multiple pathways regulating metamorphic development are well conserved, and RNAi experiments inform on key roles of 20-hydroxyecdysone, juvenile hormone, insulin, and decapentaplegic signals in regulating plasticity. Our analyses reveal a high level of sequence identity in genes between the American cockroach and two termite species, advancing it as a valuable model to study the evolutionary relationships between cockroaches and termites.


September 22, 2019

Application of PacBio Single Molecule Real-Time (SMRT) sequencing in bacterial source tracking analysis during milk powder production

This work developed a 16S rRNA-PacBio Single Molecule Real-Time (SMRT) sequencing-based method to identify and track the bacterial community of milk powder (MP) from two kinds of production settings, i.e., small-scale production contained within an in-house environment (minimal milk storage before pasteurization, milk concentration, and spray drying) and a large-scale factory production (prolonged milk storage before direct spray drying). A total of 18 samples were analyzed at the species level. Comparing with the large-scale factory production, only relatively little changes were observed in the bacterial community during the in-house production process, without significant loss in the levels of bioactive minor proteins (namely, lactoferrin, immunoglobulin G, lactoperoxidase, and lysozyme). The two most prevalent species in the in-house production, Bacillus cereus and Bacillus flexus, were likely originated from the raw milk with only small changes in their relative abundances (from 25.97% to 26.40%–28.89% and 27.40%, respectively) throughout the processing (from raw milk to MP). In contrast, large-scale factory production resulted in more obvious variation in the microbial content. This microbial tracking approach is valuable in identifying the contamination source and the specific stage when contamination happens; the implementation of such technique may also enhance food quality assurance systems that are currently used in the dairy industry.


September 22, 2019

SMRT sequencing of full-length transcriptome of flea beetle Agasicles hygrophila (Selman and Vogt).

This study was aimed at generating the full-length transcriptome of flea beetle Agasicles hygrophila (Selman and Vogt) using single-molecule real-time (SMRT) sequencing. Four developmental stages of A. hygrophila, including eggs, larvae, pupae, and adults were harvested for isolating total RNA. The mixed samples were used for SMRT sequencing to generate the full-length transcriptome. Based on the obtained transcriptome data, alternative splicing event, simple sequence repeat (SSR) analysis, coding sequence prediction, transcript functional annotation, and lncRNA prediction were performed. Total 9.45?Gb of clean reads were generated, including 335,045 reads of insert (ROI) and 158,085 full-length non-chimeric (FLNC) reads. Transcript clustering analysis of FLNC reads identified 40,004 consensus isoforms, including 31,015 high-quality ones. After removing redundant reads, 28,982 transcripts were obtained. Total 145 alternative splicing events were predicted. Additionally, 12,753 SSRs and 16,205 coding sequences were identified based on SSR analysis. Furthermore, 24,031 transcripts were annotated in eight functional databases, and 4,198 lncRNAs were predicted. This is the first study to perform SMRT sequencing of the full-length transcriptome of A. hygrophila. The obtained transcriptome may facilitate further exploration of the genetic data of A. hygrophila and uncover the interactions between this insect and the ecosystem.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.