Menu
September 22, 2019

Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation.

Shotgun metagenomics methods enable characterization of microbial communities in human microbiome and environmental samples. Assembly of metagenome sequences does not output whole genomes, so computational binning methods have been developed to cluster sequences into genome ‘bins’. These methods exploit sequence composition, species abundance, or chromosome organization but cannot fully distinguish closely related species and strains. We present a binning method that incorporates bacterial DNA methylation signatures, which are detected using single-molecule real-time sequencing. Our method takes advantage of these endogenous epigenetic barcodes to resolve individual reads and assembled contigs into species- and strain-level bins. We validate our method using synthetic and real microbiome sequences. In addition to genome binning, we show that our method links plasmids and other mobile genetic elements to their host species in a real microbiome sample. Incorporation of DNA methylation information into shotgun metagenomics analyses will complement existing methods to enable more accurate sequence binning.


September 22, 2019

Comprehensive exploration of the rumen microbial ecosystem with advancements in metagenomics

Ruminant farming and its environmental impact has long remained an economic concern. Metagenomics unravel the vast structural and functional diversity of the rumen microbial community that plays a major role in animal nutrition. Hereby, we summarize rumen metagenomic studies that have enhanced the knowledge of rumen microbe dynamics subsequently leading to development of better feed strategies to improve livestock production and reduce methane emissions.


September 22, 2019

Global dissection of alternative splicing uncovers transcriptional diversity in tissues and associates with the flavonoid pathway in tea plant (Camellia sinensis).

Alternative splicing (AS) regulates mRNA at the post-transcriptional level to change gene function in organisms. However, little is known about the AS and its roles in tea plant (Camellia sinensis), widely cultivated for making a popular beverage tea.In our study, the AS landscape and dynamics were characterized in eight tissues (bud, young leaf, summer mature leaf, winter old leaf, stem, root, flower, fruit) of tea plant by Illumina RNA-Seq and confirmed by Iso-Seq. The most abundant AS (~?20%) was intron retention and involved in RNA processes. The some alternative splicings were found to be tissue specific in stem and root etc. Thirteen co-expressed modules of AS transcripts were identified, which revealed a similar pattern between the bud and young leaves as well as a distinct pattern between seasons. AS events of structural genes including anthocyanidin reductase and MYB transcription factors were involved in biosynthesis of flavonoid, especially in vegetative tissues. The AS isoforms rather than the full-length ones were the major transcripts involved in flavonoid synthesis pathway, and is positively correlated with the catechins content conferring the tea taste. We propose that the AS is an important functional mechanism in regulating flavonoid metabolites.Our study provides the insight into the AS events underlying tea plant’s uniquely different developmental process and highlights the important contribution and efficacy of alternative splicing regulatory function to biosynthesis of flavonoids.


September 22, 2019

Uncovering full-length transcript isoforms of sugarcane cultivar Khon Kaen 3 using single-molecule long-read sequencing.

Sugarcane is an important global food crop and energy resource. To facilitate the sugarcane improvement program, genome and gene information are important for studying traits at the molecular level. Most currently available transcriptome data for sugarcane were generated using second-generation sequencing platforms, which provide short reads. The de novo assembled transcripts from these data are limited in length, and hence may be incomplete and inaccurate, especially for long RNAs.We generated a transcriptome dataset of leaf tissue from a commercial Thai sugarcane cultivar Khon Kaen 3 (KK3) using PacBio RS II single-molecule long-read sequencing by the Iso-Seq method. Short-read RNA-Seq data were generated from the same RNA sample using the Ion Proton platform for reducing base calling errors.A total of 119,339 error-corrected transcripts were generated with the N50 length of 3,611 bp, which is on average longer than any previously reported sugarcane transcriptome dataset. 110,253 sequences (92.4%) contain an open reading frame (ORF) of at least 300 bp long with ORF N50 of 1,416 bp. The mean lengths of 5′ and 3′ untranslated regions in 73,795 sequences with complete ORFs are 1,249 and 1,187 bp, respectively. 4,774 transcripts are putatively novel full-length transcripts which do not match with a previous Iso-Seq study of sugarcane. We annotated the functions of 68,962 putative full-length transcripts with at least 90% coverage when compared with homologous protein coding sequences in other plants.The new catalog of transcripts will be useful for genome annotation, identification of splicing variants, SNP identification, and other research pertaining to the sugarcane improvement program. The putatively novel transcripts suggest unique features of KK3, although more data from different tissues and stages of development are needed to establish a reference transcriptome of this cultivar.


September 22, 2019

Genome-wide transcriptome profiling of the medicinal plant Zanthoxylum planispinum using a single-molecule direct RNA sequencing approach.

High-throughput RNA sequencing has revolutionized transcriptome-based studies of candidate genes, key pathways and gene regulation in non-model organisms. We analyzed full-length cDNA sequences in Zanthoxylum planispinum (Z. planispinum), a medicinal herb in major parts of East Asia. The full-length mRNA derived from tissues of leaf, early fruit and maturing fruit stage were sequenced using PacBio RSII platform to identify isoform transcriptome. We obtained 51,402 unigenes, with average 1781?bp per gene in 82.473?Mb gene lengths. Among 51,402, 3963 unigenes showed variety of isoform. By selection of one representative gene among each of the various isoforms, we finalized 46,306 unique gene set for this herb. We identified 76 cytochrome P450 (CYP450) and related isoforms that are of the wide diversity in the molecular function and biological process. These transcriptome data of Z. planispinum will provide a good resource to study metabolic engineering for the production of valuable medicinal drugs and phytochemicals. Copyright © 2018. Published by Elsevier Inc.


September 22, 2019

Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts.

Polyploidization contributes to the complexity of gene expression, resulting in numerous related but different transcripts. This study explored the transcriptome diversity and complexity of the tetraploid Arabica coffee (Coffea arabica) bean. Long-read sequencing (LRS) by Pacbio Isoform sequencing (Iso-seq) was used to obtain full-length transcripts without the difficulty and uncertainty of assembly required for reads from short-read technologies. The tetraploid transcriptome was annotated and compared with data from the sub-genome progenitors. Caffeine and sucrose genes were targeted for case analysis. An isoform-level tetraploid coffee bean reference transcriptome with 95 995 distinct transcripts (average 3236 bp) was obtained. A total of 88 715 sequences (92.42%) were annotated with BLASTx against NCBI non-redundant plant proteins, including 34 719 high-quality annotations. Further BLASTn analysis against NCBI non-redundant nucleotide sequences, Coffea canephora coding sequences with UTR, C. arabica ESTs, and Rfam resulted in 1213 sequences without hits, were potential novel genes in coffee. Longer UTRs were captured, especially in the 5?UTRs, facilitating the identification of upstream open reading frames. The LRS also revealed more and longer transcript variants in key caffeine and sucrose metabolism genes from this polyploid genome. Long sequences (>10 kilo base) were poorly annotated. LRS technology shows the limitation of previous studies. It provides an important tool to produce a reference transcriptome including more of the diversity of full-length transcripts to help understand the biology and support the genetic improvement of polyploid species such as coffee.© The Authors 2017. Published by Oxford University Press.


September 22, 2019

MetaSort untangles metagenome assembly by reducing microbial community complexity.

Most current approaches to analyse metagenomic data rely on reference genomes. Novel microbial communities extend far beyond the coverage of reference databases and de novo metagenome assembly from complex microbial communities remains a great challenge. Here we present a novel experimental and bioinformatic framework, metaSort, for effective construction of bacterial genomes from metagenomic samples. MetaSort provides a sorted mini-metagenome approach based on flow cytometry and single-cell sequencing methodologies, and employs new computational algorithms to efficiently recover high-quality genomes from the sorted mini-metagenome by the complementary of the original metagenome. Through extensive evaluations, we demonstrated that metaSort has an excellent and unbiased performance on genome recovery and assembly. Furthermore, we applied metaSort to an unexplored microflora colonized on the surface of marine kelp and successfully recovered 75 high-quality genomes at one time. This approach will greatly improve access to microbial genomes from complex or novel communities.


September 22, 2019

An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations.

Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop. © 2017 Clavijo et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Characterization of four C1q/TNF-related proteins (CTRPs) from red-lip mullet (Liza haematocheila) and their transcriptional modulation in response to bacterial and pathogen-associated molecular pattern stimuli.

The structural and evolutionary linkage between tumor necrosis factor (TNF) and the globular C1q (gC1q) domain defines the C1q and TNF-related proteins (CTRPs), which are involved in diverse functions such as immune defense, inflammation, apoptosis, autoimmunity, and cell differentiation. In this study, red-lip mullet (Liza haematocheila) CTRP4-like (MuCTRP4-like), CTRP5 (MuCTRP5), CTRP6 (MuCTRP6), and CTRP7 (MuCTRP7) were identified from the red-lip mullet transcriptome database and molecularly characterized. According to in silico analysis, coding sequences of MuCTRP4-like, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of 1128, 753, 729, and 888 bp open reading frames (ORF), respectively and encoded 375, 250, 242, and 295 amino acids, respectively. All CTRPs possessed a putative C1q domain. Additionally, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of a collagen region. Phylogenetic analysis exemplified that MuCTRPs were distinctly clustered with the respective CTRP orthologs. Tissue-specific expression analysis demonstrated that MuCTRP4-like was mostly expressed in the blood and intestine. Moreover, MuCTRP6 was highly expressed in the blood, whereas MuCTRP5 and MuCTRP7 were predominantly expressed in the muscle and stomach, respectively. According to the temporal expression in blood, all MuCTRPs exhibited significant modulations in response to polyinosinic:polycytidylic acid (poly I:C) and Lactococcus garvieae (L. garvieae). MuCTRP4-like, MuCTRP5, and MuCTRP6 showed significant upregulation in response to lipopolysaccharides (LPS). The results of this study suggest the potential involvement of Mullet CTRPs in post-immune responses. Copyright © 2018. Published by Elsevier Ltd.


September 22, 2019

Researches on transcriptome sequencing in the study of traditional Chinese medicine

Due to its incomparable advantages, the application of transcriptome sequencing in the study of traditional Chinese medicine attracts more and more attention of researchers, which greatly promote the development of traditional Chinese medicine. In this paper, the applications of transcriptome sequencing in traditional Chinese medicine were summarized by reviewing recent related papers.


September 22, 2019

Differential increases of specific FMR1 mRNA isoforms in premutation carriers.

Over 40% of male and ~16% of female carriers of a premutation FMR1 allele (55-200 CGG repeats) will develop fragile X-associated tremor/ataxia syndrome, an adult onset neurodegenerative disorder, while about 20% of female carriers will develop fragile X-associated primary ovarian insufficiency. Marked elevation in FMR1 mRNA transcript levels has been observed with premutation alleles, and RNA toxicity due to increased mRNA levels is the leading molecular mechanism proposed for these disorders. However, although the FMR1 gene undergoes alternative splicing, it is unknown whether all or only some of the isoforms are overexpressed in premutation carriers and which isoforms may contribute to the premutation pathology.To address this question, we have applied a long-read sequencing approach using single-molecule real-time (SMRT) sequencing and qRT-PCR. Our SMRT sequencing analysis performed on peripheral blood mononuclear cells, fibroblasts and brain tissue samples derived from premutation carriers and controls revealed the existence of 16 isoforms of 24 predicted variants. Although the relative abundance of all mRNA isoforms was significantly increased in the premutation group, as expected based on the bulk increase in mRNA levels, there was a disproportionate (fourfold to sixfold) increase, relative to the overall increase in mRNA, in the abundance of isoforms spliced at both exons 12 and 14, specifically Iso10 and Iso10b, containing the complete exon 15 and differing only in splicing in exon 17.These findings suggest that RNA toxicity may arise from a relative increase of all FMR1 mRNA isoforms. Interestingly, the Iso10 and Iso10b mRNA isoforms, lacking the C-terminal functional sites for fragile X mental retardation protein function, are the most increased in premutation carriers relative to normal, suggesting a functional relevance in the pathology of FMR1-associated disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.


September 22, 2019

Alternative polyadenylation: methods, findings, and impacts.

Alternative polyadenylation (APA), a phenomenon that RNA molecules with different 3′ ends originate from distinct polyadenylation sites of a single gene, is emerging as a mechanism widely used to regulate gene expression. In the present review, we first summarized various methods prevalently adopted in APA study, mainly focused on the next-generation sequencing (NGS)-based techniques specially designed for APA identification, the related bioinformatics methods, and the strategies for APA study in single cells. Then we summarized the main findings and advances so far based on these methods, including the preferences of alternative polyA (pA) site, the biological processes involved, and the corresponding consequences. We especially categorized the APA changes discovered so far and discussed their potential functions under given conditions, along with the possible underlying molecular mechanisms. With more in-depth studies on extensive samples, more signatures and functions of APA will be revealed, and its diverse roles will gradually heave in sight. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.


September 22, 2019

Differential TGFß pathway targeting by miR-122 in humans and mice affects liver cancer metastasis.

Downregulation of a predominantly hepatocyte-specific miR-122 is associated with human liver cancer metastasis, whereas miR-122-deficient mice display normal liver function. Here we show a functional conservation of miR-122 in the TGFß pathway: miR-122 target site is present in the mouse but not human TGFßR1, whereas a noncanonical target site is present in the TGFß1 5’UTR in humans and other primates. Experimental switch of the miR-122 target between the receptor TGFßR1 and the ligand TGFß1 changes the metastatic properties of mouse and human liver cancer cells. High expression of TGFß1 in human primary liver tumours is associated with poor survival. We identify over 50 other miRNAs orthogonally targeting ligand/receptor pairs in humans and mice, suggesting that these are evolutionarily common events. These results reveal an evolutionary mechanism for miRNA-mediated gene regulation underlying species-specific physiological or pathological phenotype and provide a potentially valuable strategy for treating liver-associated diseases.


September 22, 2019

Identification and characterization of a carboxypeptidase N1 from red lip mullet (Liza haematocheila); revealing its immune relevance.

Complement system orchestrates the innate and adaptive immunity via the activation, recruitment, and regulation of immune molecules to destroy pathogens. However, regulation of the complement is essential to avoid injuries to the autologous tissues. The present study unveils the characteristic features of an important complement component, anaphylatoxin inactivator from red lip mullet at its molecular and functional level. Mullet carboxypeptidase N1 (MuCPN1) cDNA sequence possessed an open reading frame of 1347 bp, which encoded a protein of 449 amino acids with a predicted molecular weight of 51?kDa. In silico analysis discovered two domains of PM14-Zn carboxypeptidase and a C-terminal domain of M14 N/E carboxypeptidase, two zinc-binding signature motifs, and an N-glycosylation site in the MuCPN1 sequence. Homology analysis revealed that most of the residues in the sequence are conserved among the other selected homologs. Phylogeny analysis showed that MuCPN1 closely cladded with the Maylandia zebra CPN1 and clustered together with the teleostean counterparts. A challenge experiment showed modulated expression of MuCPN1 upon polyinosinic:polycytidylic acid and Lactococcus garviae in head kidney, spleen, gill, and liver tissues. The highest upregulation of MuCPN1 was observed 24?h post infection against poly I:C in each tissue. Moreover, the highest relative expressions upon L. garviae challenge were observed at 24?h post infection in head kidney tissue and 48?h post infection in spleen, gill, and liver tissues. MuCPN1 transfected cells triggered a 2.2-fold increase of nitric oxide (NO) production upon LPS stimulation compared to the un-transfected controls suggesting that MuCPN1 is an active protease which releases arginine from complement C3a, C4a, and C5a. These results have driven certain way towards enhancing the understanding of immune role of MuCPN1 in the complement defense mechanism of red lip mullet. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Analysis of the duodenal microbiotas of weaned piglet fed with epidermal growth factor-expressed Saccharomyces cerevisiae.

The bacterial community of the small intestine is a key factor that has strong influence on the health of gastrointestinal tract (GIT) in mammals during and shortly after weaning. The aim of this study was to analyze the effects of the diets of supplemented with epidermal growth factor (EGF)-expressed Saccharomyces cerevisiae (S. cerevisiae) on the duodenal microbiotas of weaned piglets.Revealed in this study, at day 7, 14 and 21, respectively, the compositional sequencing analysis of the 16S rRNA in the duodenum had no marked difference in microbial diversity from the phylum to species levels between the INVSc1(EV) and other recombinant strains encompassing INVSc1-EE(+), INVSc1-TE(-), and INVSc1-IE(+). Furthermore, the populations of potentially enterobacteria (e.g., Clostridium and Prevotella) and probiotic (e.g., Lactobacilli and Lactococcus) also remained unchanged among recombinant S. cerevisiae groups (P?>?0.05). However, the compositional sequencing analysis of the 16S rRNA in the duodenum revealed significant difference in microbial diversity from phylum to species levels between the control group and recombinant S. cerevisiae groups. In terms of the control group (the lack of S. cerevisiae), these data confirmed that dietary exogenous S. cerevisiae had the feasibility to be used as a supplement for enhancing potentially probiotic (e.g., Lactobacilli and Lactococcus) (P?


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.