Menu
September 22, 2019

Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis.

Astragalus membranaceus, also known as Huangqi in China, is one of the most widely used medicinal herbs in Traditional Chinese Medicine. Traditional Chinese Medicine formulations from Astragalus membranaceus have been used to treat a wide range of illnesses, such as cardiovascular disease, type 2 diabetes, nephritis and cancers. Pharmacological studies have shown that immunomodulating, anti-hyperglycemic, anti-inflammatory, antioxidant and antiviral activities exist in the extract of Astragalus membranaceus. Therefore, characterising the biosynthesis of bioactive compounds in Astragalus membranaceus, such as Astragalosides, Calycosin and Calycosin-7-O-ß-d-glucoside, is of particular importance for further genetic studies of Astragalus membranaceus. In this study, we reconstructed the Astragalus membranaceus full-length transcriptomes from leaf and root tissues using PacBio Iso-Seq long reads. We identified 27 975 and 22 343 full-length unique transcript models in each tissue respectively. Compared with previous studies that used short read sequencing, our reconstructed transcripts are longer, and are more likely to be full-length and include numerous transcript variants. Moreover, we also re-characterised and identified potential transcript variants of genes involved in Astragalosides, Calycosin and Calycosin-7-O-ß-d-glucoside biosynthesis. In conclusion, our study provides a practical pipeline to characterise the full-length transcriptome for species without a reference genome and a useful genomic resource for exploring the biosynthesis of active compounds in Astragalus membranaceus.


September 22, 2019

Cell-type-specific splicing of Piezo2 regulates mechanotransduction.

Piezo2 is a mechanically activated ion channel required for touch discrimination, vibration detection, and proprioception. Here, we discovered that Piezo2 is extensively spliced, producing different Piezo2 isoforms with distinct properties. Sensory neurons from both mice and humans express a large repertoire of Piezo2 variants, whereas non-neuronal tissues express predominantly a single isoform. Notably, even within sensory ganglia, we demonstrate the splicing of Piezo2 to be cell type specific. Biophysical characterization revealed substantial differences in ion permeability, sensitivity to calcium modulation, and inactivation kinetics among Piezo2 splice variants. Together, our results describe, at the molecular level, a potential mechanism by which transduction is tuned, permitting the detection of a variety of mechanosensory stimuli. Published by Elsevier Inc.


September 22, 2019

Isoform evolution in primates through independent combination of alternative RNA processing events.

Recent RNA-seq technology revealed thousands of splicing events that are under rapid evolution in primates, whereas the reliability of these events, as well as their combination on the isoform level, have not been adequately addressed due to its limited sequencing length. Here, we performed comparative transcriptome analyses in human and rhesus macaque cerebellum using single molecule long-read sequencing (Iso-seq) and matched RNA-seq. Besides 359 million RNA-seq reads, 4,165,527 Iso-seq reads were generated with a mean length of 14,875?bp, covering 11,466 human genes, and 10,159 macaque genes. With Iso-seq data, we substantially expanded the repertoire of alternative RNA processing events in primates, and found that intron retention and alternative polyadenylation are surprisingly more prevalent in primates than previously estimated. We then investigated the combinatorial mode of these alternative events at the whole-transcript level, and found that the combination of these events is largely independent along the transcript, leading to thousands of novel isoforms missed by current annotations. Notably, these novel isoforms are selectively constrained in general, and 1,119 isoforms have even higher expression than the previously annotated major isoforms in human, indicating that the complexity of the human transcriptome is still significantly underestimated. Comparative transcriptome analysis further revealed 502 genes encoding selectively constrained, lineage-specific isoforms in human but not in rhesus macaque, linking them to some lineage-specific functions. Overall, we propose that the independent combination of alternative RNA processing events has contributed to complex isoform evolution in primates, which provides a new foundation for the study of phenotypic difference among primates.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


September 22, 2019

Assessment of an organ-specific de novo transcriptome of the nematode trap-crop, Solanum sisymbriifolium

Solanum sisymbriifolium, also known as “Litchi Tomato” or “Sticky Nightshade,” is an undomesticated and poorly researched plant related to potato and tomato. Unlike the latter species, S. sisymbriifolium induces eggs of the cyst nematode, Globodera pallida, to hatch and migrate into its roots, but then arrests further nematode maturation. In order to provide researchers with a partial blueprint of its genetic make-up so that the mechanism of this response might be identified, we used single molecule real time (SMRT) sequencing to compile a high quality de novo transcriptome of 41,189 unigenes drawn from individually sequenced bud, root, stem, and leaf RNA populations. Functional annotation and BUSCO analysis showed that this transcriptome was surprisingly complete, even though it represented genes expressed at a single time point. By sequencing the 4 organ libraries separately, we found we could get a reliable snapshot of transcript distributions in each organ. A divergent site analysis of the merged transcriptome indicated that this species might have undergone a recent genome duplication and re-diploidization. Further analysis indicated that the plant then retained a disproportionate number of genes associated with photosynthesis and amino acid metabolism in comparison to genes with characteristics of R-proteins or involved in secondary metabolism. The former processes may have given S. sisymbriifolium a bigger competitive advantage than the latter did. Copyright © 2018 Wixom et al.


September 22, 2019

Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi.

Single molecule real-time (SMRT) sequencing has recently been used to obtain full-length cDNA sequences that improve genome annotation and reveal RNA isoforms. Here, we used one such method called isoform sequencing from Pacific Biosciences (PacBio) to sequence a cDNA library from the Asian malaria mosquito Anopheles stephensi. More than 600 000 full-length cDNAs, referred to as reads of insert, were identified. Owing to the inherently high error rate of PacBio sequencing, we tested different approaches for error correction. We found that error correction using Illumina RNA sequencing (RNA-seq) generated more data than using the default SMRT pipeline. The full-length error-corrected PacBio reads greatly improved the gene annotation of Anopheles stephensi: 4867 gene models were updated and 1785 alternatively spliced isoforms were added to the annotation. In addition, six trans-splicing events, where exons from different primary transcripts were joined together, were identified in An. stephensi. All six trans-splicing events appear to be conserved in Culicidae, as they are also found in Anopheles gambiae and Aedes aegypti. The proteins encoded by trans-splicing events are also highly conserved and the orthologues of these proteins are cis-spliced in outgroup species, indicating that trans-splicing may arise as a mechanism to rescue genes that broke up during evolution.© 2017 The Royal Entomological Society.


September 22, 2019

Resolving the complexity of human skin metagenomes using single-molecule sequencing.

Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.The species comprising a microbial community are often difficult to deconvolute due to technical limitations inherent to most short-read sequencing technologies. Here, we leverage new advances in sequencing technology, single-molecule sequencing, to significantly improve reconstruction of a complex human skin microbial community. With this long-read technology, we were able to reconstruct and annotate a closed, high-quality genome of a previously uncharacterized skin species. We demonstrate that hybrid approaches with short-read technology are sufficiently powerful to reconstruct even single-nucleotide polymorphism level variation of species in this a community. Copyright © 2016 Tsai et al.


September 22, 2019

Complete genome sequence of Geobacillus thermodenitrificans T12, a potential host for biotechnological applications.

In attempt to obtain a thermophilic host for the conversion of lignocellulose derived substrates into lactic acid, Geobacillus thermodenitrificans T12 was isolated from a compost heap. It was selected from over 500 isolates as a genetically tractable hemicellulolytic lactic acid producer, requiring little nutrients. The strain is able to ferment glucose and xylose simultaneously and can produce lactic acid from xylan, making it a potential host for biotechnological applications. The genome of strain T12 consists of a 3.64 Mb chromosome and two plasmids of 59 and 56 kb. It has a total of 3.676 genes with an average genomic GC content of 48.7%. The T12 genome encodes a denitrification pathway, allowing for anaerobic respiration. The identity and localization of the responsible genes are similar to those of the denitrification pathways found in strain NG80-2. The hemicellulose utilization (HUS) locus was identified based on sequence homology against G. stearothermophilus T-6. It appeared that T12 has all the genes that are present in strain T-6 except for the arabinan degradation cluster. Instead, the HUS locus of strain T12 contains genes for both an inositol and a pectate degradation pathway. Strain T12 has complete pathways for the synthesis of purine and pyrimidine, all 20 amino acids and several vitamins except D-biotin. The host-defense systems present comprise a Type II and a Type III restriction-modification system, as well as a CRISPR-Cas Type II system. It is concluded that G. thermodenitrificans T12 is a potentially interesting candidate for industrial applications.


September 22, 2019

Genomic insights into the non-histamine production and proteolytic and lipolytic activities of Tetragenococcus halophilus KUD23.

Tetragenococcus halophilus KUD23, a non-histamine producer, was isolated from a traditional Korean high-salt fermented soybean paste, doenjang. The strain was safe in terms of antibiotic susceptibility, hemolytic activity and biofilm formation. It could grow on De Man-Rogosa-Sharpe agar containing 21% (w/v) NaCl, exhibited acid production at 15% NaCl, and had strain-specific proteolytic and lipolytic activities under salt stress. Complete genome analysis of T. halophilus KUD23 and comparative genomic analysis shed light on the genetic background behind these phenotypic characteristics, including non-production of histamine and proteolytic and lipolytic activities.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


September 22, 2019

Identification of a biosynthetic gene cluster for the polyene macrolactam sceliphrolactam in a Streptomyces strain isolated from mangrove sediment.

Streptomyces are a genus of Actinobacteria capable of producing structurally diverse natural products. Here we report the isolation and characterization of a biosynthetically talented Streptomyces (Streptomyces sp. SD85) from tropical mangrove sediments. Whole-genome sequencing revealed that Streptomyces sp. SD85 harbors at least 52 biosynthetic gene clusters (BGCs), which constitute 21.2% of the 8.6-Mb genome. When cultivated under lab conditions, Streptomyces sp. SD85 produces sceliphrolactam, a 26-membered polyene macrolactam with unknown biosynthetic origin. Genome mining yielded a putative sceliphrolactam BGC (sce) that encodes a type I modular polyketide synthase (PKS) system, several ß-amino acid starter biosynthetic enzymes, transporters, and transcriptional regulators. Using the CRISPR/Cas9-based gene knockout method, we demonstrated that the sce BGC is essential for sceliphrolactam biosynthesis. Unexpectedly, the PKS system encoded by sce is short of one module required for assembling the 26-membered macrolactam skeleton according to the collinearity rule. With experimental data disfavoring the involvement of a trans-PKS module, the biosynthesis of sceliphrolactam seems to be best rationalized by invoking a mechanism whereby the PKS system employs an iterative module to catalyze two successive chain extensions with different outcomes. The potential violation of the collinearity rule makes the mechanism distinct from those of other polyene macrolactams.


September 22, 2019

Comparative genomics reveals cotton-specific virulence factors in flexible genomic regions in Verticillium dahliae and evidence of horizontal gene transfer from Fusarium.

Verticillium dahliae isolates are most virulent on the host from which they were originally isolated. Mechanisms underlying these dominant host adaptations are currently unknown. We sequenced the genome of V. dahliae Vd991, which is highly virulent on its original host, cotton, and performed comparisons with the reference genomes of JR2 (from tomato) and VdLs.17 (from lettuce). Pathogenicity-related factor prediction, orthology and multigene family classification, transcriptome analyses, phylogenetic analyses, and pathogenicity experiments were performed. The Vd991 genome harbored several exclusive, lineage-specific (LS) genes within LS regions (LSRs). Deletion mutants of the seven genes within one LSR (G-LSR2) in Vd991 were less virulent only on cotton. Integration of G-LSR2 genes individually into JR2 and VdLs.17 resulted in significantly enhanced virulence on cotton but did not affect virulence on tomato or lettuce. Transcription levels of the seven LS genes in Vd991 were higher during the early stages of cotton infection, as compared with other hosts. Phylogenetic analyses suggested that G-LSR2 was acquired from Fusarium oxysporum f. sp. vasinfectum through horizontal gene transfer. Our results provide evidence that horizontal gene transfer from Fusarium to Vd991 contributed significantly to its adaptation to cotton and may represent a significant mechanism in the evolution of an asexual plant pathogen.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


September 22, 2019

Predicting an HLA-DPB1 expression marker based on standard DPB1 genotyping: Linkage analysis of over 32,000 samples.

The risk of acute graft-versus-host disease (GvHD) after hematopoietic stem cell transplantation is increased with donor-recipient HLA-DPB1 allele mismatching. The single-nucleotide polymorphism (SNP) rs9277534 within the 3′ untranslated region (UTR) correlates with HLA-DPB1 allotype expression and serves as a marker for permissive HLA-DPB1 mismatches. Since rs9277534 is not routinely typed, we analyzed 32,681 samples of mostly European ancestry to investigate if the rs9277534 allele can be reliably imputed from standard DPB1 genotyping. We confirmed the previously-defined linkages between rs9277534 and 18 DPB1 alleles and established additional linkages for 46 DPB1 alleles. Based on these linkages, the rs9277534 allele could be predicted for 99.6% of the samples based on DPB1 genotypes (99.99% concordance). We demonstrate that 100% prediction accuracy could be achieved if the prediction utilized exon 3 sequence information. DPB1 genotyping based on exon 2 data alone allows no unambiguous rs9277534 allele prediction but was estimated to maintain 99% accuracy for samples of European descent. We conclude that DPB1 genotyping is sufficient to infer the DPB1 expression marker rs9277534 with high accuracy. This information could be used to select donors with permissive HLA-DPB1 mismatches without directly screening for rs9277534. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.


September 22, 2019

Translating genomics into practice for real-time surveillance and response to carbapenemase-producing Enterobacteriaceae: evidence from a complex multi-institutional KPC outbreak.

Until recently, Klebsiella pneumoniae carbapenemase (KPC)-producing Enterobacteriaceae were rarely identified in Australia. Following an increase in the number of incident cases across the state of Victoria, we undertook a real-time combined genomic and epidemiological investigation. The scope of this study included identifying risk factors and routes of transmission, and investigating the utility of genomics to enhance traditional field epidemiology for informing management of established widespread outbreaks.All KPC-producing Enterobacteriaceae isolates referred to the state reference laboratory from 2012 onwards were included. Whole-genome sequencing was performed in parallel with a detailed descriptive epidemiological investigation of each case, using Illumina sequencing on each isolate. This was complemented with PacBio long-read sequencing on selected isolates to establish high-quality reference sequences and interrogate characteristics of KPC-encoding plasmids.Initial investigations indicated that the outbreak was widespread, with 86 KPC-producing Enterobacteriaceae isolates (K. pneumoniae 92%) identified from 35 different locations across metropolitan and rural Victoria between 2012 and 2015. Initial combined analyses of the epidemiological and genomic data resolved the outbreak into distinct nosocomial transmission networks, and identified healthcare facilities at the epicentre of KPC transmission. New cases were assigned to transmission networks in real-time, allowing focussed infection control efforts. PacBio sequencing confirmed a secondary transmission network arising from inter-species plasmid transmission. Insights from Bayesian transmission inference and analyses of within-host diversity informed the development of state-wide public health and infection control guidelines, including interventions such as an intensive approach to screening contacts following new case detection to minimise unrecognised colonisation.A real-time combined epidemiological and genomic investigation proved critical to identifying and defining multiple transmission networks of KPC Enterobacteriaceae, while data from either investigation alone were inconclusive. The investigation was fundamental to informing infection control measures in real-time and the development of state-wide public health guidelines on carbapenemase-producing Enterobacteriaceae surveillance and management.


September 22, 2019

Identification of candidate genes at the Dp-fl locus conferring resistance against the rosy apple aphid Dysaphis plantaginea

The cultivated apple is susceptible to several pests including the rosy apple aphid (RAA; Dysaphis plantaginea Passerini), control of which is mainly based on chemical treatments. A few cases of resistance to aphids have been described in apple germplasm resources, laying the basis for the development of new resistant cultivars by breeding. The cultivar ‘Florina’ is resistant to RAA, and recently, the Dp-fl locus responsible for its resistance was mapped on linkage group 8 of the apple genome. In this paper, a chromosome walking approach was performed by using a ‘Florina’ bacterial artificial chromosome (BAC) library. The walking started from the available tightly linked molecular markers flanking the resistance region. Various walking steps were performed in order to identify the minimum tiling path of BAC clones covering the Dp-fl region from both the “resistant” and “susceptible” chromosomes of ‘Florina’. A genomic region of about 279 Kb encompassing the Dp-fl resistance locus was fully sequenced by the PacBio technology. Through the development of new polymorphic markers, the mapping interval around the resistance locus was narrowed down to a physical region of 95 Kb. The annotation of this sequence resulted in the identification of four candidate genes putatively involved in the RAA resistance response.


September 22, 2019

Complete genome sequence of Pseudomonas Parafulva PRS09-11288, a biocontrol strain produces the antibiotic phenazine-1-carboxylic acid.

Rhizoctonia solani is a plant pathogenic fungus, which can infect a wide range of economic crops including rice. In this case, biological control of this pathogen is one of the fundmental way to effectively control this pathogen. The Pseudomonas parafulva strain PRS09-11288 was isolated from rice rhizosphere and shows biocontrol ability against R. solani. Here, we analyzed the P. parafulva genome, which is ~?4.7 Mb, with 4310 coding sequences, 76 tRNAs, and 7 rRNAs. Genome analysis identified a phenazine biosynthetic pathway, which can produce antibiotic phenazine-1-carboxylic acid (PCA). This compound is responsible for biocontrol ability against R. solani Kühn, which is one of the most serious fungus disease on rice. Analysis of the phenazine biosynthesis gene mutant, ?phzF, which is very important in this pathway, confirmed the relationship between the pathway and PCA production using LC-MS profiles. The annotated full genome sequence of this strain sheds light on the role of P. parafulva PRS09-11288 as a biocontrol bacterium.


September 22, 2019

Candidatus Nitrosocaldus cavascurensis, an ammonia oxidizing, extremely thermophilic archaeon with a highly mobile genome.

Ammonia oxidizing archaea (AOA) of the phylum Thaumarchaeota are widespread in moderate environments but their occurrence and activity has also been demonstrated in hot springs. Here we present the first enrichment of a thermophilic representative with a sequenced genome, which facilitates the search for adaptive strategies and for traits that shape the evolution of Thaumarchaeota.CandidatusNitrosocaldus cavascurensis has been enriched from a hot spring in Ischia, Italy. It grows optimally at 68°C under chemolithoautotrophic conditions on ammonia or urea converting ammonia stoichiometrically into nitrite with a generation time of approximately 23 h. Phylogenetic analyses based on ribosomal proteins place the organism as a sister group to all known mesophilic AOA. The 1.58 Mb genome ofCa.N. cavascurensis harbors anamoAXCB gene cluster encoding ammonia monooxygenase and genes for a 3-hydroxypropionate/4-hydroxybutyrate pathway for autotrophic carbon fixation, but also genes that indicate potential alternative energy metabolisms. Although abona fidegene for nitrite reductase is missing, the organism is sensitive to NO-scavenging, underlining the potential importance of this compound for AOA metabolism.Ca.N. cavascurensis is distinct from all other AOA in its gene repertoire for replication, cell division and repair. Its genome has an impressive array of mobile genetic elements and other recently acquired gene sets, including conjugative systems, a provirus, transposons and cell appendages. Some of these elements indicate recent exchange with the environment, whereas others seem to have been domesticated and might convey crucial metabolic traits.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.