Menu
September 22, 2019

Membrane attack complex-associated molecules from redlip mullet (Liza haematocheila): Molecular characterization and transcriptional evidence of C6, C7, C8ß, and C9 in innate immunity.

The redlip mullet (Liza haematocheila) is one of the most economically important fish in Korea and other East Asian countries; it is susceptible to infections by pathogens such as Lactococcus garvieae, Argulus spp., Trichodina spp., and Vibrio spp. Learning about the mechanisms of the complement system of the innate immunity of redlip mullet is important for efforts towards eradicating pathogens. Here, we report a comprehensive study of the terminal complement complex (TCC) components that form the membrane attack complex (MAC) through in-silico characterization and comparative spatial and temporal expression profiling. Five conserved domains (TSP1, LDLa, MACPF, CCP, and FIMAC) were detected in the TCC components, but the CCP and FIMAC domains were absent in MuC8ß and MuC9. Expression analysis of four TCC genes from healthy redlip mullets showed the highest expression levels in the liver, whereas limited expression was observed in other tissues; immune-induced expression in the head kidney and spleen revealed significant responses against Lactococcus garvieae and poly I:C injection, suggesting their involvement in MAC formation in response to harmful pathogenic infections. Furthermore, the response to poly I:C may suggest the role of TCC components in the breakdown of the membrane of enveloped viruses. These findings may help to elucidate the mechanisms behind the complement system of the teleosts innate immunity. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand.Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms.We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.


September 22, 2019

PacBio metabarcoding of Fungi and other eukaryotes: errors, biases and perspectives.

Second-generation, high-throughput sequencing methods have greatly improved our understanding of the ecology of soil microorganisms, yet the short barcodes (< 500 bp) provide limited taxonomic and phylogenetic information for species discrimination and taxonomic assignment. Here, we utilized the third-generation Pacific Biosciences (PacBio) RSII and Sequel instruments to evaluate the suitability of full-length internal transcribed spacer (ITS) barcodes and longer rRNA gene amplicons for metabarcoding Fungi, Oomycetes and other eukaryotes in soil samples. Metabarcoding revealed multiple errors and biases: Taq polymerase substitution errors and mis-incorporating indels in sequencing homopolymers constitute major errors; sequence length biases occur during PCR, library preparation, loading to the sequencing instrument and quality filtering; primer-template mismatches bias the taxonomic profile when using regular and highly degenerate primers. The RSII and Sequel platforms enable the sequencing of amplicons up to 3000 bp, but the sequence quality remains slightly inferior to Illumina sequencing especially in longer amplicons. The full ITS barcode and flanking rRNA small subunit gene greatly improve taxonomic identification at the species and phylum levels, respectively. We conclude that PacBio sequencing provides a viable alternative for metabarcoding of organisms that are of relatively low diversity, require > 500-bp barcode for reliable identification or when phylogenetic approaches are intended.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


September 22, 2019

An environmental bacterial taxon with a large and distinct metabolic repertoire.

Cultivated bacteria such as actinomycetes are a highly useful source of biomedically important natural products. However, such ‘talented’ producers represent only a minute fraction of the entire, mostly uncultivated, prokaryotic diversity. The uncultured majority is generally perceived as a large, untapped resource of new drug candidates, but so far it is unknown whether taxa containing talented bacteria indeed exist. Here we report the single-cell- and metagenomics-based discovery of such producers. Two phylotypes of the candidate genus ‘Entotheonella’ with genomes of greater than 9 megabases and multiple, distinct biosynthetic gene clusters co-inhabit the chemically and microbially rich marine sponge Theonella swinhoei. Almost all bioactive polyketides and peptides known from this animal were attributed to a single phylotype. ‘Entotheonella’ spp. are widely distributed in sponges and belong to an environmental taxon proposed here as candidate phylum ‘Tectomicrobia’. The pronounced bioactivities and chemical uniqueness of ‘Entotheonella’ compounds provide significant opportunities for ecological studies and drug discovery.


September 22, 2019

A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation.

Alternative splicing (AS) is a crucial regulatory mechanism in eukaryotes, which acts by greatly increasing transcriptome diversity. The extent and complexity of AS has been revealed in model plants using high-throughput next-generation sequencing. However, this technique is less effective in accurately identifying transcript isoforms in polyploid species because of the high sequence similarity between coexisting subgenomes. Here we characterize AS in the polyploid species cotton. Using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq), we developed an integrated pipeline for Iso-Seq transcriptome data analysis (https://github.com/Nextomics/pipeline-for-isoseq). We identified 176 849 full-length transcript isoforms from 44 968 gene models and updated gene annotation. These data led us to identify 15 102 fibre-specific AS events and estimate that c. 51.4% of homoeologous genes produce divergent isoforms in each subgenome. We reveal that AS allows differential regulation of the same gene by miRNAs at the isoform level. We also show that nucleosome occupancy and DNA methylation play a role in defining exons at the chromatin level. This study provides new insights into the complexity and regulation of AS, and will enhance our understanding of AS in polyploid species. Our methodology for Iso-Seq data analysis will be a useful reference for the study of AS in other species.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


September 22, 2019

Transcriptional adaptations during long-term persistence of Staphylococcus aureus in the airways of a cystic fibrosis patient.

The lungs of Cystic fibrosis (CF) patients are often colonized and/or infected by Staphylococcus aureus for years, mostly by one predominant clone. For long-term survival in this environment, S. aureus needs to adapt during its interactions with host factors, antibiotics, and other pathogens. Here, we study long-term transcriptional as well as genomic adaptations of an isogenic pair of S. aureus isolates from a single patient using RNA sequencing (RNA-Seq) and whole genome sequencing (WGS). Mimicking in vivo conditions, we cultivated the S. aureus isolates using artificial sputum medium before harvesting RNA for subsequent analysis. We confirmed our RNA-Seq data using quantitative real-time (qRT)-PCR and additionally investigated intermediate isolates from the same patient representing in total 13.2 years of persistence in the CF airways. Comparative RNA-Seq analysis of the first and the last (“late”) isolate revealed significant differences in the late isolate after 13.2 years of persistence. Of the 2545 genes expressed in both isolates that were cultivated aerobically, 256 genes were up- and 161 were down-regulated with a minimum 2-fold change (2f). Focusing on 25 highly (=8f) up- (n=9) or down- (n=16) regulated genes, we identified several genes encoding for virulence factors involved in immune evasion, bacterial spread or secretion (e.g. spa, sak, and esxA). Moreover, these genes displayed similar expression trends under aerobic, microaerophilic and anaerobic conditions. Further qRT-PCR-experiments of highly up- or down-regulated genes within intermediate S. aureus isolates resulted in different gene expression patterns over the years. Using sequencing analysis of the differently expressed genes and their upstream regions in the late S. aureus isolate resulted in only few genomic alterations. Comparative transcriptomic analysis revealed adaptive changes affecting mainly genes involved in host-pathogen interaction. Although the underlying mechanisms were not known, our results suggest adaptive processes beyond genomic mutations triggered by local factors rather than by activation of global regulators. Copyright © 2014 The Authors. Published by Elsevier GmbH.. All rights reserved.


September 22, 2019

Assessing quality of Medicago sativa silage by monitoring bacterial composition with single molecule, real-time sequencing technology and various physiological parameters.

The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, ?-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage.


September 22, 2019

Comparison of the mitochondrial genomes and steady state transcriptomes of two strains of the trypanosomatid parasite, Leishmania tarentolae.

U-insertion/deletion RNA editing is a post-transcriptional mitochondrial RNA modification phenomenon required for viability of trypanosomatid parasites. Small guide RNAs encoded mainly by the thousands of catenated minicircles contain the information for this editing. We analyzed by NGS technology the mitochondrial genomes and transcriptomes of two strains, the old lab UC strain and the recently isolated LEM125 strain. PacBio sequencing provided complete minicircle sequences which avoided the assembly problem of short reads caused by the conserved regions. Minicircles were identified by a characteristic size, the presence of three short conserved sequences, a region of inherently bent DNA and the presence of single gRNA genes at a fairly defined location. The LEM125 strain contained over 114 minicircles encoding different gRNAs and the UC strain only ~24 minicircles. Some LEM125 minicircles contained no identifiable gRNAs. Approximate copy numbers of the different minicircle classes in the network were determined by the number of PacBio CCS reads that assembled to each class. Mitochondrial RNA libraries from both strains were mapped against the minicircle and maxicircle sequences. Small RNA reads mapped to the putative gRNA genes but also to multiple regions outside the genes on both strands and large RNA reads mapped in many cases over almost the entire minicircle on both strands. These data suggest that minicircle transcription is complete and bidirectional, with 3′ processing yielding the mature gRNAs. Steady state RNAs in varying abundances are derived from all maxicircle genes, including portions of the repetitive divergent region. The relative extents of editing in both strains correlated with the presence of a cascade of cognate gRNAs. These data should provide the foundation for a deeper understanding of this dynamic genetic system as well as the evolutionary variation of editing in different strains.


September 22, 2019

Global transcript structure resolution of high gene density genomes through multi-platform data integration.

Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019

Alternative TSSs are co-regulated in single cells in the mouse brain.

Alternative transcription start sites (TSSs) have been extensively studied genome-wide for many cell types and have been shown to be important during development and to regulate transcript abundance between cell types. Likewise, single-cell gene expression has been extensively studied for many cell types. However, how single cells use TSSs has not yet been examined. In particular, it is unknown whether alternative TSSs are independently expressed, or whether they are co-activated or even mutually exclusive in single cells. Here, we use a previously published single-cell RNA-seq dataset, comprising thousands of cells, to study alternative TSS usage. We find that alternative TSS usage is a regulated process, and the correlation between two TSSs expressed in single cells of the same cell type is surprisingly high. Our findings indicate that TSSs generally are regulated by common factors rather than being independently regulated or stochastically expressed.© 2017 The Authors. Published under the terms of the CC BY 4.0 license.


September 22, 2019

neoantigenR: An annotation based pipeline for tumor neoantigen identification from sequencing data

Studies indicate that more than 90% of human genes are alternatively spliced, suggesting the complexity of the transcriptome assembly and analysis. The splicing process is often disrupted, resulting in both functional and non-functional end-products (Sveen et al. 2016) in many cancers. Harnessing the immune system to fight against malignant cancers carrying aberrantly mutated or spliced products is becoming a promising approach to cancer therapy. Advances in immune checkpoint blockade have elicited adaptive immune responses with promising clinical responses to treatments against human malignancies (Tumor Neoantigens in Personalized Cancer Immunotherapy 2017). Emerging data suggest that recognition of patient-specific mutation-associated cancer antigens (i.e. from alternative splicing isoforms) may allow scientists to dissect the immune response in the activity of clinical immunotherapies (Schumacher and Schreiber 2015). The advent of high-throughput sequencing technology has provided a comprehensive view of both splicing aberrations and somatic mutations across a range of human malignancies, allowing for a deeper understanding of the interplay of various disease mechanisms. Meanwhile, studies show that the number of transcript isoforms reported to date may be limited by the short-read sequencing due to the inherit limitation of transcriptome reconstruction algorithms, whereas long-read sequencing is able to significantly improve the detection of alternative splicing variants since there is no need to assemble full-length transcripts from short reads. The analysis of these high-throughput long-read sequencing data may permit a systematic view of tumor specific peptide epitopes (also known as neoantigens) that could serve as targets for immunotherapy (Tumor Neoantigens in Personalized Cancer Immunotherapy 2017). Currently, there is no software pipeline available that can efficiently produce mutation-associated cancer antigens from raw high-throughput sequencing data on patient tumor DNA (The Problem with Neoantigen Prediction 2017). In addressing this issue, we introduce a R package that allows the discoveries of peptide epitope candidates, which are the tumor-specific peptide fragments containing potential functional neoantigens. These peptide epitopes consist of structure variants including insertion, deletions, alternative sequences, and peptides from nonsynonymous mutations. Analysis of these precursor candidates with widely used tools such as netMHC allows for the accurate in-silico prediction of neoantigens. The pipeline named neoantigeR is currently hosted in https://github.com/ICBI/neoantigeR.


September 22, 2019

The genome of an underwater architect, the caddisfly Stenopsyche tienmushanensis Hwang (Insecta: Trichoptera).

Caddisflies (Insecta: Trichoptera) are a highly adapted freshwater group of insects split from a common ancestor with Lepidoptera. They are the most diverse (>16,000 species) of the strictly aquatic insect orders and are widely employed as bio-indicators in water quality assessment and monitoring. Among the numerous adaptations to aquatic habitats, caddisfly larvae use silk and materials from the environment (e.g., stones, sticks, leaf matter) to build composite structures such as fixed retreats and portable cases. Understanding how caddisflies have adapted to aquatic habitats will help explain the evolution and subsequent diversification of the group.We sequenced a retreat-builder caddisfly Stenopsyche tienmushanensis Hwang and assembled a high-quality genome from both Illumina and Pacific Biosciences (PacBio) sequencing. In total, 601.2 M Illumina reads (90.2 Gb) and 16.9 M PacBio subreads (89.0 Gb) were generated. The 451.5 Mb assembled genome has a contig N50 of 1.29 M, has a longest contig of 4.76 Mb, and covers 97.65% of the 1,658 insect single-copy genes as assessed by Benchmarking Universal Single-Copy Orthologs. The genome comprises 36.76% repetitive elements. A total of 14,672 predicted protein-coding genes were identified. The genome revealed gene expansions in specific groups of the cytochrome P450 family and olfactory binding proteins, suggesting potential genomic features associated with pollutant tolerance and mate finding. In addition, the complete gene complex of the highly repetitive H-fibroin, the major protein component of caddisfly larval silk, was assembled.We report the draft genome of Stenopsyche tienmushanensis, the highest-quality caddisfly genome so far. The genome information will be an important resource for the study of caddisflies and may shed light on the evolution of aquatic insects.


September 22, 2019

Subaerial biofilms on granitic historic buildings: microbial diversity and development of phototrophic multi-species cultures.

Microbial communities of natural subaerial biofilms developed on granitic historic buildings of a World Heritage Site (Santiago de Compostela, NW Spain) were characterized and cultured in liquid BG11 medium. Environmental barcoding through next-generation sequencing (Pacific Biosciences) revealed that the biofilms were mainly composed of species of Chlorophyta (green algae) and Ascomycota (fungi) commonly associated with rock substrata. Richness and diversity were higher for the fungal than for the algal assemblages and fungi showed higher heterogeneity among samples. Cultures derived from natural biofilms showed the establishment of stable microbial communities mainly composed of Chlorophyta and Cyanobacteria. Although most taxa found in these cultures were not common in the original biofilms, they are likely common pioneer colonizers of building stone surfaces, including granite. Stable phototrophic multi-species cultures of known microbial diversity were thus obtained and their reliability to emulate natural colonization on granite should be confirmed in further experiments.


September 22, 2019

Improved performance of the PacBio SMRT technology for 16S rDNA sequencing.

Improved sequencing accuracy was obtained with 16S amplicons from environmental samples and a known pure culture when upgraded Pacific Biosciences (PacBio) hardware and enzymes were used for the single molecule, real-time (SMRT) sequencing platform. The new PacBio RS II system with P4/C2 chemistry, when used with previously constructed libraries (Mosher et al., 2013) surpassed the accuracy of Roche/454 pyrosequencing platform. With accurate read lengths of >1400 base pairs, the PacBio system opens up the possibility of identifying microorganisms to the species level in environmental samples. Copyright © 2014 Elsevier B.V. All rights reserved.


September 22, 2019

The state of long non-coding RNA biology.

Transcriptomic studies have demonstrated that the vast majority of the genomes of mammals and other complex organisms is expressed in highly dynamic and cell-specific patterns to produce large numbers of intergenic, antisense and intronic long non-protein-coding RNAs (lncRNAs). Despite well characterized examples, their scaling with developmental complexity, and many demonstrations of their association with cellular processes, development and diseases, lncRNAs are still to be widely accepted as major players in gene regulation. This may reflect an underappreciation of the extent and precision of the epigenetic control of differentiation and development, where lncRNAs appear to have a central role, likely as organizational and guide molecules: most lncRNAs are nuclear-localized and chromatin-associated, with some involved in the formation of specialized subcellular domains. I suggest that a reassessment of the conceptual framework of genetic information and gene expression in the 4-dimensional ontogeny of spatially organized multicellular organisms is required. Together with this and further studies on their biology, the key challenges now are to determine the structure?function relationships of lncRNAs, which may be aided by emerging evidence of their modular structure, the role of RNA editing and modification in enabling epigenetic plasticity, and the role of RNA signaling in transgenerational inheritance of experience.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.