Menu
September 22, 2019

Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.

Parallel sequencing of a single cell’s genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ~3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.


September 22, 2019

PhyloPythiaS+: a self-training method for the rapid reconstruction of low-ranking taxonomic bins from metagenomes.

Background. Metagenomics is an approach for characterizing environmental microbial communities in situ, it allows their functional and taxonomic characterization and to recover sequences from uncultured taxa. This is often achieved by a combination of sequence assembly and binning, where sequences are grouped into ‘bins’ representing taxa of the underlying microbial community. Assignment to low-ranking taxonomic bins is an important challenge for binning methods as is scalability to Gb-sized datasets generated with deep sequencing techniques. One of the best available methods for species bins recovery from deep-branching phyla is the expert-trained PhyloPythiaS package, where a human expert decides on the taxa to incorporate in the model and identifies ‘training’ sequences based on marker genes directly from the sample. Due to the manual effort involved, this approach does not scale to multiple metagenome samples and requires substantial expertise, which researchers who are new to the area do not have. Results. We have developed PhyloPythiaS+, a successor to our PhyloPythia(S) software. The new (+) component performs the work previously done by the human expert. PhyloPythiaS+ also includes a new k-mer counting algorithm, which accelerated the simultaneous counting of 4-6-mers used for taxonomic binning 100-fold and reduced the overall execution time of the software by a factor of three. Our software allows to analyze Gb-sized metagenomes with inexpensive hardware, and to recover species or genera-level bins with low error rates in a fully automated fashion. PhyloPythiaS+ was compared to MEGAN, taxator-tk, Kraken and the generic PhyloPythiaS model. The results showed that PhyloPythiaS+ performs especially well for samples originating from novel environments in comparison to the other methods. Availability. PhyloPythiaS+ in a virtual machine is available for installation under Windows, Unix systems or OS X on: https://github.com/algbioi/ppsp/wiki.


September 22, 2019

Androgen receptor variant AR-V9 is co-expressed with AR-V7 in prostate cancer metastases and predicts abiraterone resistance.

Purpose: Androgen receptor (AR) variant AR-V7 is a ligand-independent transcription factor that promotes prostate cancer resistance to AR-targeted therapies.  Accordingly, efforts are underway to develop strategies for monitoring and inhibiting AR-V7 in castration-resistant prostate cancer (CRPC).  The purpose of this study was to understand whether other AR variants may be co-expressed with AR-V7 and promote resistance to AR-targeted therapies. Experimental Design:  We utilized complementary short- and long-read sequencing of intact AR mRNA isoforms to characterize AR expression in CRPC models.  Co-expression of AR-V7 and AR-V9 mRNA in CRPC metastases and circulating tumor cells was assessed by RNA-seq and RT-PCR, respectively.  Expression of AR-V9 protein in CRPC models was evaluated with polyclonal antisera.  Multivariate analysis was performed to test whether AR variant mRNA expression in metastatic tissues was associated with a 12-week progression-free survival endpoint in a prospective clinical trial of 78 CRPC-stage patients initiating therapy with the androgen synthesis inhibitor, abiraterone acetate. Results: AR-V9 was frequently co-expressed with AR-V7.  Both AR variant species were found to share a common 3′ terminal cryptic exon, which rendered AR-V9 susceptible to experimental manipulations that were previously-thought to target AR-V7 uniquely.  AR-V9 promoted ligand-independent growth of prostate cancer cells.  High AR-V9 mRNA expression in CRPC metastases was predictive of primary resistance to abiraterone acetate (HR = 4.0, 95% CI = 1.31-12.2, P = 0.02).   Conclusions:  AR-V9 may be an important component of therapeutic resistance in CRPC. Copyright ©2017, American Association for Cancer Research.


September 22, 2019

L_RNA_scaffolder: scaffolding genomes with transcripts.

Generation of large mate-pair libraries is necessary for de novo genome assembly but the procedure is complex and time-consuming. Furthermore, in some complex genomes, it is hard to increase the N50 length even with large mate-pair libraries, which leads to low transcript coverage. Thus, it is necessary to develop other simple scaffolding approaches, to at least solve the elongation of transcribed fragments.We describe L_RNA_scaffolder, a novel genome scaffolding method that uses long transcriptome reads to order, orient and combine genomic fragments into larger sequences. To demonstrate the accuracy of the method, the zebrafish genome was scaffolded. With expanded human transcriptome data, the N50 of human genome was doubled and L_RNA_scaffolder out-performed most scaffolding results by existing scaffolders which employ mate-pair libraries. In these two examples, the transcript coverage was almost complete, especially for long transcripts. We applied L_RNA_scaffolder to the highly polymorphic pearl oyster draft genome and the gene model length significantly increased.The simplicity and high-throughput of RNA-seq data makes this approach suitable for genome scaffolding. L_RNA_scaffolder is available at http://www.fishbrowser.org/software/L_RNA_scaffolder.


September 22, 2019

Daily HIV pre-exposure prophylaxis (PrEP) with tenofovir disoproxil fumarate-emtricitabine reduced Streptococcus and increased Erysipelotrichaceae in rectal microbiota.

Daily PrEP is highly effective at preventing HIV-1 acquisition, but risks of long-term tenofovir disoproxil fumarate plus emtricitabine (TDF-FTC) include renal decline and bone mineral density decrease in addition to initial gastrointestinal side effects. We investigated the impact of TDF-FTC on the enteric microbiome using rectal swabs collected from healthy MSM before PrEP initiation and after 48 to 72 weeks of adherent PrEP use. The V4 region of the 16S ribosomal RNA gene sequencing showed that Streptococcus was significantly reduced from 12.0% to 1.2% (p?=?0.036) and Erysipelotrichaceae family was significantly increased from 0.79% to 3.3% (p?=?0.028) after 48-72 weeks of daily PrEP. Catenibacterium mitsuokai, Holdemanella biformis and Turicibacter sanguinis were increased within the Erysipelotrichaceae family and Streptococcus agalactiae, Streptococcus oralis, Streptococcus mitis were reduced. These changes were not associated with host factors including PrEP duration, age, race, tenofovir diphosphate blood level, any drug use and drug abuse, suggesting that the observed microbiome shifts were likely induced by daily PrEP use. Long-term PrEP resulted in increases of Catenibacterium mitsuokai and Holdemanella biformis, which have been associated with gut microbiome dysbiosis. Our observations can aid in characterizing PrEP’s side effects, which is likely to improve PrEP adherence, and thus HIV-1 prevention.


September 22, 2019

Bayesian nonparametric discovery of isoforms and individual specific quantification.

Most human protein-coding genes can be transcribed into multiple distinct mRNA isoforms. These alternative splicing patterns encourage molecular diversity, and dysregulation of isoform expression plays an important role in disease etiology. However, isoforms are difficult to characterize from short-read RNA-seq data because they share identical subsequences and occur in different frequencies across tissues and samples. Here, we develop BIISQ, a Bayesian nonparametric model for isoform discovery and individual specific quantification from short-read RNA-seq data. BIISQ does not require isoform reference sequences but instead estimates an isoform catalog shared across samples. We use stochastic variational inference for efficient posterior estimates and demonstrate superior precision and recall for simulations compared to state-of-the-art isoform reconstruction methods. BIISQ shows the most gains for low abundance isoforms, with 36% more isoforms correctly inferred at low coverage versus a multi-sample method and 170% more versus single-sample methods. We estimate isoforms in the GEUVADIS RNA-seq data and validate inferred isoforms by associating genetic variants with isoform ratios.


September 22, 2019

The influence of energy harvesting strategies on performance and microbial community for sediment microbial fuel cells

Sediment microbial fuel cells (SMFCs) are being developed as potential energy sources where remote sensing and monitoring would be useful. Several energy harvesting techniques for SMFCs have emerged, but effects of these different strategies on startup, performance, and microbial community are not well understood. We investigated these effects by comparing a continuous energy harvesting (CEH) strategy with an intermittent energy harvesting (IEH) strategy. During startup, IEH systems immediately produced higher power and were cathode limited. CEH systems exhibited a gradual power increase and were anode-limited during startup. Both system types produced similar amounts of steady-state power, 1.5 mW ft-2 (16 mW m-2) when optimized. However, an IEH system using unoptimized settings could not be subsequently switched to optimal settings and produce expected power levels. The choice of energy harvester did not appear to significantly affect steady-state community structure. Anodes were dominated by ?- and d-proteobacteria while a- and ?-proteobacteria dominated cathodes. The results suggest performance and community structure are unaffected by energy harvesting strategy, but that startup conditions influence the initial amount of harvested energy and steady-state performance, suggesting future investigations into optimizing startup of these systems are critical for rapidly generating maximum power.


September 22, 2019

Transcriptome profiling using single-molecule direct RNA sequencing approach for in-depth understanding of genes in secondary metabolism pathways of Camellia sinensis.

Characteristic secondary metabolites, including flavonoids, theanine and caffeine, are important components of Camellia sinensis, and their biosynthesis has attracted widespread interest. Previous studies on the biosynthesis of these major secondary metabolites using next-generation sequencing technologies limited the accurately prediction of full-length (FL) splice isoforms. Herein, we applied single-molecule sequencing to pooled tea plant tissues, to provide a more complete transcriptome of C. sinensis. Moreover, we identified 94 FL transcripts and four alternative splicing events for enzyme-coding genes involved in the biosynthesis of flavonoids, theanine and caffeine. According to the comparison between long-read isoforms and assemble transcripts, we improved the quality and accuracy of genes sequenced by short-read next-generation sequencing technology. The resulting FL transcripts, together with the improved assembled transcripts and identified alternative splicing events, enhance our understanding of genes involved in the biosynthesis of characteristic secondary metabolites in C. sinensis.


September 22, 2019

PacBio full-length transcriptome profiling of insect mitochondrial gene expression.

In this study, we sequenced the first full-length insect transcriptome using the Erthesina fullo Thunberg based on the PacBio platform. We constructed the first quantitative transcription map of animal mitochondrial genomes and built a straightforward and concise methodology to investigate mitochondrial gene transcription, RNA processing, mRNA maturation and several other related topics. Most of the results were consistent with the previous studies, while to the best of our knowledge some findings were reported for the first time in this study. The new findings included the high levels of mitochondrial gene expression, the 3′ polyadenylation and possible 5′ m(7)G caps of rRNAs, the isoform diversity of 12S rRNA, the polycistronic transcripts and natural antisense transcripts of mitochondrial genes et al. These findings could challenge and enrich fundamental concepts of mitochondrial gene transcription and RNA processing, particularly of the rRNA primary (sequence) structure. The methodology constructed in this study can also be used to study gene expression or RNA processing of nuclear genomes.


September 22, 2019

Bacterial community structure in simultaneous nitrification, denitrification and organic matter removal process treating saline mustard tuber wastewater as revealed by 16S rRNA sequencing.

A simultaneous nitrification, denitrification and organic matter removal (SNDOR) process in sequencing batch biofilm reactor (SBBR) was established to treat saline mustard tuber wastewater (MTWW) in this study. An average COD removal efficiency of 86.48% and total nitrogen removal efficiency of 86.48% were achieved at 30gNaClL(-1) during 100days’ operation. The underlying mechanisms were investigated by PacBio SMRT DNA sequencing (V1-V9) to analyze the microbial community structures and its variation from low salinity at 10gNaClL(-1) to high salinity at 30gNaClL(-1). Results showed elevated salinity did not affect biological performance but reduced microbial diversity in SBBR, and halophilic bacteria gradually predominated by succession. Despite of high C/N, autotrophic ammonia-oxidizing bacteria (AOB) Nitrosomonas and ammonia-oxidizing archaea (AOA) Candidatus Nitrososphaera both contributed to ammonium oxidation. As salinity increasing, nitrite-oxidizing bacteria (NOB) were significantly inhibited, partial nitrification and denitrification (PND) process gradually contributed to nitrogen removal. Copyright © 2016 Elsevier Ltd. All rights reserved.


September 22, 2019

Computational identification of novel genes: current and future perspectives.

While it has long been thought that all genomic novelties are derived from the existing material, many genes lacking homology to known genes were found in recent genome projects. Some of these novel genes were proposed to have evolved de novo, ie, out of noncoding sequences, whereas some have been shown to follow a duplication and divergence process. Their discovery called for an extension of the historical hypotheses about gene origination. Besides the theoretical breakthrough, increasing evidence accumulated that novel genes play important roles in evolutionary processes, including adaptation and speciation events. Different techniques are available to identify genes and classify them as novel. Their classification as novel is usually based on their similarity to known genes, or lack thereof, detected by comparative genomics or against databases. Computational approaches are further prime methods that can be based on existing models or leveraging biological evidences from experiments. Identification of novel genes remains however a challenging task. With the constant software and technologies updates, no gold standard, and no available benchmark, evaluation and characterization of genomic novelty is a vibrant field. In this review, the classical and state-of-the-art tools for gene prediction are introduced. The current methods for novel gene detection are presented; the methodological strategies and their limits are discussed along with perspective approaches for further studies.


September 22, 2019

The role of MHC-E in T cell immunity is conserved among humans, rhesus macaques, and cynomolgus macaques.

MHC-E is a highly conserved nonclassical MHC class Ib molecule that predominantly binds and presents MHC class Ia leader sequence-derived peptides for NK cell regulation. However, MHC-E also binds pathogen-derived peptide Ags for presentation to CD8+ T cells. Given this role in adaptive immunity and its highly monomorphic nature in the human population, HLA-E is an attractive target for novel vaccine and immunotherapeutic modalities. Development of HLA-E-targeted therapies will require a physiologically relevant animal model that recapitulates HLA-E-restricted T cell biology. In this study, we investigated MHC-E immunobiology in two common nonhuman primate species, Indian-origin rhesus macaques (RM) and Mauritian-origin cynomolgus macaques (MCM). Compared to humans and MCM, RM expressed a greater number of MHC-E alleles at both the population and individual level. Despite this difference, human, RM, and MCM MHC-E molecules were expressed at similar levels across immune cell subsets, equivalently upregulated by viral pathogens, and bound and presented identical peptides to CD8+ T cells. Indeed, SIV-specific, Mamu-E-restricted CD8+ T cells from RM recognized antigenic peptides presented by all MHC-E molecules tested, including cross-species recognition of human and MCM SIV-infected CD4+ T cells. Thus, MHC-E is functionally conserved among humans, RM, and MCM, and both RM and MCM represent physiologically relevant animal models of HLA-E-restricted T cell immunobiology. Copyright © 2017 by The American Association of Immunologists, Inc.


September 22, 2019

Evaluating the mobility potential of antibiotic resistance genes in environmental resistomes without metagenomics.

Antibiotic resistance genes are ubiquitous in the environment. However, only a fraction of them are mobile and able to spread to pathogenic bacteria. Until now, studying the mobility of antibiotic resistance genes in environmental resistomes has been challenging due to inadequate sensitivity and difficulties in contig assembly of metagenome based methods. We developed a new cost and labor efficient method based on Inverse PCR and long read sequencing for studying mobility potential of environmental resistance genes. We applied Inverse PCR on sediment samples and identified 79 different MGE clusters associated with the studied resistance genes, including novel mobile genetic elements, co-selected resistance genes and a new putative antibiotic resistance gene. The results show that the method can be used in antibiotic resistance early warning systems. In comparison to metagenomics, Inverse PCR was markedly more sensitive and provided more data on resistance gene mobility and co-selected resistances.


September 22, 2019

Koumiss consumption alleviates symptoms of patients with chronic atrophic gastritis: A possible link To modulation of gut microbiota

Intestinal dysbiosisis closely related to a variety of medical conditions, especially gastrointestinal diseases. The present study aimed to investigate the effects of koumiss on chronic atrophic gastritis (CAG) in an out-patient clinical trial (n = 10; all female subjects aged 41-55; body mass index ranging from 19.5 to 25.8). Each patient consumed three servings of koumiss per day (i.e. 250 ml daily before each of 3 meals) for a 60-day period. The improvement of patients’ symptoms was monitored by comparing the total scores of symptoms before and after the treatment. Meanwhile, the changes in the patients’ fecal microbiota composition and specific blood parameters were determined. After the 60-day koumiss administration, significant symptom improvements were observed, as evidenced by the reduction of the total symptoms score, and changes in blood platelet and cholesterol levels. The changes in patients’ fecal microbiota composition were found. The patients’ fecal microbiota fell into two distinct enterotypes, Bacteroides dorei/ Bacteroides uniformis (BB-enterotype) and Prevotella copri (P-enterotype). Significant less Bacteroides uniformis was found in the BB-enterotype patient group, while significant more butyrate-producing bacteria (e.g. Eubacterium rectale and Faecalibacterium prausnitzii) were found in the P-enterotype patient group, following koumiss administration. After stopping koumiss consumption, the relative abundance of some biomarker taxa returned to the original level, suggesting that the gut microbiota modulatory effect was not permanent and that continuous koumiss administration was required to maintain the therapeutic effect. In conclusion, koumiss consumption could alleviate the symptoms of CAG patients. Our results may help understand the mechanism of koumiss in alleviating CAG disease symptoms, facilitating the development of such products with desired therapeutic functions.


September 22, 2019

Single-molecule long-read transcriptome profiling of Platysternon megacephalum mitochondrial genome with gene rearrangement and control region duplication.

Platysternon megacephalum is the sole living representative of the poorly studied turtle lineage Platysternidae. Their mitochondrial genome has been subject to gene rearrangement and control region duplication, resulting in a unique mitochondrial gene order in vertebrates. In this study, we sequenced the first full-length turtle (P. megacephalum) liver transcriptome using single-molecule real-time sequencing to study the transcriptional mechanisms of its mitochondrial genome. ND5 and ND6 anti-sense (ND6AS) forms a single transcript with the same expression in the human mitochondrial genome, but here we demonstrated differential expression of the rearranged ND5 and ND6AS genes in P. megacephalum. And some polycistronic transcripts were also reported in this study. Notably, we detected some novel long non-coding RNAs with alternative polyadenylation from the duplicated control region, and a novel ND6AS transcript composed of a long non-coding sequence, ND6AS, and tRNA-GluAS. These results provide the first description of a mtDNA transcriptome with gene rearrangement and control region duplication. These findings further our understanding of the fundamental concepts of mitochondrial gene transcription and RNA processing, and provide a new insight into the mechanism of transcription regulation of the mitochondrial genome.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.