Menu
September 22, 2019  |  

A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.


September 22, 2019  |  

Alternative splice variants of AID are not stoichiometrically present at the protein level in chronic lymphocytic leukemia

Activation-induced deaminase (AID) is a DNA-mutating enzyme that mediates class-switch recombination as well as somatic hypermutation of antibody genes in B cells. Due to off-target activity, AID is implicated in lymphoma development by introducing genome-wide DNA damage and initiating chromosomal translocations such as c-myc/IgH. Several alternative splice transcripts of AID have been reported in activated B cells as well as malignant B cells such as chronic lymphocytic leukemia (CLL). As most commercially available antibodies fail to recognize alternative splice variants, their abundance in vivo, and hence their biological significance, has not been determined. In this study, we assessed the protein levels of AID splice isoforms by introducing an AID splice reporter construct into cell lines and primary CLL cells from patients as well as from WT and TCL1(tg) C57BL/6 mice (where TCL1 is T-cell leukemia/lymphoma 1). The splice construct is 5′-fused to a GFP-tag, which is preserved in all splice isoforms and allows detection of translated protein. Summarizing, we show a thorough quantification of alternatively spliced AID transcripts and demonstrate that the corresponding protein abundances, especially those of splice variants AID-ivs3 and AID-?E4, are not stoichiometrically equivalent. Our data suggest that enhanced proteasomal degradation of low-abundance proteins might be causative for this discrepancy. © 2013 The Authors. European Journal of Immunology published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


September 22, 2019  |  

The human microbiome and understanding the 16S rRNA gene in translational nursing science.

As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and healthcare practitioners to analyze these microbial communities and their role in health and disease. 16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings.The objectives of this review are to (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung, and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science.Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists-individuals uniquely positioned to utilize these techniques in future studies in clinical settings.


September 22, 2019  |  

Quantitative profiling of Drosophila melanogaster Dscam1 isoforms reveals no changes in splicing after bacterial exposure.

The hypervariable Dscam1 (Down syndrome cell adhesion molecule 1) gene can produce thousands of different ectodomain isoforms via mutually exclusive alternative splicing. Dscam1 appears to be involved in the immune response of some insects and crustaceans. It has been proposed that the diverse isoforms may be involved in the recognition of, or the defence against, diverse parasite epitopes, although evidence to support this is sparse. A prediction that can be generated from this hypothesis is that the gene expression of specific exons and/or isoforms is influenced by exposure to an immune elicitor. To test this hypothesis, we for the first time, use a long read RNA sequencing method to directly investigate the Dscam1 splicing pattern after exposing adult Drosophila melanogaster and a S2 cell line to live Escherichia coli. After bacterial exposure both models showed increased expression of immune-related genes, indicating that the immune system had been activated. However there were no changes in total Dscam1 mRNA expression. RNA sequencing further showed that there were no significant changes in individual exon expression and no changes in isoform splicing patterns in response to bacterial exposure. Therefore our studies do not support a change of D. melanogaster Dscam1 isoform diversity in response to live E. coli. Nevertheless, in future this approach could be used to identify potentially immune-related Dscam1 splicing regulation in other host species or in response to other pathogens.


September 22, 2019  |  

Interaction between the microbiome and TP53 in human lung cancer.

Lung cancer is the leading cancer diagnosis worldwide and the number one cause of cancer deaths. Exposure to cigarette smoke, the primary risk factor in lung cancer, reduces epithelial barrier integrity and increases susceptibility to infections. Herein, we hypothesize that somatic mutations together with cigarette smoke generate a dysbiotic microbiota that is associated with lung carcinogenesis. Using lung tissue from 33 controls and 143 cancer cases, we conduct 16S ribosomal RNA (rRNA) bacterial gene sequencing, with RNA-sequencing data from lung cancer cases in The Cancer Genome Atlas serving as the validation cohort.Overall, we demonstrate a lower alpha diversity in normal lung as compared to non-tumor adjacent or tumor tissue. In squamous cell carcinoma specifically, a separate group of taxa are identified, in which Acidovorax is enriched in smokers. Acidovorax temporans is identified within tumor sections by fluorescent in situ hybridization and confirmed by two separate 16S rRNA strategies. Further, these taxa, including Acidovorax, exhibit higher abundance among the subset of squamous cell carcinoma cases with TP53 mutations, an association not seen in adenocarcinomas.The results of this comprehensive study show both microbiome-gene and microbiome-exposure interactions in squamous cell carcinoma lung cancer tissue. Specifically, tumors harboring TP53 mutations, which can impair epithelial function, have a unique bacterial consortium that is higher in relative abundance in smoking-associated tumors of this type. Given the significant need for clinical diagnostic tools in lung cancer, this study may provide novel biomarkers for early detection.


September 22, 2019  |  

Clonal distribution of BCR-ABL1 mutations and splice isoforms by single-molecule long-read RNA sequencing.

The evolution of mutations in the BCR-ABL1 fusion gene transcript renders CML patients resistant to tyrosine kinase inhibitor (TKI) based therapy. Thus screening for BCR-ABL1 mutations is recommended particularly in patients experiencing poor response to treatment. Herein we describe a novel approach for the detection and surveillance of BCR-ABL1 mutations in CML patients.To detect mutations in the BCR-ABL1 transcript we developed an assay based on the Pacific Biosciences (PacBio) sequencing technology, which allows for single-molecule long-read sequencing of BCR-ABL1 fusion transcript molecules. Samples from six patients with poor response to therapy were analyzed both at diagnosis and follow-up. cDNA was generated from total RNA and a 1,6 kb fragment encompassing the BCR-ABL1 transcript was amplified using long range PCR. To estimate the sensitivity of the assay, a serial dilution experiment was performed.Over 10,000 full-length BCR-ABL1 sequences were obtained for all samples studied. Through the serial dilution analysis, mutations in CML patient samples could be detected down to a level of at least 1%. Notably, the assay was determined to be sufficiently sensitive even in patients harboring a low abundance of BCR-ABL1 levels. The PacBio sequencing successfully identified all mutations seen by standard methods. Importantly, we identified several mutations that escaped detection by the clinical routine analysis. Resistance mutations were found in all but one of the patients. Due to the long reads afforded by PacBio sequencing, compound mutations present in the same molecule were readily distinguished from independent alterations arising in different molecules. Moreover, several transcript isoforms of the BCR-ABL1 transcript were identified in two of the CML patients. Finally, our assay allowed for a quick turn around time allowing samples to be reported upon within 2 days.In summary the PacBio sequencing assay can be applied to detect BCR-ABL1 resistance mutations in both diagnostic and follow-up CML patient samples using a simple protocol applicable to routine diagnosis. The method besides its sensitivity, gives a complete view of the clonal distribution of mutations, which is of importance when making therapy decisions.


September 22, 2019  |  

Major histocompatibility complex haplotyping and long-amplicon allele discovery in cynomolgus macaques from Chinese breeding facilities.

Very little is currently known about the major histocompatibility complex (MHC) region of cynomolgus macaques (Macaca fascicularis; Mafa) from Chinese breeding centers. We performed comprehensive MHC class I haplotype analysis of 100 cynomolgus macaques from two different centers, with animals from different reported original geographic origins (Vietnamese, Cambodian, and Cambodian/Indonesian mixed-origin). Many of the samples were of known relation to each other (sire, dam, and progeny sets), making it possible to characterize lineage-level haplotypes in these animals. We identified 52 Mafa-A and 74 Mafa-B haplotypes in this cohort, many of which were restricted to specific sample origins. We also characterized full-length MHC class I transcripts using Pacific Biosciences (PacBio) RS II single-molecule real-time (SMRT) sequencing. This technology allows for complete read-through of unfragmented MHC class I transcripts (~1100 bp in length), so no assembly is required to unambiguously resolve novel full-length sequences. Overall, we identified 311 total full-length transcripts in a subset of 72 cynomolgus macaques from these Chinese breeding facilities; 130 of these sequences were novel and an additional 115 extended existing short database sequences to span the complete open reading frame. This significantly expands the number of Mafa-A, Mafa-B, and Mafa-I full-length alleles in the official cynomolgus macaque MHC class I database. The PacBio technique described here represents a general method for full-length allele discovery and genotyping that can be extended to other complex immune loci such as MHC class II, killer immunoglobulin-like receptors, and Fc gamma receptors.


September 22, 2019  |  

Novel exons and splice variants in the human antibody heavy chain identified by single cell and single molecule sequencing.

Antibody heavy chains contain a variable and a constant region. The constant region of the antibody heavy chain is encoded by multiple groups of exons which define the isotype and therefore many functional characteristics of the antibody. We performed both single B cell RNAseq and long read single molecule sequencing of antibody heavy chain transcripts and were able to identify novel exons for IGHA1 and IGHA2 as well as novel isoforms for IGHM antibody heavy chain.


September 22, 2019  |  

Improving eukaryotic genome annotation using single molecule mRNA sequencing.

The advantages of Pacific Biosciences (PacBio) single-molecule real-time (SMRT) technology include long reads, low systematic bias, and high consensus read accuracy. Here we use these attributes to improve on the genome annotation of the parasitic hookworm Ancylostoma ceylanicum using PacBio RNA-Seq.We sequenced 192,888 circular consensus sequences (CCS) derived from cDNAs generated using the CloneTech SMARTer system. These SMARTer-SMRT libraries were normalized and size-selected providing a robust population of expressed structural genes for subsequent genome annotation. We demonstrate PacBio mRNA sequences based genome annotation improvement, compared to genome annotation using conventional sequencing-by-synthesis alone, by identifying 1609 (9.2%) new genes, extended the length of 3965 (26.7%) genes and increased the total genomic exon length by 1.9 Mb (12.4%). Non-coding sequence representation (primarily from UTRs based on dT reverse transcription priming) was particularly improved, increasing in total length by fifteen-fold, by increasing both the length and number of UTR exons. In addition, the UTR data provided by these CCS allowed for the identification of a novel SL2 splice leader sequence for A. ceylanicum and an increase in the number and proportion of functionally annotated genes. RNA-seq data also confirmed some of the newly annotated genes and gene features.Overall, PacBio data has supported a significant improvement in gene annotation in this genome, and is an appealing alternative or complementary technique for genome annotation to the other transcript sequencing technologies.


September 22, 2019  |  

The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity.

Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity. Copyright © 2017 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

A manganese superoxide dismutase (MnSOD) from red lip mullet, Liza haematocheila: Evaluation of molecular structure, immune response, and antioxidant function.

Manganese superoxide dismutase (MnSOD) is a nuclear-encoded antioxidant metalloenzyme. The main function of this enzyme is to dismutase the toxic superoxide anion (O2-) into less toxic hydrogen peroxide (H2O2) and oxygen (O2). Structural analysis of mullet MnSOD (MuMnSOD) was performed using different bioinformatics tools. Pairwise alignment revealed that the protein sequence matched to that derived from Larimichthys crocea with a 95.2% sequence identity. Phylogenetic tree analysis showed that the MuMnSOD was included in the category of teleosts. Multiple sequence alignment showed that a SOD Fe-N domain, SOD Fe-C domain, and Mn/Fe SOD signature were highly conserved among the other examined MnSOD orthologs. Quantitative real-time PCR showed that the highest MuMnSOD mRNA expression level was in blood cells. The highest expression level of MuMnSOD was observed in response to treatment with both Lactococcus garvieae and lipopolysaccharide (LPS) at 6?h post treatment in the head kidney and blood. Potential ROS-scavenging ability of the purified recombinant protein (rMuMnSOD) was examined by the xanthine oxidase assay (XOD assay). The optimum temperature and pH for XOD activity were found to be 25?°C and pH 7, respectively. Relative XOD activity was significantly increased with the dose of rMuMnSOD, revealing its dose dependency. Activity of rMuMnSOD was inhibited by potassium cyanide (KCN) and N-N’-diethyl-dithiocarbamate (DDC). Moreover, expression of MuMnSOD resulted in considerable growth retardation of both gram-positive and gram-negative bacteria. Results of the current study suggest that MuMnSOD acts as an antioxidant enzyme and participates in the immune response in mullet. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

Human and rhesus macaque KIR haplotypes defined by their transcriptomes.

The killer-cell Ig-like receptors (KIRs) play a central role in the immune recognition in infection, pregnancy, and transplantation through their interactions with MHC class I molecules. KIR genes display abundant copy number variation as well as high levels of polymorphism. As a result, it is challenging to characterize this structurally dynamic region. KIR haplotypes have been analyzed in different species using conventional characterization methods, such as Sanger sequencing and Roche/454 pyrosequencing. However, these methods are time-consuming and often failed to define complete haplotypes, or do not reach allele-level resolution. In addition, most analyses were performed on genomic DNA, and thus were lacking substantial information about transcription and its corresponding modifications. In this paper, we present a single-molecule real-time sequencing approach, using Pacific Biosciences Sequel platform to characterize the KIR transcriptomes in human and rhesus macaque (Macaca mulatta) families. This high-resolution approach allowed the identification of novel Mamu-KIR alleles, the extension of reported allele sequences, and the determination of human and macaque KIR haplotypes. In addition, multiple recombinant KIR genes were discovered, all located on contracted haplotypes, which were likely the result of chromosomal rearrangements. The relatively high number of contracted haplotypes discovered might be indicative of selection on small KIR repertoires and/or novel fusion gene products. This next-generation method provides an improved high-resolution characterization of the KIR cluster in humans and macaques, which eventually may aid in a better understanding and interpretation of KIR allele-associated diseases, as well as the immune response in transplantation and reproduction. Copyright © 2018 by The American Association of Immunologists, Inc.


September 22, 2019  |  

Electrosynthesis of commodity chemicals by an autotrophic microbial community.

A microbial community originating from brewery waste produced methane, acetate, and hydrogen when selected on a granular graphite cathode poised at -590 mV versus the standard hydrogen electrode (SHE) with CO(2) as the only carbon source. This is the first report on the simultaneous electrosynthesis of these commodity chemicals and the first description of electroacetogenesis by a microbial community. Deep sequencing of the active community 16S rRNA revealed a dynamic microbial community composed of an invariant Archaea population of Methanobacterium spp. and a shifting Bacteria population. Acetobacterium spp. were the most abundant Bacteria on the cathode when acetogenesis dominated. Methane was generally the dominant product with rates increasing from <1 to 7 mM day(-1) (per cathode liquid volume) and was concomitantly produced with acetate and hydrogen. Acetogenesis increased to >4 mM day(-1) (accumulated to 28.5 mM over 12 days), and methanogenesis ceased following the addition of 2-bromoethanesulfonic acid. Traces of hydrogen accumulated during initial selection and subsequently accelerated to >11 mM day(-1) (versus 0.045 mM day(-1) abiotic production). The hypothesis of electrosynthetic biocatalysis occurring at the microbe-electrode interface was supported by a catalytic wave (midpoint potential of -460 mV versus SHE) in cyclic voltammetry scans of the biocathode, the lack of redox active components in the medium, and the generation of comparatively high amounts of products (even after medium exchange). In addition, the volumetric production rates of these three commodity chemicals are marked improvements for electrosynthesis, advancing the process toward economic feasibility.


September 22, 2019  |  

Accurate characterization of the IFITM locus using MiSeq and PacBio sequencing shows genetic variation in Galliformes.

Interferon inducible transmembrane (IFITM) proteins are effectors of the immune system widely characterized for their role in restricting infection by diverse enveloped and non-enveloped viruses. The chicken IFITM (chIFITM) genes are clustered on chromosome 5 and to date four genes have been annotated, namely chIFITM1, chIFITM3, chIFITM5 and chIFITM10. However, due to poor assembly of this locus in the Gallus Gallus v4 genome, accurate characterization has so far proven problematic. Recently, a new chicken reference genome assembly Gallus Gallus v5 was generated using Sanger, 454, Illumina and PacBio sequencing technologies identifying considerable differences in the chIFITM locus over the previous genome releases.We re-sequenced the locus using both Illumina MiSeq and PacBio RS II sequencing technologies and we mapped RNA-seq data from the European Nucleotide Archive (ENA) to this finalized chIFITM locus. Using SureSelect probes capture probes designed to the finalized chIFITM locus, we sequenced the locus of a different chicken breed, namely a White Leghorn, and a turkey.We confirmed the Gallus Gallus v5 consensus except for two insertions of 5 and 1 base pair within the chIFITM3 and B4GALNT4 genes, respectively, and a single base pair deletion within the B4GALNT4 gene. The pull down revealed a single amino acid substitution of A63V in the CIL domain of IFITM2 compared to Red Jungle fowl and 13, 13 and 11 differences between IFITM1, 2 and 3 of chickens and turkeys, respectively. RNA-seq shows chIFITM2 and chIFITM3 expression in numerous tissue types of different chicken breeds and avian cell lines, while the expression of the putative chIFITM1 is limited to the testis, caecum and ileum tissues.Locus resequencing using these capture probes and RNA-seq based expression analysis will allow the further characterization of genetic diversity within Galliformes.


September 22, 2019  |  

The full transcription map of mouse papillomavirus type 1 (MmuPV1) in mouse wart tissues.

Mouse papillomavirus type 1 (MmuPV1) provides, for the first time, the opportunity to study infection and pathogenesis of papillomaviruses in the context of laboratory mice. In this report, we define the transcriptome of MmuPV1 genome present in papillomas arising in experimentally infected mice using a combination of RNA-seq, PacBio Iso-seq, 5′ RACE, 3′ RACE, primer-walking RT-PCR, RNase protection, Northern blot and in situ hybridization analyses. We demonstrate that the MmuPV1 genome is transcribed unidirectionally from five major promoters (P) or transcription start sites (TSS) and polyadenylates its transcripts at two major polyadenylation (pA) sites. We designate the P7503, P360 and P859 as “early” promoters because they give rise to transcripts mostly utilizing the polyadenylation signal at nt 3844 and therefore can only encode early genes, and P7107 and P533 as “late” promoters because they give rise to transcripts utilizing polyadenylation signals at either nt 3844 or nt 7047, the latter being able to encode late, capsid proteins. MmuPV1 genome contains five splice donor sites and three acceptor sites that produce thirty-six RNA isoforms deduced to express seven predicted early gene products (E6, E7, E1, E1^M1, E1^M2, E2 and E8^E2) and three predicted late gene products (E1^E4, L2 and L1). The majority of the viral early transcripts are spliced once from nt 757 to 3139, while viral late transcripts, which are predicted to encode L1, are spliced twice, first from nt 7243 to either nt 3139 (P7107) or nt 757 to 3139 (P533) and second from nt 3431 to nt 5372. Thirteen of these viral transcripts were detectable by Northern blot analysis, with the P533-derived late E1^E4 transcripts being the most abundant. The late transcripts could be detected in highly differentiated keratinocytes of MmuPV1-infected tissues as early as ten days after MmuPV1 inoculation and correlated with detection of L1 protein and viral DNA amplification. In mature warts, detection of L1 was also found in more poorly differentiated cells, as previously reported. Subclinical infections were also observed. The comprehensive transcription map of MmuPV1 generated in this study provides further evidence that MmuPV1 is similar to high-risk cutaneous beta human papillomaviruses. The knowledge revealed will facilitate the use of MmuPV1 as an animal virus model for understanding of human papillomavirus gene expression, pathogenesis and immunology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.