Menu
September 22, 2019

The state of play in higher eukaryote gene annotation.

A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe – or ‘annotate’ – genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists – from clinicians to evolutionary biologists – need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets. Their creation, however, remains challenging.


September 22, 2019

Targeted combinatorial alternative splicing generates brain region-specific repertoires of neurexins.

Molecular diversity of surface receptors has been hypothesized to provide a mechanism for selective synaptic connectivity. Neurexins are highly diversified receptors that drive the morphological and functional differentiation of synapses. Using a single cDNA sequencing approach, we detected 1,364 unique neurexin-a and 37 neurexin-ß mRNAs produced by alternative splicing of neurexin pre-mRNAs. This molecular diversity results from near-exhaustive combinatorial use of alternative splice insertions in Nrxn1a and Nrxn2a. By contrast, Nrxn3a exhibits several highly stereotyped exon selections that incorporate novel elements for posttranscriptional regulation of a subset of transcripts. Complexity of Nrxn1a repertoires correlates with the cellular complexity of neuronal tissues, and a specific subset of isoforms is enriched in a purified cell type. Our analysis defines the molecular diversity of a critical synaptic receptor and provides evidence that neurexin diversity is linked to cellular diversity in the nervous system. Copyright © 2014 Elsevier Inc. All rights reserved.


September 22, 2019

Full-length transcriptome of Misgurnus anguillicaudatus provides insights into evolution of genus Misgurnus.

Reconstruction and annotation of transcripts, particularly for a species without reference genome, plays a critical role in gene discovery, investigation of genomic signatures, and genome annotation in the pre-genomic era. This study generated 33,330 full-length transcripts of diploid M. anguillicaudatus using PacBio SMRT Sequencing. A total of 6,918 gene families were identified with two or more isoforms, and 26,683 complete ORFs with an average length of 1,497?bp were detected. Totally, 1,208 high-confidence lncRNAs were identified, and most of these appeared to be precursor transcripts of miRNAs or snoRNAs. Phylogenetic tree of the Misgurnus species was inferred based on the 1,905 single copy orthologous genes. The tetraploid and diploid M. anguillicaudatus grouped into a clade, and M. bipartitus showed a closer relationship with the M. anguillicaudatus. The overall evolutionary rates of tetraploid M. anguillicaudatus were significantly higher than those of other Misgurnus species. Meanwhile, 28 positively selected genes were identified in M. anguillicaudatus clade. These positively selected genes may play critical roles in the adaptation to various habitat environments for M. anguillicaudatus. This study could facilitate further exploration of the genomic signatures of M. anguillicaudatus and provide potential insights into unveiling the evolutionary history of tetraploid loach.


September 22, 2019

Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire.

Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surveillance, and host immune response to vaccines. In particular, single-molecule circular consensus sequencing permits the sequencing of antibody repertoires at previously unattainable depths of coverage and accuracy. We approached the bovine immunoglobulin G (IgG) repertoire with the objective of characterizing diversity of expressed IgG transcripts. Here we present single-molecule real-time sequencing data of expressed IgG heavy-chain repertoires of four individual cattle. We describe the diversity observed within antigen binding regions and visualize this diversity using a network-based approach.We generated 49,945 high quality cDNA sequences, each spanning the entire IgG variable region from four Bos taurus calves. From these sequences we identified 49,521 antigen binding regions using the automated Paratome web server. Approximately 9% of all unique complementarity determining 2 (CDR2) sequences were of variable lengths. A bimodal distribution of unique CDR3 sequence lengths was observed, with common lengths of 5-6 and 21-25 amino acids. The average number of cysteine residues in CDR3s increased with CDR3 length and we observed that cysteine residues were centrally located in CDR3s. We identified 19 extremely long CDR3 sequences (up to 62 amino acids in length) within IgG transcripts. Network analyses revealed distinct patterns among the expressed IgG antigen binding repertoires of the examined individuals.We utilized circular consensus sequencing technology to provide baseline data of the expressed bovine IgG repertoire that can be used for future studies important to livestock research. Somatic mutation resulting in base insertions and deletions in CDR2 further diversifies the bovine antibody repertoire. In contrast to previous studies, our data indicate that unusually long CDR3 sequences are not unique to IgM antibodies in cattle. Centrally located cysteine residues in bovine CDR3s provide further evidence that disulfide bond formation is likely of structural importance. We hypothesize that network or cluster-based analyses of expressed antibody repertoires from controlled challenge experiments will help identify novel natural antigen binding solutions to specific pathogens of interest.


September 22, 2019

A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.


September 22, 2019

Investigating bacterial population structure and dynamics in traditional koumiss from Inner Mongolia using single molecule real-time sequencing.

Koumiss is considered as a complete dairy product high in nutrients and with medicinal properties. The bacterial communities involved in production of koumiss play a crucial role in the fermentation cycle. To reveal bacterial biodiversity in koumiss and the dynamics of succession in bacterial populations during fermentation, 22 samples were collected from 5 sampling sites and the full length of the 16S ribosomal RNA genes sequenced using single molecule real-time sequencing technology. One hundred forty-eight species were identified from 82 bacterial genera and 8 phyla. These results suggested that the structural difference in the bacterial community could be attributed to geographical location. The most significant difference in bacterial composition occurred in samples from group D compared with other groups. The sampling location of group D was distant from the city and maintained the primitive local nomadic life. The dynamics of succession in bacterial communities showed that Lactobacillus helveticus increased in abundance from 0 to 9h and reached its peak at 9h and then decreased. In contrast, Enterococcus faecalis, Enterococcus durans, and Enterococcus casseliflavus increased gradually throughout the fermentation process, and reached a maximum after 24h. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Analyses of intestinal microbiota: culture versus sequencing.

Analyzing human as well as animal microbiota composition has gained growing interest because structural components and metabolites of microorganisms fundamentally influence all aspects of host physiology. Originally dominated by culture-dependent methods for exploring these ecosystems, the development of molecular techniques such as high throughput sequencing has dramatically increased our knowledge. Because many studies of the microbiota are based on the bacterial 16S ribosomal RNA (rRNA) gene targets, they can, at least in principle, be compared to determine the role of the microbiome composition for developmental processes, host metabolism, and physiology as well as different diseases. In our review, we will summarize differences and pitfalls in current experimental protocols, including all steps from nucleic acid extraction to bioinformatical analysis which may produce variation that outweighs subtle biological differences. Future developments, such as integration of metabolomic, transcriptomic, and metagenomic data sets and standardization of the procedures, will be discussed. © The Author 2015. Published by Oxford University Press on behalf of the Institute for Laboratory Animal Research. All rights reserved. For permissions, please email: journals.permissions@oup.com.


September 22, 2019

Alternative splice variants of AID are not stoichiometrically present at the protein level in chronic lymphocytic leukemia

Activation-induced deaminase (AID) is a DNA-mutating enzyme that mediates class-switch recombination as well as somatic hypermutation of antibody genes in B cells. Due to off-target activity, AID is implicated in lymphoma development by introducing genome-wide DNA damage and initiating chromosomal translocations such as c-myc/IgH. Several alternative splice transcripts of AID have been reported in activated B cells as well as malignant B cells such as chronic lymphocytic leukemia (CLL). As most commercially available antibodies fail to recognize alternative splice variants, their abundance in vivo, and hence their biological significance, has not been determined. In this study, we assessed the protein levels of AID splice isoforms by introducing an AID splice reporter construct into cell lines and primary CLL cells from patients as well as from WT and TCL1(tg) C57BL/6 mice (where TCL1 is T-cell leukemia/lymphoma 1). The splice construct is 5′-fused to a GFP-tag, which is preserved in all splice isoforms and allows detection of translated protein. Summarizing, we show a thorough quantification of alternatively spliced AID transcripts and demonstrate that the corresponding protein abundances, especially those of splice variants AID-ivs3 and AID-?E4, are not stoichiometrically equivalent. Our data suggest that enhanced proteasomal degradation of low-abundance proteins might be causative for this discrepancy. © 2013 The Authors. European Journal of Immunology published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


September 22, 2019

Molecular mechanisms of acclimatization to phosphorus starvation and recovery underlying full-length transcriptome profiling in barley (Hordeum vulgare L.).

A lack of phosphorus (P) in plants can severely constrain growth and development. Barley, one of the earliest domesticated crops, is extensively planted in poor soil around the world. To date, the molecular mechanisms of enduring low phosphorus, at the transcriptional level, in barley are still unclear. In the present study, two different barley genotypes (GN121 and GN42)-with contrasting phosphorus efficiency-were used to reveal adaptations to low phosphorus stress, at three time points, at the morphological, physiological, biochemical, and transcriptome level. GN121 growth was less affected by phosphorus starvation and recovery than that of GN42. The biomass and inorganic phosphorus concentration of GN121 and GN42 declined under the low phosphorus-induced stress and increased after recovery with normal phosphorus. However, the range of these parameters was higher in GN42 than in GN121. Subsequently, a more complete genome annotation was obtained by correcting with the data sequenced on Illumina HiSeq X 10 and PacBio RSII SMRT platform. A total of 6,182 and 5,270 differentially expressed genes (DEGs) were identified in GN121 and GN42, respectively. The majority of these DEGs were involved in phosphorus metabolism such as phospholipid degradation, hydrolysis of phosphoric enzymes, sucrose synthesis, phosphorylation/dephosphorylation and post-transcriptional regulation; expression of these genes was significantly different between GN121 and GN42. Specifically, six and seven DEGs were annotated as phosphorus transporters in roots and leaves, respectively. Furthermore, a putative model was constructed relying on key metabolic pathways related to phosphorus to illustrate the higher phosphorus efficiency of GN121 compared to GN42 under low phosphorus conditions. Results from this study provide a multi-transcriptome database and candidate genes for further study on phosphorus use efficiency (PUE).


September 22, 2019

SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes.

Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a single sequence. The Lace software is provided to construct superTranscripts from any set of transcripts, including de novo assemblies. We demonstrate how superTranscripts enable visualisation, variant detection and differential isoform detection in non-model organisms. We further use Lace to combine reference and assembled transcriptomes for chicken and recover hundreds of gaps in the reference genome.


September 22, 2019

Somatic APP gene recombination in Alzheimer’s disease and normal neurons.

The diversity and complexity of the human brain are widely assumed to be encoded within a constant genome. Somatic gene recombination, which changes germline DNA sequences to increase molecular diversity, could theoretically alter this code but has not been documented in the brain, to our knowledge. Here we describe recombination of the Alzheimer’s disease-related gene APP, which encodes amyloid precursor protein, in human neurons, occurring mosaically as thousands of variant ‘genomic cDNAs’ (gencDNAs). gencDNAs lacked introns and ranged from full-length cDNA copies of expressed, brain-specific RNA splice variants to myriad smaller forms that contained intra-exonic junctions, insertions, deletions, and/or single nucleotide variations. DNA in situ hybridization identified gencDNAs within single neurons that were distinct from wild-type loci and absent from non-neuronal cells. Mechanistic studies supported neuronal ‘retro-insertion’ of RNA to produce gencDNAs; this process involved transcription, DNA breaks, reverse transcriptase activity, and age. Neurons from individuals with sporadic Alzheimer’s disease showed increased gencDNA diversity, including eleven mutations known to be associated with familial Alzheimer’s disease that were absent from healthy neurons. Neuronal gene recombination may allow ‘recording’ of neural activity for selective ‘playback’ of preferred gene variants whose expression bypasses splicing; this has implications for cellular diversity, learning and memory, plasticity, and diseases of the human brain.


September 22, 2019

Assessment of the physicochemical properties and bacterial composition of Lactobacillus plantarum and Enterococcus faecium-fermented Astragalus membranaceus using single molecule, real-time sequencing technology.

We investigated if fermentation with probiotic cultures could improve the production of health-promoting biological compounds in Astragalus membranaceus. We tested the probiotics Enterococcus faecium, Lactobacillus plantarum and Enterococcus faecium?+?Lactobacillus plantarum and applied PacBio single molecule, real-time sequencing technology (SMRT) to evaluate the quality of Astragalus fermentation. We found that the production rates of acetic acid, methylacetic acid, aethyl acetic acid and lactic acid using E. faecium?+?L. plantarum were 1866.24?mg/kg on day 15, 203.80?mg/kg on day 30, 996.04?mg/kg on day 15, and 3081.99?mg/kg on day 20, respectively. Other production rates were: polysaccharides, 9.43%, 8.51%, and 7.59% on day 10; saponins, 19.6912?mg/g, 21.6630?mg/g and 20.2084?mg/g on day 15; and flavonoids, 1.9032?mg/g, 2.0835?mg/g, and 1.7086?mg/g on day 20 using E. faecium, L. plantarum and E. faecium?+?L. plantarum, respectively. SMRT was used to analyze microbial composition, and we found that E. faecium and L. plantarum were the most prevalent species after fermentation for 3 days. E. faecium?+?L. plantarum gave more positive effects than single strains in the Astragalus solid state fermentation process. Our data demonstrated that the SMRT sequencing platform is applicable to quality assessment of Astragalus fermentation.


September 22, 2019

The human microbiome and understanding the 16S rRNA gene in translational nursing science.

As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and healthcare practitioners to analyze these microbial communities and their role in health and disease. 16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings.The objectives of this review are to (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung, and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science.Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists-individuals uniquely positioned to utilize these techniques in future studies in clinical settings.


September 22, 2019

Localized electron transfer rates and microelectrode-based enrichment of microbial communities within a phototrophic microbial mat.

Phototrophic microbial mats frequently exhibit sharp, light-dependent redox gradients that regulate microbial respiration on specific electron acceptors as a function of depth. In this work, a benthic phototrophic microbial mat from Hot Lake, a hypersaline, epsomitic lake located near Oroville in north-central Washington, was used to develop a microscale electrochemical method to study local electron transfer processes within the mat. To characterize the physicochemical variables influencing electron transfer, we initially quantified redox potential, pH, and dissolved oxygen gradients by depth in the mat under photic and aphotic conditions. We further demonstrated that power output of a mat fuel cell was light-dependent. To study local electron transfer processes, we deployed a microscale electrode (microelectrode) with tip size ~20 µm. To enrich a subset of microorganisms capable of interacting with the microelectrode, we anodically polarized the microelectrode at depth in the mat. Subsequently, to characterize the microelectrode-associated community and compare it to the neighboring mat community, we performed amplicon sequencing of the V1-V3 region of the 16S gene. Differences in Bray-Curtis beta diversity, illustrated by large changes in relative abundance at the phylum level, suggested successful enrichment of specific mat community members on the microelectrode surface. The microelectrode-associated community exhibited substantially reduced alpha diversity and elevated relative abundances of Prosthecochloris, Loktanella, Catellibacterium, other unclassified members of Rhodobacteraceae, Thiomicrospira, and Limnobacter, compared with the community at an equivalent depth in the mat. Our results suggest that local electron transfer to an anodically polarized microelectrode selected for a specific microbial population, with substantially more abundance and diversity of sulfur-oxidizing phylotypes compared with the neighboring mat community.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.