Menu
September 22, 2019  |  

Universal alternative splicing of noncoding exons.

The human transcriptome is so large, diverse, and dynamic that, even after a decade of investigation by RNA sequencing (RNA-seq), we have yet to resolve its true dimensions. RNA-seq suffers from an expression-dependent bias that impedes characterization of low-abundance transcripts. We performed targeted single-molecule and short-read RNA-seq to survey the transcriptional landscape of a single human chromosome (Hsa21) at unprecedented resolution. Our analysis reaches the lower limits of the transcriptome, identifying a fundamental distinction between protein-coding and noncoding gene content: almost every noncoding exon undergoes alternative splicing, producing a seemingly limitless variety of isoforms. Analysis of syntenic regions of the mouse genome shows that few noncoding exons are shared between human and mouse, yet human splicing profiles are recapitulated on Hsa21 in mouse cells, indicative of regulation by a deeply conserved splicing code. We propose that noncoding exons are functionally modular, with alternative splicing generating an enormous repertoire of potential regulatory RNAs and a rich transcriptional reservoir for gene evolution. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.


September 22, 2019  |  

Cartography of neurexin alternative splicing mapped by single-molecule long-read mRNA sequencing.

Neurexins are evolutionarily conserved presynaptic cell-adhesion molecules that are essential for normal synapse formation and synaptic transmission. Indirect evidence has indicated that extensive alternative splicing of neurexin mRNAs may produce hundreds if not thousands of neurexin isoforms, but no direct evidence for such diversity has been available. Here we use unbiased long-read sequencing of full-length neurexin (Nrxn)1a, Nrxn1ß, Nrxn2ß, Nrxn3a, and Nrxn3ß mRNAs to systematically assess how many sites of alternative splicing are used in neurexins with a significant frequency, and whether alternative splicing events at these sites are independent of each other. In sequencing more than 25,000 full-length mRNAs, we identified a novel, abundantly used alternatively spliced exon of Nrxn1a and Nrxn3a (referred to as alternatively spliced sequence 6) that encodes a 9-residue insertion in the flexible hinge region between the fifth LNS (laminin-a, neurexin, sex hormone-binding globulin) domain and the third EGF-like sequence. In addition, we observed several larger-scale events of alternative splicing that deleted multiple domains and were much less frequent than the canonical six sites of alternative splicing in neurexins. All of the six canonical events of alternative splicing appear to be independent of each other, suggesting that neurexins may exhibit an even larger isoform diversity than previously envisioned and comprise thousands of variants. Our data are consistent with the notion that a-neurexins represent extracellular protein-interaction scaffolds in which different LNS and EGF domains mediate distinct interactions that affect diverse functions and are independently regulated by independent events of alternative splicing.


September 22, 2019  |  

PacBio sequencing and its applications.

Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing (SGS) technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research. The highly-contiguous de novo assemblies using PacBio sequencing can close gaps in current reference assemblies and characterize structural variation (SV) in personal genomes. With longer reads, we can sequence through extended repetitive regions and detect mutations, many of which are associated with diseases. Moreover, PacBio transcriptome sequencing is advantageous for the identification of gene isoforms and facilitates reliable discoveries of novel genes and novel isoforms of annotated genes, due to its ability to sequence full-length transcripts or fragments with significant lengths. Additionally, PacBio’s sequencing technique provides information that is useful for the direct detection of base modifications, such as methylation. In addition to using PacBio sequencing alone, many hybrid sequencing strategies have been developed to make use of more accurate short reads in conjunction with PacBio long reads. In general, hybrid sequencing strategies are more affordable and scalable especially for small-size laboratories than using PacBio Sequencing alone. The advent of PacBio sequencing has made available much information that could not be obtained via SGS alone. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.


September 22, 2019  |  

Quantitative isoform-profiling of highly diversified recognition molecules.

Complex biological systems rely on cell surface cues that govern cellular self-recognition and selective interactions with appropriate partners. Molecular diversification of cell surface recognition molecules through DNA recombination and complex alternative splicing has emerged as an important principle for encoding such interactions. However, the lack of tools to specifically detect and quantify receptor protein isoforms is a major impediment to functional studies. We here developed a workflow for targeted mass spectrometry by selected reaction monitoring (SRM) that permits quantitative assessment of highly diversified protein families. We apply this workflow to dissecting the molecular diversity of the neuronal neurexin receptors and uncover an alternative splicing-dependent recognition code for synaptic ligands.


September 22, 2019  |  

Diversified microbiota of meconium is affected by maternal diabetes status.

This study was aimed to assess the diversity of the meconium microbiome and determine if the bacterial community is affected by maternal diabetes status.The first intestinal discharge (meconium) was collected from 23 newborns stratified by maternal diabetes status: 4 mothers had pre-gestational type 2 diabetes mellitus (DM) including one mother with dizygotic twins, 5 developed gestational diabetes mellitus (GDM) and 13 had no diabetes. The meconium microbiome was profiled using multi-barcode 16S rRNA sequencing followed by taxonomic assignment and diversity analysis.All meconium samples were not sterile and contained diversified microbiota. Compared with adult feces, the meconium showed a lower species diversity, higher sample-to-sample variation, and enrichment of Proteobacteria and reduction of Bacteroidetes. Among the meconium samples, the taxonomy analyses suggested that the overall bacterial content significantly differed by maternal diabetes status, with the microbiome of the DM group showing higher alpha-diversity than that of no-diabetes or GDM groups. No global difference was found between babies delivered vaginally versus via Cesarean-section. Regression analysis showed that the most robust predictor for the meconium microbiota composition was the maternal diabetes status that preceded pregnancy. Specifically, Bacteroidetes (phyla) and Parabacteriodes (genus) were enriched in the meconium in the DM group compared to the no-diabetes group.Our study provides evidence that meconium contains diversified microbiota and is not affected by the mode of delivery. It also suggests that the meconium microbiome of infants born to mothers with DM is enriched for the same bacterial taxa as those reported in the fecal microbiome of adult DM patients.


September 22, 2019  |  

Bat biology, genomes, and the Bat1K project: To generate chromosome-level genomes for all living bat species.

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n~1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.


September 22, 2019  |  

Multiple large inversions and breakpoint rewiring of gene expression in the evolution of the fire ant social supergene.

Supergenes consist of co-adapted loci that segregate together and are associated with adaptive traits. In the fire ant Solenopsis invicta, two ‘social’ supergene variants regulate differences in colony queen number and other traits. Suppressed recombination in this system is maintained, in part, by a greater than 9 Mb inversion, but the supergene is larger. Has the supergene in S. invicta undergone multiple large inversions? The initial gene content of the inverted allele of a supergene would be the same as that of the wild-type allele. So, how did the inversion increase in frequency? To address these questions, we cloned one extreme breakpoint in the fire ant supergene. In doing so, we found a second large (greater than 800 Kb) rearrangement. Furthermore, we determined the temporal order of the two big inversions based on the translocation pattern of a third small fragment. Because the S. invicta supergene lacks evolutionary strata, our finding of multiple inversions may support an introgression model of the supergene. Finally, we showed that one of the inversions swapped the promoter of a breakpoint-adjacent gene, which might have conferred a selective advantage relative to the non-inverted allele. Our findings provide a rare example of gene alterations arising directly from an inversion event.© 2018 The Author(s).


September 22, 2019  |  

Multiplex assessment of protein variant abundance by massively parallel sequencing.

Determining the pathogenicity of genetic variants is a critical challenge, and functional assessment is often the only option. Experimentally characterizing millions of possible missense variants in thousands of clinically important genes requires generalizable, scalable assays. We describe variant abundance by massively parallel sequencing (VAMP-seq), which measures the effects of thousands of missense variants of a protein on intracellular abundance simultaneously. We apply VAMP-seq to quantify the abundance of 7,801 single-amino-acid variants of PTEN and TPMT, proteins in which functional variants are clinically actionable. We identify 1,138 PTEN and 777 TPMT variants that result in low protein abundance, and may be pathogenic or alter drug metabolism, respectively. We observe selection for low-abundance PTEN variants in cancer, and show that p.Pro38Ser, which accounts for ~10% of PTEN missense variants in melanoma, functions via a dominant-negative mechanism. Finally, we demonstrate that VAMP-seq is applicable to other genes, highlighting its generalizability.


September 22, 2019  |  

Human copy number variants are enriched in regions of low mappability.

Copy number variants (CNVs) are known to affect a large portion of the human genome and have been implicated in many diseases. Although whole-genome sequencing (WGS) can help identify CNVs, most analytical methods suffer from limited sensitivity and specificity, especially in regions of low mappability. To address this, we use PopSV, a CNV caller that relies on multiple samples to control for technical variation. We demonstrate that our calls are stable across different types of repeat-rich regions and validate the accuracy of our predictions using orthogonal approaches. Applying PopSV to 640 human genomes, we find that low-mappability regions are approximately 5 times more likely to harbor germline CNVs, in stark contrast to the nearly uniform distribution observed for somatic CNVs in 95 cancer genomes. In addition to known enrichments in segmental duplication and near centromeres and telomeres, we also report that CNVs are enriched in specific types of satellite and in some of the most recent families of transposable elements. Finally, using this comprehensive approach, we identify 3455 regions with recurrent CNVs that were missing from existing catalogs. In particular, we identify 347 genes with a novel exonic CNV in low-mappability regions, including 29 genes previously associated with disease.


September 22, 2019  |  

Methylation of the reelin gene promoter in peripheral blood and its relationship with the cognitive function of schizophrenia patients.

There is a decrease in the expression of the reelin gene (RELN) in the brain of schizophrenia patients, which can underlie observed cognitive abnormalities. It is suggested that this decrease is caused by the hypermethylation of the RELN promoter. The aim of the study was to investigate methylation of the RELN promoter in the peripheral blood of schizophrenia patients and its association with their cognitive deficits. A modified SMRT-BS (single-molecule real-time bisulfite sequencing) was used. We determined the methylation rate of 170 CpG sites within a 1465 bp DNA region containing the entire CpG island in the RELN promoter in 51 schizophrenia patients and 52 healthy controls. All subjects completed a battery of neuropsychological tests. There were no DNA methylation changes associated with schizophrenia. Most CpGs sites were unmethylated in both groups. At the same time, there was a variability in the methylation level of different regions within the promoter. The methylation level in the area from -258 to -151 bp relative to RELN transcription start site was a significant predictor of the index of patients’ cognitive functioning if sex, age, smoking, education, and polymorphism rsl858815 had been considered. The positive correlation between the methylation rate in this region and cognitive index suggests that the hypomethylation of the RELN promoter could contribute to the development of cognitive deficits in schizophrenia.


September 21, 2019  |  

Detecting AGG interruptions in females with a FMR1 premutation by long-read Single-Molecule Sequencing: A 1 year clinical experience.

The fragile X syndrome arises from the FMR1 CGG expansion of a premutation (55-200 repeats) to a full mutation allele (>200 repeats) and is the most frequent cause of inherited X-linked intellectual disability. The risk for a premutation to expand to a full mutation allele depends on the repeat length and AGG triplets interrupting this repeat. In genetic counseling it is important to have information on both these parameters to provide an accurate risk estimate to women carrying a premutation allele and weighing up having children. For example, in case of a small risk a woman might opt for a natural pregnancy followed up by prenatal diagnosis while she might choose for preimplantation genetic diagnosis (PGD) if the risk is high. Unfortunately, the detection of AGG interruptions was previously hampered by technical difficulties complicating their use in diagnostics. Therefore we recently developed, validated and implemented a new methodology which uses long-read single-molecule sequencing to identify AGG interruptions in females with a FMR1 premutation. Here we report on the assets of AGG interruption detection by sequencing and the impact of implementing the assay on genetic counseling.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.