Menu
September 22, 2019

Transcriptional fates of human-specific segmental duplications in brain.

Despite the importance of duplicate genes for evolutionary adaptation, accurate gene annotation is often incomplete, incorrect, or lacking in regions of segmental duplication. We developed an approach combining long-read sequencing and hybridization capture to yield full-length transcript information and confidently distinguish between nearly identical genes/paralogs. We used biotinylated probes to enrich for full-length cDNA from duplicated regions, which were then amplified, size-fractionated, and sequenced using single-molecule, long-read sequencing technology, permitting us to distinguish between highly identical genes by virtue of multiple paralogous sequence variants. We examined 19 gene families as expressed in developing and adult human brain, selected for their high sequence identity (average >99%) and overlap with human-specific segmental duplications (SDs). We characterized the transcriptional differences between related paralogs to better understand the birth-death process of duplicate genes and particularly how the process leads to gene innovation. In 48% of the cases, we find that the expressed duplicates have changed substantially from their ancestral models due to novel sites of transcription initiation, splicing, and polyadenylation, as well as fusion transcripts that connect duplication-derived exons with neighboring genes. We detect unannotated open reading frames in genes currently annotated as pseudogenes, while relegating other duplicates to nonfunctional status. Our method significantly improves gene annotation, specifically defining full-length transcripts, isoforms, and open reading frames for new genes in highly identical SDs. The approach will be more broadly applicable to genes in structurally complex regions of other genomes where the duplication process creates novel genes important for adaptive traits.© 2018 Dougherty et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

The state of play in higher eukaryote gene annotation.

A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe – or ‘annotate’ – genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists – from clinicians to evolutionary biologists – need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets. Their creation, however, remains challenging.


September 22, 2019

Multiscale patterns and drivers of arbuscular mycorrhizal fungal communities in the roots and root-associated soil of a wild perennial herb.

Arbuscular mycorrhizal (AM) fungi form diverse communities and are known to influence above-ground community dynamics and biodiversity. However, the multiscale patterns and drivers of AM fungal composition and diversity are still poorly understood. We sequenced DNA markers from roots and root-associated soil from Plantago lanceolata plants collected across multiple spatial scales to allow comparison of AM fungal communities among neighbouring plants, plant subpopulations, nearby plant populations, and regions. We also measured soil nutrients, temperature, humidity, and community composition of neighbouring plants and nonAM root-associated fungi. AM fungal communities were already highly dissimilar among neighbouring plants (c. 30 cm apart), albeit with a high variation in the degree of similarity at this small spatial scale. AM fungal communities were increasingly, and more consistently, dissimilar at larger spatial scales. Spatial structure and environmental drivers explained a similar percentage of the variation, from 7% to 25%. A large fraction of the variation remained unexplained, which may be a result of unmeasured environmental variables, species interactions and stochastic processes. We conclude that AM fungal communities are highly variable among nearby plants. AM fungi may therefore play a major role in maintaining small-scale variation in community dynamics and biodiversity.© 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


September 22, 2019

Ecological genomics of tropical trees: how local population size and allelic diversity of resistance genes relate to immune responses, cosusceptibility to pathogens, and negative density dependence

In tropical forests, rarer species show increased sensitivity to species-specific soil pathogens and more negative effects of conspecific density on seedling survival (NDD). These patterns suggest a connection between ecology and immunity, perhaps because small population size disproportionately reduces genetic diversity of hyperdiverse loci such as immunity genes. In an experiment examining seedling roots from six species in one tropical tree community, we found that smaller populations have reduced amino acid diversity in pathogen resistance (R) genes but not the transcriptome in general. Normalized R gene amino acid diversity varied with local abundance and prior measures of differences in sensitivity to conspecific soil and NDD. After exposure to live soil, species with lower R gene diversity had reduced defence gene induction, more cosusceptibility of maternal cohorts to colonization by potentially pathogenic fungi, reduced root growth arrest (an R gene-mediated response) and their root-associated fungi showed lower induction of self-defence (antioxidants). Local abundance was not related to the ability to induce immune responses when pathogen recognition was bypassed by application of salicylic acid, a phytohormone that activates defence responses downstream of R gene signalling. These initial results support the hypothesis that smaller local tree populations have reduced R gene diversity and recognition-dependent immune responses, along with greater cosusceptibility to species-specific pathogens that may facilitate disease transmission and NDD. Locally rare species may be less able to increase their equilibrium abundance without genetic boosts to defence via immigration of novel R gene alleles from a larger and more diverse regional population.


September 22, 2019

Universal alternative splicing of noncoding exons.

The human transcriptome is so large, diverse, and dynamic that, even after a decade of investigation by RNA sequencing (RNA-seq), we have yet to resolve its true dimensions. RNA-seq suffers from an expression-dependent bias that impedes characterization of low-abundance transcripts. We performed targeted single-molecule and short-read RNA-seq to survey the transcriptional landscape of a single human chromosome (Hsa21) at unprecedented resolution. Our analysis reaches the lower limits of the transcriptome, identifying a fundamental distinction between protein-coding and noncoding gene content: almost every noncoding exon undergoes alternative splicing, producing a seemingly limitless variety of isoforms. Analysis of syntenic regions of the mouse genome shows that few noncoding exons are shared between human and mouse, yet human splicing profiles are recapitulated on Hsa21 in mouse cells, indicative of regulation by a deeply conserved splicing code. We propose that noncoding exons are functionally modular, with alternative splicing generating an enormous repertoire of potential regulatory RNAs and a rich transcriptional reservoir for gene evolution. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Interpreting microbial biosynthesis in the genomic age: Biological and practical considerations.

Genome mining has become an increasingly powerful, scalable, and economically accessible tool for the study of natural product biosynthesis and drug discovery. However, there remain important biological and practical problems that can complicate or obscure biosynthetic analysis in genomic and metagenomic sequencing projects. Here, we focus on limitations of available technology as well as computational and experimental strategies to overcome them. We review the unique challenges and approaches in the study of symbiotic and uncultured systems, as well as those associated with biosynthetic gene cluster (BGC) assembly and product prediction. Finally, to explore sequencing parameters that affect the recovery and contiguity of large and repetitive BGCs assembled de novo, we simulate Illumina and PacBio sequencing of the Salinispora tropica genome focusing on assembly of the salinilactam (slm) BGC.


September 22, 2019

First insights into the nature and evolution of antisense transcription in nematodes.

The development of multicellular organisms is coordinated by various gene regulatory mechanisms that ensure correct spatio-temporal patterns of gene expression. Recently, the role of antisense transcription in gene regulation has moved into focus of research. To characterize genome-wide patterns of antisense transcription and to study their evolutionary conservation, we sequenced a strand-specific RNA-seq library of the nematode Pristionchus pacificus.We identified 1112 antisense configurations of which the largest group represents 465 antisense transcripts (ASTs) that are fully embedded in introns of their host genes. We find that most ASTs show homology to protein-coding genes and are overrepresented in proteomic data. Together with the finding, that expression levels of ASTs and host genes are uncorrelated, this indicates that most ASTs in P. pacificus do not represent non-coding RNAs and do not exhibit regulatory functions on their host genes. We studied the evolution of antisense gene pairs across 20 nematode genomes, showing that the majority of pairs is lineage-specific and even the highly conserved vps-4, ddx-27, and sel-2 loci show abundant structural changes including duplications, deletions, intron gains and loss of antisense transcription. In contrast, host genes in general, are remarkably conserved and encode exceptionally long introns leading to unusually large blocks of conserved synteny.Our study has shown that in P. pacificus antisense transcription as such does not define non-coding RNAs but is rather a feature of highly conserved genes with long introns. We hypothesize that the presence of regulatory elements imposes evolutionary constraint on the intron length, but simultaneously, their large size makes them a likely target for translocation of genomic elements including protein-coding genes that eventually end up as ASTs.


September 22, 2019

Plant 24-nt reproductive phasiRNAs from intramolecular duplex mRNAs in diverse monocots.

In grasses, two pathways that generate diverse and numerous 21-nt (premeiotic) and 24-nt (meiotic) phased siRNAs are highly enriched in anthers, the male reproductive organs. These “phasiRNAs” are analogous to mammalian piRNAs, yet their functions and evolutionary origins remain largely unknown. The 24-nt meiotic phasiRNAs have only been described in grasses, wherein their biogenesis is dependent on a specialized Dicer (DCL5). To assess how evolution gave rise to this pathway, we examined reproductive phasiRNA pathways in nongrass monocots: garden asparagus, daylily, and lily. The common ancestors of these species diverged approximately 115-117 million years ago (MYA). We found that premeiotic 21-nt and meiotic 24-nt phasiRNAs were abundant in all three species and displayed spatial localization and temporal dynamics similar to grasses. The miR2275-triggered pathway was also present, yielding 24-nt reproductive phasiRNAs, and thus originated more than 117 MYA. In asparagus, unlike in grasses, these siRNAs are largely derived from inverted repeats (IRs); analyses in lily identified thousands of precursor loci, and many were also predicted to form foldback substrates for Dicer processing. Additionally, reproductive phasiRNAs were present in female reproductive organs and thus may function in both male and female germinal development. These data describe several distinct mechanisms of production for 24-nt meiotic phasiRNAs and provide new insights into the evolution of reproductive phasiRNA pathways in monocots.© 2018 Kakrana et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Melanization of mycorrhizal fungal necromass structures microbial decomposer communities

Mycorrhizal fungal necromass is increasingly recognized as an important contributor to soil organic carbon pools, particularly in forest ecosystems. While its decomposition rate is primarily determined by biochemical composition, how traits such as melanin content affect the structure of necromass decomposer communities remains poorly understood. To assess the role of biochemical traits on microbial decomposer community composition and functioning, we incubated melanized and non-melanized necromass of the mycorrhizal fungus Meliniomyces bicolor in Pinus- and Quercus-dominated forests in Minnesota, USA and then assessed the associated fungal and bacterial decomposer communities after 1, 2 and 3 months using high-throughput sequencing. Melanized necromass decomposed significantly slower than non-melanized necromass in both forests. The structure of the microbial decomposer communities depended significantly on necromass melanin content, although the effect was stronger for fungi than bacteria. On non-melanized necromass, fungal communities were dominated by r-selected ascomycete and mucoromycete microfungi early and then replaced by basidiomycete ectomycorrhizal fungi, while on melanized necromass these groups were co-dominant throughout the incubation. Bacterial communities were dominated by both specialist mycophageous and generalist taxa. Synthesis. Our results indicate that necromass biochemistry not only strongly affects rates of decomposition but also the structure of the associated decomposer communities. Furthermore, the observed colonization patterns suggest that fungi, and particularly ectomycorrhizal fungi, may play a more important role in necromass decomposition than previously recognized.


September 22, 2019

Cow-to-mouse fecal transplantations suggest intestinal microbiome as one cause of mastitis.

Mastitis, which affects nearly all lactating mammals including human, is generally thought to be caused by local infection of the mammary glands. For treatment, antibiotics are commonly prescribed, which however are of concern in both treatment efficacy and neonate safety. Here, using bovine mastitis which is the most costly disease in the dairy industry as a model, we showed that intestinal microbiota alone can lead to mastitis.Fecal microbiota transplantation (FMT) from mastitis, but not healthy cows, to germ-free (GF) mice resulted in mastitis symptoms in mammary gland and inflammations in serum, spleen, and colon. Probiotic intake in parallel with FMT from diseased cows led to relieved mastitis symptoms in mice, by shifting the murine intestinal microbiota to a state that is functionally distinct from either healthy or diseased microbiota yet structurally similar to the latter. Despite conservation in mastitis symptoms, diseased cows and mice shared few mastitis-associated bacterial organismal or functional markers, suggesting striking divergence in mastitis-associated intestinal microbiota among lactating mammals. Moreover, an “amplification effect” of disease-health distinction in both microbiota structure and function was apparent during the cow-to-mouse FMT.Hence, dysbiosis of intestinal microbiota may be one cause of mastitis, and probiotics that restore intestinal microbiota function are an effective and safe strategy to treat mastitis.


September 22, 2019

Quantitative metaproteomics highlight the metabolic contributions of uncultured phylotypes in a thermophilic anaerobic digester.

In this study, we used multiple meta-omic approaches to characterize the microbial community and the active metabolic pathways of a stable industrial biogas reactor with food waste as the dominant feedstock, operating at thermophilic temperatures (60°C) and elevated levels of free ammonia (367 mg/liter NH3-N). The microbial community was strongly dominated (76% of all 16S rRNA amplicon sequences) by populations closely related to the proteolytic bacterium Coprothermobacter proteolyticus. Multiple Coprothermobacter-affiliated strains were detected, introducing an additional level of complexity seldom explored in biogas studies. Genome reconstructions provided metabolic insight into the microbes that performed biomass deconstruction and fermentation, including the deeply branching phyla Dictyoglomi and Planctomycetes and the candidate phylum “Atribacteria” These biomass degraders were complemented by a synergistic network of microorganisms that convert key fermentation intermediates (fatty acids) via syntrophic interactions with hydrogenotrophic methanogens to ultimately produce methane. Interpretation of the proteomics data also suggested activity of a Methanosaeta phylotype acclimatized to high ammonia levels. In particular, we report multiple novel phylotypes proposed as syntrophic acetate oxidizers, which also exert expression of enzymes needed for both the Wood-Ljungdahl pathway and ß-oxidation of fatty acids to acetyl coenzyme A. Such an arrangement differs from known syntrophic oxidizing bacteria and presents an interesting hypothesis for future studies. Collectively, these findings provide increased insight into active metabolic roles of uncultured phylotypes and presents new synergistic relationships, both of which may contribute to the stability of the biogas reactor.Biogas production through anaerobic digestion of organic waste provides an attractive source of renewable energy and a sustainable waste management strategy. A comprehensive understanding of the microbial community that drives anaerobic digesters is essential to ensure stable and efficient energy production. Here, we characterize the intricate microbial networks and metabolic pathways in a thermophilic biogas reactor. We discuss the impact of frequently encountered microbial populations as well as the metabolism of newly discovered novel phylotypes that seem to play distinct roles within key microbial stages of anaerobic digestion in this stable high-temperature system. In particular, we draft a metabolic scenario whereby multiple uncultured syntrophic acetate-oxidizing bacteria are capable of syntrophically oxidizing acetate as well as longer-chain fatty acids (via the ß-oxidation and Wood-Ljundahl pathways) to hydrogen and carbon dioxide, which methanogens subsequently convert to methane. Copyright © 2016 American Society for Microbiology.


September 22, 2019

A single-cell genome for Thiovulum sp.

We determined a significant fraction of the genome sequence of a representative of Thiovulum, the uncultivated genus of colorless sulfur Epsilonproteobacteria, by analyzing the genome sequences of four individual cells collected from phototrophic mats from Elkhorn Slough, California. These cells were isolated utilizing a microfluidic laser-tweezing system, and their genomes were amplified by multiple-displacement amplification prior to sequencing. Thiovulum is a gradient bacterium found at oxic-anoxic marine interfaces and noted for its distinctive morphology and rapid swimming motility. The genomic sequences of the four individual cells were assembled into a composite genome consisting of 221 contigs covering 2.083 Mb including 2,162 genes. This single-cell genome represents a genomic view of the physiological capabilities of isolated Thiovulum cells. Thiovulum is the second-fastest bacterium ever observed, swimming at 615 µm/s, and this genome shows that this rapid swimming motility is a result of a standard flagellar machinery that has been extensively characterized in other bacteria. This suggests that standard flagella are capable of propelling bacterial cells at speeds much faster than typically thought. Analysis of the genome suggests that naturally occurring Thiovulum populations are more diverse than previously recognized and that studies performed in the past probably address a wide range of unrecognized genotypic and phenotypic diversities of Thiovulum. The genome presented in this article provides a basis for future isolation-independent studies of Thiovulum, where single-cell and metagenomic tools can be used to differentiate between different Thiovulum genotypes.


September 22, 2019

Genomics and host specialization of honey bee and bumble bee gut symbionts.

Gilliamella apicola and Snodgrassella alvi are dominant members of the honey bee (Apis spp.) and bumble bee (Bombus spp.) gut microbiota. We generated complete genomes of the type strains G. apicola wkB1(T) and S. alvi wkB2(T) (isolated from Apis), as well as draft genomes for four other strains from Bombus. G. apicola and S. alvi were found to occupy very different metabolic niches: The former is a saccharolytic fermenter, whereas the latter is an oxidizer of carboxylic acids. Together, they may form a syntrophic network for partitioning of metabolic resources. Both species possessed numerous genes [type 6 secretion systems, repeats in toxin (RTX) toxins, RHS proteins, adhesins, and type IV pili] that likely mediate cell-cell interactions and gut colonization. Variation in these genes could account for the host fidelity of strains observed in previous phylogenetic studies. Here, we also show the first experimental evidence, to our knowledge, for this specificity in vivo: Strains of S. alvi were able to colonize their native bee host but not bees of another genus. Consistent with specific, long-term host association, comparative genomic analysis revealed a deep divergence and little or no gene flow between Apis and Bombus gut symbionts. However, within a host type (Apis or Bombus), we detected signs of horizontal gene transfer between G. apicola and S. alvi, demonstrating the importance of the broader gut community in shaping the evolution of any one member. Our results show that host specificity is likely driven by multiple factors, including direct host-microbe interactions, microbe-microbe interactions, and social transmission.


September 22, 2019

The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity.

Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity. Copyright © 2017 Elsevier Ltd. All rights reserved.


September 22, 2019

Tracking alternatively spliced isoforms from long reads by SpliceHunter.

Alternative splicing increases the functional complexity of a genome by generating multiple isoforms and potentially proteins from the same gene. Vast amounts of alternative splicing events are routinely detected by short read deep sequencing technologies but their functional interpretation is hampered by an uncertain transcript context. Emerging long-read sequencing technologies provide a more complete picture of full-length transcript sequences. We introduce SpliceHunter, a tool for the computational interpretation of long reads generated by for example Pacific Biosciences instruments. SpliceHunter defines and tracks isoforms and novel transcription units across time points, compares their splicing pattern to a reference annotation, and translates them into potential protein sequences.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.