Menu
September 22, 2019

Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials.

The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.


September 22, 2019

Soil bacterial communities are shaped by temporal and environmental filtering: evidence from a long-term chronosequence.

Soil microbial communities are abundant, hyper-diverse and mediate global biogeochemical cycles, but we do not yet understand the processes mediating their assembly. Current hypothetical frameworks suggest temporal (e.g. dispersal limitation) and environmental (e.g. soil pH) filters shape microbial community composition; however, there is limited empirical evidence supporting this framework in the hyper-diverse soil environment, particularly at large spatial (i.e. regional to continental) and temporal (i.e. 100 to 1000 years) scales. Here, we present evidence from a long-term chronosequence (4000 years) that temporal and environmental filters do indeed shape soil bacterial community composition. Furthermore, nearly 20 years of environmental monitoring allowed us to control for potentially confounding environmental variation. Soil bacterial communities were phylogenetically distinct across the chronosequence. We determined that temporal and environmental factors accounted for significant portions of bacterial phylogenetic structure using distance-based linear models. Environmental factors together accounted for the majority of phylogenetic structure, namely, soil temperature (19%), pH (17%) and litter carbon:nitrogen (C:N; 17%). However, of all individual factors, time since deglaciation accounted for the greatest proportion of bacterial phylogenetic structure (20%). Taken together, our results provide empirical evidence that temporal and environmental filters act together to structure soil bacterial communities across large spatial and long-term temporal scales. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Improved full-length killer cell immunoglobulin-like receptor transcript discovery in Mauritian cynomolgus macaques.

Killer cell immunoglobulin-like receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens, and so, considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lack the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing Pacific Biosciences circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probe sets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.


September 22, 2019

De novo assembly of a Chinese soybean genome.

Soybean was domesticated in China and has become one of the most important oilseed crops. Due to bottlenecks in their introduction and dissemination, soybeans from different geographic areas exhibit extensive genetic diversity. Asia is the largest soybean market; therefore, a high-quality soybean reference genome from this area is critical for soybean research and breeding. Here, we report the de novo assembly and sequence analysis of a Chinese soybean genome for “Zhonghuang 13” by a combination of SMRT, Hi-C and optical mapping data. The assembled genome size is 1.025 Gb with a contig N50 of 3.46 Mb and a scaffold N50 of 51.87 Mb. Comparisons between this genome and the previously reported reference genome (cv. Williams 82) uncovered more than 250,000 structure variations. A total of 52,051 protein coding genes and 36,429 transposable elements were annotated for this genome, and a gene co-expression network including 39,967 genes was also established. This high quality Chinese soybean genome and its sequence analysis will provide valuable information for soybean improvement in the future.


September 22, 2019

The small peptide world in long noncoding RNAs.

Long noncoding RNAs (lncRNAs) are a group of transcripts that are longer than 200 nucleotides (nt) without coding potential. Over the past decade, tens of thousands of novel lncRNAs have been annotated in animal and plant genomes because of advanced high-throughput RNA sequencing technologies and with the aid of coding transcript classifiers. Further, a considerable number of reports have revealed the existence of stable, functional small peptides (also known as micropeptides), translated from lncRNAs. In this review, we discuss the methods of lncRNA classification, the investigations regarding their coding potential and the functional significance of the peptides they encode.


September 22, 2019

Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.


September 22, 2019

Complete genome sequence of Endomicrobium proavitum, a free-living relative of the intracellular symbionts of termite gut flagellates (phylum Elusimicrobia).

We sequenced the complete genome of Endomicrobium proavitum strain Rsa215, the first isolate of the class Endomicrobia (phylum Elusimicrobia). It is the closest free-living relative of the endosymbionts of termite gut flagellates and thereby provides an excellent model for studying the evolutionary processes during the establishment of an intracellular symbiosis. Copyright © 2015 Zheng and Brune.


September 22, 2019

Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing.

Divergence of alternative splicing represents one of the major driving forces to shape phenotypic diversity during evolution. However, the extent to which these divergences could be explained by the evolving cis-regulatory versus trans-acting factors remains unresolved. To globally investigate the relative contributions of the two factors for the first time in mammals, we measured splicing difference between C57BL/6J and SPRET/EiJ mouse strains and allele-specific splicing pattern in their F1 hybrid. Out of 11,818 alternative splicing events expressed in the cultured fibroblast cells, we identified 796 with significant difference between the parental strains. After integrating allele-specific data from F1 hybrid, we demonstrated that these events could be predominately attributed to cis-regulatory variants, including those residing at and beyond canonical splicing sites. Contrary to previous observations in Drosophila, such predominant contribution was consistently observed across different types of alternative splicing. Further analysis of liver tissues from the same mouse strains and reanalysis of published datasets on other strains showed similar trends, implying in general the predominant contribution of cis-regulatory changes in the evolution of mouse alternative splicing. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.


September 22, 2019

Precise fecal microbiome of the herbivorous Tibetan antelope inhabiting high-altitude alpine plateau

The metataxonomic approach combining 16S rRNA gene amplicon sequencing using the PacBio technology with the application of the operational phylogenetic unit (OPU) approach, has been used to analyze the fecal microbial composition of the high-altitude and herbivorous Tibetan antelopes. The fecal samples of the antelope were collected in Hoh Xil National Nature Reserve, at an altitude over 4500 m, the largest depopulated zone in Qinghai-Tibetan Plateau, China, where non-native animals or humans may experience life-threatening acute mountain sickness. In total, 104 antelope fecal samples were enrolled in this study, and were clustered into 61,258 operational taxonomic units (OTUs) at an identity of 98.7% and affiliated with 757 OPUs, including 144 known species, 256 potentially new species, 103 potentially higher taxa within known lineages. In addition, 254 comprised sequences not affiliating with any known family, and the closest relatives were unclassified lineages of existing orders or classes. A total of 42 out of 757 OPUs conformed to the core fecal microbiome, of which four major lineages, namely, un-cultured Ruminococcaceae, Lachnospiraceae, Akkermansia and Christensenellaceae were associated with human health or longevity. The current study reveals that the fecal core microbiome of antelope is mainly composited of uncultured bacteria. The most abundant core taxa, namely, uncultured Ruminococcaceae, uncultured Akkermansia, uncultured Bacteroides, uncultured Christensenellaceae, uncultured Mollicutes, and uncultured Lachnospiraceae, may represent new bacterial candidates at high taxa levels, and several may have beneficial roles in health promotion or anti-intestinal dysbiosis. These organisms should be further isolated and evaluated for potential effect on human health and longevity.


September 22, 2019

Recent insights into the tick microbiome gained through next-generation sequencing.

The tick microbiome comprises communities of microorganisms, including viruses, bacteria and eukaryotes, and is being elucidated through modern molecular techniques. The advent of next-generation sequencing (NGS) technologies has enabled the genes and genomes within these microbial communities to be explored in a rapid and cost-effective manner. The advantages of using NGS to investigate microbiomes surpass the traditional non-molecular methods that are limited in their sensitivity, and conventional molecular approaches that are limited in their scalability. In recent years the number of studies using NGS to investigate the microbial diversity and composition of ticks has expanded. Here, we provide a review of NGS strategies for tick microbiome studies and discuss the recent findings from tick NGS investigations, including the bacterial diversity and composition, influential factors, and implications of the tick microbiome.


September 22, 2019

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.


September 22, 2019

Soil microbial communities and elk foraging intensity: implications for soil biogeochemical cycling in the sagebrush steppe.

Foraging intensity of large herbivores may exert an indirect top-down ecological force on soil microbial communities via changes in plant litter inputs. We investigated the responses of the soil microbial community to elk (Cervus elaphus) winter range occupancy across a long-term foraging exclusion experiment in the sagebrush steppe of the North American Rocky Mountains, combining phylogenetic analysis of fungi and bacteria with shotgun metagenomics and extracellular enzyme assays. Winter foraging intensity was associated with reduced bacterial richness and increasingly distinct bacterial communities. Although fungal communities did not respond linearly to foraging intensity, a greater ß-diversity response to winter foraging exclusion was observed. Furthermore, winter foraging exclusion increased soil cellulolytic and hemicellulolytic enzyme potential and higher foraging intensity reduced chitinolytic gene abundance. Thus, future changes in winter range occupancy may shape biogeochemical processes via shifts in microbial communities and subsequent changes to their physiological capacities to cycle soil C and N.© 2017 John Wiley & Sons Ltd/CNRS.


September 22, 2019

Full-length RNA sequencing reveals unique transcriptome composition in bermudagrass.

Bermudagrass [Cynodon dactylon (L.) Pers.] is an important perennial warm-season turfgrass species with great economic value. However, the reference genome and transcriptome information are still deficient in bermudagrass, which severely impedes functional and molecular breeding studies. In this study, through analyzing a mixture sample of leaves, stolons, shoots, roots and flowers with single-molecule long-read sequencing technology from Pacific Biosciences (PacBio), we reported the first full-length transcriptome dataset of bermudagrass (C. dactylon cultivar Yangjiang) comprising 78,192 unigenes. Among the unigenes, 66,409 were functionally annotated, whereas 27,946 were found to have two or more isoforms. The annotated full-length unigenes provided many new insights into gene sequence characteristics and systematic phylogeny of bermudagrass. By comparison with transcriptome dataset in nine grass species, KEGG pathway analyses further revealed that C4 photosynthesis-related genes, notably the phosphoenolpyruvate carboxylase and pyruvate, phosphate dikinase genes, are specifically enriched in bermudagrass. These results not only explained the possible reason why bermudagrass flourishes in warm areas but also provided a solid basis for future studies in this important turfgrass species. Copyright © 2018 Elsevier Masson SAS. All rights reserved.


September 22, 2019

Divergent brain gene expression profiles between alternative behavioural helper types in a cooperative breeder.

Juveniles of the cooperatively breeding cichlid fish Neolamprologus pulcher either consistently provide help in form of alloparental egg care (“cleaners”) or consistently abstain from helping (“noncleaners”). These phenotypes are not based on heritable genetic differences. Instead, they arise during ontogeny, which should lead to differences in brain structure or physiology, a currently untested prediction. We compared brain gene expression profiles of cleaners and noncleaners in two experimental conditions, a helping opportunity and a control condition. We aimed to identify (a) expression differences between cleaners and noncleaners in the control, (b) changes in gene expression induced by the opportunity and (c) differences in plasticity of gene expression between cleaners and noncleaners. Control cleaners and noncleaners differed in the expression of a single gene, irx2, which regulates neural differentiation. During the opportunity, cleaners and noncleaners had three upregulated genes in common, which were implicated in neuroplasticity, hormonal signalling and cell proliferation. Thus, the stimulus in the opportunity was sufficiently salient. Cleaners also showed higher expression of seven additional genes that were unique to the opportunity. One of these cleaner-specific genes is implicated in neuropeptide metabolism, indicating that this process is associated with cleaning performance. This suggests that the two types employed different pathways to integrate social information, preparing them for accelerated reaction to future opportunities. Interestingly, three developmental genes were downregulated between the control and the opportunity in cleaners only. Our results indicate that the two behavioural types responded differently to the helping opportunity and that only cleaners responded by downregulating developmental genes.© 2018 John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.