Menu
September 22, 2019

Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials.

The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.


September 22, 2019

Recent insights into the tick microbiome gained through next-generation sequencing.

The tick microbiome comprises communities of microorganisms, including viruses, bacteria and eukaryotes, and is being elucidated through modern molecular techniques. The advent of next-generation sequencing (NGS) technologies has enabled the genes and genomes within these microbial communities to be explored in a rapid and cost-effective manner. The advantages of using NGS to investigate microbiomes surpass the traditional non-molecular methods that are limited in their sensitivity, and conventional molecular approaches that are limited in their scalability. In recent years the number of studies using NGS to investigate the microbial diversity and composition of ticks has expanded. Here, we provide a review of NGS strategies for tick microbiome studies and discuss the recent findings from tick NGS investigations, including the bacterial diversity and composition, influential factors, and implications of the tick microbiome.


September 22, 2019

Full-length RNA sequencing reveals unique transcriptome composition in bermudagrass.

Bermudagrass [Cynodon dactylon (L.) Pers.] is an important perennial warm-season turfgrass species with great economic value. However, the reference genome and transcriptome information are still deficient in bermudagrass, which severely impedes functional and molecular breeding studies. In this study, through analyzing a mixture sample of leaves, stolons, shoots, roots and flowers with single-molecule long-read sequencing technology from Pacific Biosciences (PacBio), we reported the first full-length transcriptome dataset of bermudagrass (C. dactylon cultivar Yangjiang) comprising 78,192 unigenes. Among the unigenes, 66,409 were functionally annotated, whereas 27,946 were found to have two or more isoforms. The annotated full-length unigenes provided many new insights into gene sequence characteristics and systematic phylogeny of bermudagrass. By comparison with transcriptome dataset in nine grass species, KEGG pathway analyses further revealed that C4 photosynthesis-related genes, notably the phosphoenolpyruvate carboxylase and pyruvate, phosphate dikinase genes, are specifically enriched in bermudagrass. These results not only explained the possible reason why bermudagrass flourishes in warm areas but also provided a solid basis for future studies in this important turfgrass species. Copyright © 2018 Elsevier Masson SAS. All rights reserved.


September 22, 2019

Identification of a novel fusion transcript between human relaxin-1 (RLN1) and human relaxin-2 (RLN2) in prostate cancer.

Simultaneous expression of highly homologous RLN1 and RLN2 genes in prostate impairs their accurate delineation. We used PacBio SMRT sequencing and RNA-Seq in LNCaP cells in order to dissect the expression of RLN1 and RLN2 variants. We identified a novel fusion transcript comprising the RLN1 and RLN2 genes and found evidence of its expression in the normal and prostate cancer tissues. The RLN1-RLN2 fusion putatively encodes RLN2 isoform with the deleted secretory signal peptide. The identification of the fusion transcript provided information to determine unique RLN1-RLN2 fusion and RLN1 regions. The RLN1-RLN2 fusion was co-expressed with RLN1 in LNCaP cells, but the two gene products were inversely regulated by androgens. We showed that RLN1 is underrepresented in common PCa cell lines in comparison to normal and PCa tissue. The current study brings a highly relevant update to the relaxin field, and will encourage further studies of RLN1 and RLN2 in PCa and broader. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.


September 22, 2019

Multiplex amplicon sequencing for microbe identification in community-based culture collections.

Microbiome analysis using metagenomic sequencing has revealed a vast microbial diversity associated with plants. Identifying the molecular functions associated with microbiome-plant interaction is a significant challenge concerning the development of microbiome-derived technologies applied to agriculture. An alternative to accelerate the discovery of the microbiome benefits to plants is to construct microbial culture collections concomitant with accessing microbial community structure and abundance. However, traditional methods of isolation, cultivation, and identification of microbes are time-consuming and expensive. Here we describe a method for identification of microbes in culture collections constructed by picking colonies from primary platings that may contain single or multiple microorganisms, which we named community-based culture collections (CBC). A multiplexing 16S rRNA gene amplicon sequencing based on two-step PCR amplifications with tagged primers for plates, rows, and columns allowed the identification of the microbial composition regardless if the well contains single or multiple microorganisms. The multiplexing system enables pooling amplicons into a single tube. The sequencing performed on the PacBio platform led to recovery near-full-length 16S rRNA gene sequences allowing accurate identification of microorganism composition in each plate well. Cross-referencing with plant microbiome structure and abundance allowed the estimation of diversity and abundance representation of microorganism in the CBC.


September 22, 2019

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.


September 22, 2019

Multiplatform next-generation sequencing identifies novel RNA molecules and transcript isoforms of the endogenous retrovirus isolated from cultured cells.

In this study, we applied short- and long-read RNA sequencing techniques, as well as PCR analysis to investigate the transcriptome of the porcine endogenous retrovirus (PERV) expressed from cultured porcine kidney cell line PK-15. This analysis has revealed six novel transcripts and eight transcript isoforms, including five length and three splice variants. We were able to establish whether a deletion in a transcript is the result of the splicing of mRNAs or of genomic deletion in one of the PERV clones. Additionally, we re-annotated the formerly identified RNA molecules. Our analysis revealed a higher complexity of PERV transcriptome than it was earlier believed.© FEMS 2018. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


September 22, 2019

Prey range and genome evolution of Halobacteriovorax marinus predatory bacteria from an estuary

Halobacteriovorax strains are saltwater-adapted predatory bacteria that attack Gram-negative bacteria and may play an important role in shaping microbial communities. To understand how Halobacteriovorax strains impact ecosystems and develop them as biocontrol agents, it is important to characterize variation in predation phenotypes and investigate Halobacteriovorax genome evolution. We isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island using Vibrio from the same site as prey. Small, fast-moving, attack-phase BE01 cells attach to and invade prey cells, consistent with the intraperiplasmic predation strategy of the H. marinus type strain, SJ. BE01 is a prey generalist, forming plaques on Vibrio strains from the estuary, Pseudomonas from soil, and Escherichia coli. Genome analysis revealed extremely high conservation of gene order and amino acid sequences between BE01 and SJ, suggesting strong selective pressure to maintain the genome in this H. marinus lineage. Despite this, we identified two regions of gene content difference that likely resulted from horizontal gene transfer. Analysis of modal codon usage frequencies supports the hypothesis that these regions were acquired from bacteria with different codon usage biases than H. marinus. In one of these regions, BE01 and SJ carry different genes associated with mobile genetic elements. Acquired functions in BE01 include the dnd operon, which encodes a pathway for DNA modification, and a suite of genes involved in membrane synthesis and regulation of gene expression that was likely acquired from another Halobacteriovorax lineage. This analysis provides further evidence that horizontal gene transfer plays an important role in genome evolution in predatory bacteria. IMPORTANCE Predatory bacteria attack and digest other bacteria and therefore may play a role in shaping microbial communities. To investigate phenotypic and genotypic variation in saltwater-adapted predatory bacteria, we isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island, assayed whether it could attack different prey bacteria, and sequenced and analyzed its genome. We found that BE01 is a prey generalist, attacking bacteria from different phylogenetic groups and environments. Gene order and amino acid sequences are highly conserved between BE01 and the H. marinus type strain, SJ. By comparative genomics, we detected two regions of gene content difference that likely occurred via horizontal gene transfer events. Acquired genes encode functions such as modification of DNA, membrane synthesis and regulation of gene expression. Understanding genome evolution and variation in predation phenotypes among predatory bacteria will inform their development as biocontrol agents and clarify how they impact microbial communities.


September 22, 2019

Extreme haplotype variation in the desiccation-tolerant clubmoss Selaginella lepidophylla.

Plant genome size varies by four orders of magnitude, and most of this variation stems from dynamic changes in repetitive DNA content. Here we report the small 109?Mb genome of Selaginella lepidophylla, a clubmoss with extreme desiccation tolerance. Single-molecule sequencing enables accurate haplotype assembly of a single heterozygous S. lepidophylla plant, revealing extensive structural variation. We observe numerous haplotype-specific deletions consisting of largely repetitive and heavily methylated sequences, with enrichment in young Gypsy LTR retrotransposons. Such elements are active but rapidly deleted, suggesting “bloat and purge” to maintain a small genome size. Unlike all other land plant lineages, Selaginella has no evidence of a whole-genome duplication event in its evolutionary history, but instead shows unique tandem gene duplication patterns reflecting adaptation to extreme drying. Gene expression changes during desiccation in S. lepidophylla mirror patterns observed across angiosperm resurrection plants.


September 22, 2019

An ancient integration in a plant NLR is maintained as a trans-species polymorphism

Plant immune receptors are under constant selective pressure to maintain resistance to plant pathogens. Nucleotide-binding leucine-rich repeat (NLR) proteins are one class of cytoplasmic immune receptors whose genes commonly show signatures of adaptive evolution. While it is known that balancing selection contributes to maintaining high intraspecific allelic diversity, the evolutionary mechanism that influences the transmission of alleles during speciation remains unclear. The barley Mla locus has over 30 described alleles conferring isolate-specific resistance to barley powdery mildew and contains three NLR families (RGH1, RGH2, and RGH3). We discovered (using sequence capture and RNAseq) the presence of a novel integrated Exo70 domain in RGH2 in the Mla3 haplotype. Allelic variation across barley accessions includes presence/absence of the integrated domain in RGH2. Expanding our search to several Poaceae species, we found shared interspecific conservation in the RGH2-Exo70 integration. We hypothesise that balancing selection has maintained allelic variation at Mla as a trans-species polymorphism over 24 My, thus contributing to and preserving interspecific allelic diversity during speciation.


September 22, 2019

Comparative heterochromatin profiling reveals conserved and unique epigenome signatures linked to adaptation and development of malaria parasites.

Heterochromatin-dependent gene silencing is central to the adaptation and survival of Plasmodium falciparum malaria parasites, allowing clonally variant gene expression during blood infection in humans. By assessing genome-wide heterochromatin protein 1 (HP1) occupancy, we present a comprehensive analysis of heterochromatin landscapes across different Plasmodium species, strains, and life cycle stages. Common targets of epigenetic silencing include fast-evolving multi-gene families encoding surface antigens and a small set of conserved HP1-associated genes with regulatory potential. Many P. falciparum heterochromatic genes are marked in a strain-specific manner, increasing the parasite’s adaptive capacity. Whereas heterochromatin is strictly maintained during mitotic proliferation of asexual blood stage parasites, substantial heterochromatin reorganization occurs in differentiating gametocytes and appears crucial for the activation of key gametocyte-specific genes and adaptation of erythrocyte remodeling machinery. Collectively, these findings provide a catalog of heterochromatic genes and reveal conserved and specialized features of epigenetic control across the genus Plasmodium. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

RTS,S/AS01 malaria vaccine mismatch observed among Plasmodium falciparum isolates from southern and central Africa and globally.

The RTS,S/AS01 malaria vaccine encompasses the central repeats and C-terminal of Plasmodium falciparum circumsporozoite protein (PfCSP). Although no Phase II clinical trial studies observed evidence of strain-specific immunity, recent studies show a decrease in vaccine efficacy against non-vaccine strain parasites. In light of goals to reduce malaria morbidity, anticipating the effectiveness of RTS,S/AS01 is critical to planning widespread vaccine introduction. We deep sequenced C-terminal Pfcsp from 77 individuals living along the international border in Luapula Province, Zambia and Haut-Katanga Province, the Democratic Republic of the Congo (DRC) and compared translated amino acid haplotypes to the 3D7 vaccine strain. Only 5.2% of the 193 PfCSP sequences from the Zambia-DRC border region matched 3D7 at all 84 amino acids. To further contextualize the genetic diversity sampled in this study with global PfCSP diversity, we analyzed an additional 3,809 Pfcsp sequences from the Pf3k database and constructed a haplotype network representing 15 countries from Africa and Asia. The diversity observed in our samples was similar to the diversity observed in the global haplotype network. These observations underscore the need for additional research assessing genetic diversity in P. falciparum and the impact of PfCSP diversity on RTS,S/AS01 efficacy.


September 22, 2019

Biosynthesis of the 15-membered ring depsipeptide neoantimycin.

Antimycins are a family of natural products possessing outstanding biological activities and unique structures, which have intrigued chemists for over a half century. Of particular interest are the ring-expanded antimycins that show promising anticancer potential and whose biosynthesis remains uncharacterized. Specifically, neoantimycin and its analogs have been shown to be effective regulators of the oncogenic proteins GRP78/BiP and K-Ras. The neoantimycin structural skeleton is built on a 15-membered tetralactone ring containing one methyl, one hydroxy, one benzyl, and three alkyl moieties, as well as an amide linkage to a conserved 3-formamidosalicylic acid moiety. Although the biosynthetic gene cluster for neoantimycins was recently identified, the enzymatic logic that governs the synthesis of neoantimycins has not yet been revealed. In this work, the neoantimycin gene cluster is identified, and an updated sequence and annotation is provided delineating a nonribosomal peptide synthetase/polyketide synthase (NRPS/PKS) hybrid scaffold. Using cosmid expression and CRISPR/Cas-based genome editing, several heterologous expression strains for neoantimycin production are constructed in two separate Streptomyces species. A combination of in vivo and in vitro analysis is further used to completely characterize the biosynthesis of neoantimycins including the megasynthases and trans-acting domains. This work establishes a set of highly tractable hosts for producing and engineering neoantimycins and their C11 oxidized analogs, paving the way for neoantimycin-based drug discovery and development.


September 22, 2019

Multiple large inversions and breakpoint rewiring of gene expression in the evolution of the fire ant social supergene.

Supergenes consist of co-adapted loci that segregate together and are associated with adaptive traits. In the fire ant Solenopsis invicta, two ‘social’ supergene variants regulate differences in colony queen number and other traits. Suppressed recombination in this system is maintained, in part, by a greater than 9 Mb inversion, but the supergene is larger. Has the supergene in S. invicta undergone multiple large inversions? The initial gene content of the inverted allele of a supergene would be the same as that of the wild-type allele. So, how did the inversion increase in frequency? To address these questions, we cloned one extreme breakpoint in the fire ant supergene. In doing so, we found a second large (greater than 800 Kb) rearrangement. Furthermore, we determined the temporal order of the two big inversions based on the translocation pattern of a third small fragment. Because the S. invicta supergene lacks evolutionary strata, our finding of multiple inversions may support an introgression model of the supergene. Finally, we showed that one of the inversions swapped the promoter of a breakpoint-adjacent gene, which might have conferred a selective advantage relative to the non-inverted allele. Our findings provide a rare example of gene alterations arising directly from an inversion event.© 2018 The Author(s).


September 22, 2019

Genetic diversity of Cryptosporidium hominis in a Bangladeshi community as revealed by whole genome sequencing.

We studied the genetic diversity of Cryptosporidium hominis infections in slum-dwelling infants from Dhaka over a 2-year period. Cryptosporidium hominis infections were common during the monsoon, and were genetically diverse as measured by gp60 genotyping and whole-genome resequencing. Recombination in the parasite was evidenced by the decay of linkage disequilibrium in the genome over <300 bp. Regions of the genome with high levels of polymorphism were also identified. Yet to be determined is if genomic diversity is responsible in part for the high rate of reinfection, seasonality, and varied clinical presentations of cryptosporidiosis in this population.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.