Menu
September 22, 2019  |  

Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials.

The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.


September 22, 2019  |  

Multiplatform next-generation sequencing identifies novel RNA molecules and transcript isoforms of the endogenous retrovirus isolated from cultured cells.

In this study, we applied short- and long-read RNA sequencing techniques, as well as PCR analysis to investigate the transcriptome of the porcine endogenous retrovirus (PERV) expressed from cultured porcine kidney cell line PK-15. This analysis has revealed six novel transcripts and eight transcript isoforms, including five length and three splice variants. We were able to establish whether a deletion in a transcript is the result of the splicing of mRNAs or of genomic deletion in one of the PERV clones. Additionally, we re-annotated the formerly identified RNA molecules. Our analysis revealed a higher complexity of PERV transcriptome than it was earlier believed.© FEMS 2018. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


September 22, 2019  |  

A survey of localized sequence rearrangements in human DNA.

Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.


September 22, 2019  |  

Construction and characterization of bacterial artificial chromosomes harboring the full-length genome of a highly attenuated vaccinia virus LC16m8.

LC16m8 (m8), a highly attenuated vaccinia virus (VAC) strain, was developed as a smallpox vaccine, and its safety and immunogenicity have been confirmed. Here, we aimed to develop a system that recovers infectious m8 from a bacterial artificial chromosome (BAC) that retains the full-length viral genomic DNA (m8-BAC system). The infectious virus was successfully recovered from a VAC-BAC plasmid, named pLC16m8-BAC. Furthermore, the bacterial replicon-free virus was generated by intramolecular homologous recombination and was successfully recovered from a modified VAC-BAC plasmid, named pLC16m8.8S-BAC. Also, the growth of the recovered virus was indistinguishable from that of authentic m8. The full genome sequence of the plasmid, which harbors identical inverted terminal repeats (ITR) to that of authentic m8, was determined by long-read next-generation sequencing (NGS). The ITR contains x 18 to 32 of the 70 and x 30 to 45 of 54 base pair tandem repeats, and the number of tandem repeats was different between the ITR left and right. Since the virus recovered from pLC16m8.8S-BAC was expected to retain the identical viral genome to that of m8, including the ITR, a reference-based alignment following a short-read NGS was performed to validate the sequence of the recovered virus. Based on the pattern of coverage depth in the ITR, no remarkable differences were observed between the virus and m8, and the other region was confirmed to be identical as well. In summary, this new system can recover the virus, which is geno- and phenotypically indistinguishable from authentic m8.


September 22, 2019  |  

Xanthomonas citri jumbo phage XacN1 exhibits a wide host range and high complement of tRNA genes.

Xanthomonas virus (phage) XacN1 is a novel jumbo myovirus infecting Xanthomonas citri, the causative agent of Asian citrus canker. Its linear 384,670?bp double-stranded DNA genome encodes 592 proteins and presents the longest (66?kbp) direct terminal repeats (DTRs) among sequenced viral genomes. The DTRs harbor 56 tRNA genes, which correspond to all 20 amino acids and represent the largest number of tRNA genes reported in a viral genome. Codon usage analysis revealed a propensity for the phage encoded tRNAs to target codons that are highly used by the phage but less frequently by its host. The existence of these tRNA genes and seven additional translation-related genes as well as a chaperonin gene found in the XacN1 genome suggests a relative independence of phage replication on host molecular machinery, leading to a prediction of a wide host range for this jumbo phage. We confirmed the prediction by showing a wider host range of XacN1 than other X. citri phages in an infection test against a panel of host strains. Phylogenetic analyses revealed a clade of phages composed of XacN1 and ten other jumbo phages, indicating an evolutionary stable large genome size for this group of phages.


September 22, 2019  |  

CliqueSNV: Scalable reconstruction of intra-host viral populations from NGS reads

Highly mutable RNA viruses such as influenza A virus, human immunodeficiency virus and hepatitis C virus exist in infected hosts as highly heterogeneous populations of closely related genomic variants. The presence of low-frequency variants with few mutations with respect to major strains may result in an immune escape, emergence of drug resistance, and an increase of virulence and infectivity. Next-generation sequencing technologies permit detection of sample intra-host viral population at extremely great depth, thus providing an opportunity to access low-frequency variants. Long read lengths offered by single-molecule sequencing technologies allow all viral variants to be sequenced in a single pass. However, high sequencing error rates limit the ability to study heterogeneous viral populations composed of rare, closely related variants. In this article, we present CliqueSNV, a novel reference-based method for reconstruction of viral variants from NGS data. It efficiently constructs an allele graph based on linkage between single nucleotide variations and identifies true viral variants by merging cliques of that graph using combinatorial optimization techniques. The new method outperforms existing methods in both accuracy and running time on experimental and simulated NGS data for titrated levels of known viral variants. For PacBio reads, it accurately reconstructs variants with frequency as low as 0.1%. For Illumina reads, it fully reconstructs main variants. The open source implementation of CliqueSNV is freely available for download at https://github.com/vyacheslav-tsivina/CliqueSNV


September 22, 2019  |  

Epigenetic landscape influences the liver cancer genome architecture.

The accumulations of different types of genetic alterations such as nucleotide substitutions, structural rearrangements and viral genome integrations and epigenetic alterations contribute to carcinogenesis. Here, we report correlation between the occurrence of epigenetic features and genetic aberrations by whole-genome bisulfite, whole-genome shotgun, long-read, and virus capture sequencing of 373 liver cancers. Somatic substitutions and rearrangement breakpoints are enriched in tumor-specific hypo-methylated regions with inactive chromatin marks and actively transcribed highly methylated regions in the cancer genome. Individual mutation signatures depend on chromatin status, especially, signatures with a higher transcriptional strand bias occur within active chromatic areas. Hepatitis B virus (HBV) integration sites are frequently detected within inactive chromatin regions in cancer cells, as a consequence of negative selection for integrations in active chromatin regions. Ultra-high structural instability and preserved unmethylation of integrated HBV genomes are observed. We conclude that both precancerous and somatic epigenetic features contribute to the cancer genome architecture.


September 22, 2019  |  

The Egyptian rousette genome reveals unexpected features of bat antiviral immunity.

Bats harbor many viruses asymptomatically, including several notorious for causing extreme virulence in humans. To identify differences between antiviral mechanisms in humans and bats, we sequenced, assembled, and analyzed the genome of Rousettus aegyptiacus, a natural reservoir of Marburg virus and the only known reservoir for any filovirus. We found an expanded and diversified KLRC/KLRD family of natural killer cell receptors, MHC class I genes, and type I interferons, which dramatically differ from their functional counterparts in other mammals. Such concerted evolution of key components of bat immunity is strongly suggestive of novel modes of antiviral defense. An evaluation of the theoretical function of these genes suggests that an inhibitory immune state may exist in bats. Based on our findings, we hypothesize that tolerance of viral infection, rather than enhanced potency of antiviral defenses, may be a key mechanism by which bats asymptomatically host viruses that are pathogenic in humans. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019  |  

A genome comparison of T7-like Podoviruses that infect Caulobacter crescentus.

Bacteriophages remain an understudied component of bacterial communities. Therefore, our laboratory has initiated an effort to isolate large numbers of bacteriophages that infect Caulobacter crescentus to provide an estimate of the diversity of bacteriophages that infect this common environmental bacterium. The majority of the new isolates are phicbkviruses, a genus of giant viruses that appear to be Caulobacter specific. However, we have also isolated several Podoviruses with icosahedral heads and small tails. One of these Podoviruses, designated Lullwater, is similar to two previously isolated Caulobacter phages, Cd1 and Percy. All three have genomes that are approximately 45 kb and contain approximately 30 genes. The gene order is conserved among the three genomes with one of the genes coding for a DNA polymerase that has homology to the family of T7 DNA polymerases. Phylogenetic trees based on either the DNA polymerase or the RNA polymerase amino acid sequences suggests that the three phages represent a new branch of the T7virus tree. Based on these similarities, we concluded that Cd1, Lullwater, and Percy comprise a new group in the T7virus genus.


September 22, 2019  |  

Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089?bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.


September 22, 2019  |  

Diversity and evolution of the emerging Pandoraviridae family.

With DNA genomes reaching 2.5?Mb packed in particles of bacterium-like shape and dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes.


September 22, 2019  |  

Unexpected invasion of miniature inverted-repeat transposable elements in viral genomes

Transposable elements (TEs) are common and often present with high copy numbers in cellular genomes. Unlike in cellular organisms, TEs were previously thought to be either rare or absent in viruses. Almost all reported TEs display only one or two copies per viral genome. In addition, the discovery of pandoraviruses with genomes up to 2.5-Mb emphasizes the need for biologists to rethink the fundamental nature of the relationship between viruses and cellular life.


September 22, 2019  |  

Comprehensive evaluation of the host responses to infection with differentially virulent classical swine fever virus strains in pigs.

Classical swine fever virus (CSFV) infection causes most variable clinical syndromes from chronic or latent infection to acute death, and it is generally acknowledged that the course of disease is affected by both virus and host factors. To compare host immune responses to differentially virulent CSFV strains in pigs, fifteen 8-week-old specific-pathogen-free pigs were randomly divided into four groups and inoculated with the CSFV Shimen strain (a highly virulent strain), the HLJZZ2014 strain (a moderately virulent strains), C-strain (an avirulent strain), and DMEM (mock control), respectively. Infection with the Shimen or HLJZZ2014 strain resulted in fever, clinical signs and histopathological lesions, which were not observed in the C-strain-inoculated pigs, though low viral genome copies were detected in the peripheral blood and tissue samples. The data showed that the virulence of the strains affected the outcome of duration and intensity of the disease rather than the tissue tropism of the virus. Furthermore, leukopenia, lymphocytopenia, differentiation of T-cells, and the secretion of cytokines associated with inflammation or apoptosis such as interferon alpha (IFN-a), tumor necrosis factor alpha (TNF-a), interleukin 2 (IL-2), IL-4, IL-6, and IL-10 were induced by the virulent CSFV infection, the differences reflected in onset and extent of the regulation. Taken together, our results revealed that the major differences among the three strains resided in the kinetics of host response to the infection: severe and immediate with the highly virulent strain, while progressive and delayed with the moderately virulent one. This comparative study will help to dissect the pathogenesis of CSFV. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019  |  

Hepacivirus A infection in horses defines distinct envelope hypervariable regions and elucidates potential roles of viral strain and adaptive immune status in determining envelope diversity and infection outcome.

Hepacivirus A (also known as nonprimate hepacivirus and equine hepacivirus) is a hepatotropic virus that can cause both transient and persistent infections in horses. The evolution of intrahost viral populations (quasispecies) has not been studied in detail for hepacivirus A, and its roles in immune evasion and persistence are unknown. To address these knowledge gaps, we first evaluated the envelope gene (E1 and E2) diversity of two different hepacivirus A strains (WSU and CU) in longitudinal blood samples from experimentally infected adult horses, juvenile horses (foals), and foals with severe combined immunodeficiency (SCID). Persistent infection with the WSU strain was associated with significantly greater quasispecies diversity than that observed in horses who spontaneously cleared infection (P = 0.0002) or in SCID foals (P < 0.0001). In contrast, the CU strain was able to persist despite significantly lower (P < 0.0001) and relatively static envelope diversity. These findings indicate that envelope diversity is a poor predictor of hepacivirus A infection outcomes and could be dependent on strain-specific factors. Next, entropy analysis was performed on all E1/E2 genes entered into GenBank. This analysis defined three novel hypervariable regions (HVRs) in E2, at residues 391 to 402 (HVR1), 450 to 461 (HVR2), and 550 to 562 (HVR3). For the experimentally infected horses, entropy analysis focusing on the HVRs demonstrated that these regions were under increased selective pressure during persistent infection. Increased diversity in the HVRs was also temporally associated with seroconversion in some horses, suggesting that these regions may be targets of neutralizing antibody and may play a role in immune evasion.IMPORTANCE Hepacivirus C (hepatitis C virus) is estimated to infect 150 million people worldwide and is a leading cause of cirrhosis and hepatocellular carcinoma. In contrast, its closest relative, hepacivirus A, causes relatively mild disease in horses and is frequently cleared. The relationship between quasispecies evolution and infection outcome has not been explored for hepacivirus A. To address this knowledge gap, we examined envelope gene diversity in horses with resolving and persistent infections. Interestingly, two strain-specific patterns of quasispecies diversity emerged. Persistence of the WSU strain was associated with increased quasispecies diversity and the accumulation of amino acid changes within three novel hypervariable regions following seroconversion. These findings provided evidence that envelope gene mutation is influenced by adaptive immune pressure and may contribute to hepacivirus persistence. However, the CU strain persisted despite relative evolutionary stasis, suggesting that some hepacivirus strains may use alternative mechanisms to persist in the host. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Antiviral adaptive immunity and tolerance in the mosquito Aedes aegyti

Mosquitoes spread pathogenic arboviruses while themselves tolerate infection. We here characterize an immunity pathway providing long-term antiviral protection and define how this pathway discriminates between self and non-self. Mosquitoes use viral RNAs to create viral derived cDNAs (vDNAs) central to the antiviral response. vDNA molecules are acquired through a process of reverse-transcription and recombination directed by endogenous retrotransposons. These vDNAs are thought to integrate in the host genome as endogenous viral elements (EVEs). Sequencing of pre-integrated vDNA revealed that the acquisition process exquisitely distinguishes viral from host RNA, providing one layer of self-nonself discrimination. Importantly, we show EVE-derived piRNAs have antiviral activity and are loaded onto Piwi4 to inhibit virus replication. In a second layer of self-non-self discrimination, Piwi4 preferentially loads EVE-derived piRNAs, discriminating against transposon-targeting piRNAs. Our findings define a fundamental virus-specific immunity pathway in mosquitoes that uses EVEs as a potent and specific antiviral transgenerational mechanism.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.