Influenza virus-infected cells vary widely in their expression of viral genes and only occasionally activate innate immunity. Here, we develop a new method to assess how the genetic variation in viral populations contributes to this heterogeneity. We do this by determining the transcriptome and full-length sequences of all viral genes in single cells infected with a nominally “pure” stock of influenza virus. Most cells are infected by virions with defects, some of which increase the frequency of innate-immune activation. These immunostimulatory defects are diverse and include mutations that perturb the function of the viral polymerase protein PB1, large internal deletions in viral genes, and failure to express the virus’s interferon antagonist NS1. However, immune activation remains stochastic in cells infected by virions with these defects and occasionally is triggered even by virions that express unmutated copies of all genes. Our work shows that the diverse spectrum of defects in influenza virus populations contributes to-but does not completely explain-the heterogeneity in viral gene expression and immune activation in single infected cells.IMPORTANCE Because influenza virus has a high mutation rate, many cells are infected by mutated virions. But so far, it has been impossible to fully characterize the sequence of the virion infecting any given cell, since conventional techniques such as flow cytometry and single-cell transcriptome sequencing (scRNA-seq) only detect if a protein or transcript is present, not its sequence. Here we develop a new approach that uses long-read PacBio sequencing to determine the sequences of virions infecting single cells. We show that viral genetic variation explains some but not all of the cell-to-cell variability in viral gene expression and innate immune induction. Overall, our study provides the first complete picture of how viral mutations affect the course of infection in single cells.Copyright © 2019 Russell et al.
SMRT sequencing revealed the diversity and characteristics of defective interfering RNAs in influenza A (H7N9) virus infection.
Influenza defective interfering (DI) particles are replication-incompetent viruses carrying large internal deletion in the genome. The loss of essential genetic information causes abortive viral replication, which can be rescued by co-infection with a helper virus that possesses an intact genome. Despite reports of DI particles present in seasonal influenza A H1N1 infections, their existence in human infections by the avian influenza A viruses, such as H7N9, has not been studied. Here we report the ubiquitous presence of DI-RNAs in nasopharyngeal aspirates of H7N9-infected patients. Single Molecule Real Time (SMRT) sequencing was first applied and long-read sequencing analysis showed that a variety of H7N9 DI-RNA species were present in the patient samples and human bronchial epithelial cells. In several abundantly expressed DI-RNA species, long overlapping sequences have been identified around at the breakpoint region and the other side of deleted region. Influenza DI-RNA is known as a defective viral RNA with single large internal deletion. Beneficial to the long-read property of SMRT sequencing, double and triple internal deletions were identified in half of the DI-RNA species. In addition, we examined the expression of DI-RNAs in mice infected with sublethal dose of H7N9 virus at different time points. Interestingly, DI-RNAs were abundantly expressed as early as day 2 post-infection. Taken together, we reveal the diversity and characteristics of DI-RNAs found in H7N9-infected patients, cells and animals. Further investigations on this overwhelming generation of DI-RNA may provide important insights into the understanding of H7N9 viral replication and pathogenesis.
PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II’s sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Discovery of a divergent HPIV4 from respiratory secretions using second and third generation metagenomic sequencing.
Molecular detection of viruses has been aided by high-throughput sequencing, permitting the genomic characterization of emerging strains. In this study, we comprehensively screened 500 respiratory secretions from children with upper and/or lower respiratory tract infections for viral pathogens. The viruses detected are described, including a divergent human parainfluenza virus type 4 from GS FLX pyrosequencing of 92 specimens. Complete full-genome characterization of the virus followed, using Single Molecule, Real-Time (SMRT) sequencing. Subsequent “primer walking” combined with Sanger sequencing validated the RS platform’s utility in viral sequencing from complex clinical samples. Comparative genomics reveals the divergent strain clusters with the only completely sequenced HPIV4a subtype. However, it also exhibits various structural features present in one of the HPIV4b reference strains, opening questions regarding their lifecycle and evolutionary relationships among these viruses. Clinical data from patients infected with the strain, as well as viral prevalence estimates using real-time PCR, is also described.
Viral infection perturbs host cells and can be used to uncover regulatory mechanisms controlling cellular responses and susceptibility to infections. Using cell biological, biochemical, and genetic tools, we reveal that influenza A virus (IAV) infection induces global transcriptional defects at the 3′ ends of active host genes and RNA polymerase II (RNAPII) run-through into extragenic regions. Deregulated RNAPII leads to expression of aberrant RNAs (3′ extensions and host-gene fusions) that ultimately cause global transcriptional downregulation of physiological transcripts, an effect influencing antiviral response and virulence. This phenomenon occurs with multiple strains of IAV, is dependent on influenza NS1 protein, and can be modulated by SUMOylation of an intrinsically disordered region (IDR) of NS1 expressed by the 1918 pandemic IAV strain. Our data identify a strategy used by IAV to suppress host gene expression and indicate that polymorphisms in IDRs of viral proteins can affect the outcome of an infection.
Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics.
Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio’s single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing.© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.
Influenza A virus is characterized by high genetic diversity. However, most of what is known about influenza evolution has come from consensus sequences sampled at the epidemiological scale that only represent the dominant virus lineage within each infected host. Less is known about the extent of within-host virus diversity and what proportion of this diversity is transmitted between individuals. To characterize virus variants that achieve sustainable transmission in new hosts, we examined within-host virus genetic diversity in household donor-recipient pairs from the first wave of the 2009 H1N1 pandemic when seasonal H3N2 was co-circulating. Although the same variants were found in multiple members of the community, the relative frequencies of variants fluctuated, with patterns of genetic variation more similar within than between households. We estimated the effective population size of influenza A virus across donor-recipient pairs to be approximately 100-200 contributing members, which enabled the transmission of multiple lineages, including antigenic variants.
Complete genome sequence of Haemophilus influenzae strain 375 from the middle ear of a pediatric patient with otitis media.
Originally isolated from a pediatric patient with otitis media, Haemophilus influenzae strain 375 (Hi375) has been extensively studied as a model system for intracellular invasion of airway epithelial cells and other pathogenesis traits. Here, we report its complete genome sequence and methylome. Copyright © 2014 Mell et al.
Mammals express the sialic acids N-acetylneuraminic acid (Neu5Ac) and N-glycolylneuraminic acid (Neu5Gc) on cell surfaces, where they act as receptors for pathogens, including influenza A virus (IAV). Neu5Gc is synthesized from Neu5Ac by the enzyme cytidine monophosphate-N-acetylneuraminic acid hydroxylase (CMAH). In humans, this enzyme is inactive and only Neu5Ac is produced. Ferrets are susceptible to human-adapted IAV strains and have been the dominant animal model for IAV studies. Here we show that ferrets, like humans, do not synthesize Neu5Gc. Genomic analysis reveals an ancient, nine-exon deletion in the ferret CMAH gene that is shared by the Pinnipedia and Musteloidia members of the Carnivora. Interactions between two human strains of IAV with the sialyllactose receptor (sialic acid-a2,6Gal) confirm that the type of terminal sialic acid contributes significantly to IAV receptor specificity. Our results indicate that exclusive expression of Neu5Ac contributes to the susceptibility of ferrets to human-adapted IAV strains.
Long single-molecule reads can resolve the complexity of the influenza virus composed of rare, closely related mutant variants
As a result of a high rate of mutations and recombination events, an RNA-virus exists as a heterogeneous “swarm” of mutant variants. The long read length offered by single-molecule sequencing technologies allows each mutant variant to be sequenced in a single pass. However, high error rate limits the ability to reconstruct heterogeneous viral population composed of rare, related mutant variants. In this paper, we present 2SNV, a method able to tolerate the high error-rate of the single-molecule protocol and reconstruct mutant variants. 2SNV uses linkage between single nucleotide variations to efficiently distinguish them from read errors. To benchmark the sensitivity of 2SNV, we performed a single-molecule sequencing experiment on a sample containing a titrated level of known viral mutant variants. Our method is able to accurately reconstruct clone with frequency of 0.2 % and distinguish clones that differed in only two nucleotides distantly located on the genome. 2SNV outperforms existing methods for full-length viral mutant reconstruction. The open source implementation of 2SNV is freely available for download at http://?alan.?cs.?gsu.?edu/?NGS/???q=?content/?2snv.