Menu
April 21, 2020  |  

Long metabarcoding of the eukaryotic rDNA operon to phylogenetically and taxonomically resolve environmental diversity

High-throughput environmental DNA metabarcoding has revolutionized the analysis of microbial diversity, but this approach is generally restricted to amplicon sizes below 500 base pairs. These short regions contain limited phylogenetic signal, which makes it impractical to use environmental DNA in full phylogenetic inferences. However, new long-read sequencing technologies such as the Pacific Biosciences platform may provide sufficiently large sequence lengths to overcome the poor phylogenetic resolution of short amplicons. To test this idea, we amplified soil DNA and used PacBio Circular Consensus Sequencing (CCS) to obtain a ~4500 bp region of the eukaryotic rDNA operon spanning most of the small (18S) and large subunit (28S) ribosomal RNA genes. The CCS reads were first treated with a novel curation workflow that generated 650 high-quality OTUs containing the physically linked 18S and 28S regions of the long amplicons. In order to assign taxonomy to these OTUs, we developed a phylogeny-aware approach based on the 18S region that showed greater accuracy and sensitivity than similarity-based and phylogenetic placement-based methods using shorter reads. The taxonomically-annotated OTUs were then combined with available 18S and 28S reference sequences to infer a well-resolved phylogeny spanning all major groups of eukaryotes, allowing to accurately derive the evolutionary origin of environmental diversity. A total of 1019 sequences were included, of which a majority (58%) corresponded to the new long environmental CCS reads. Comparisons to the 18S-only region of our amplicons revealed that the combined 18S-28S genes globally increased the phylogenetic resolution, recovering specific groupings otherwise missing. The long-reads also allowed to directly investigate the relationships among environmental sequences themselves, which represents a key advantage over the placement of short reads on a reference phylogeny. Altogether, our results show that long amplicons can be treated in a full phylogenetic framework to provide greater taxonomic resolution and a robust evolutionary perspective to environmental DNA.


April 21, 2020  |  

Disruption of the kringle 1 domain of prothrombin leads to late onset mortality in zebrafish

The ability to prevent blood loss in response to injury is a critical, evolutionarily conserved function of all vertebrates. Prothrombin (F2) contributes to both primary and secondary hemostasis through the activation of platelets and the conversion of soluble fibrinogen to insoluble fibrin, respectively. Complete prothrombin deficiency has never been observed in humans and is incompatible with life in mice, limiting the ability to understand the entirety of prothrombin’s in vivo functions. We have previously demonstrated the ability of zebrafish to tolerate loss of both pro- and anticoagulant factors that are embryonic lethal in mammals, making them an ideal model for the study of prothrombin deficiency. Using genome editing with TALENs, we have generated a null allele in zebrafish f2. Homozygous mutant embryos develop normally into early adulthood, but demonstrate eventual complete mortality with the majority of fish succumbing to internal hemorrhage by 2 months of age. We show that despite the extended survival, the mutants are unable to form occlusive thrombi in both the venous and arterial systems as early as 3-5 days of life, and we were able to phenocopy this early hemostatic defect using direct oral anticoagulants. When the equivalent mutation was engineered into the homologous residues of human prothrombin, there were severe reductions in secretion and activation, suggesting a possible role for kringle 1 in thrombin maturation, and the possibility that the F1.2 fragment has a functional role in exerting the procoagulant effects of thrombin. Together, our data demonstrate the conserved function of thrombin in zebrafish, as well as the requirement for kringle 1 for biosynthesis and activation by prothrombinase. Understanding how zebrafish are able to develop normally and survive into early adulthood without prothrombin will provide important insight into its pleiotropic functions as well as the management of patients with bleeding disorders.


April 21, 2020  |  

Insights into the bacterial species and communities of a full-scale anaerobic/anoxic/oxic wastewater treatment plant by using third-generation sequencing.

For the first time, full-length 16S rRNA sequencing method was applied to disclose the bacterial species and communities of a full-scale wastewater treatment plant using an anaerobic/anoxic/oxic (A/A/O) process in Wuhan, China. The compositions of the bacteria at phylum and class levels in the activated sludge were similar to which revealed by Illumina Miseq sequencing. At genus and species levels, third-generation sequencing showed great merits and accuracy. Typical functional taxa classified to ammonia-oxidizing bacteria (AOB), nitrite-oxidizing bacteria (NOB), denitrifying bacteria (DB), anaerobic ammonium oxidation bacteria (ANAMMOXB) and polyphosphate-accumulating organisms (PAOs) were presented, which were Nitrosomonas (1.11%), Nitrospira (3.56%), Pseudomonas (3.88%), Planctomycetes (13.80%), Comamonadaceae (1.83%), respectively. Pseudomonas (3.88%) and Nitrospira (3.56%) were the most predominating two genera, mainly containing Pseudomonas extremaustralis (1.69%), Nitrospira defluvii (3.13%), respectively. Bacteria regarding to nitrogen and phosphorus removal at species level were put forward. The predicted functions proved that the A/A/O process was efficient regarding nitrogen and organics removal. Copyright © 2019 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.


April 21, 2020  |  

Towards PacBio-based pan-eukaryote metabarcoding using full-length ITS sequences.

Development of high-throughput sequencing techniques have greatly benefited our understanding about microbial ecology; yet the methods producing short reads suffer from species-level resolution and uncertainty of identification. Here we optimize PacBio-based metabarcoding protocols covering the Internal Transcribed Spacer (ITS region) and partial Small Subunit (SSU) of the rRNA gene for species-level identification of all eukaryotes, with a specific focus on Fungi (including Glomeromycota) and Stramenopila (particularly Oomycota). Based on tests on composite soil samples and mock communities, we propose best suitable degenerate primers, ITS9munngs + ITS4ngsUni for eukaryotes and selected groups therein and discuss pros and cons of long read-based identification of eukaryotes. This article is protected by copyright. All rights reserved.


April 21, 2020  |  

Mogamulizumab Treatment Elicits Autoantibodies Attacking the Skin in Patients with Adult T-Cell Leukemia-Lymphoma.

Purpose: The anti-CCR4 mAb, mogamulizumab, offers therapeutic benefit to patients with adult T-cell leukemia-lymphoma (ATL), but skin-related adverse events (AE) such as erythema multiforme occur frequently. The purpose of this study was to determine the mechanisms by which mogamulizumab causes skin-related AEs in patients with ATL.Experimental Design: We investigated whether autoantibodies were present in patients’ sera using flow cytometry to determine binding to keratinocytes and melanocytes (n = 17), and immunofluorescence analysis of tissue sections. We analyzed the IgM heavy chain repertoire in peripheral blood mononuclear cells before and after mogamulizumab or other chemotherapy by next-generation sequencing (NGS; n = 16).Results: Autoantibodies recognizing human keratinocytes or melanocytes were found in the sera of 6 of 8 patients suffering from mogamulizumab-induced erythema multiforme. In one patient, complement-dependent cytotoxicity (CDC) mediated by autoantibodies against keratinocytes or melanocytes was proportionally related to the severity of the erythema multiforme. The presence of autoantibodies in the epidermis was confirmed in all biopsy specimens of mogamulizumab-induced erythema multiforme (n = 12). Furthermore, colocalization of autoantibodies and C1q, suggesting the activation of CDC, was observed in 67% (8/12). In contrast, no autoantibody or C1q was found in ATL tumor skin lesions (n = 13). Consistent with these findings, NGS demonstrated that IgM germline genes had newly emerged and expanded, resulting in IgM repertoire skewing at the time of erythema multiforme.Conclusions: Mogamulizumab elicits autoantibodies playing an important role in skin-related AEs, possibly associated with regulatory T-cell depletion. This is the first report demonstrating the presence of skin-directed autoantibodies after mogamulizumab treatment. ©2019 American Association for Cancer Research.


April 21, 2020  |  

CRISPR/Cas9-targeted enrichment and long-read sequencing of the Fuchs endothelial corneal dystrophy-associated TCF4 triplet repeat.

To demonstrate the utility of an amplification-free long-read sequencing method to characterize the Fuchs endothelial corneal dystrophy (FECD)-associated intronic TCF4 triplet repeat (CTG18.1).We applied an amplification-free method, utilizing the CRISPR/Cas9 system, in combination with PacBio single-molecule real-time (SMRT) long-read sequencing, to study CTG18.1. FECD patient samples displaying a diverse range of CTG18.1 allele lengths and zygosity status (n?=?11) were analyzed. A robust data analysis pipeline was developed to effectively filter, align, and interrogate CTG18.1-specific reads. All results were compared with conventional polymerase chain reaction (PCR)-based fragment analysis.CRISPR-guided SMRT sequencing of CTG18.1 provided accurate genotyping information for all samples and phasing was possible for 18/22 alleles sequenced. Repeat length instability was observed for all expanded (=50 repeats) phased CTG18.1 alleles analyzed. Furthermore, higher levels of repeat instability were associated with increased CTG18.1 allele length (mode length =91 repeats) indicating that expanded alleles behave dynamically.CRISPR-guided SMRT sequencing of CTG18.1 has revealed novel insights into CTG18.1 length instability. Furthermore, this study provides a framework to improve the molecular diagnostic accuracy for CTG18.1-mediated FECD, which we anticipate will become increasingly important as gene-directed therapies are developed for this common age-related and sight threatening disease.


April 21, 2020  |  

Broadly Neutralizing Antibodies Targeting New Sites of Vulnerability in Hepatitis C Virus E1E2.

Increasing evidence indicates that broadly neutralizing antibodies (bNAbs) play an important role in immune-mediated control of hepatitis C virus (HCV) infection, but the relative contribution of neutralizing antibodies targeting antigenic sites across the HCV envelope (E1 and E2) proteins is unclear. Here, we isolated thirteen E1E2-specific monoclonal antibodies (MAbs) from B cells of a single HCV-infected individual who cleared one genotype 1a infection and then became persistently infected with a second genotype 1a strain. These MAbs bound six distinct discontinuous antigenic sites on the E1 protein, the E2 protein, or the E1E2 heterodimer. Three antigenic sites, designated AS108, AS112 (an N-terminal E1 site), and AS146, were distinct from previously described antigenic regions (ARs) 1 to 5 and E1 sites. Antibodies targeting four sites (AR3, AR4-5, AS108, and AS146) were broadly neutralizing. These MAbs also displayed distinct patterns of relative neutralizing potency (i.e., neutralization profiles) across a panel of diverse HCV strains, which led to complementary neutralizing breadth when they were tested in combination. Overall, this study demonstrates that HCV bNAb epitopes are not restricted to previously described antigenic sites, expanding the number of sites that could be targeted for vaccine development.IMPORTANCE Worldwide, more than 70 million people are infected with hepatitis C virus (HCV), which is a leading cause of hepatocellular carcinoma and liver transplantation. Despite the development of potent direct acting antivirals (DAAs) for HCV treatment, a vaccine is urgently needed due to the high cost of treatment and the possibility of reinfection after cure. Induction of multiple broadly neutralizing antibodies (bNAbs) that target distinct epitopes on the HCV envelope proteins is one approach to vaccine development. However, antigenic sites targeted by bNAbs in individuals with spontaneous control of HCV have not been fully defined. In this study, we characterize 13 monoclonal antibodies (MAbs) from a single person who cleared an HCV infection without treatment, and we identify 3 new sites targeted by neutralizing antibodies. The sites targeted by these MAbs could inform HCV vaccine development. Copyright © 2019 American Society for Microbiology.


April 21, 2020  |  

Single-Cell Virus Sequencing of Influenza Infections That Trigger Innate Immunity.

Influenza virus-infected cells vary widely in their expression of viral genes and only occasionally activate innate immunity. Here, we develop a new method to assess how the genetic variation in viral populations contributes to this heterogeneity. We do this by determining the transcriptome and full-length sequences of all viral genes in single cells infected with a nominally “pure” stock of influenza virus. Most cells are infected by virions with defects, some of which increase the frequency of innate-immune activation. These immunostimulatory defects are diverse and include mutations that perturb the function of the viral polymerase protein PB1, large internal deletions in viral genes, and failure to express the virus’s interferon antagonist NS1. However, immune activation remains stochastic in cells infected by virions with these defects and occasionally is triggered even by virions that express unmutated copies of all genes. Our work shows that the diverse spectrum of defects in influenza virus populations contributes to-but does not completely explain-the heterogeneity in viral gene expression and immune activation in single infected cells.IMPORTANCE Because influenza virus has a high mutation rate, many cells are infected by mutated virions. But so far, it has been impossible to fully characterize the sequence of the virion infecting any given cell, since conventional techniques such as flow cytometry and single-cell transcriptome sequencing (scRNA-seq) only detect if a protein or transcript is present, not its sequence. Here we develop a new approach that uses long-read PacBio sequencing to determine the sequences of virions infecting single cells. We show that viral genetic variation explains some but not all of the cell-to-cell variability in viral gene expression and innate immune induction. Overall, our study provides the first complete picture of how viral mutations affect the course of infection in single cells.Copyright © 2019 Russell et al.


April 21, 2020  |  

A Highly Unusual V1 Region of Env in an Elite Controller of HIV Infection.

HIV elite controllers represent a remarkable minority of patients who maintain normal CD4+ T-cell counts and low or undetectable viral loads for decades in the absence of antiretroviral therapy. To examine the possible contribution of virus attenuation to elite control, we obtained a primary HIV-1 isolate from an elite controller who had been infected for 19?years, the last 10 of which were in the absence of antiretroviral therapy. Full-length sequencing of this isolate revealed a highly unusual V1 domain in Envelope (Env). The V1 domain in this HIV-1 strain was 49 amino acids, placing it in the top 1% of lengths among the 6,112 Env sequences in the Los Alamos National Laboratory online database. Furthermore, it included two additional N-glycosylation sites and a pair of cysteines suggestive of an extra disulfide loop. Virus with this Env retained good infectivity and replicative capacity; however, analysis of recombinant viruses suggested that other sequences in Env were adapted to accommodate the unusual V1 domain. While the long V1 domain did not confer resistance to neutralization by monoclonal antibodies of the V1/V2-glycan-dependent class, it did confer resistance to neutralization by monoclonal antibodies of the V3-glycan-dependent class. Our findings support results in the literature that suggest a role for long V1 regions in shielding HIV-1 from recognition by V3-directed broadly neutralizing antibodies. In the case of the elite controller described here, it seems likely that selective pressures from the humoral immune system were responsible for driving the highly unusual polymorphisms present in this HIV-1 Envelope.IMPORTANCE Elite controllers have long provided an avenue for researchers to reveal mechanisms underlying control of HIV-1. While the role of host genetic factors in facilitating elite control is well known, the possibility of infection by attenuated strains of HIV-1 has been much less studied. Here we describe an unusual viral feature found in an elite controller of HIV-1 infection and demonstrate its role in conferring escape from monoclonal antibodies of the V3-glycan class. Our results suggest that extreme variation may be needed by HIV-1 to escape neutralization by some antibody specificities. Copyright © 2019 Silver et al.


April 21, 2020  |  

Full-Length Multi-Barcoding: DNA Barcoding from Single Ingredient to Complex Mixtures.

DNA barcoding has been used for decades, although it has mostly been applied to somesingle-species. Traditional Chinese medicine (TCM), which is mainly used in the form ofcombination-one type of the multi-species, identification is crucial for clinical usage.Next-generation Sequencing (NGS) has been used to address this authentication issue for the pastfew years, but conventional NGS technology is hampered in application due to its short sequencingreads and systematic errors. Here, a novel method, Full-length multi-barcoding (FLMB) vialong-read sequencing, is employed for the identification of biological compositions in herbalcompound formulas in adequate and well controlled studies. By directly sequencing the full-lengthamplicons of ITS2 and psbA-trnH through single-molecule real-time (SMRT) technology, thebiological composition of a classical prescription Sheng-Mai-San (SMS) was analyzed. At the sametime, clone-dependent Sanger sequencing was carried out as a parallel control. Further, anotherformula-Sanwei-Jili-San (SJS)-was analyzed with genes of ITS2 and CO1. All the ingredients inthe samples of SMS and SJS were successfully authenticated at the species level, and 11 exogenousspecies were also checked, some of which were considered as common contaminations in theseproducts. Methodology analysis demonstrated that this method was sensitive, accurate andreliable. FLMB, a superior but feasible approach for the identification of biological complexmixture, was established and elucidated, which shows perfect interpretation for DNA barcodingthat could lead its application in multi-species mixtures.


April 21, 2020  |  

Genome sequence of the corn leaf aphid (Rhopalosiphum maidis Fitch).

The corn leaf aphid (Rhopalosiphum maidis Fitch) is the most economically damaging aphid pest on maize (Zea mays), one of the world’s most important grain crops. In addition to causing direct damage by removing photoassimilates, R. maidis transmits several destructive maize viruses, including maize yellow dwarf virus, barley yellow dwarf virus, sugarcane mosaic virus, and cucumber mosaic virus.The genome of a parthenogenetically reproducing R. maidis clone was assembled with a combination of Pacific Biosciences (207-fold coverage) and Illumina (83-fold coverage) sequencing. The 689 assembled contigs, which have an N50 size of 9.0 megabases (Mb) and a low level of heterozygosity, were clustered using Phase Genomics Hi-C interaction maps. Consistent with the commonly observed 2n = 8 karyotype of R. maidis, most of the contigs (473 spanning 321 Mb) were successfully oriented into 4 scaffolds. The genome assembly captured the full length of 95.8% of the core eukaryotic genes, indicating that it is highly complete. Repetitive sequences accounted for 21.2% of the assembly, and a total of 17,629 protein-coding genes were predicted with integrated evidence from ab initio and homology-based gene predictions and transcriptome sequences generated with both Pacific Biosciences and Illumina. An analysis of likely horizontally transferred genes identified 2 from bacteria, 7 from fungi, 2 from protozoa, and 9 from algae. Repeat elements, transposons, and genes encoding likely detoxification enzymes (cytochrome P450s, glutathione S-transferases, carboxylesterases, uridine diphosphate-glucosyltransferases, and ABC transporters) were identified in the genome sequence. Other than Buchnera aphidicola (642,929 base pairs, 602 genes), no endosymbiont bacteria were found in R. maidis.A high-quality R. maidis genome was assembled at the chromosome level. This genome sequence will enable further research related to ecological interactions, virus transmission, pesticide resistance, and other aspects of R. maidis biology. It also serves as a valuable resource for comparative investigation of other aphid species. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020  |  

A personalized platform identifies trametinib plus zoledronate for a patient with KRAS-mutant metastatic colorectal cancer.

Colorectal cancer remains a leading source of cancer mortality worldwide. Initial response is often followed by emergent resistance that is poorly responsive to targeted therapies, reflecting currently undruggable cancer drivers such as KRAS and overall genomic complexity. Here, we report a novel approach to developing a personalized therapy for a patient with treatment-resistant metastatic KRAS-mutant colorectal cancer. An extensive genomic analysis of the tumor’s genomic landscape identified nine key drivers. A transgenic model that altered orthologs of these nine genes in the Drosophila hindgut was developed; a robotics-based screen using this platform identified trametinib plus zoledronate as a candidate treatment combination. Treating the patient led to a significant response: Target and nontarget lesions displayed a strong partial response and remained stable for 11 months. By addressing a disease’s genomic complexity, this personalized approach may provide an alternative treatment option for recalcitrant disease such as KRAS-mutant colorectal cancer.


April 21, 2020  |  

A new full-length circular DNA sequencing method for viral-sized genomes reveals that RNAi transgenic plants provoke a shift in geminivirus populations in the field.

We present a new method, CIDER-Seq (Circular DNA Enrichment sequencing) for the unbiased enrichment and long-read sequencing of viral-sized circular DNA molecules. We used CIDER-Seq to produce single-read full-length virus genomes for the first time. CIDER-Seq combines PCR-free virus enrichment with Single Molecule Real Time sequencing and a new sequence de-concatenation algorithm. We apply our technique to produce >1200 full-length, highly accurate geminivirus genomes from RNAi-transgenic and control plants in a field trial in Kenya. Using CIDER-Seq we can demonstrate for the first time that the expression of antiviral double-stranded RNA (dsRNA) in transgenic plants causes a consistent shift in virus populations towards species sharing low homology to the transgene derived dsRNA. Our method and its application in an economically important crop plant opens new possibilities in periodic virus sequence surveillance and accurate profiling of diverse circular DNA elements.


April 21, 2020  |  

Full-length 16S rRNA gene classification of Atlantic salmon bacteria and effects of using different 16S variable regions on community structure analysis.

Understanding fish-microbial relationships may be of great value for fish producers as fish growth, development and welfare are influenced by the microbial community associated with the rearing systems and fish surfaces. Accurate methods to generate and analyze these microbial communities would be an important tool to help improve understanding of microbial effects in the industry. In this study, we performed taxonomic classification and determination of operational taxonomic units on Atlantic salmon microbiota by taking advantage of full-length 16S rRNA gene sequences. Skin mucus was dominated by the genera Flavobacterium and Psychrobacter. Intestinal samples were dominated by the genera Carnobacterium, Aeromonas, Mycoplasma and by sequences assigned to the order Clostridiales. Applying Sanger sequencing on the full-length bacterial 16S rRNA gene from the pool of 46 isolates obtained in this study showed a clear assignment of the PacBio full-length bacterial 16S rRNA gene sequences down to the genus level. One of the bottlenecks in comparing microbial profiles is that different studies use different 16S rRNA gene regions. Comparisons of sequence assignments between full-length and in silico derived variable 16S rRNA gene regions showed different microbial profiles with variable effects between phylogenetic groups and taxonomic ranks. © 2019 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


April 21, 2020  |  

Long-read amplicon denoising.

Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called ‘amplicon denoising’, this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.