Menu
July 19, 2019

Characterization of hepatitis C virus (HCV) envelope diversification from acute to chronic infection within a sexually transmitted HCV cluster by using single-molecule, real-time sequencing.

In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms.IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. Copyright © 2017 American Society for Microbiology.


July 19, 2019

Selective graft-versus-leukemia depends on magnitude and diversity of the alloreactive T cell response.

Patients with leukemia who receive a T cell-depleted allogeneic stem cell graft followed by postponed donor lymphocyte infusion (DLI) can experience graft-versus-leukemia (GVL) reactivity, with a lower risk of graft-versus-host disease (GVHD). Here, we have investigated the magnitude, diversity, and specificity of alloreactive CD8 T cells in patients who developed GVL reactivity after DLI in the absence or presence of GVHD. We observed a lower magnitude and diversity of CD8 T cells for minor histocompatibility antigens (MiHAs) in patients with selective GVL reactivity without GVHD. Furthermore, we demonstrated that MiHA-specific T cell clones from patients with selective GVL reactivity showed lower reactivity against nonhematopoietic cells, even when pretreated with inflammatory cytokines. Expression analysis of MiHA-encoding genes showed that similar types of antigens were recognized in both patient groups, but in patients who developed GVHD, T cell reactivity was skewed to target broadly expressed MiHAs. As an inflammatory environment can render nonhematopoietic cells susceptible to T cell recognition, prevention of such circumstances favors induction of selective GVL reactivity without development of GVHD.


July 19, 2019

SMRT genome assembly corrects reference errors, resolving the genetic basis of virulence in Mycobacterium tuberculosis.

The genetic basis of virulence in Mycobacterium tuberculosis has been investigated through genome comparisons of virulent (H37Rv) and attenuated (H37Ra) sister strains. Such analysis, however, relies heavily on the accuracy of the sequences. While the H37Rv reference genome has had several corrections to date, that of H37Ra is unmodified since its original publication.Here, we report the assembly and finishing of the H37Ra genome from single-molecule, real-time (SMRT) sequencing. Our assembly reveals that the number of H37Ra-specific variants is less than half of what the Sanger-based H37Ra reference sequence indicates, undermining and, in some cases, invalidating the conclusions of several studies. PE_PPE family genes, which are intractable to commonly-used sequencing platforms because of their repetitive and GC-rich nature, are overrepresented in the set of genes in which all reported H37Ra-specific variants are contradicted. Further, one of the sequencing errors in H37Ra masks a true variant in common with the clinical strain CDC1551 which, when considered in the context of previous work, corresponds to a sequencing error in the H37Rv reference genome.Our results constrain the set of genomic differences possibly affecting virulence by more than half, which focuses laboratory investigation on pertinent targets and demonstrates the power of SMRT sequencing for producing high-quality reference genomes.


July 19, 2019

IG and TR single chain fragment variable (scFv) sequence analysis: a new advanced functionality of IMGT/V-QUEST and IMGT/HighV-QUEST.

IMGT®, the international ImMunoGeneTics information system® ( http://www.imgt.org ), was created in 1989 in Montpellier, France (CNRS and Montpellier University) to manage the huge and complex diversity of the antigen receptors, and is at the origin of immunoinformatics, a science at the interface between immunogenetics and bioinformatics. Immunoglobulins (IG) or antibodies and T cell receptors (TR) are managed and described in the IMGT® databases and tools at the level of receptor, chain and domain. The analysis of the IG and TR variable (V) domain rearranged nucleotide sequences is performed by IMGT/V-QUEST (online since 1997, 50 sequences per batch) and, for next generation sequencing (NGS), by IMGT/HighV-QUEST, the high throughput version of IMGT/V-QUEST (portal begun in 2010, 500,000 sequences per batch). In vitro combinatorial libraries of engineered antibody single chain Fragment variable (scFv) which mimic the in vivo natural diversity of the immune adaptive responses are extensively screened for the discovery of novel antigen binding specificities. However the analysis of NGS full length scFv (~850 bp) represents a challenge as they contain two V domains connected by a linker and there is no tool for the analysis of two V domains in a single chain.The functionality “Analyis of single chain Fragment variable (scFv)” has been implemented in IMGT/V-QUEST and, for NGS, in IMGT/HighV-QUEST for the analysis of the two V domains of IG and TR scFv. It proceeds in five steps: search for a first closest V-REGION, full characterization of the first V-(D)-J-REGION, then search for a second V-REGION and full characterization of the second V-(D)-J-REGION, and finally linker delimitation.For each sequence or NGS read, positions of the 5’V-DOMAIN, linker and 3’V-DOMAIN in the scFv are provided in the ‘V-orientated’ sense. Each V-DOMAIN is fully characterized (gene identification, sequence description, junction analysis, characterization of mutations and amino changes). The functionality is generic and can analyse any IG or TR single chain nucleotide sequence containing two V domains, provided that the corresponding species IMGT reference directory is available.The “Analysis of single chain Fragment variable (scFv)” implemented in IMGT/V-QUEST and, for NGS, in IMGT/HighV-QUEST provides the identification and full characterization of the two V domains of full-length scFv (~850 bp) nucleotide sequences from combinatorial libraries. The analysis can also be performed on concatenated paired chains of expressed antigen receptor IG or TR repertoires.


July 19, 2019

Diversity of the TLR4 immunity receptor in Czech native cattle breeds revealed using the Pacific Biosciences sequencing platform.

The allelic variants of immunity genes in historical breeds likely reflect local infection pressure and therefore represent a reservoir for breeding. Screening to determine the diversity of the Toll-like receptor gene TLR4 was conducted in two conserved cattle breeds: Czech Red and Czech Red Pied. High-throughput sequencing of pooled PCR amplicons using the PacBio platform revealed polymorphisms, which were subsequently confirmed via genotyping techniques. Eight SNPs found in coding and adjacent regions were grouped into 18 haplotypes, representing a significant portion of the known diversity in the global breed panel and presumably exceeding diversity in production populations. Notably, the ancient Czech Red breed appeared to possess greater haplotype diversity than the Czech Red Pied breed, a Simmental variant, although the haplotype frequencies might have been distorted by significant crossbreeding and bottlenecks in the history of Czech Red cattle. The differences in haplotype frequencies validated the phenotypic distinctness of the local breeds. Due to the availability of Czech Red Pied production herds, the effect of intensive breeding on TLR diversity can be evaluated in this model. The advantages of the Pacific Biosciences technology for the resequencing of long PCR fragments with subsequent direct phasing were independently validated.


July 19, 2019

Defective HIV-1 proviruses are expressed and can be recognized by cytotoxic T lymphocytes, which shape the proviral landscape.

Despite antiretroviral therapy, HIV-1 persists in memory CD4(+) T cells, creating a barrier to cure. The majority of HIV-1 proviruses are defective and considered clinically irrelevant. Using cells from HIV-1-infected individuals and reconstructed patient-derived defective proviruses, we show that defective proviruses can be transcribed into RNAs that are spliced and translated. Proviruses with defective major splice donors (MSDs) can activate novel splice sites to produce HIV-1 transcripts, and cells with these proviruses can be recognized by HIV-1-specific cytotoxic T lymphocytes (CTLs). Further, cells with proviruses containing lethal mutations upstream of CTL epitopes can also be recognized by CTLs, potentially through aberrant translation. Thus, CTLs may change the landscape of HIV-1 proviruses by preferentially targeting cells with specific types of defective proviruses. Additionally, the expression of defective proviruses will need to be considered in the measurement of HIV-1 latency reversal. Copyright © 2017 Elsevier Inc. All rights reserved.


July 19, 2019

Increased risk of low birth weight in women with placental malaria associated with P. falciparum VAR2CSA clade.

Pregnancy associated malaria (PAM) causes adverse pregnancy and birth outcomes owing to Plasmodium falciparum accumulation in the placenta. Placental accumulation is mediated by P. falciparum protein VAR2CSA, a leading PAM-specific vaccine target. The extent of its antigen diversity and impact on clinical outcomes remain poorly understood. Through amplicon deep-sequencing placental malaria samples from women in Malawi and Benin, we assessed sequence diversity of VAR2CSA’s ID1-DBL2x region, containing putative vaccine targets and estimated associations of specific clades with adverse birth outcomes. Overall, var2csa diversity was high and haplotypes subdivided into five clades, the largest two defined by homology to parasites strains, 3D7 or FCR3. Across both cohorts, compared to women infected with only FCR3-like variants, women infected with only 3D7-like variants delivered infants with lower birthweight (difference: -267.99?g; 95% Confidence Interval [CI]: -466.43?g,-69.55?g) and higher odds of low birthweight (<2500?g) (Odds Ratio [OR] 5.41; 95% CI:0.99,29.52) and small-for-gestational-age (OR: 3.65; 95% CI: 1.01,13.38). In two distinct malaria-endemic African settings, parasites harboring 3D7-like variants of VAR2CSA were associated with worse birth outcomes, supporting differential effects of infection with specific parasite strains. The immense diversity coupled with differential clinical effects of this diversity suggest that an effective VAR2CSA-based vaccine may require multivalent activity.


July 19, 2019

Characterisation of MHC class I genes in the koala.

Koala (Phascolarctos cinereus) populations are on the decline across the majority of Australia’s mainland. Two major diseases threatening the long-term survival of affected koala populations are caused by obligate intracellular pathogens: Chlamydia and koala retrovirus (KoRV). To improve our understanding of the koala immune system, we characterised their major histocompatibility complex (MHC) class I genes, which are centrally involved in presenting foreign peptides derived from intracellular pathogens to cytotoxic T cells. A total of 11 class I genes were identified in the koala genome. Three genes, Phci-UA, UB and UC, showed relatively high genetic variability and were expressed in all 12 examined tissues, whereas the other eight genes had tissue-specific expression and limited polymorphism. Evidence of diversifying selection was detected in Phci-UA and UC, while gene conversion may have played a role in creating new alleles at Phci-UB. We propose that Phci-UA, UB and UC are likely classical MHC genes of koalas, and further research is needed to understand their role in koala chlamydial and KoRV infections.


July 19, 2019

Genome and methylome variation in Helicobacter pylori with a cag pathogenicity island during early stages of human infection.

Helicobacter pylori is remarkable for its genetic variation. Yet little isknown about its genetic changes during early stages of human infection, as the bacteria adapt to their new environment. We analyzed genome and methylome variations in a fully virulent strain of H pylori strain during experimental infection.We performed a randomized Phase 1 and 2, observer-blind, placebo-controlled, study of 12 healthy, H pylori-negative adults in Germany from October 2008 through March 2010. The volunteers were given a prophylactic vaccine candidate (n=7) or placebo (n=5) and then challenged with H pylori strain BCM-300. Biopsy samples were collected and H pylori were isolated. Genomes of the challenge strain and 12 re-isolates, obtained 12 weeks after (or in 1 case, 62 weeks after) infection were sequenced by single-molecule, real-time technology, which, in parallel, permitted determination of genome-wide methylation patterns for all strains. Functional effects of genetic changes observed in H pylori strains during human infection were assessed by measuring release of interleukin 8 from AGS cells (to detect cag PAI function), neutral red uptake (to detect vacuolating cytotoxin activity), and adhesion assays.The observed mutation rate was in agreement with rates previously determined from patients with chronic H pylori infections, without evidence of a mutation burst. A loss; of cag PAI function was observed in 3 re-isolates. In addition, 3 re-isolates from the vaccine; group acquired mutations in the vacuolating cytotoxin gene vacA, resulting in loss of; vacuolization activity from gastric epithelial cells. We observed inter-strain variation in; methylomes due to phase variation in genes encoding methyltransferases.We analyzed adaptation of a fully virulent strain of H pylori to 12 differentvolunteers to obtain a robust estimate of the frequency of genetic and epigenetic changes inthe absence of inter-strain recombination. Our findings indicate that the large amount of; genetic variation in H pylori poses a challenge to vaccine development. ClinicalTrials.gov no: NCT00736476. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.


July 19, 2019

Pacific Biosciences sequencing and IMGT/HighV-QUEST analysis of full-length single chain fragment variable from an in vivo selected phage-display combinatorial Library.

Phage-display selection of immunoglobulin (IG) or antibody single chain Fragment variable (scFv) from combinatorial libraries is widely used for identifying new antibodies for novel targets. Next-generation sequencing (NGS) has recently emerged as a new method for the high throughput characterization of IG and T cell receptor (TR) immune repertoires bothin vivoandin vitro. However, challenges remain for the NGS sequencing of scFv from combinatorial libraries owing to the scFv length (>800?bp) and the presence of two variable domains [variable heavy (VH) and variable light (VL) for IG] associated by a peptide linker in a single chain. Here, we show that single-molecule real-time (SMRT) sequencing with the Pacific Biosciences RS II platform allows for the generation of full-length scFv reads obtained from anin vivoselection of scFv-phages in an animal model of atherosclerosis. We first amplified the DNA of the phagemid inserts from scFv-phages eluted from an aortic section at the third round of thein vivoselection. From this amplified DNA, 450,558 reads were obtained from 15 SMRT cells. Highly accurate circular consensus sequences from these reads were generated, filtered by quality and then analyzed by IMGT/HighV-QUEST with the functionality for scFv. Full-length scFv were identified and characterized in 348,659 reads. Full-length scFv sequencing is an absolute requirement for analyzing the associated VH and VL domains enriched during thein vivopanning rounds. In order to further validate the ability of SMRT sequencing to provide high quality, full-length scFv sequences, we tracked the reads of an scFv-phage clone P3 previously identified by biological assays and Sanger sequencing. Sixty P3 reads showed 100% identity with the full-length scFv of 767?bp, 53 of them covering the whole insert of 977?bp, which encompassed the primer sequences. The remaining seven reads were identical over a shortened length of 939?bp that excludes the vicinity of primers at both ends. Interestingly these reads were obtained from each of the 15 SMRT cells. Thus, the SMRT sequencing method and the IMGT/HighV-QUEST functionality for scFv provides a straightforward protocol for characterization of full-length scFv from combinatorial phage libraries.


July 19, 2019

Ultradeep single-molecule real-time sequencing of HIV envelope reveals complete compartmentalization of highly macrophage-tropic R5 proviral variants in brain and CXCR4-using variants in immune and peripheral tissues.

Despite combined antiretroviral therapy (cART), HIV+ patients still develop neurological disorders, which may be due to persistent HIV infection and selective evolution in brain tissues. Single-molecule real-time (SMRT) sequencing technology offers an improved opportunity to study the relationship among HIV isolates in the brain and lymphoid tissues because it is capable of generating thousands of long sequence reads in a single run. Here, we used SMRT sequencing to generate ~?50,000 high-quality full-length HIV envelope sequences (>?2200 bp) from seven autopsy tissues from an HIV+/cART+ subject, including three brain and four non-brain sites. Sanger sequencing was used for comparison with SMRT data and to clone functional pseudoviruses for in vitro tropism assays. Phylogenetic analysis demonstrated that brain-derived HIV was compartmentalized from HIV outside the brain and that the variants from each of the three brain tissues grouped independently. Variants from all peripheral tissues were intermixed on the tree but independent of the brain clades. Due to the large number of sequences, a clustering analysis at three similarity thresholds (99, 99.5, and 99.9%) was also performed. All brain sequences clustered exclusive of any non-brain sequences at all thresholds; however, frontal lobe sequences clustered independently of occipital and parietal lobes. Translated sequences revealed potentially functional differences between brain and non-brain sequences in the location of putative N-linked glycosylation sites (N-sites), V1 length, V3 charge, and the number of V4 N-sites. All brain sequences were predicted to use the CCR5 co-receptor, while most non-brain sequences were predicted to use CXCR4 co-receptor. Tropism results were confirmed by in vitro infection assays. The study is the first to use a SMRT sequencing approach to study HIV compartmentalization in tissues and supports other reports of limited trafficking between brain and non-brain sequences during cART. Due to the long sequence length, we could observe changes along the entire envelope gene, likely caused by differential selective pressure in the brain that may contribute to neurological disease.


July 19, 2019

Utility of DNA, RNA, protein, and functional approaches to solve cryptic immunodeficiencies.

We report a female infant identified by newborn screening for severe combined immunodeficiencies (NBS SCID) with T cell lymphopenia (TCL). The patient had persistently elevated alpha-fetoprotein (AFP) with IgA deficiency, and elevated IgM. Gene sequencing for a SCID panel was uninformative. We sought to determine the cause of the immunodeficiency in this infant.We performed whole-exome sequencing (WES) on the patient and parents to identify a genetic diagnosis. Based on the WES result, we developed a novel flow cytometric panel for rapid assessment of DNA repair defects using blood samples. We also performed whole transcriptome sequencing (WTS) on fibroblast RNA from the patient and father for abnormal transcript analysis.WES revealed a pathogenic paternally inherited indel in ATM. We used the flow panel to assess several proteins in the DNA repair pathway in lymphocyte subsets. The patient had absent phosphorylation of ATM, resulting in absent or aberrant phosphorylation of downstream proteins, including ?H2AX. However, ataxia-telangiectasia (AT) is an autosomal recessive condition, and the abnormal functional data did not correspond with a single ATM variant. WTS revealed in-frame reciprocal fusion transcripts involving ATM and SLC35F2 indicating a chromosome 11 inversion within 11q22.3, of maternal origin. Inversion breakpoints were identified within ATM intron 16 and SLC35F2 intron 7.We identified a novel ATM-breaking chromosome 11 inversion in trans with a pathogenic indel (compound heterozygote) resulting in non-functional ATM protein, consistent with a diagnosis of AT. Utilization of several molecular and functional assays allowed successful resolution of this case.


July 19, 2019

The Florida manatee (Trichechus manatus latirostris) T cell receptor loci exhibit V subgroup synteny and chain-specific evolution.

The Florida manatee (Trichechus manatus latirostris) has limited diversity in the immunoglobulin heavy chain. We therefore investigated the antigen receptor loci of the other arm of the adaptive immune system: the T cell receptor. Manatees are the first species from Afrotheria, a basal eutherian superorder, to have an in-depth characterization of all T cell receptor loci. By annotating the genome and expressed transcripts, we found that each chain has distinct features that correlates to their individual functions. The genomic organization also plays a role in modulating sequence conservation between species. There were extensive V subgroup synteny blocks in the TRA and TRB loci between T. m. latirostris and human. Increased genomic locus complexity correlated to increased locus synteny. We also identified evidence for a VHD pseudogene for the first time in a eutherian mammal. These findings emphasize the value of including species within this basal eutherian radiation in comparative studies. Copyright © 2018. Published by Elsevier Ltd.


July 19, 2019

High-Throughput Single-Cell Sequencing of both TCR-ß Alleles.

Allelic exclusion is a vital mechanism for the generation of monospecificity to foreign Ags in B and T lymphocytes. In this study, we developed a high-throughput barcoded method to simultaneously analyze the VDJ recombination status of both mouse TCR-ß alleles in hundreds of single cells using next-generation sequencing. Copyright © 2018 by The American Association of Immunologists, Inc.


July 19, 2019

Adaptation and conservation insights from the koala genome.

The koala, the only extant species of the marsupial family Phascolarctidae, is classified as ‘vulnerable’ due to habitat loss and widespread disease. We sequenced the koala genome, producing a complete and contiguous marsupial reference genome, including centromeres. We reveal that the koala’s ability to detoxify eucalypt foliage may be due to expansions within a cytochrome P450 gene family, and its ability to smell, taste and moderate ingestion of plant secondary metabolites may be due to expansions in the vomeronasal and taste receptors. We characterized novel lactation proteins that protect young in the pouch and annotated immune genes important for response to chlamydial disease. Historical demography showed a substantial population crash coincident with the decline of Australian megafauna, while contemporary populations had biogeographic boundaries and increased inbreeding in populations affected by historic translocations. We identified genetically diverse populations that require habitat corridors and instituting of translocation programs to aid the koala’s survival in the wild.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.