Menu
July 19, 2019

Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships.

Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33-35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution.


July 19, 2019

Fc? receptors: genetic variation, function, and disease.

Fc? receptors (Fc?Rs) are key immune receptors responsible for the effective control of both humoral and innate immunity and are central to maintaining the balance between generating appropriate responses to infection and preventing autoimmunity. When this balance is lost, pathology results in increased susceptibility to cancer, autoimmunity, and infection. In contrast, optimal Fc?R engagement facilitates effective disease resolution and response to monoclonal antibody immunotherapy. The underlying genetics of the Fc?R gene family are a central component of this careful balance. Complex in humans and generated through ancestral duplication events, here we review the evolution of the gene family in mammals, the potential importance of copy number, and functionally relevant single nucleotide polymorphisms, as well as discussing current approaches and limitations when exploring genetic variation in this region. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


July 19, 2019

Long-read Single-Molecule Real-Time (SMRT) full gene sequencing of cytochrome P450-2D6 (CYP2D6).

The CYP2D6 enzyme metabolizes ~25% of common medications, yet homologous pseudogenes and copy-number variants (CNVs) make interrogating the polymorphic CYP2D6 gene with short-read sequencing challenging. Therefore, we developed a novel long-read, full gene CYP2D6 single-molecule real-time (SMRT) sequencing method using the Pacific Biosciences platform. Long-range PCR and CYP2D6 SMRT sequencing of 10 previously genotyped controls identified expected star (*) alleles, but also enabled suballele resolution, diplotype refinement, and discovery of novel alleles. Coupled with an optimized variant calling pipeline, CYP2D6 SMRT sequencing was highly reproducible as triplicate intra- and inter-run non-reference genotype results were completely concordant. Importantly, targeted SMRT sequencing of upstream and downstream CYP2D6 gene copies characterized the duplicated allele in 15 control samples with CYP2D6 CNVs. The utility of CYP2D6 SMRT sequencing was further underscored by identifying the diplotypes of 14 samples with discordant or unclear CYP2D6 configurations from previous targeted genotyping, which again included suballele resolution, duplicated allele characterization, and discovery of a novel allele and tandem arrangement (CYP2D6*36+*41). Taken together, long-read CYP2D6 SMRT sequencing is an innovative, reproducible, and validated method for full-gene characterization, duplication allele-specific analysis and novel allele discovery, which will likely improve CYP2D6 metabolizer phenotype prediction for both research and clinical testing applications. This article is protected by copyright. All rights reserved.This article is protected by copyright. All rights reserved.


July 19, 2019

Detection and screening of chromosomal rearrangements in uterine leiomyomas by long-distance inverse PCR.

Genome instability is a hallmark of many tumors and recently, next-generation sequencing methods have enabled analyses of tumor genomes at an unprecedented level. Studying rearrangement-prone chromosomal regions (putative “breakpoint hotspots”) in detail, however, necessitates molecular assays that can detect de novo DNA fusions arising from these hotspots. Here we demonstrate the utility of a long-distance inverse PCR-based method for the detection and screening of de novo DNA rearrangements in uterine leiomyomas, one of the most common types of human neoplasm. This assay allows in principle any genomic region suspected of instability to be queried for DNA rearrangements originating there. No prior knowledge of the identity of the fusion partner chromosome is needed. We used this method to screen uterine leiomyomas for rearrangements at genomic locations known to be rearrangement-prone in this tumor type: upstream HMGA2 and within RAD51B. We identified a novel DNA rearrangement upstream of HMGA2 that had gone undetected in an earlier whole-genome sequencing study. In more than 30 additional uterine leiomyoma samples, not analyzed by whole-genome sequencing previously, no rearrangements were observed within the 1,107 bp and 1,996 bp assayed in the RAD51B and HMGA2 rearrangement hotspots. Our findings show that long-distance inverse PCR is a robust, sensitive, and cost-effective method for the detection and screening of DNA rearrangements from solid tumors that should be useful for many diagnostic applications. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.


July 19, 2019

A diamond in the ruff.

Reference genomes are only as valuable as the scientific questions they can address. The ruff genome sequence papers exemplify three of the most important aspects of a useful genome: new biological insights, a high-quality resource and population variation data.


July 19, 2019

TCR sequencing of single cells reactive to DQ2.5-glia-a2 and DQ2.5-glia-?2 reveals clonal expansion and epitope-specific V-gene usage.

CD4+ T cells recognizing dietary gluten epitopes in the context of disease-associated human leukocyte antigen (HLA)-DQ2 or HLA-DQ8 molecules are the key players in celiac disease pathogenesis. Here, we conducted a large-scale single-cell paired T-cell receptor (TCR) sequencing study to characterize the TCR repertoire for two homologous immunodominant gluten epitopes, DQ2.5-glia-a2 and DQ2.5-glia-?2, in blood of celiac disease patients after oral gluten challenge. Despite sequence similarity of the epitopes, the TCR repertoires are unique but shared several overall features. We demonstrate that clonally expanded T cells dominate the T-cell responses to both epitopes. Moreover, we find V-gene bias of TRAV26, TRAV4, and TRBV7 in DQ2.5-glia-a2 reactive TCRs, while DQ2.5-glia-?2 TCRs displayed significant bias toward TRAV4 and TRBV4. The knowledge that antigen-specific TCR repertoire in chronic inflammatory diseases tends to be dominated by a few expanded clones that use the same TCR V-gene segments across patients is important information for HLA-associated diseases where the antigen is unknown.


July 19, 2019

Major improvements to the Heliconius melpomene genome assembly used to confirm 10 chromosome fusion events in 6 million years of butterfly evolution.

The Heliconius butterflies are a widely studied adaptive radiation of 46 species spread across Central and South America, several of which are known to hybridize in the wild. Here, we present a substantially improved assembly of the Heliconius melpomene genome, developed using novel methods that should be applicable to improving other genome assemblies produced using short read sequencing. First, we whole-genome-sequenced a pedigree to produce a linkage map incorporating 99% of the genome. Second, we incorporated haplotype scaffolds extensively to produce a more complete haploid version of the draft genome. Third, we incorporated ~20x coverage of Pacific Biosciences sequencing, and scaffolded the haploid genome using an assembly of this long-read sequence. These improvements result in a genome of 795 scaffolds, 275 Mb in length, with an N50 length of 2.1 Mb, an N50 number of 34, and with 99% of the genome placed, and 84% anchored on chromosomes. We use the new genome assembly to confirm that the Heliconius genome underwent 10 chromosome fusions since the split with its sister genus Eueides, over a period of about 6 million yr. Copyright © 2016 Davey et al.


July 19, 2019

Long-read sequence assembly of the gorilla genome.

Accurate sequence and assembly of genomes is a critical first step for studies of genetic variation. We generated a high-quality assembly of the gorilla genome using single-molecule, real-time sequence technology and a string graph de novo assembly algorithm. The new assembly improves contiguity by two to three orders of magnitude with respect to previously released assemblies, recovering 87% of missing reference exons and incomplete gene models. Although regions of large, high-identity segmental duplications remain largely unresolved, this comprehensive assembly provides new biological insight into genetic diversity, structural variation, gene loss, and representation of repeat structures within the gorilla genome. The approach provides a path forward for the routine assembly of mammalian genomes at a level approaching that of the current quality of the human genome. Copyright © 2016, American Association for the Advancement of Science.


July 19, 2019

Towards better precision medicine: PacBio single-molecule long reads resolve the interpretation of HIV drug resistant mutation profiles at explicit quasispecies (haplotype) level.

Development of HIV-1 drug resistance mutations (HDRMs) is one of the major reasons for the clinical failure of antiretroviral therapy. Treatment success rates can be improved by applying personalized anti-HIV regimens based on a patient’s HDRM profile. However, the sensitivity and specificity of the HDRM profile is limited by the methods used for detection. Sanger-based sequencing technology has traditionally been used for determining HDRM profiles at the single nucleotide variant (SNV) level, but with a sensitivity of only = 20% in the HIV population of a patient. Next Generation Sequencing (NGS) technologies offer greater detection sensitivity (~ 1%) and larger scope (hundreds of samples per run). However, NGS technologies produce reads that are too short to enable the detection of the physical linkages of individual SNVs across the haplotype of each HIV strain present. In this article, we demonstrate that the single-molecule long reads generated using the Third Generation Sequencer (TGS), PacBio RS II, along with the appropriate bioinformatics analysis method, can resolve the HDRM profile at a more advanced quasispecies level. The case studies on patients’ HIV samples showed that the quasispecies view produced using the PacBio method offered greater detection sensitivity and was more comprehensive for understanding HDRM situations, which is complement to both Sanger and NGS technologies. In conclusion, the PacBio method, providing a promising new quasispecies level of HDRM profiling, may effect an important change in the field of HIV drug resistance research.


July 19, 2019

An incomplete understanding of human genetic variation.

Deciphering the genetic basis of human disease requires a comprehensive knowledge of genetic variants irrespective of their class or frequency. Although an impressive number of human genetic variants have been catalogued, a large fraction of the genetic difference that distinguishes two human genomes is still not understood at the base-pair level. This is because the emphasis has been on single-nucleotide variation as opposed to less tractable and more complex genetic variants, including indels and structural variants. The latter, we propose, will have a large impact on human phenotypes but require a more systematic assessment of genomes at deeper coverage and alternate sequencing and mapping technologies. Copyright © 2016 by the Genetics Society of America.


July 19, 2019

Mitotic intragenic recombination: A mechanism of survival for several congenital disorders of glycosylation.

Congenital disorders of glycosylation (CDGs) are disorders of abnormal protein glycosylation that affect multiple organ systems. Because most CDGs have been described in only a few individuals, our understanding of the associated phenotypes and the mechanisms of individual survival are limited. In the process of studying two siblings, aged 6 and 11 years, with MOGS-CDG and biallelic MOGS (mannosyl-oligosaccharide glucosidase) mutations (GenBank: NM_006302.2; c.[65C>A; 329G>A] p.[Ala22Glu; Arg110His]; c.[370C>T] p.[Gln124(*)]), we noted that their survival was much longer than the previous report of MOGS-CDG, in a child who died at 74 days of age. Upon mutation analysis, we detected multiple MOGS genotypes including wild-type alleles in their cultured fibroblast and peripheral blood DNA. Further analysis of DNA from cultured fibroblasts of six individuals with compound heterozygous mutations of PMM2 (PMM2-CDG), MPI (MPI-CDG), ALG3 (ALG3-CDG), ALG12 (ALG12-CDG), DPAGT1 (DPAGT1-CDG), and ALG1 (ALG1-CDG) also identified multiple genotypes including wild-type alleles for each. Droplet digital PCR showed a ratio of nearly 1:1 wild-type to mutant alleles for most, but not all, mutations. This suggests that mitotic recombination contributes to the survival and the variable expressivity of individuals with compound heterozygous CDGs. This also provides an explanation for prior observations of a reduced frequency of homozygous mutations and might contribute to increased levels of residual enzyme activity in cultured fibroblasts of individuals with MPI- and PMM2-CDGs. Copyright © 2016 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.


July 19, 2019

Nested Russian doll-like genetic mobility drives rapid dissemination of the Carbapenem resistance gene blaKPC

The recent widespread emergence of carbapenem resistance in Enterobacteriaceae is a major public health concern, as carbapenems are a therapy of last resort against this family of common bacterial pathogens. Resistance genes can mobilize via various mechanisms, including conjugation and transposition; however, the importance of this mobility in short-term evolution, such as within nosocomial outbreaks, is unknown. Using a combination of short- and long-read whole-genome sequencing of 281 blaKPC-positive Enterobacteriaceae isolates from a single hospital over 5 years, we demonstrate rapid dissemination of this carbapenem resistance gene to multiple species, strains, and plasmids. Mobility of blaKPC occurs at multiple nested genetic levels, with transmission of blaKPC strains between individuals, frequent transfer of blaKPC plasmids between strains/species, and frequent transposition of blaKPC transposon Tn4401 between plasmids. We also identify a common insertion site for Tn4401 within various Tn2-like elements, suggesting that homologous recombination between Tn2-like elements has enhanced the spread of Tn4401 between different plasmid vectors. Furthermore, while short-read sequencing has known limitations for plasmid assembly, various studies have attempted to overcome this by the use of reference-based methods. We also demonstrate that, as a consequence of the genetic mobility observed in this study, plasmid structures can be extremely dynamic, and therefore these reference-based methods, as well as traditional partial typing methods, can produce very misleading conclusions. Overall, our findings demonstrate that nonclonal resistance gene dissemination can be extremely rapid, presenting significant challenges for public health surveillance and achieving effective control of antibiotic resistance. Copyright © 2016 Sheppard et al.


July 19, 2019

Continuous evolution of Bacillus thuringiensis toxins overcomes insect resistance.

The Bacillus thuringiensis d-endotoxins (Bt toxins) are widely used insecticidal proteins in engineered crops that provide agricultural, economic, and environmental benefits. The development of insect resistance to Bt toxins endangers their long-term effectiveness. Here we have developed a phage-assisted continuous evolution selection that rapidly evolves high-affinity protein-protein interactions, and applied this system to evolve variants of the Bt toxin Cry1Ac that bind a cadherin-like receptor from the insect pest Trichoplusia ni (TnCAD) that is not natively bound by wild-type Cry1Ac. The resulting evolved Cry1Ac variants bind TnCAD with high affinity (dissociation constant Kd?=?11-41?nM), kill TnCAD-expressing insect cells that are not susceptible to wild-type Cry1Ac, and kill Cry1Ac-resistant T. ni insects up to 335-fold more potently than wild-type Cry1Ac. Our findings establish that the evolution of Bt toxins with novel insect cell receptor affinity can overcome insect Bt toxin resistance and confer lethality approaching that of the wild-type Bt toxin against non-resistant insects.


July 19, 2019

Initial assessment of the molecular epidemiology of blaNDM-1 in Colombia.

We report complete genome sequences of fourblaNDM-1-harboring Gram-negative multidrug resistant (MDR) isolates from Colombia. TheblaNDM-1genes were located 193Kb-Inc FIA, 178Kb-Inc A/C2 and 47Kb (unknown Inc type) plasmids. MLST revealed that isolates belong to ST10 (Escherichia coli), ST392 (Klebsiella pneumoniae), and ST322 and ST464 (Acinetobacter baumanniiandA. nosocomialis, respectively). Our analysis identified that the Inc A/C2 plasmid inE. colicontained a novel complex transposon (Tn125and Tn5393with 3 copies ofblaNDM-1) and a recombination “hotspot” for the acquisition of new resistance determinants. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 19, 2019

Genome structural diversity among 31 Bordetella pertussis isolates from two recent U.S. whooping cough statewide epidemics

During 2010 and 2012, California and Vermont, respectively, experienced statewide epidemics of pertussis with differences seen in the demographic affected, case clinical presentation, and molecular epidemiology of the circulating strains. To overcome limitations of the current molecular typing methods for pertussis, we utilized whole-genome sequencing to gain a broader understanding of how current circulating strains are causing large epidemics. Through the use of combined next-generation sequencing technologies, this study compared de novo, single-contig genome assemblies from 31 out of 33 Bordetella pertussis isolates collected during two separate pertussis statewide epidemics and 2 resequenced vaccine strains. Final genome architecture assemblies were verified with whole-genome optical mapping. Sixteen distinct genome rearrangement profiles were observed in epidemic isolate genomes, all of which were distinct from the genome structures of the two resequenced vaccine strains. These rearrangements appear to be mediated by repetitive sequence elements, such as high-copy-number mobile genetic elements and rRNA operons. Additionally, novel and previously identified single nucleotide polymorphisms were detected in 10 virulence-related genes in the epidemic isolates. Whole-genome variation analysis identified state-specific variants, and coding regions bearing nonsynonymous mutations were classified into functional annotated orthologous groups. Comprehensive studies on whole genomes are needed to understand the resurgence of pertussis and develop novel tools to better characterize the molecular epidemiology of evolving B.~pertussis populations.IMPORTANCE Pertussis, or whooping cough, is the most poorly controlled vaccine-preventable bacterial disease in the United States, which has experienced a resurgence for more than a decade. Once viewed as a monomorphic pathogen, B.~pertussis strains circulating during epidemics exhibit diversity visible on a genome structural level, previously undetectable by traditional sequence analysis using short-read technologies. For the first time, we combine short- and long-read sequencing platforms with restriction optical mapping for single-contig, de novo assembly of 31 isolates to investigate two geographically and temporally independent U.S. pertussis epidemics. These complete genomes reshape our understanding of B.~pertussis evolution and strengthen molecular epidemiology toward one day understanding the resurgence of pertussis.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.