Menu
July 7, 2019

Resolving complex structural genomic rearrangements using a randomized approach.

Complex chromosomal rearrangements are structural genomic alterations involving multiple instances of deletions, duplications, inversions, or translocations that co-occur either on the same chromosome or represent different overlapping events on homologous chromosomes. We present SVelter, an algorithm that identifies regions of the genome suspected to harbor a complex event and then resolves the structure by iteratively rearranging the local genome structure, in a randomized fashion, with each structure scored against characteristics of the observed sequencing data. SVelter is able to accurately reconstruct complex chromosomal rearrangements when compared to well-characterized genomes that have been deeply sequenced with both short and long reads.


July 7, 2019

Environmental changes bridge evolutionary valleys.

In the basic fitness landscape metaphor for molecular evolution, evolutionary pathways are presumed to follow uphill steps of increasing fitness. How evolution can cross fitness valleys is an open question. One possibility is that environmental changes alter the fitness landscape such that low-fitness sequences reside on a hill in alternate environments. We experimentally test this hypothesis on the antibiotic resistance gene TEM-15 ß-lactamase by comparing four evolutionary strategies shaped by environmental changes. The strategy that included initial steps of selecting for low antibiotic resistance (negative selection) produced superior alleles compared with the other three strategies. We comprehensively examined possible evolutionary pathways leading to one such high-fitness allele and found that an initially deleterious mutation is key to the allele’s evolutionary history. This mutation is an initial gateway to an otherwise relatively inaccessible area of sequence space and participates in higher-order, positive epistasis with a number of neutral to slightly beneficial mutations. The ability of negative selection and environmental changes to provide access to novel fitness peaks has important implications for natural evolutionary mechanisms and applied directed evolution.


July 7, 2019

A phylogenetic and phenotypic analysis of Salmonella enterica serovar Weltevreden, an emerging agent of diarrheal disease in tropical regions.

Salmonella enterica serovar Weltevreden (S. Weltevreden) is an emerging cause of diarrheal and invasive disease in humans residing in tropical regions. Despite the regional and international emergence of this Salmonella serovar, relatively little is known about its genetic diversity, genomics or virulence potential in model systems. Here we used whole genome sequencing and bioinformatics analyses to define the phylogenetic structure of a diverse global selection of S. Weltevreden. Phylogenetic analysis of more than 100 isolates demonstrated that the population of S. Weltevreden can be segregated into two main phylogenetic clusters, one associated predominantly with continental Southeast Asia and the other more internationally dispersed. Subcluster analysis suggested the local evolution of S. Weltevreden within specific geographical regions. Four of the isolates were sequenced using long read sequencing to produce high quality reference genomes. Phenotypic analysis in Hep-2 cells and in a murine infection model indicated that S. Weltevreden were significantly attenuated in these models compared to the classical S. Typhimurium reference strain SL1344. Our work outlines novel insights into this important emerging pathogen and provides a baseline understanding for future research studies.


July 7, 2019

Precision medicine and rare genetic variants.

Interindividual variability in drug metabolism and drug toxicity persists as a major problem for drug development and treatment. Increased or decreased capacity for drug elimination or drug action reduces drug efficacy and places substantial economic burdens on society (e.g., due to treatment of adverse drug reactions) [1]. To a great extent this variation is based on genetic differences, and indeed many drugs now carry pharmacogenomic labels regarding mandatory or informative genetic tests that have to/can be performed before prescription (http://www.fda.gov/drugs/ scienceresearch/researchareas/pharmacogenetics/ucm083378.htm).Theselabelsarebasedonthe most common allelic variants in germline or somatic genes with importance for drug metabolism that encode phase I or phase II enzymes, transporters, or drug targets. In many cases, particularly in oncology, these labels are major determinants of successful treatment. However, the question arises of to what extent these labels are useful for future precision medicine encompassing specific patients carrying mutations not commonly seen in the whole population.


July 7, 2019

Population structure and antimicrobial resistance profiles of Streptococcus suis serotype 2 sequence type 25 strains

Strains of serotype 2 Streptococcus suis are responsible for swine and human infections. Different serotype 2 genetic backgrounds have been defined using multilocus sequence typing (MLST). However, little is known about the genetic diversity within each MLST sequence type (ST). Here, we used whole-genome sequencing to test the hypothesis that S. suis serotype 2 strains of the ST25 lineage are genetically heterogeneous. We evaluated 51 serotype 2 ST25 S. suis strains isolated from diseased pigs and humans in Canada, the United States of America, and Thailand. Whole-genome sequencing revealed numerous large-scale rearrangements in the ST25 genome, compared to the genomes of ST1 and ST28 S. suis strains, which result, among other changes, in disruption of a pilus island locus. We report that recombination and lateral gene transfer contribute to ST25 genetic diversity. Phylogenetic analysis identified two main and distinct Thai and North American clades grouping most strains investigated. These clades also possessed distinct patterns of antimicrobial resistance genes, which correlated with acquisition of different integrative and conjugative elements (ICEs). Some of these ICEs were found to be integrated at a recombination hot spot, previously identified as the site of integration of the 89K pathogenicity island in serotype 2 ST7 S. suis strains. Our results highlight the limitations of MLST for phylogenetic analysis of S. suis, and the importance of lateral gene transfer and recombination as drivers of diversity in this swine pathogen and zoonotic agent.


July 7, 2019

The Cer-cqu gene cluster determines three key players in a ß-diketone synthase polyketide pathway synthesizing aliphatics in epicuticular waxes.

Aliphatic compounds on plant surfaces, called epicuticular waxes, are the first line of defense against pathogens and pests, contribute to reducing water loss and determine other important phenotypes. Aliphatics can form crystals affecting light refraction, resulting in a color change and allowing identification of mutants in their synthesis or transport. The present study discloses three such Eceriferum (cer) genes in barley – Cer-c, Cer-q and Cer-u – known to be tightly linked and functioning in a biochemical pathway forming dominating amounts of ß-diketone and hydroxy-ß-diketones plus some esterified alkan-2-ols. These aliphatics are present in many Triticeae as well as dicotyledons such as Eucalyptus and Dianthus. Recently developed genomic resources and mapping populations in barley defined these genes to a small region on chromosome arm 2HS. Exploiting Cer-c and -u potential functions pinpointed five candidates, of which three were missing in apparent cer-cqu triple mutants. Sequencing more than 50 independent mutants for each gene confirmed their identification. Cer-c is a chalcone synthase-like polyketide synthase, designated diketone synthase (DKS), Cer-q is a lipase/carboxyl transferase and Cer-u is a P450 enzyme. All were highly expressed in pertinent leaf sheath tissue of wild type. A physical map revealed the order Cer-c, Cer-u, Cer-q with the flanking genes 101kb apart, confirming they are a gene cluster, Cer-cqu. Homology-based modeling suggests that many of the mutant alleles affect overall protein structure or specific active site residues. The rich diversity of identified mutations will facilitate future studies of three key enzymes involved in synthesis of plant apoplast waxes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.


July 7, 2019

Indica rice genome assembly, annotation and mining of blast disease resistance genes.

Rice is a major staple food crop in the world. Over 80 % of rice cultivation area is under indica rice. Currently, genomic resources are lacking for indica as compared to japonica rice. In this study, we generated deep-sequencing data (Illumina and Pacific Biosciences sequencing) for one of the indica rice cultivars, HR-12 from India.We assembled over 86 % (389 Mb) of rice genome and annotated 56,284 protein-coding genes from HR-12 genome using Illumina and PacBio sequencing. Comprehensive comparative analyses between indica and japonica subspecies genomes revealed a large number of indica specific variants including SSRs, SNPs and InDels. To mine disease resistance genes, we sequenced few indica rice cultivars that are reported to be highly resistant (Tetep and Tadukan) and susceptible (HR-12 and Co-39) against blast fungal isolates in many countries including India. Whole genome sequencing of rice genotypes revealed high rate of mutations in defense related genes (NB-ARC, LRR and PK domains) in resistant cultivars as compared to susceptible. This study has identified R-genes Pi-ta and Pi54 from durable indica resistant cultivars; Tetep and Tadukan, which can be used in marker assisted selection in rice breeding program.This is the first report of whole genome sequencing approach to characterize Indian rice germplasm. The genomic resources from our work will have a greater impact in understanding global rice diversity, genetics and molecular breeding.


July 7, 2019

Filling in the gap of human chromosome 4: Single Molecule Real Time sequencing of macrosatellite repeats in the facioscapulohumeral muscular dystrophy locus.

A majority of facioscapulohumeral muscular dystrophy (FSHD) is caused by contraction of macrosatellite repeats called D4Z4 that are located in the subtelomeric region of human chromosome 4q35. Sequencing the FSHD locus has been technically challenging due to its long size and nearly identical nature of repeat elements. Here we report sequencing and partial assembly of a BAC clone carrying an entire FSHD locus by a single molecule real time (SMRT) sequencing technology which could produce long reads up to about 18 kb containing D4Z4 repeats. De novo assembly by Hierarchical Genome Assembly Process 1 (HGAP.1) yielded a contig of 41 kb containing all but a part of the most distal D4Z4 element. The validity of the sequence model was confirmed by an independent approach employing anchored multiple sequence alignment by Kalign using reads containing unique flanking sequences. Our data will provide a basis for further optimization of sequencing and assembly conditions of D4Z4.


July 7, 2019

Single-locus enrichment without amplification for sequencing and direct detection of epigenetic modifications.

A gene-level targeted enrichment method for direct detection of epigenetic modifications is described. The approach is demonstrated on the CGG-repeat region of the FMR1 gene, for which large repeat expansions, hitherto refractory to sequencing, are known to cause fragile X syndrome. In addition to achieving a single-locus enrichment of nearly 700,000-fold, the elimination of all amplification steps removes PCR-induced bias in the repeat count and preserves the native epigenetic modifications of the DNA. In conjunction with the single-molecule real-time sequencing approach, this enrichment method enables direct readout of the methylation status and the CGG repeat number of the FMR1 allele(s) for a clonally derived cell line. The current method avoids potential biases introduced through chemical modification and/or amplification methods for indirect detection of CpG methylation events.


July 7, 2019

Comparative genomic analyses of the Moraxella catarrhalis serosensitive and seroresistant lineages demonstrate their independent evolution.

The bacterial species Moraxella catarrhalishas been hypothesized as being composed of two distinct lineages (referred to as the seroresistant [SR] and serosensitive [SS]) with separate evolutionary histories based on several molecular typing methods, whereas 16S ribotyping has suggested an additional split within the SS lineage. Previously, we characterized whole-genome sequences of 12 SR-lineage isolates, which revealed a relatively small supragenome when compared with other opportunistic nasopharyngeal pathogens, suggestive of a relatively short evolutionary history. Here, we performed whole-genome sequencing on 18 strains from both ribotypes of the SS lineage, an additional SR strain, as well as four previously identified highly divergent strains based on multilocus sequence typing analyses. All 35 strains were subjected to a battery of comparative genomic analyses which clearly show that there are three lineages-the SR, SS, and the divergent. The SR and SS lineages are closely related, but distinct from each other based on three different methods of comparison: Allelic differences observed among core genes; possession of lineage-specific sets of core and distributed genes; and by an alignment of concatenated core sequences irrespective of gene annotation. All these methods show that the SS lineage has much longer interstrain branches than the SR lineage indicating that this lineage has likely been evolving either longer or faster than the SR lineage. There is evidence of extensive horizontal gene transfer (HGT) within both of these lineages, and to a lesser degree between them. In particular, we identified very high rates of HGT between these two lineages for ß-lactamase genes. The four divergent strains aresui generis, being much more distantly related to both the SR and SS groups than these other two groups are to each other. Based on average nucleotide identities, gene content, GC content, and genome size, this group could be considered as a separate taxonomic group. The SR and SS lineages, although distinct, clearly form a single species based on multiple criteria including a large common core genome, average nucleotide identity values, GC content, and genome size. Although neither of these lineages arose from within the other based on phylogenetic analyses, the question of how and when these lineages split and then subsequently reunited in the human nasopharynx is explored. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Stability of the encoding plasmids and surface expression of CS6 differs in enterotoxigenic Escherichia coli (ETEC) encoding Different heat-stable (ST) enterotoxins (STh and STp).

Enterotoxigenic Escherichia coli (ETEC), one of the most common reasons of diarrhea among infants and children in developing countries, causes disease by expression of either or both of the enterotoxins heat-labile (LT) and heat-stable (ST; divided into human-type [STh] and porcine-type [STp] variants), and colonization factors (CFs) among which CS6 is one of the most prevalent ETEC CFs. In this study we show that ETEC isolates expressing CS6+STh have higher copy numbers of the cssABCD operon encoding CS6 than those expressing CS6+STp. Long term cultivation of up to ten over-night passages of ETEC isolates harboring CS6+STh (n = 10) or CS6+STp (n = 15) showed instability of phenotypic expression of CS6 in a majority of the CS6+STp isolates, whereas most of the CS6+STh isolates retained CS6 expression. The observed instability was a correlated with loss of genes cssA and cssD as examined by PCR. Mobilization of the CS6 plasmid from an unstable CS6+STp isolate into a laboratory E. coli strain resulted in loss of the plasmid after a single over-night passage whereas the plasmid from an CS6+STh strain was retained in the laboratory strain during 10 passages. A sequence comparison between the CS6 plasmids from a stable and an unstable ETEC isolate revealed that genes necessary for plasmid stabilization, for example pemI, pemK, stbA, stbB and parM, were not present in the unstable ETEC isolate. Our results indicate that stable retention of CS6 may in part be affected by the stability of the plasmid on which both CS6 and STp or STh are located.


July 7, 2019

Evolutionary history of the global emergence of the Escherichia coli epidemic clone ST131.

Escherichia colisequence type 131 (ST131) has emerged globally as the most predominant extraintestinal pathogenic lineage within this clinically important species, and its association with fluoroquinolone and extended-spectrum cephalosporin resistance impacts significantly on treatment. The evolutionary histories of this lineage, and of important antimicrobial resistance elements within it, remain unclearly defined. This study of the largest worldwide collection (n= 215) of sequenced ST131E. coliisolates to date demonstrates that the clonal expansion of two previously recognized antimicrobial-resistant clades, C1/H30R and C2/H30Rx, started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine. These two clades appear to have emerged in the United States, with the expansion of the C2/H30Rx clade driven by the acquisition of ablaCTX-M-15-containing IncFII-like plasmid that has subsequently undergone extensive rearrangement. Several other evolutionary processes influencing the trajectory of this drug-resistant lineage are described, including sporadic acquisitions of CTX-M resistance plasmids and chromosomal integration ofblaCTX-Mwithin subclusters followed by vertical evolution. These processes are also occurring for another family of CTX-M gene variants more recently observed among ST131, theblaCTX-M-14/14-likegroup. The complexity of the evolutionary history of ST131 has important implications for antimicrobial resistance surveillance, epidemiological analysis, and control of emerging clinical lineages ofE. coli These data also highlight the global imperative to reduce specific antibiotic selection pressures and demonstrate the important and varied roles played by plasmids and other mobile genetic elements in the perpetuation of antimicrobial resistance within lineages.IMPORTANCEEscherichia coli, perennially a major bacterial pathogen, is becoming increasingly difficult to manage due to emerging resistance to all preferred antimicrobials. Resistance is concentrated within specificE. colilineages, such as sequence type 131 (ST131). Clarification of the genetic basis for clonally associated resistance is key to devising intervention strategies. We used high-resolution genomic analysis of a large global collection of ST131 isolates to define the evolutionary history of extended-spectrum beta-lactamase production in ST131. We documented diverse contributory genetic processes, including stable chromosomal integrations of resistance genes, persistence and evolution of mobile resistance elements within sublineages, and sporadic acquisition of different resistance elements. Both global distribution and regional segregation were evident. The diversity of resistance element acquisition and propagation within ST131 indicates a need for control and surveillance strategies that target both bacterial strains and mobile genetic elements. Copyright © 2016 Stoesser et al.


July 7, 2019

Third-generation sequencing and the future of genomics

Third-generation long-range DNA sequencing and mapping technologies are creating a renaissance in high-quality genome sequencing. Unlike second-generation sequencing, which produces short reads a few hundred base-pairs long, third-generation single-molecule technologies generate over 10,000 bp reads or map over 100,000 bp molecules. We analyze how increased read lengths can be used to address long-standing problems in de novo genome assembly, structural variation analysis and haplotype phasing.


July 7, 2019

Complete genome sequence of the engineered Escherichia coli SHuffle strains and their wild-type parents.

SHuffle strains are genetically engineered Escherichia coli strains that are capable of oxidizing cysteines within proteins to form disulfide bonds. Here we present the complete genome of both the K-12 and B versions of SHuffle strains along with their parental ancestors. These strains have been of significant use to both the general scientific community and the biotech industry, interested in producing novel disulfide-bonded proteins that were hitherto unable to be expressed in standard E. coli expression strains.


July 7, 2019

First report of cfr-encoding plasmids in the pandemic sequence type (ST) 22 methicillin-resistant Staphylococcus aureus Staphylococcal cassette chromosome mec type-IV clone.

Linezolid is often the drug of last resort for serious methicillin-resistant Staphylococcus aureus (MRSA) infections. Linezolid resistance is mediated by mutations in 23S rRNA and genes for ribosomal proteins, cfr encoding phenicol, lincosamide, oxazolidinone, pleuromutilin and streptogramin A (PhLOPSA) resistance, its homolgue cfr(B) or optrA conferring oxazolidinone and phenicol resistance. Linezolid resistance is rare in S. aureus, and cfr even rarer. This study investigated the clonality and linezolid resistance mechanisms of two MRSA isolates from patients in separate Irish hospitals. Isolates were subjected to cfr PCR, PhLOPSA susceptibility testing, 23S rRNA PCR and sequencing, DNA microarray profiling, spa typing, pulsed-field gel electrophoresis (PFGE), plasmid curing and conjugative transfer. Whole-genome sequencing was used for single nucleotide variant (SNV) analysis, multilocus-sequence typing, L-protein mutation identification, cfr-plasmid sequence analysis and optrA and cfr(B) detection. Isolates M12/0145 and M13/0401 exhibited linezolid MICs of 64 and 16 mg/liter, respectively, and harbored identical 23S rRNA and L22 mutations, but M12/0145 exhibited the mutation in 2/6 23S rRNA alleles compared to 1/5 in M13/0401. Both isolates were ST22-MRSA-IV/t032, harbored cfr, exhibited the PhLOPSA phenotype and lacked optrA and cfr(B). They differed by five PFGE bands and 603 SNVs. Isolate M12/0145 harbored cfr and fexA on a 41-kb conjugative pSCFS3-type plasmid, whereas M13/0401 harbored cfr and lsa(B) on a novel 27-kb plasmid. This is the first report of cfr in the pandemic ST22-MRSA-IV clone. Different cfr plasmids and mutations associated with linezolid resistance in genotypically distinct ST22-MRSA-IV isolates highlights that prudent management of linezolid use is essential. Copyright © 2016 Shore et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.