Menu
July 7, 2019  |  

Distinct Salmonella enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings.

An epidemiological paradox surrounds Salmonella enterica serovar Enteritidis. In high-income settings, it has been responsible for an epidemic of poultry-associated, self-limiting enterocolitis, whereas in sub-Saharan Africa it is a major cause of invasive nontyphoidal Salmonella disease, associated with high case fatality. By whole-genome sequence analysis of 675 isolates of S. Enteritidis from 45 countries, we show the existence of a global epidemic clade and two new clades of S. Enteritidis that are geographically restricted to distinct regions of Africa. The African isolates display genomic degradation, a novel prophage repertoire, and an expanded multidrug resistance plasmid. S. Enteritidis is a further example of a Salmonella serotype that displays niche plasticity, with distinct clades that enable it to become a prominent cause of gastroenteritis in association with the industrial production of eggs and of multidrug-resistant, bloodstream-invasive infection in Africa.


July 7, 2019  |  

Key experimental evidence of chromosomal DNA transfer among selected tuberculosis-causing mycobacteria.

Horizontal gene transfer (HGT) is a major driving force of bacterial diversification and evolution. For tuberculosis-causing mycobacteria, the impact of HGT in the emergence and distribution of dominant lineages remains a matter of debate. Here, by using fluorescence-assisted mating assays and whole genome sequencing, we present unique experimental evidence of chromosomal DNA transfer between tubercle bacilli of the early-branching Mycobacterium canettii clade. We found that the obtained recombinants had received multiple donor-derived DNA fragments in the size range of 100 bp to 118 kbp, fragments large enough to contain whole operons. Although the transfer frequency between M. canettii strains was low and no transfer could be observed among classical Mycobacterium tuberculosis complex (MTBC) strains, our study provides the proof of concept for genetic exchange in tubercle bacilli. This outstanding, now experimentally validated phenomenon presumably played a key role in the early evolution of the MTBC toward pathogenicity. Moreover, our findings also provide important information for the risk evaluation of potential transfer of drug resistance and fitness mutations among clinically relevant mycobacterial strains.


July 7, 2019  |  

An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes.

Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity.


July 7, 2019  |  

Chromosome assembly of large and complex genomes using multiple references

Despite the rapid development of sequencing technologies, assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout, a reference-assisted assembly tool that now works for large and complex genomes. Taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. Using Ragout, we transformed NGS assemblies of 15 different Mus musculus and one Mus spretus genomes into sets of complete chromosomes, leaving less than 5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long PacBio reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. Additionally, we applied Ragout to Mus caroli and Mus pahari genomes, which exhibit karyotype-scale variations compared to other genomes from the Muridae family. Chromosome color maps confirmed most large-scale rearrangements that Ragout detected.


July 7, 2019  |  

Genomic sequencing-based mutational enrichment analysis identifies motility genes in a genetically intractable gut microbe.

A major roadblock to understanding how microbes in the gastrointestinal tract colonize and influence the physiology of their hosts is our inability to genetically manipulate new bacterial species and experimentally assess the function of their genes. We describe the application of population-based genomic sequencing after chemical mutagenesis to map bacterial genes responsible for motility in Exiguobacterium acetylicum, a representative intestinal Firmicutes bacterium that is intractable to molecular genetic manipulation. We derived strong associations between mutations in 57 E. acetylicum genes and impaired motility. Surprisingly, less than half of these genes were annotated as motility-related based on sequence homologies. We confirmed the genetic link between individual mutations and loss of motility for several of these genes by performing a large-scale analysis of spontaneous suppressor mutations. In the process, we reannotated genes belonging to a broad family of diguanylate cyclases and phosphodiesterases to highlight their specific role in motility and assigned functions to uncharacterized genes. Furthermore, we generated isogenic strains that allowed us to establish that Exiguobacterium motility is important for the colonization of its vertebrate host. These results indicate that genetic dissection of a complex trait, functional annotation of new genes, and the generation of mutant strains to define the role of genes in complex environments can be accomplished in bacteria without the development of species-specific molecular genetic tools.


July 7, 2019  |  

Complete genome sequence of Streptomyces formicae KY5, the formicamycin producer.

Here we report the complete genome of the new species Streptomyces formicae KY5 isolated from Tetraponera fungus growing ants. S. formicae was sequenced using the PacBio and 454 platforms to generate a single linear chromosome with terminal inverted repeats. Illumina MiSeq sequencing was used to correct base changes resulting from the high error rate associated with PacBio. The genome is 9.6 Mbps, has a GC content of 71.38% and contains 8162 protein coding sequences. Predictive analysis shows this strain encodes at least 45 gene clusters for the biosynthesis of secondary metabolites, including a type 2 polyketide synthase encoding cluster for the antibacterial formicamycins. Streptomyces formicae KY5 is a new, taxonomically distinct Streptomyces species and this complete genome sequence provides an important marker in the genus of Streptomyces. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.


July 7, 2019  |  

scanPAV: a pipeline for extracting presence-absence variations in genome pairs.

The recent technological advances in genome sequencing techniques have resulted in an exponential increase in the number of sequenced human and non-human genomes. The ever increasing number of assemblies generated by novel de novo pipelines and strategies demands the development of new software to evaluate assembly quality and completeness. One way to determine the completeness of an assembly is by detecting its Presence-Absence variations (PAV) with respect to a reference, where PAVs between two assemblies are defined as the sequences present in one assembly but entirely missing in the other one. Beyond assembly error or technology bias, PAVs can also reveal real genome polymorphism, consequence of species or individual evolution, or horizontal transfer from viruses and bacteria.We present scanPAV, a pipeline for pairwise assembly comparison to identify and extract sequences present in one assembly but not the other. In this note, we use the GRCh38 reference assembly to assess the completeness of six human genome assemblies from various assembly strategies and sequencing technologies including Illumina short reads, 10× genomics linked-reads, PacBio and Oxford Nanopore long reads, and Bionano optical maps. We also discuss the PAV polymorphism of seven Tasmanian devil whole genome assemblies of normal animal tissues and devil facial tumour 1 (DFT1) and 2 (DFT2) samples, and the identification of bacterial sequences as contamination in some of the tumorous assemblies.The pipeline is available under the MIT License at https://github.com/wtsi-hpag/scanPAV.Supplementary data are available at Bioinformatics online.


July 7, 2019  |  

PlasmidTron: assembling the cause of phenotypes and genotypes from NGS data.

Increasingly rich metadata are now being linked to samples that have been whole-genome sequenced. However, much of this information is ignored. This is because linking this metadata to genes, or regions of the genome, usually relies on knowing the gene sequence(s) responsible for the particular trait being measured and looking for its presence or absence in that genome. Examples of this would be the spread of antimicrobial resistance genes carried on mobile genetic elements (MGEs). However, although it is possible to routinely identify the resistance gene, identifying the unknown MGE upon which it is carried can be much more difficult if the starting point is short-read whole-genome sequence data. The reason for this is that MGEs are often full of repeats and so assemble poorly, leading to fragmented consensus sequences. Since mobile DNA, which can carry many clinically and ecologically important genes, has a different evolutionary history from the host, its distribution across the host population will, by definition, be independent of the host phylogeny. It is possible to use this phenomenon in a genome-wide association study to identify both the genes associated with the specific trait and also the DNA linked to that gene, for example the flanking sequence of the plasmid vector on which it is encoded, which follows the same patterns of distribution as the marker gene/sequence itself. We present PlasmidTron, which utilizes the phenotypic data normally available in bacterial population studies, such as antibiograms, virulence factors, or geographical information, to identify traits that are likely to be present on DNA that can randomly reassort across defined bacterial populations. It is also possible to use this methodology to associate unknown genes/sequences (e.g. plasmid backbones) with a specific molecular signature or marker (e.g. resistance gene presence or absence) using PlasmidTron. PlasmidTron uses a k-mer-based approach to identify reads associated with a phylogenetically unlinked phenotype. These reads are then assembled de novo to produce contigs in a fast and scalable-to-large manner. PlasmidTron is written in Python 3 and is available under the open source licence GNU GPL3 from https://github.com/sanger-pathogens/plasmidtron.


July 7, 2019  |  

Complete genome sequence of herpes simplex virus 2 strain 333.

Herpes simplex virus 2, or human herpesvirus 2, is a ubiquitous human pathogen that causes genital ulcerations and establishes latency in sacral root ganglia. We fully sequenced and manually curated the viral genome sequence of herpes simplex virus 2, strain 333 using Pacific Biosciences and Illumina sequencing technologies.


July 7, 2019  |  

New variant of multidrug-resistant Salmonella enterica serovar Typhimurium associated with invasive disease in immunocompromised patients in Vietnam.

Nontyphoidal Salmonella (NTS), particularly Salmonella enterica serovar Typhimurium, is among the leading etiologic agents of bacterial enterocolitis globally and a well-characterized cause of invasive disease (iNTS) in sub-Saharan Africa. In contrast, S Typhimurium is poorly defined in Southeast Asia, a known hot spot for zoonotic disease with a recently described burden of iNTS disease. Here, we aimed to add insight into the epidemiology and potential impact of zoonotic transfer and antimicrobial resistance (AMR) in S Typhimurium associated with iNTS and enterocolitis in Vietnam. We performed whole-genome sequencing and phylogenetic reconstruction on 85 human (enterocolitis, carriage, and iNTS) and 113 animal S Typhimurium isolates isolated in Vietnam. We found limited evidence for the zoonotic transmission of S Typhimurium. However, we describe a chain of events where a pandemic monophasic variant of S Typhimurium (serovar I:4,[5],12:i:- sequence type 34 [ST34]) has been introduced into Vietnam, reacquired a phase 2 flagellum, and acquired an IncHI2 multidrug-resistant plasmid. Notably, these novel biphasic ST34 S Typhimurium variants were significantly associated with iNTS in Vietnamese HIV-infected patients. Our study represents the first characterization of novel iNTS organisms isolated outside sub-Saharan Africa and outlines a new pathway for the emergence of alternative Salmonella variants into susceptible human populations.IMPORTANCESalmonella Typhimurium is a major diarrheal pathogen and associated with invasive nontyphoid Salmonella (iNTS) disease in vulnerable populations. We present the first characterization of iNTS organisms in Southeast Asia and describe a different evolutionary trajectory from that of organisms causing iNTS in sub-Saharan Africa. In Vietnam, the globally distributed monophasic variant of Salmonella Typhimurium, the serovar I:4,[5],12:i:- ST34 clone, has reacquired a phase 2 flagellum and gained a multidrug-resistant plasmid to become associated with iNTS disease in HIV-infected patients. We document distinct communities of S Typhimurium and I:4,[5],12:i:- in animals and humans in Vietnam, despite the greater mixing of these host populations here. These data highlight the importance of whole-genome sequencing surveillance in a One Health context in understanding the evolution and spread of resistant bacterial infections. Copyright © 2018 Mather et al.


July 7, 2019  |  

Bridging gaps in transposable element research with single-molecule and single-cell technologies

More than half of the genomic landscape in humans and many other organisms is composed of repetitive DNA, which mostly derives from transposable elements (TEs) and viruses. Recent technological advances permit improved assessment of the repetitive content across genomes and newly developed molecular assays have revealed important roles of TEs and viruses in host genome evolution and organization. To update on our current understanding of TE biology and to promote new interdisciplinary strategies for the TE research community, leading experts gathered for the 2nd Uppsala Transposon Symposium on October 4–5, 2018 in Uppsala, Sweden. Using cutting-edge single-molecule and single-cell approaches, research on TEs and other repeats has entered a new era in biological and biomedical research.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.