Menu
July 7, 2019

Complete genome sequence of the sesame pathogen Ralstonia solanacearum strain SEPPX 05.

Ralstonia solanacearum is a soil-borne phytopathogen associated with bacterial wilt disease of sesame. R. solanacearum is the predominant agent causing damping-off from tropical to temperate regions. Because bacterial wilt has decreased the sesame industry yield, we sequenced the SEPPX05 genome using PacBio and Illumina HiSeq 2500 systems and revealed that R. solanacearum strain SEPPX05 carries a bipartite genome consisting of a 3,930,849 bp chromosome and a 2,066,085 bp megaplasmid with 66.84% G+C content that harbors 5,427 coding sequences. Based on the whole genome, phylogenetic analysis showed that strain SEPPX05 is grouped with two phylotype I strains (EP1 and GMI1000). Pan-genomic analysis shows that R. solanacearum is a complex species with high biological diversity and was able to colonize various environments during evolution. Despite deletions, insertions, and inversions, most genes of strain SEPPX05 have relatively high levels of synteny compared with strain GMI1000. We identified 104 genes involved in virulence-related factors in the SEPPX05 genome and eight absent genes encoding T3Es of GMI1000. Comparing SEPPX05 with other species, we found highly conserved secretion systems central to modulating interactions of host bacteria. These data may provide important clues for understanding underlying pathogenic mechanisms of R. solanacearum and help in the control of sesame bacterial wilt.


July 7, 2019

Host genetic variation strongly influences the microbiome structure and function in fungal fruiting-bodies.

Despite increasing knowledge on host-associated microbiomes, little is known about mechanisms underlying fungus-microbiome interactions. This study aimed to examine the relative importance of host genetic, geographic and environmental variations in structuring fungus-associated microbiomes. We analyzed the taxonomic composition and function of microbiomes inhabiting fungal fruiting-bodies in relation to host genetic variation, soil pH and geographic distance between samples. For this, we sequenced the metagenomes of 40 fruiting-bodies collected from six fairy rings (i.e., genets) of a saprotrophic fungus Marasmius oreades. Our analyses revealed that fine genetic variations between host fungi could strongly affect their associated microbiome, explaining, respectively, 25% and 37% of the variation in microbiome structure and function, whereas geographic distance and soil pH remained of secondary importance. These results, together with the smaller genome size of fungi compared to other eukaryotes, suggest that fruiting-bodies are suitable for further genome-centric studies on host-microbiome interactions.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


July 7, 2019

Draft genome sequence of the phytopathogenic fungus Ganoderma boninense, the causal agent of basal stem rot disease on oil palm.

Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24?Mb, 495 scaffolds, and 26,226 predicted coding sequences. Copyright © 2018 Utomo et al.


July 7, 2019

A fast approximate algorithm for mapping long reads to large reference databases.

Emerging single-molecule sequencing technologies from Pacific Biosciences and Oxford Nanopore have revived interest in long-read mapping algorithms. Alignment-based seed-and-extend methods demonstrate good accuracy, but face limited scalability, while faster alignment-free methods typically trade decreased precision for efficiency. In this article, we combine a fast approximate read mapping algorithm based on minimizers with a novel MinHash identity estimation technique to achieve both scalability and precision. In contrast to prior methods, we develop a mathematical framework that defines the types of mapping targets we uncover, establish probabilistic estimates of p-value and sensitivity, and demonstrate tolerance for alignment error rates up to 20%. With this framework, our algorithm automatically adapts to different minimum length and identity requirements and provides both positional and identity estimates for each mapping reported. For mapping human PacBio reads to the hg38 reference, our method is 290?×?faster than Burrows-Wheeler Aligner-MEM with a lower memory footprint and recall rate of 96%. We further demonstrate the scalability of our method by mapping noisy PacBio reads (each =5?kbp in length) to the complete NCBI RefSeq database containing 838 Gbp of sequence and >60,000 genomes.


July 7, 2019

Satellite DNA evolution: old ideas, new approaches.

A substantial portion of the genomes of most multicellular eukaryotes consists of large arrays of tandemly repeated sequence, collectively called satellite DNA. The processes generating and maintaining different satellite DNA abundances across lineages are important to understand as satellites have been linked to chromosome mis-segregation, disease phenotypes, and reproductive isolation between species. While much theory has been developed to describe satellite evolution, empirical tests of these models have fallen short because of the challenges in assessing satellite repeat regions of the genome. Advances in computational tools and sequencing technologies now enable identification and quantification of satellite sequences genome-wide. Here, we describe some of these tools and how their applications are furthering our knowledge of satellite evolution and function. Copyright © 2018 Elsevier Ltd. All rights reserved.


July 7, 2019

Tigmint: correcting assembly errors using linked reads from large molecules.

Genome sequencing yields the sequence of many short snippets of DNA (reads) from a genome. Genome assembly attempts to reconstruct the original genome from which these reads were derived. This task is difficult due to gaps and errors in the sequencing data, repetitive sequence in the underlying genome, and heterozygosity. As a result, assembly errors are common. In the absence of a reference genome, these misassemblies may be identified by comparing the sequencing data to the assembly and looking for discrepancies between the two. Once identified, these misassemblies may be corrected, improving the quality of the assembled sequence. Although tools exist to identify and correct misassemblies using Illumina paired-end and mate-pair sequencing, no such tool yet exists that makes use of the long distance information of the large molecules provided by linked reads, such as those offered by the 10x Genomics Chromium platform. We have developed the tool Tigmint to address this gap.To demonstrate the effectiveness of Tigmint, we applied it to assemblies of a human genome using short reads assembled with ABySS 2.0 and other assemblers. Tigmint reduced the number of misassemblies identified by QUAST in the ABySS assembly by 216 (27%). While scaffolding with ARCS alone more than doubled the scaffold NGA50 of the assembly from 3 to 8 Mbp, the combination of Tigmint and ARCS improved the scaffold NGA50 of the assembly over five-fold to 16.4 Mbp. This notable improvement in contiguity highlights the utility of assembly correction in refining assemblies. We demonstrate the utility of Tigmint in correcting the assemblies of multiple tools, as well as in using Chromium reads to correct and scaffold assemblies of long single-molecule sequencing.Scaffolding an assembly that has been corrected with Tigmint yields a final assembly that is both more correct and substantially more contiguous than an assembly that has not been corrected. Using single-molecule sequencing in combination with linked reads enables a genome sequence assembly that achieves both a high sequence contiguity as well as high scaffold contiguity, a feat not currently achievable with either technology alone.


July 7, 2019

PlasmidTron: assembling the cause of phenotypes and genotypes from NGS data.

Increasingly rich metadata are now being linked to samples that have been whole-genome sequenced. However, much of this information is ignored. This is because linking this metadata to genes, or regions of the genome, usually relies on knowing the gene sequence(s) responsible for the particular trait being measured and looking for its presence or absence in that genome. Examples of this would be the spread of antimicrobial resistance genes carried on mobile genetic elements (MGEs). However, although it is possible to routinely identify the resistance gene, identifying the unknown MGE upon which it is carried can be much more difficult if the starting point is short-read whole-genome sequence data. The reason for this is that MGEs are often full of repeats and so assemble poorly, leading to fragmented consensus sequences. Since mobile DNA, which can carry many clinically and ecologically important genes, has a different evolutionary history from the host, its distribution across the host population will, by definition, be independent of the host phylogeny. It is possible to use this phenomenon in a genome-wide association study to identify both the genes associated with the specific trait and also the DNA linked to that gene, for example the flanking sequence of the plasmid vector on which it is encoded, which follows the same patterns of distribution as the marker gene/sequence itself. We present PlasmidTron, which utilizes the phenotypic data normally available in bacterial population studies, such as antibiograms, virulence factors, or geographical information, to identify traits that are likely to be present on DNA that can randomly reassort across defined bacterial populations. It is also possible to use this methodology to associate unknown genes/sequences (e.g. plasmid backbones) with a specific molecular signature or marker (e.g. resistance gene presence or absence) using PlasmidTron. PlasmidTron uses a k-mer-based approach to identify reads associated with a phylogenetically unlinked phenotype. These reads are then assembled de novo to produce contigs in a fast and scalable-to-large manner. PlasmidTron is written in Python 3 and is available under the open source licence GNU GPL3 from https://github.com/sanger-pathogens/plasmidtron.


July 7, 2019

Draft genome sequence of a Shewanella halifaxensis strain isolated from the intestine of marine red seabream (Pagrus major), which includes an integrative conjugative element with macrolide resistance genes.

Shewanella halifaxensis strain 6JANF4-E-4 was isolated from the intestine of a red seabream (Pagrus major). Here, we report the draft genome sequence of this bacterium, which includes an integrative conjugative element of the SXT/R391 family, where the macrolide resistance determinants mef(C) and mph(G) exist. Copyright © 2018 Sugimoto et al.


July 7, 2019

Complete genome sequence of Lactobacillus plantarum subsp. plantarum strain LB1-2, Iiolated from the hindgut of European honeybees, Apis mellifera L., from the Philippines.

Lactobacillus plantarum subsp. plantarum strain LB1-2, isolated from the hindgut of European honeybees in the Philippines, is active against Paenibacillus larvae and has broad activity against several Gram-positive and Gram-negative bacteria. The complete genome sequence reported herein contains gene clusters for multiple bacteriocins and extensive gene inventories for carbohydrate metabolism. Copyright © 2018 Ilagan-Cruzada et al.


July 7, 2019

Complete genome sequence of a VIM-1- producing Salmonella enterica subsp. enterica serovar Infantis isolate derived from minced pork meat.

Carbapenems are considered last-resort antibiotics used to treat human infections caused by multidrug-resistant bacteria. In 2011, VIM-1 carbapenemase-producing Salmonella enterica subsp. enterica serovar Infantis strains were isolated from livestock for the first time in Germany. Here, we announce the complete genome sequence of the first German blaVIM-1-harboring Salmonella Infantis isolate (15-SA01028) originating from food. Copyright © 2018 Borowiak et al.


July 7, 2019

The sequenced angiosperm genomes and genome databases.

Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.


July 7, 2019

Molecular preadaptation to antimony resistance in Leishmania donovani on the Indian subcontinent.

Antimonials (Sb) were used for decades for chemotherapy of visceral leishmaniasis (VL). Now abandoned in the Indian subcontinent (ISC) because of Leishmania donovani resistance, this drug offers a unique model for understanding drug resistance dynamics. In a previous phylogenomic study, we found two distinct populations of L. donovani: the core group (CG) in the Gangetic plains and ISC1 in the Nepalese highlands. Sb resistance was only encountered within the CG, and a series of potential markers were identified. Here, we analyzed the development of resistance to trivalent antimonials (SbIII) upon experimental selection in ISC1 and CG strains. We observed that (i) baseline SbIII susceptibility of parasites was higher in ISC1 than in the CG, (ii) time to SbIII resistance was higher for ISC1 parasites than for CG strains, and (iii) untargeted genomic and metabolomic analyses revealed molecular changes along the selection process: these were more numerous in ISC1 than in the CG. Altogether these observations led to the hypothesis that CG parasites are preadapted to SbIII resistance. This hypothesis was experimentally confirmed by showing that only wild-type CG strains could survive a direct exposure to the maximal concentration of SbIII The main driver of this preadaptation was shown to be MRPA, a gene involved in SbIII sequestration and amplified in an intrachromosomal amplicon in all CG strains characterized so far. This amplicon emerged around 1850 in the CG, well before the implementation of antimonials for VL chemotherapy, and we discuss here several hypotheses of selective pressure that could have accompanied its emergence.IMPORTANCE The “antibiotic resistance crisis” is a major challenge for scientists and medical professionals. This steady rise in drug-resistant pathogens also extends to parasitic diseases, with antimony being the first anti-Leishmania drug that fell in the Indian subcontinent (ISC). Leishmaniasis is a major but neglected infectious disease with limited therapeutic options. Therefore, understanding how parasites became resistant to antimonials is of commanding importance. In this study, we experimentally characterized the dynamics of this resistance acquisition and show for the first time that some Leishmania populations of the ISC were preadapted to antimony resistance, likely driven by environmental factors or by drugs used in the 19th century. Copyright © 2018 Dumetz et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.