Menu
July 7, 2019

A fast approximate algorithm for mapping long reads to large reference databases.

Emerging single-molecule sequencing technologies from Pacific Biosciences and Oxford Nanopore have revived interest in long-read mapping algorithms. Alignment-based seed-and-extend methods demonstrate good accuracy, but face limited scalability, while faster alignment-free methods typically trade decreased precision for efficiency. In this article, we combine a fast approximate read mapping algorithm based on minimizers with a novel MinHash identity estimation technique to achieve both scalability and precision. In contrast to prior methods, we develop a mathematical framework that defines the types of mapping targets we uncover, establish probabilistic estimates of p-value and sensitivity, and demonstrate tolerance for alignment error rates up to 20%. With this framework, our algorithm automatically adapts to different minimum length and identity requirements and provides both positional and identity estimates for each mapping reported. For mapping human PacBio reads to the hg38 reference, our method is 290?×?faster than Burrows-Wheeler Aligner-MEM with a lower memory footprint and recall rate of 96%. We further demonstrate the scalability of our method by mapping noisy PacBio reads (each =5?kbp in length) to the complete NCBI RefSeq database containing 838 Gbp of sequence and >60,000 genomes.


July 7, 2019

Rhodobacter sp. Rb3, an aerobic anoxygenic phototroph which thrives in the polyextreme ecosystem of the Salar de Huasco, in the Chilean Altiplano.

The Salar de Huasco is an evaporitic basin located in the Chilean Altiplano, which presents extreme environmental conditions for life, i.e. high altitude (3800 m.a.s.l.), negative water balance, a wide salinity range, high daily temperature changes and the occurrence of the highest registered solar radiation on the planet (>?1200 W m-2). This ecosystem is considered as a natural laboratory to understand different adaptations of microorganisms to extreme conditions. Rhodobacter, an anoxygenic aerobic phototrophic bacterial genus, represents one of the most abundant groups reported based on taxonomic diversity surveys in this ecosystem. The bacterial mat isolate Rhodobacter sp. strain Rb3 was used to study adaptation mechanisms to stress-inducing factors potentially explaining its success in a polyextreme ecosystem. We found that the Rhodobacter sp. Rb3 genome was characterized by a high abundance of genes involved in stress tolerance and adaptation strategies, among which DNA repair and oxidative stress were the most conspicuous. Moreover, many other molecular mechanisms associated with oxidative stress, photooxidation and antioxidants; DNA repair and protection; motility, chemotaxis and biofilm synthesis; osmotic stress, metal, metalloid and toxic anions resistance; antimicrobial resistance and multidrug pumps; sporulation; cold shock and heat shock stress; mobile genetic elements and toxin-antitoxin system were detected and identified as potential survival mechanism features in Rhodobacter sp. Rb3. In total, these results reveal a wide set of strategies used by the isolate to adapt and thrive under environmental stress conditions as a model of polyextreme environmental resistome.


July 7, 2019

PlasmidTron: assembling the cause of phenotypes and genotypes from NGS data.

Increasingly rich metadata are now being linked to samples that have been whole-genome sequenced. However, much of this information is ignored. This is because linking this metadata to genes, or regions of the genome, usually relies on knowing the gene sequence(s) responsible for the particular trait being measured and looking for its presence or absence in that genome. Examples of this would be the spread of antimicrobial resistance genes carried on mobile genetic elements (MGEs). However, although it is possible to routinely identify the resistance gene, identifying the unknown MGE upon which it is carried can be much more difficult if the starting point is short-read whole-genome sequence data. The reason for this is that MGEs are often full of repeats and so assemble poorly, leading to fragmented consensus sequences. Since mobile DNA, which can carry many clinically and ecologically important genes, has a different evolutionary history from the host, its distribution across the host population will, by definition, be independent of the host phylogeny. It is possible to use this phenomenon in a genome-wide association study to identify both the genes associated with the specific trait and also the DNA linked to that gene, for example the flanking sequence of the plasmid vector on which it is encoded, which follows the same patterns of distribution as the marker gene/sequence itself. We present PlasmidTron, which utilizes the phenotypic data normally available in bacterial population studies, such as antibiograms, virulence factors, or geographical information, to identify traits that are likely to be present on DNA that can randomly reassort across defined bacterial populations. It is also possible to use this methodology to associate unknown genes/sequences (e.g. plasmid backbones) with a specific molecular signature or marker (e.g. resistance gene presence or absence) using PlasmidTron. PlasmidTron uses a k-mer-based approach to identify reads associated with a phylogenetically unlinked phenotype. These reads are then assembled de novo to produce contigs in a fast and scalable-to-large manner. PlasmidTron is written in Python 3 and is available under the open source licence GNU GPL3 from https://github.com/sanger-pathogens/plasmidtron.


July 7, 2019

Complete genome sequences of two Bacillus pumilus strains from Cuatrociénegas, Coahuila, Mexico.

We assembled the complete genome sequences of Bacillus pumilus strains 145 and 150a from Cuatrociénegas, Mexico. We detected genes codifying for proteins potentially involved in antagonism (bacteriocins) and defense mechanisms (abortive infection bacteriophage proteins and 4-azaleucine resistance). Both strains harbored prophage sequences. Our results provide insights into understanding the establishment of microbial interactions. Copyright © 2018 Zarza et al.


July 7, 2019

Reevaluation of the complete genome sequence of Magnetospirillum gryphiswaldense MSR-1 with Single-Molecule Real-Time Sequencing data.

Magnetospirillum gryphiswaldense is a key organism for understanding magnetosome formation and magnetotaxis. As earlier studies suggested a high genomic plasticity, we (re)sequenced the type strain MSR-1 and the laboratory strain R3/S1. Both sequences differ by only 11 point mutations, but organization of the magnetosome island deviates from that of previous genome sequences. Copyright © 2018 Uebe et al.


July 7, 2019

Complete genome sequence of a VIM-1- producing Salmonella enterica subsp. enterica serovar Infantis isolate derived from minced pork meat.

Carbapenems are considered last-resort antibiotics used to treat human infections caused by multidrug-resistant bacteria. In 2011, VIM-1 carbapenemase-producing Salmonella enterica subsp. enterica serovar Infantis strains were isolated from livestock for the first time in Germany. Here, we announce the complete genome sequence of the first German blaVIM-1-harboring Salmonella Infantis isolate (15-SA01028) originating from food. Copyright © 2018 Borowiak et al.


July 7, 2019

Whole-genome sequence of phage-resistant strain Escherichia coli DH5a.

The genomes of many strains of Escherichia coli have been sequenced, as this organism is a classic model bacterium. Here, we report the genome sequence of Escherichia coli DH5a, which is resistant to a T4 bacteriophage (CCTCC AB 2015375), while its other homologous E. coli strains, such as E. coli BL21, DH10B, and MG1655, are not resistant to phage invasions. Thus, understanding of the genome of the DH5a strain, along with comparative analysis of its genome sequence along with other sequences of E. coli strains, may help to reveal the bacteriophage resistance mechanism of E. coli. Copyright © 2018 Chen et al.


July 7, 2019

Complete genome sequence of Escherichia albertii strain 1551-2, a potential extracellular and intracellular pathogen.

Escherichia albertii has recently been recognized as an emerging human and bird enteric pathogen. Here, we report the complete chromosome sequence of a clinical isolate of E. albertii strain 1551-2, which may provide information about the pathogenic potential of this new species and the mechanisms of evolution of Escherichia species. Copyright © 2018 Romão et al.


July 7, 2019

Complete genome sequence of Lelliottia nimipressuralis type strain SGAir0187, isolated from tropical air collected in Singapore.

Lelliottia nimipressuralis type strain SGAir0187 was isolated from tropical air samples collected in Singapore. The genome was assembled with an average coverage of 180-fold using Pacific Biosciences long reads and Illumina MiSeq paired-end reads. The genome measures 4.8?Mb and contains 4,424 protein-coding genes, 83 tRNAs, and 25 rRNAs. Copyright © 2018 Heinle et al.


July 7, 2019

First detection of a blaCTX-M-15-carrying plasmid in Vibrio alginolyticus.

Vibrio alginolyticus is a gram-negative halophilic bacterium, widely distributed in sea-water and seafood all over the world and is the main pathogenic bacteria of marine animals such as fish, shrimp and shellfish. Besides, it is also an important human pathogen causing eye, ear and wound infections, as well as gastroenteritis, septicemia, and necrotizing fasciitis [1]. Resistance to extended-spectrum cephalosporins is rarely ob- served in V. alginolyticus. Here, we report for the first time the identification of a foodborne V. alginolyticus strain Vb0506 carrying plasmid encoding blaCTX-M-15.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.