Menu
July 7, 2019

Efficient cardinality estimation for k-mers in large DNA sequencing data sets

We present an open implementation of the HyperLogLog cardinality estimation sketch for counting fixed-length substrings of DNA strings (k-mers). The HyperLogLog sketch implementation is in C++ with a Python interface, and is distributed as part of the khmer software package. khmer is freely available from urlhttps://github.com/dib-lab/khmer under a BSD License. The features presented here are included in version 1.4 and later.


July 7, 2019

Atypical Salmonella enterica serovars in murine and human infection models: Is it time to reassess our approach to the study of salmonellosis?

Nontyphoidal Salmonella species are globally disseminated pathogens and the predominant cause of gastroenteritis. The pathogenesis of salmonellosis has been extensively studied using in vivo murine models and cell lines typically challenged with Salmonella Typhimurium. Although serovars Enteritidis and Typhimurium are responsible for the most of human infections reported to the CDC, several other serovars also contribute to clinical cases of salmonellosis. Despite their epidemiological importance, little is known about their infection phenotypes. Here, we report the virulence characteristics and genomes of 10 atypical S. enterica serovars linked to multistate foodborne outbreaks in the United States. We show that the murine RAW 264.7 macrophage model of infection is unsuitable for inferring human relevant differences in nontyphoidal Salmonella infections whereas differentiated human THP-1 macrophages allowed these isolates to be further characterised in a more relevant, human context.


July 7, 2019

Characterization of the first cultured representative of Verrucomicrobia subdivision 5 indicates the proposal of a novel phylum.

The recently isolated strain L21-Fru-AB(T) represents moderately halophilic, obligately anaerobic and saccharolytic bacteria that thrive in the suboxic transition zones of hypersaline microbial mats. Phylogenetic analyses based on 16S rRNA genes, RpoB proteins and gene content indicated that strain L21-Fru-AB(T) represents a novel species and genus affiliated with a distinct phylum-level lineage originally designated Verrucomicrobia subdivision 5. A survey of environmental 16S rRNA gene sequences revealed that members of this newly recognized phylum are wide-spread and ecologically important in various anoxic environments ranging from hypersaline sediments to wastewater and the intestine of animals. Characteristic phenotypic traits of the novel strain included the formation of extracellular polymeric substances, a Gram-negative cell wall containing peptidoglycan and the absence of odd-numbered cellular fatty acids. Unusual metabolic features deduced from analysis of the genome sequence were the production of sucrose as osmoprotectant, an atypical glycolytic pathway lacking pyruvate kinase and the synthesis of isoprenoids via mevalonate. On the basis of the analyses of phenotypic, genomic and environmental data, it is proposed that strain L21-Fru-AB(T) and related bacteria are specifically adapted to the utilization of sulfated glycopolymers produced in microbial mats or biofilms.


July 7, 2019

Structural and functional analysis of the finished genome of the recently isolated toxic Anabaena sp. WA102.

Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture.The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90.Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.


July 7, 2019

A roadmap for gene system development in Clostridium.

Clostridium species are both heroes and villains. Some cause serious human and animal diseases, those present in the gut microbiota generally contribute to health and wellbeing, while others represent useful industrial chassis for the production of chemicals and fuels. To understand, counter or exploit, there is a fundamental requirement for effective systems that may be used for directed or random genome modifications. We have formulated a simple roadmap whereby the necessary gene systems maybe developed and deployed. At its heart is the use of ‘pseudo-suicide’ vectors and the creation of a pyrE mutant (a uracil auxotroph), initially aided by ClosTron technology, but ultimately made using a special form of allelic exchange termed ACE (Allele-Coupled Exchange). All mutants, regardless of the mutagen employed, are made in this host. This is because through the use of ACE vectors, mutants can be rapidly complemented concomitant with correction of the pyrE allele and restoration of uracil prototrophy. This avoids the phenotypic effects frequently observed with high copy number plasmids and dispenses with the need to add antibiotic to ensure plasmid retention. Once available, the pyrE host may be used to stably insert all manner of application specific modules. Examples include, a sigma factor to allow deployment of a mariner transposon, hydrolases involved in biomass deconstruction and therapeutic genes in cancer delivery vehicles. To date, provided DNA transfer is obtained, we have not encountered any clostridial species where this technology cannot be applied. These include, Clostridium difficile, Clostridium acetobutylicum, Clostridium beijerinckii, Clostridium botulinum, Clostridium perfringens, Clostridium sporogenes, Clostridium pasteurianum, Clostridium ljungdahlii, Clostridium autoethanogenum and even Geobacillus thermoglucosidasius. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.


July 7, 2019

Ploidy influences the functional attributes of de novo lager yeast hybrids.

The genomes of hybrid organisms, such as lager yeast (Saccharomyces cerevisiae × Saccharomyces eubayanus), contain orthologous genes, the functionality and effect of which may differ depending on their origin and copy number. How the parental subgenomes in lager yeast contribute to important phenotypic traits such as fermentation performance, aroma production, and stress tolerance remains poorly understood. Here, three de novo lager yeast hybrids with different ploidy levels (allodiploid, allotriploid, and allotetraploid) were generated through hybridization techniques without genetic modification. The hybrids were characterized in fermentations of both high gravity wort (15 °P) and very high gravity wort (25 °P), which were monitored for aroma compound and sugar concentrations. The hybrid strains with higher DNA content performed better during fermentation and produced higher concentrations of flavor-active esters in both worts. The hybrid strains also outperformed both the parent strains. Genome sequencing revealed that several genes related to the formation of flavor-active esters (ATF1, ATF2¸ EHT1, EEB1, and BAT1) were present in higher copy numbers in the higher ploidy hybrid strains. A direct relationship between gene copy number and transcript level was also observed. The measured ester concentrations and transcript levels also suggest that the functionality of the S. cerevisiae- and S. eubayanus-derived gene products differs. The results contribute to our understanding of the complex molecular mechanisms that determine phenotypes in lager yeast hybrids and are expected to facilitate targeted strain development through interspecific hybridization.


July 7, 2019

Reply to Bemm et al. and Arakawa: Identifying foreign genes in independent Hypsibius dujardini genome assemblies.

Our report (1) describing the discovery of extensive horizontal gene transfer in a tardigrade genome has raised questions from other groups who were sequencing the Hypsibius dujardini genome in parallel or who have done new experiments and analyses since our report (2??–5). Bemm et al. (2) now report filtering our data for likely contaminants, resulting in a new, prefiltered genome assembly. Arakawa (3) has sequenced genomes of starved, washed, individual animals that had been treated with antibiotics for 48 h, and used this genomic sequence and RNA-Seq data to identify likely bona fide tardigrade contigs. Two other reports have contributed data and analysis: Delmont and Eren (4) used a newly published analysis and visualization platform, Anvi’o (6), to identify likely contaminants in our genome assembly, and Koutsovoulos et al. (5) applied useful taxon-annotated GC coverage plots (Blobplots) (7) to our data and reported an independent genome assembly.


July 7, 2019

Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.


July 7, 2019

Co-utilization of glucose and xylose by evolved Thermus thermophilus LC113 strain elucidated by (13)C metabolic flux analysis and whole genome sequencing.

We evolved Thermus thermophilus to efficiently co-utilize glucose and xylose, the two most abundant sugars in lignocellulosic biomass, at high temperatures without carbon catabolite repression. To generate the strain, T. thermophilus HB8 was first evolved on glucose to improve its growth characteristics, followed by evolution on xylose. The resulting strain, T. thermophilus LC113, was characterized in growth studies, by whole genome sequencing, and (13)C-metabolic flux analysis ((13)C-MFA) with [1,6-(13)C]glucose, [5-(13)C]xylose, and [1,6-(13)C]glucose+[5-(13)C]xylose as isotopic tracers. Compared to the starting strain, the evolved strain had an increased growth rate (~2-fold), increased biomass yield, increased tolerance to high temperatures up to 90°C, and gained the ability to grow on xylose in minimal medium. At the optimal growth temperature of 81°C, the maximum growth rate on glucose and xylose was 0.44 and 0.46h(-1), respectively. In medium containing glucose and xylose the strain efficiently co-utilized the two sugars. (13)C-MFA results provided insights into the metabolism of T. thermophilus LC113 that allows efficient co-utilization of glucose and xylose. Specifically, (13)C-MFA revealed that metabolic fluxes in the upper part of metabolism adjust flexibly to sugar availability, while fluxes in the lower part of metabolism remain relatively constant. Whole genome sequence analysis revealed two large structural changes that can help explain the physiology of the evolved strain: a duplication of a chromosome region that contains many sugar transporters, and a 5x multiplication of a region on the pVV8 plasmid that contains xylose isomerase and xylulokinase genes, the first two enzymes of xylose catabolism. Taken together, (13)C-MFA and genome sequence analysis provided complementary insights into the physiology of the evolved strain. Copyright © 2016 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of Vibrio alginolyticus ATCC 33787(T) isolated from seawater with three native megaplasmids.

Vibrio alginolyticus, an opportunistic pathogen, is commonly associated with vibriosis in fish and shellfish and can also cause superficial and ear infections in humans. V. alginolyticus ATCC 33787(T) was originally isolated from seawater and has been used as one of the type strains for exploring the virulence factors of marine bacteria and for developing vaccine against vibriosis. Here we sequenced and assembled the whole genome of this strain, and identified three megaplasmids and three Type VI secretion systems, thus providing useful information for the study of virulence factors and for the development of vaccine for Vibrio. Copyright © 2016. Published by Elsevier B.V.


July 7, 2019

Complete genome sequence of the crude oil-degrading thermophilic bacterium Geobacillus sp. JS12.

Here, we report the complete genome sequence of Geobacillus sp. JS12, isolated from composts located in Namhae, Korea, which shows extracellular lipolytic activities at high temperatures. An array of genes related to the utilization of lipids was identified by whole genome analysis. The genome sequence of the strain JS12 provides basic information for wider exploitation of thermostable industrial lipases. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of the Streptomyces sp. strain CdTB01, a bacterium tolerant to cadmium.

Streptomyces sp. Strain CdTB01, which is tolerant to high concentrations of heavy metals, particularly cadmium, was isolated from soil contaminated with heavy metals. Two contigs with total genome size of 10.19Mb were identified in the whole genome sequencing and assembly, and numerous homologous genes known to be involved in heavy metal resistance were found in the genome. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

Pseudomonas cerasi sp. nov. (non Griffin, 1911) isolated from diseased tissue of cherry.

Eight isolates of Gram-negative fluorescent bacteria (58(T), 122, 374, 791, 963, 966, 970a and 1021) were obtained from diseased tissue of cherry trees from different regions of Poland. The symptoms resembled those of bacterial canker. Based on an analysis of 16S rDNA sequences the isolates shared the highest over 99.9% similarity with Pseudomonas ficuserectae JCM 2400(T) and P. congelans DSM 14939(T). Phylogenetic analysis using housekeeping genes gyrB, rpoD and rpoB revealed that they form a separate cluster and confirmed their closest relation to P. syringae NCPPB 281(T) and P. congelans LMG 21466(T). DNA-DNA hybridization between the cherry isolate 58(T) and the type strains of these two closely related species revealed relatedness values of 58.2% and 41.9%, respectively. This was further supported by Average Nucleotide Identity (ANIb) and Genome-to-Genome Distance (GGDC) between the whole genome sequences of strain LMG 28609(T) and closely related Pseudomonas species. The major cellular fatty acids are 16:0 and summed feature 3 (16:1 ?7c/15:0 iso 2OH). Phenotypic characteristics differentiated the novel isolates from other closely related species. The G+C content of the genomic DNA of strain 58(T) was 59%. The diversity was proved by PCR MP and BOX PCR, eliminating the possibility that they constitute a clonal population. Based on the evidence of this polyphasic taxonomic study the eight strains are considered to represent a novel species of the genus Pseudomonas for which the name P. cerasi sp. nov. (non Griffin, 1911) is proposed. The type strain of this species is 58(T) (=LMG 28609(T)=CFBP 8305(T)). Copyright © 2016 Elsevier GmbH. All rights reserved.


July 7, 2019

TERRA promotes telomerase-mediated telomere elongation in Schizosaccharomyces pombe.

Telomerase-mediated telomere elongation provides cell populations with the ability to proliferate indefinitely. Telomerase is capable of recognizing and extending the shortest telomeres in cells; nevertheless, how this mechanism is executed remains unclear. Here, we show that, in the fission yeast Schizosaccharomyces pombe, shortened telomeres are highly transcribed into the evolutionarily conserved long noncoding RNA TERRA A fraction of TERRA produced upon telomere shortening is polyadenylated and largely devoid of telomeric repeats, and furthermore, telomerase physically interacts with this polyadenylated TERRA in vivo We also show that experimentally enhanced transcription of a manipulated telomere promotes its association with telomerase and concomitant elongation. Our data represent the first direct evidence that TERRA stimulates telomerase recruitment and activity at chromosome ends in an organism with human-like telomeres. © 2016 The Authors.


July 7, 2019

Evaluation of an optimal epidemiologic typing scheme for Legionella pneumophila with whole genome sequence data using validation guidelines.

Sequence-based typing (SBT), analogous to multi-locus sequence typing (MLST), is the current gold-standard typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila However, as common sequence types (STs) cause many infections, some investigations remain unresolved. Here, various whole genome sequencing (WGS)-based methods were evaluated according to published guidelines, including: i) single nucleotide polymorphism (SNP)-based; ii) extended multi-locus sequence typing (MLST) using different numbers of genes; iii) gene presence/absence, and iv) kmer-based. L. pneumophila serogroup 1 isolates (n=106) from the standard “typing panel”, previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI) were tested together with another 229 isolates.Over 98% isolates were considered typable using the mapping- and kmer-based methods. Percentages of isolates with complete extended MLST profiles ranged from 99.1% (50-gene) to 86.8% (1455-gene) whilst only 41.5% produced a full profile with the gene presence/absence scheme. Replicates demonstrated that all methods offer 100% reproducibility. Indices of discrimination range from 0.972 (ribosomal MLST) to 0.999 (SNP-based), and all values are higher than that achieved with SBT (0.940). Epidemiological concordance is generally inversely related to discriminatory power. We propose that an extended MLST scheme with ~50 genes provides optimal epidemiological concordance whilst substantially improving the discrimination offered by SBT, and can be used as part of a hierarchical typing scheme that should maintain backwards compatibility and increase discrimination where necessary. This analysis will be useful for the ESGLI to design a scheme that has the potential to become the new gold standard typing method for L. pneumophila. Copyright © 2016 David et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.