Menu
July 7, 2019

Complete genome sequence of Microcystis aeruginosa NIES-2481 and common genomic features of group G M. aeruginosa.

Microcystis aeruginosa is a freshwater bloom-forming cyanobacterium that is distributed worldwide. M. aeruginosa can be divided into at least 8 phylogenetic groups (A-G and X) at the intraspecific level. Here, we report the complete genome sequence of M. aeruginosa NIES-2481, which was isolated from Lake Kasumigaura, Japan, and is assigned to group G. The complete genome sequence of M. aeruginosa NIES-2481 comprises a 4.29-Mbp circular chromosome and a 147,539-bp plasmid; the circular chromosome and the plasmid contain 4,332 and 167 protein-coding genes, respectively. Comparative analysis with the complete genome of M. aeruginosa NIES-2549, which belongs to the same group with NIES-2481, showed that the genome size is the smallest level in previously sequenced M. aeruginosa strains, and the genomes do not contain a microcystin biosynthetic gene cluster in common. Synteny analysis revealed only small-scale rearrangements between the two genomes.


July 7, 2019

Complete genome sequence of “Thiodictyon syntrophicum” sp. nov. strain Cad16T, a photolithoautotrophic purple sulfur bacterium isolated from the alpine meromictic Lake Cadagno.

Thiodictyon syntrophicum sp. nov. strain Cad16T is a photoautotrophic purple sulfur bacterium belonging to the family of Chromatiaceae in the class of Gammaproteobacteria. The type strain Cad16T was isolated from the chemocline of the alpine meromictic Lake Cadagno in Switzerland. Strain Cad16T represents a key species within this sulfur-driven bacterial ecosystem with respect to carbon fixation. The 7.74-Mbp genome of strain Cad16T has been sequenced and annotated. It encodes 6237 predicted protein sequences and 59 RNA sequences. Phylogenetic comparison based on 16S rRNA revealed that Thiodictyon elegans strain DSM 232T the most closely related species. Genes involved in sulfur oxidation, central carbon metabolism and transmembrane transport were found. Noteworthy, clusters of genes encoding the photosynthetic machinery and pigment biosynthesis are found on the 0.48 Mb plasmid pTs485. We provide a detailed insight into the Cad16T genome and analyze it in the context of the microbial ecosystem of Lake Cadagno.


July 7, 2019

Complete genome sequence of Gordonia sp. YC-JH1, a bacterium efficiently degrading a wide range of phthalic acid esters.

Phthalic acid esters (PAEs) are a family of recalcitrant pollutants mainly used as plasticizer. The strain Gordonia sp.YC-JH1, isolated from petroleum-contaminated soil, is capable of efficiently degrading a wide range of PAEs. In order to pertinently investigate the genetic mechanism of PAEs catabolism by strain YC-JH1, its complete genome sequencing has been performed by SMRT sequencing technology. The genome comprises a circular chromosome and a plasmid with a size of 4,101,557 bp and 91,767 bp respectively. Based on the genome sequence, 3563 protein-coding genes are predicted, of which the genes responsible for PAEs degradation are identified, including the two genes of PAEs hydrolase and the gene clusters for phthalic acid and protocatechuic acid degradation. The genome information provides genomic basis of PAEs degradation to allow the complete metabolism of PAEs. The wide substrate spectrum and its genetic basis of this strain should expand its application potential for environments bioremediation, provide novel gene resources involved in PAEs degradation for biotechnology and gene engineering, and contribute to shed light on the mechanism of PAEs metabolism. Copyright © 2018. Published by Elsevier B.V.


July 7, 2019

Thauera sinica sp. nov., a phenol derivative-degrading bacterium isolated from activated sludge.

A bacterial strain, K11T, capable of degrading phenol derivatives was isolated from activated sludge of a sewage treatment plant in China. This strain, which can degrade more than ten phenol derivatives, was identified as a Gram-stain negative, rod-shaped, asporogenous, facultative anaerobic bacterium with a polar flagellum. The strain was found to grow in tryptic soy broth in the presence of 0-2.5% (w/v) NaCl (optimum 0-1%), at 4-43 °C (optimum 30-35 °C) and pH 4.5-10.5 (optimum 7.5-8). Comparative analysis of nearly full-length 16S rRNA gene sequences showed that this strain belongs to the genus Thauera. The 16S rRNA gene sequence was found to show high similarity (97.5%) to that of Thauera chlorobenzoica 3CB-1T, with lesser similarity to other recognised Thauera strains. The G+C content of the DNA of the strain was determined to be 67.8 mol%. The DNA-DNA hybridization value between K11T and Thauera aromatica DSM6984T was 10.4 ± 4.5%. The genomic OrthoANI values of K11T with the other nine type strains of genus Thauera were less than 81.1%. Chemotaxonomic analysis of strain K11T revealed that Q-8 is the predominant quinone; the polar lipids contain phosphatidylglycerol, diphosphatidylglycerol, phosphatidylethanolamine, two unidentified phospholipids and five uncharacterised lipids; the major cellular fatty acid was identified as summed feature 3 (C16:1 ?7c and/or iso-C15:0 2-OH; 45.9%), followed by C16:0 (20.5%) and C18:1 ?7c (15.8%). Based on the phenotypic and phylogenetic evidence, DNA-DNA hybridisation, OrthoANI, chemotaxonomic analysis and results of the physiological and biochemical tests, a new species named Thauera sinica sp. nov. is proposed with strain K11T (= CGMCC 1.15731T = KACC 19216T) designated as the type strain.


July 7, 2019

RIFRAF: a frame-resolving consensus algorithm.

Protein coding genes can be studied using long-read next generation sequencing. However, high rates of indel sequencing errors are problematic, corrupting the reading frame. Even the consensus of multiple independent sequence reads retains indel errors. To solve this problem, we introduce Reference-Informed Frame-Resolving multiple-Alignment Free template inference algorithm (RIFRAF), a sequence consensus algorithm that takes a set of error-prone reads and a reference sequence and infers an accurate in-frame consensus. RIFRAF uses a novel structure, analogous to a two-layer hidden Markov model: the consensus is optimized to maximize alignment scores with both the set of noisy reads and with a reference. The template-to-reads component of the model encodes the preponderance of indels, and is sensitive to the per-base quality scores, giving greater weight to more accurate bases. The reference-to-template component of the model penalizes frame-destroying indels. A local search algorithm proceeds in stages to find the best consensus sequence for both objectives.Using Pacific Biosciences SMRT sequences from an HIV-1 env clone, NL4-3, we compare our approach to other consensus and frame correction methods. RIFRAF consistently finds a consensus sequence that is more accurate and in-frame, especially with small numbers of reads. It was able to perfectly reconstruct over 80% of consensus sequences from as few as three reads, whereas the best alternative required twice as many. RIFRAF is able to achieve these results and keep the consensus in-frame even with a distantly related reference sequence. Moreover, unlike other frame correction methods, RIFRAF can detect and keep true indels while removing erroneous ones.RIFRAF is implemented in Julia, and source code is publicly available at https://github.com/MurrellGroup/Rifraf.jl.Supplementary data are available at Bioinformatics online.


July 7, 2019

The draft genome of the lichen-forming fungus Lasallia hispanica (Frey) Sancho & A. Crespo

Lasallia hispanica (Frey) Sancho & A. Crespo is one of three Lasallia species occurring in central-western Europe. It is an orophytic, photophilous Mediterranean endemic which is sympatric with the closely related, widely distributed, highly clonal sister taxon L. pustulata in the supra- and oro-Mediterranean belts. We sequenced the genome of L. hispanica from a multispore isolate. The total genome length is 41·2 Mb, including 8488 gene models. We present the annotation of a variety of genes that are involved in protein secretion, mating processes and secondary metabolism, and we report transposable elements. Additionally, we compared the genome of L. hispanica to the closely related, yet ecologically distant, L. pustulata and found high synteny in gene content and order. The newly assembled and annotated L. hispanica genome represents a useful resource for future investigations into niche differentiation, speciation and microevolution in L. hispanica and other members of the genus.


July 7, 2019

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.

The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time.Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50?=?4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50?=?14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~?10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n?=?13).ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.


July 7, 2019

Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of ß-lactamase [NPS ß-lactamase (EC 3.5.2.6), ß-lactamase class C, and a metal-dependent hydrolase of ß-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.


July 7, 2019

Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak.

Multi-country outbreaks of foodborne bacterial disease present challenges in their detection, tracking, and notification. As food is increasingly distributed across borders, such outbreaks are becoming more common. This increases the need for high-resolution, accessible, and replicable isolate typing schemes. Here we evaluate a core genome multilocus typing (cgMLST) scheme for the high-resolution reproducible typing of Salmonella enterica (S. enterica) isolates, by its application to a large European outbreak of S. enterica serovar Enteritidis. This outbreak had been extensively characterised using single nucleotide polymorphism (SNP)-based approaches. The cgMLST analysis was congruent with the original SNP-based analysis, the epidemiological data, and whole genome MLST (wgMLST) analysis. Combination of the cgMLST and epidemiological data confirmed that the genetic diversity among the isolates predated the outbreak, and was likely present at the infection source. There was consequently no link between country of isolation and genetic diversity, but the cgMLST clusters were congruent with date of isolation. Furthermore, comparison with publicly available Enteritidis isolate data demonstrated that the cgMLST scheme presented is highly scalable, enabling outbreaks to be contextualised within the Salmonella genus. The cgMLST scheme is therefore shown to be a standardised and scalable typing method, which allows Salmonella outbreaks to be analysed and compared across laboratories and jurisdictions. Copyright © 2018. Published by Elsevier B.V.


July 7, 2019

Complete genome sequence of Acinetobacter radioresistens strain LH6, a multidrug-resistant bacteriophage-propagating strain.

Antimicrobial resistance is a major problem worldwide. Understanding the interplay between drug-resistant pathogens, such as Acinetobacter baumannii and related species, potentially acting as environmental reservoirs is critical for preventing the spread of resistance determinants. Here we report the complete genome sequence of a multidrug-resistant bacteriophage-propagating strain of Acinetobacter radioresistens.


July 7, 2019

Low-level antimicrobials in the medicinal leech select for resistant pathogens that spread to patients.

Fluoroquinolones (FQs) and ciprofloxacin (Cp) are important antimicrobials that pollute the environment in trace amounts. Although Cp has been recommended as prophylaxis for patients undergoing leech therapy to prevent infections by the leech gut symbiont Aeromonas, a puzzling rise in Cp-resistant (Cpr) Aeromonas infections has been reported. We report on the effects of subtherapeutic FQ concentrations on bacteria in an environmental reservoir, the medicinal leech, and describe the presence of multiple antibiotic resistance mutations and a gain-of-function resistance gene. We link the rise of CprAeromonas isolates to exposure of the leech microbiota to very low levels of Cp (0.01 to 0.04 µg/ml), <1/100 of the clinical resistance breakpoint for Aeromonas Using competition experiments and comparative genomics of 37 strains, we determined the mechanisms of resistance in clinical and leech-derived Aeromonas isolates, traced their origin, and determined that the presence of merely 0.01 µg/ml Cp provides a strong competitive advantage for Cpr strains. Deep-sequencing the Cpr-conferring region of gyrA enabled tracing of the mutation-harboring Aeromonas population in archived gut samples, and an increase in the frequency of the Cpr-conferring mutation in 2011 coincides with the initial reports of CprAeromonas infections in patients receiving leech therapy.IMPORTANCE The role of subtherapeutic antimicrobial contamination in selecting for resistant strains has received increasing attention and is an important clinical matter. This study describes the relationship of resistant bacteria from the medicinal leech, Hirudo verbana, with patient infections following leech therapy. While our results highlight the need for alternative antibiotic therapies, the rise of Cpr bacteria demonstrates the importance of restricting the exposure of animals to antibiotics approved for veterinary use. The shift to a more resistant community and the dispersion of Cpr-conferring mechanisms via mobile elements occurred in a natural setting due to the presence of very low levels of fluoroquinolones, revealing the challenges of controlling the spread of antibiotic-resistant bacteria and highlighting the importance of a holistic approach in the management of antibiotic use. Copyright © 2018 Beka et al.


July 7, 2019

High- quality draft genome sequences of eight bacteria isolated from fungus gardens grown by Trachymyrmex septentrionalis ants.

For their food source, Trachymyrmex septentrionalis ants raise symbiotic fungus gardens that contain bacteria whose functions are poorly understood. Here, we report the genome sequences of eight bacteria isolated from these fungus gardens to better describe the ecology of these strains and their potential to produce secondary metabolites in this niche.


July 7, 2019

Culture- and metagenomics-enabled analyses of the Methanosphaera genus reveals their monophyletic origin and differentiation according to genome size.

The genus Methanosphaera is a well-recognized but poorly characterized member of the mammalian gut microbiome, and distinctive from Methanobrevibacter smithii for its ability to induce a pro-inflammatory response in humans. Here we have used a combination of culture- and metagenomics-based approaches to expand the representation and information for the genus, which has supported the examination of their phylogeny and physiological capacity. Novel isolates of the genus Methanosphaera were recovered from bovine rumen digesta and human stool, with the bovine isolate remarkable for its large genome size relative to other Methanosphaera isolates from monogastric hosts. To substantiate this observation, we then recovered seven high-quality Methanosphaera-affiliated population genomes from ruminant and human gut metagenomic datasets. Our analyses confirm a monophyletic origin of Methanosphaera spp. and that the colonization of monogastric and ruminant hosts favors representatives of the genus with different genome sizes, reflecting differences in the genome content needed to persist in these different habitats.


July 7, 2019

Characterization and genome analysis of a phthalate esters-degrading strain Sphingobium yanoikuyae SHJ.

A bacterium capable of utilizing dimethyl phthalate (DMP), diethyl phthalate (DEP), di-n-butyl phthalate (DBP), and diisobuthyl phthalate (DIBP) as the sole carbon and energy source was isolated from shallow aquifer sediments. The strain was identified as Sphingobium yanoikuyae SHJ based on morphological characteristics, 16S rDNA gene phylogeny, and whole genome average nucleotide identity (ANI). The degradation half-life of DBP with substrate concentration of 8.5 and 50.0 mg/L by strain SHJ was 99.7 and 101.4 hours, respectively. The optimum degradation rate of DBP by SHJ was observed at 30°C and weak alkaline (pH 7.5). Genome sequence of the strain SHJ showed a circular chromosome and additional two circular plasmids with whole genome size of 5,669,383 bp and GC content of 64.23%. Functional annotation of SHJ revealed a total of 5,402 genes, with 5,183 protein-encoding genes, 143 pseudogenes, and 76 noncoding RNA genes. Based on genome annotation, 44 genes were identified to be involved in PAEs hydrolysis potentially. Besides, a region with size of about 6.9 kb comprised of seven ORFs, which is located on the smaller plasmid pSES189, was presumed to be responsible for the biodegradation of phthalate. These results provide insights into the genetic basis of DBP biodegradation in this strain.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.