Menu
July 7, 2019

Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in IncI2 plasmids.

A multiple DNA inversion system, the shufflon, exists in incompatibility (Inc) I1 and I2 plasmids. The shufflon generates variants of the PilV protein, a minor component of the thin pilus. The shufflon is one of the most difficult regions for de novo genome assembly because of its structural diversity even in an isolated bacterial clone. We determined complete genome sequences, including those of IncI2 plasmids carrying mcr-1, of three Escherichia coli strains using single-molecule, real-time (SMRT) sequencing and Illumina sequencing. The sequences assembled using only SMRT sequencing contained misassembled regions in the shufflon. A hybrid analysis using SMRT and Illumina sequencing resolved the misassembled region and revealed that the three IncI2 plasmids, excluding the shufflon region, were highly conserved. Moreover, the abundance ratio of whole-shufflon structures could be determined by quantitative structural variation analysis of the SMRT data, suggesting that a remarkable heterogeneity of whole-shufflon structural variations exists in IncI2 plasmids. These findings indicate that remarkable rearrangement regions should be validated using both long-read and short-read sequencing data and that the structural variation of PilV in the shufflon might be closely related to phenotypic heterogeneity of plasmid-mediated transconjugation involved in horizontal gene transfer even in bacterial clonal populations.


July 7, 2019

De novo genome and transcriptome assembly of the Canadian beaver (Castor canadensis).

The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. Copyright © 2017 Lok et al.


July 7, 2019

Isolation and genomic characterization of a Dehalococcoides strain suggests genomic rearrangement during culture.

We have developed and characterized a bacterial consortium that reductively dechlorinates trichloroethene to ethene. Quantitative PCR analysis for the 16S rRNA and reductive dehalogenase genes showed that the consortium is highly enriched with Dehalococcoides spp. that have two vinyl chloride reductive dehalogenase genes, bvcA and vcrA, and a trichloroethene reductive dehalogenase gene, tceA. The metagenome analysis of the consortium by the next generation sequencer SOLiD 3 Plus suggests that a Dehalococcoides sp. that is highly homologous to D. mccartyi 195 and equipped with vcrA and tceA exists in the consortium. We isolated this Dehalococcoides sp. and designated it as D. mccartyi UCH-ATV1. As the growth of D. mccartyi UCH-ATV1 is too slow under isolated conditions, we constructed a consortium by mixing D. mccartyi UCH-ATV1 with several other bacteria and performed metagenomic sequencing using the single molecule DNA sequencer PacBio RS II. We successfully determined the complete genome sequence of D. mccartyi UCH-ATV1. The strain is equipped with vcrA and tceA, but lacks bvcA. Comparison with tag sequences of SOLiD 3 Plus from the original consortium shows a few differences between the sequences. This suggests that a genome rearrangement of Dehalococcoides sp. occurred during culture.


July 7, 2019

Genome mining and predictive functional profiling of acidophilic rhizobacterium Pseudomonas fluorescens Pt14.

Pseudomonas fluorescens Pt14 is a non-pathogenic and acidophilic bacterium isolated from acidic soil (pH 4.65). Genome sequencing of strain Pt14 was performed using Single Molecule Real Time (SMRT) sequencing to get insights into unique existence of this strain in acidic environment. Complete genome sequence of this strain revealed a chromosome of 5,841,722 bp having 5354 CDSs and 88 RNAs. Phylogenomic reconstruction based on 16S rRNA gene, Average Nucleotide Identity (ANI) values and marker proteins revealed that strain Pt14 shared a common clade with P. fluorescens strain A506 and strain SS101. ANI value of strain Pt14 in relation to strain A506 was found 99.23% demonstrating a very close sub-species association at genome level. Further, orthology determination among these three phylogenetic neighbors revealed 4726 core proteins. Functional analysis elucidated significantly higher abundance of sulphur metabolism (>1×) which could be one of the reasons for the survival of strain Pt14 under acidic conditions (pH 4.65). Acidophilic bacteria have capability to oxidize sulphur into sulphuric acid which in turn can make the soil acidic and genome-wide analysis of P. fluorescens Pt14 demonstrated that this strain contributes towards making the soil acidic.


July 7, 2019

Identification and characterization of a biosynthetic gene cluster for tryptophan dimers in deep sea-derived Streptomyces sp. SCSIO 03032.

Tryptophan dimers (TDs) are an important class of natural products with diverse bioactivities and share conserved biosynthetic pathways. We report the identification of a partial gene cluster (spm) responsible for the biosynthesis of a class of unusual TDs with non-planar skeletons including spiroindimicins (SPMs), indimicins (IDMs), and lynamicins (LNMs) from the deep-sea derived Streptomyces sp. SCSIO 03032. Bioinformatics analysis, targeted gene disruptions, and heterologous expression studies confirmed the involvement of the spm gene cluster in the biosynthesis of SPM/IDM/LNMs, and revealed the indispensable roles for the halogenase/reductase pair SpmHF, the amino acid oxidase SpmO, and the chromopyrrolic acid (CPA) synthase SpmD, as well as the positive regulator SpmR and the putative transporter SpmA. However, the spm gene cluster was unable to confer a heterologous host the ability to produce SPM/IDM/LNMs. In addition, the P450 enzyme SpmP and the monooxygenase SpmX2 were found to be non-relevant to the biosynthesis of SPM/IDM/LNMs. Sequence alignment and structure modeling suggested the lack of key conserved amino acid residues in the substrate-binding pocket of SpmP. Furthermore, feeding experiments in the non-producing ?spmO mutant revealed several biosynthetic precursors en route to SPMs, indicating that key enzymes responsible for the biosynthesis of SPMs should be encoded by genes outside of the identified spm gene cluster. Finally, the biosynthetic pathways of SPM/IDM/LNMs are proposed to lay a basis for further insights into their intriguing biosynthetic machinery.


July 7, 2019

De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms.

Long-read sequencing technologies such as Pacific Biosciences and Oxford Nanopore MinION are capable of producing long sequencing reads with average fragment lengths of over 10,000 base-pairs and maximum lengths reaching 100,000 base- pairs. Compared with short reads, the assemblies obtained from long-read sequencing platforms have much higher contig continuity and genome completeness as long fragments are able to extend paths into problematic or repetitive regions. Many successful assembly applications of the Pacific Biosciences technology have been reported ranging from small bacterial genomes to large plant and animal genomes. Recently, genome assemblies using Oxford Nanopore MinION data have attracted much attention due to the portability and low cost of this novel sequencing instrument. In this paper, we re-sequenced a well characterized genome, the Saccharomyces cerevisiae S288C strain using three different platforms: MinION, PacBio and MiSeq. We present a comprehensive metric comparison of assemblies generated by various pipelines and discuss how the platform associated data characteristics affect the assembly quality. With a given read depth of 31X, the assemblies from both Pacific Biosciences and Oxford Nanopore MinION show excellent continuity and completeness for the 16 nuclear chromosomes, but not for the mitochondrial genome, whose reconstruction still represents a significant challenge.


July 7, 2019

Determination of nucleopolyhedrovirus’ taxonomic position

To date , over 78 genomes of nucleopolyhedroviruses (NPVs) have been sequenced and deposited in NCBI. How to define a new virus from the infected larvae in the field is usually the first question. Two NPV strains, which were isolated from casuarina moth (L. xylina) and golden birdwing larvae (Troides aeacus), respectively, displayed the same question. Due to the identity of polyhedrin (polh) sequences of these two isolates to that of Lymantria dispar MNPV and Bombyx mori NPV, they are named LdMNPV-like virus and TraeNPV, provisionally. To further clarify the relationships of LdMNPV-like virus and TraeNPV to closely related NPVs, Kimura 2-parameter (K-2-P) analysis was performed. Apparently, the results of K-2-P analysis that showed LdMNPV-like virus is an LdMNPV isolate, while TraeNPV had an ambiguous relationship to BmNPV. Otherwise, MaviNPV, which is a mini-AcMNPV, also exhibited a different story by K-2-P analysis. Since K-2-P analysis could not cover all species determination issues, therefore, TraeNPV needs to be sequenced for defining its taxonomic position. For this purpose, different genomic sequencing technologies and bioinformatic analysis approaches will be discussed. We anticipated that these applications will help to exam nucleotide information of unknown species and give an insight and facilitate to this issue.


July 7, 2019

Non-toxin-producing Bacillus cereus strains belonging to the B. anthracis clade isolated from the International Space Station.

In an ongoing Microbial Observatory investigation of the International Space Station (ISS), 11 Bacillus strains (2 from the Kibo Japanese experimental module, 4 from the U.S. segment, and 5 from the Russian module) were isolated and their whole genomes were sequenced. A comparative analysis of the 16S rRNA gene sequences of these isolates showed the highest similarity (>99%) to the Bacillus anthracis-B. cereus-B. thuringiensis group. The fatty acid composition, polar lipid profile, peptidoglycan type, and matrix-assisted laser desorption ionization-time of flight profiles were consistent with the B. cereus sensu lato group. The phenotypic traits such as motile rods, enterotoxin production, lack of capsule, and resistance to gamma phage/penicillin observed in ISS isolates were not characteristics of B. anthracis. Whole-genome sequence characterizations showed that ISS strains had the plcR non-B. anthracis ancestral “C” allele and lacked anthrax toxin-encoding plasmids pXO1 and pXO2, excluding their identification as B. anthracis. The genetic identities of all 11 ISS isolates characterized via gyrB analyses arbitrarily identified them as members of the B. cereus group, but traditional DNA-DNA hybridization (DDH) showed that the ISS isolates are similar to B. anthracis (88% to 90%) but distant from the B. cereus (42%) and B. thuringiensis (48%) type strains. The DDH results were supported by average nucleotide identity (>98.5%) and digital DDH (>86%) analyses. However, the collective phenotypic traits and genomic evidence were the reasons to exclude the ISS isolates from B. anthracis. Nevertheless, multilocus sequence typing and whole-genome single nucleotide polymorphism analyses placed these isolates in a clade that is distinct from previously described members of the B. cereus sensu lato group but closely related to B. anthracis. IMPORTANCE The International Space Station Microbial Observatory (Microbial Tracking-1) study is generating a microbial census of the space station’s surfaces and atmosphere by using advanced molecular microbial community analysis techniques supported by traditional culture-based methods and modern bioinformatic computational modeling. This approach will lead to long-term, multigenerational studies of microbial population dynamics in a closed environment and address key questions, including whether microgravity influences the evolution and genetic modification of microorganisms. The spore-forming Bacillus cereus sensu lato group consists of pathogenic (B. anthracis), food poisoning (B. cereus), and biotechnologically useful (B. thuringiensis) microorganisms; their presence in a closed system such as the ISS might be a concern for the health of crew members. A detailed characterization of these potential pathogens would lead to the development of suitable countermeasures that are needed for long-term future missions and a better understanding of microorganisms associated with space missions.


July 7, 2019

Genomic epidemiology of NDM-1-encoding plasmids in Latin American clinical isolates reveals insights into the evolution of multidrug resistance

Bacteria that produce the broad-spectrum Carbapenem antibiotic New Delhi Metallo-ß-lactamase (NDM) place a burden on health care systems worldwide, due to the limited treatment options for infections caused by them and the rapid global spread of this antibiotic resistance mechanism. Although it is believed that the associated resistance gene blaNDM-1 originated in Acinetobacter spp., the role of Enterobacteriaceae in its dissemination remains unclear. In this study, we used whole genome sequencing to investigate the dissemination dynamics of blaNDM-1-positive plasmids in a set of 21 clinical NDM-1-positive isolates from Colombia and Mexico (Providencia rettgeri, Klebsiella pneumoniae, and Acinetobacter baumannii) as well as six representative NDM-1-positive Escherichia coli transconjugants. Additionally, the plasmids from three representative P. rettgeri isolates were sequenced by PacBio sequencing and finished. Our results demonstrate the presence of previously reported plasmids from K. pneumoniae and A. baumannii in different genetic backgrounds and geographically distant locations in Colombia. Three new previously unclassified plasmids were also identified in P. rettgeri from Colombia and Mexico, plus an interesting genetic link between NDM-1-positive P. rettgeri from distant geographic locations (Canada, Mexico, Colombia, and Israel) without any reported epidemiological links was discovered. Finally, we detected a relationship between plasmids present in P. rettgeri and plasmids from A. baumannii and K. pneumoniae. Overall, our findings suggest a Russian doll model for the dissemination of blaNDM-1 in Latin America, with P. rettgeri playing a central role in this process, and reveal new insights into the evolution and dissemination of plasmids carrying such antibiotic resistance genes.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Discovering and sequencing new plant viral genomes by next-generation sequencing: description of a practical pipeline

Small-scale sequencing has improved substantially in recent decades, culminating in the development of next-generation sequencing (NGS) technologies. Modern NGS methods have helped the discovery of many new plant viruses. Nevertheless, there is still a need to establish solid assembly pipelines targeting small genomes characterised by low identities to known viral sequences. Here, we describe and discuss the fundamental steps required for discovering and sequencing new plant viral genomes by NGS. A practical pipeline and standard alternative tools used in NGS analysis are presented.


July 7, 2019

Genome assembly of Chryseobacterium sp. strain IHBB 10212 from glacier top-surface soil in the Indian trans-Himalayas with potential for hydrolytic enzymes

The cold-active esterases are gaining importance due to their catalytic activities finding applications in chemical industry, food processes and detergent industry as additives, and organic synthesis of unstable compounds as catalysts. In the present study, the complete genome sequence of 4,843,645 bp with an average 34.08% G + C content and 4260 protein-coding genes are reported for the low temperature-active esterase-producing novel strain of Chrysobacterium isolated from the top-surface soil of a glacier in the cold deserts of the Indian trans-Himalayas. The genome contained two plasmids of 16,553 and 11,450 bp with 40.54 and 40.37% G + C contents, respectively. Several genes encoding the hydrolysis of ester linkages of triglycerides into fatty acids and glycerol were predicted in the genome. The annotation also predicted the genes encoding proteases, lipases, amylases, ß-glucosidases, endoglucanases and xylanases involved in biotechnological processes. The complete genome sequence of Chryseobacterium sp. strain IHBB 10212 and two plasmids have been deposited vide accession numbers CP015199, CP015200 and CP015201 at DDBJ/EMBL/GenBank.


July 7, 2019

Comparative genomic and phylogenetic analysis of a toxigenic clinical isolate of Corynebacterium diphtheriae strain B-D-16-78 from Malaysia.

In this study, we report the comparative genomics and phylogenetic analysis of Corynebacterium diphtheriae strain B-D-16-78 that was isolated from a clinical specimen in 2016. The complete genome of C. diphtheriae strain B-D-16-78 was sequenced using PacBio Single Molecule, Real-Time sequencing technology and consists of a 2,474,151-bp circular chromosome with an average GC content of 53.56%. The core genome of C. diphtheriae was also deduced from a total of 74 strains with complete or draft genome sequences and the core genome-based phylogenetic analysis revealed close genetic relationship among strains that shared the same MLST allelic profile. In the context of CRISPR-Cas system, which confers adaptive immunity against re-invading DNA, 73 out of 86 spacer sequences were found to be unique to Malaysian strains which harboured only type-II-C and/or type-I-E-a systems. A total of 48 tox genes which code for the diphtheria toxin were retrieved from the 74 genomes and with the exception of one truncated gene, only nucleotide substitutions were detected when compared to the tox gene sequence of PW8. More than half were synonymous substitution and only two were nonsynonymous substitutions whereby H24Y was predicted to have a damaging effect on the protein function whilst T262V was predicted to be tolerated. Both toxigenic and non-toxigenic toxin-gene bearing strains have been isolated in Malaysia but the repeated isolation of toxigenic strains with the same MLST profile suggests the possibility of some of these strains may be circulating in the population. Hence, efforts to increase herd immunity should be continued and supported by an effective monitoring and surveillance system to track, manage and control outbreak of cases. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Novel chaperonins are prevalent in the virioplankton and demonstrate links to viral biology and ecology.

Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70?kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.


July 7, 2019

Complete genome sequence of Leuconostoc suionicum DSM 20241(T) provides insights into its functional and metabolic features.

The genome of Leuconostoc suionicum DSM 20241(T) (=ATCC 9135(T) = LMG 8159(T) = NCIMB 6992(T)) was completely sequenced and its fermentative metabolic pathways were reconstructed to investigate the fermentative properties and metabolites of strain DSM 20241(T) during fermentation. The genome of L. suionicum DSM 20241(T) consists of a circular chromosome (2026.8 Kb) and a circular plasmid (21.9 Kb) with 37.58% G + C content, encoding 997 proteins, 12 rRNAs, and 72 tRNAs. Analysis of the metabolic pathways of L. suionicum DSM 20241(T) revealed that strain DSM 20241(T) performs heterolactic acid fermentation and can metabolize diverse organic compounds including glucose, fructose, galactose, cellobiose, mannose, sucrose, trehalose, arbutin, salcin, xylose, arabinose and ribose.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.