Menu
July 7, 2019

Novel chaperonins are prevalent in the virioplankton and demonstrate links to viral biology and ecology.

Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70?kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.


July 7, 2019

Adaptive evolution of a hyperthermophilic archaeon pinpoints a formate transporter as a critical factor for the growth enhancement on formate.

Previously, we reported that the hyperthermophilic archaeon Thermococcus onnurineus NA1 could grow on formate and produce H2. Formate conversion to hydrogen was mediated by a formate-hydrogen lyase complex and was indeed a part of chemiosmotic coupling to ATP generation. In this study, we employed an adaptation approach to enhance the cell growth on formate and investigated molecular changes. As serial transfer continued on formate-containing medium at the serum vial, cell growth, H2 production and formate consumption increased remarkably. The 156 times transferred-strain, WTF-156T, was demonstrated to enhance H2 production using formate in a bioreactor. The whole-genome sequencing of the WTF-156T strain revealed eleven mutations. While no mutation was found among the genes encoding formate hydrogen lyase, a point mutation (G154A) was identified in a formate transporter (TON_1573). The TON_1573 (A52T) mutation, when introduced into the parent strain, conferred increase in formate consumption and H2 production. Another adaptive passage, carried out by culturing repeatedly in a bioreactor, resulted in a strain, which has a mutation in TON_1573 (C155A) causing amino acid change, A52E. These results implicate that substitution of A52 residue of a formate transporter might be a critical factor to ensure the increase in formate uptake and cell growth.


July 7, 2019

Complete genome sequences of three Xanthomonas citri strains from Texas.

The complete genome sequences of three Xanthomonas citri strains isolated from lime trees in Texas were found to belong to the A(w) group. All carried nearly identical large plasmids with similarity to those of a citrus canker strain from India and to xanthomonads from Africa and Colombia. All three strains harbored unusual pthA homologs. Copyright © 2017 Munoz Bodnar et al.


July 7, 2019

Genomic insights into the virulence and salt tolerance of Staphylococcus equorum.

To shed light on the genetic background behind the virulence and salt tolerance of Staphylococcus equorum, we performed comparative genome analysis of six S. equorum strains. Data on four previously published genome sequences were obtained from the NCBI database, while those on strain KM1031 displaying resistance to multiple antibiotics and strain C2014 causing haemolysis were determined in this study. Examination of the pan-genome of five of the six S. equorum strains showed that the conserved core genome retained the genes for general physiological processes and survival of the species. In this comparative genomic analysis, the factors that distinguish the strains from each other, including acquired genomic factors in mobile elements, were identified. Additionally, the high salt tolerance of strains enabling growth at a NaCl concentration of 25% (w/v) was attributed to the genes encoding potassium voltage-gated channels. Among the six strains, KS1039 does not possess any of the functional virulence determinants expressed in the other strains.


July 7, 2019

Whole genome sequence of the heterozygous clinical isolate Candida krusei 81-B-5.

Candida krusei is a diploid, heterozygous yeast that is an opportunistic fungal pathogen in immunocompromised patients. This species also is utilized for fermenting cocoa beans during chocolate production. One major concern in the clinical setting is the innate resistance of this species to the most commonly used antifungal drug fluconazole. Here we report a high-quality genome sequence and assembly for the first clinical isolate of C. krusei, strain 81-B-5, into 11 scaffolds generated with PacBio sequencing technology. Gene annotation and comparative analysis revealed a unique profile of transporters that could play a role in drug resistance or adaptation to different environments. In addition, we show that while 82% of the genome is highly heterozygous, a 2.0 Mb region of the largest scaffold has undergone loss of heterozygosity. This genome will serve as a reference for further genetic studies of this pathogen. Copyright © 2017 Author et al.


July 7, 2019

ConcatSeq: A method for increasing throughput of single molecule sequencing by concatenating short DNA fragments.

Single molecule sequencing (SMS) platforms enable base sequences to be read directly from individual strands of DNA in real-time. Though capable of long read lengths, SMS platforms currently suffer from low throughput compared to competing short-read sequencing technologies. Here, we present a novel strategy for sequencing library preparation, dubbed ConcatSeq, which increases the throughput of SMS platforms by generating long concatenated templates from pools of short DNA molecules. We demonstrate adaptation of this technique to two target enrichment workflows, commonly used for oncology applications, and feasibility using PacBio single molecule real-time (SMRT) technology. Our approach is capable of increasing the sequencing throughput of the PacBio RSII platform by more than five-fold, while maintaining the ability to correctly call allele frequencies of known single nucleotide variants. ConcatSeq provides a versatile new sample preparation tool for long-read sequencing technologies.


July 7, 2019

Plasmid composition in Aeromonas salmonicida subsp. salmonicida 01-B526 unravels unsuspected type three secretion system loss patterns.

Aeromonas salmonicida subsp. salmonicida is a ubiquitous psychrophilic waterborne bacterium and a fish pathogen. The numerous mobile elements, especially insertion sequences (IS), in its genome promote rearrangements that impact its phenotype. One of the main virulence factors of this bacterium, its type three secretion system (TTSS), is affected by these rearrangements. In Aeromonas salmonicida subsp. salmonicida most of the TTSS genes are encoded in a single locus on a large plasmid called pAsa5, and may be lost when the bacterium is cultivated at a higher temperature (25 °C), producing non-virulent mutants. In a previous study, pAsa5-rearranged strains that lacked the TTSS locus on pAsa5 were produced using parental strains, including 01-B526. Some of the generated deletions were explained by homologous recombination between ISs found on pAsa5, whereas the others remained unresolved. To investigate those rearrangements, short- and long-read high-throughput sequencing technologies were used on the A. salmonicida subsp. salmonicida 01-B526 whole genome.Whole genome sequencing of the 01-B526 strain revealed that its pAsa5 has an additional IS copy, an ISAS5, compared to the reference strain (A449) sequence, which allowed for a previously unknown rearrangement to occur. It also appeared that 01-B526 bears a second large plasmid, named pAsa9, which shares 40 kbp of highly similar sequences with pAsa5. Following these discoveries, previously unexplained deletions were elucidated by genotyping. Furthermore, in one of the derived strains a fusion of pAsa5 and pAsa9, involving the newly discovered ISAS5 copy, was observed.The loss of TTSS and hence virulence is explained by one consistent mechanism: IS-driven homologous recombination. The similarities between pAsa9 and pAsa5 also provide another example of genetic diversity driven by ISs.


July 7, 2019

Discovery and genotyping of novel sequence insertions in many sequenced individuals

Motivation: Despite recent advances in algorithms design to characterize structural variation using high-throughput short read sequencing (HTS) data, characterization of novel sequence insertions longer than the average read length remains a challenging task. This is mainly due to both computational difficulties and the complexities imposed by genomic repeats in generating reliable assemblies to accurately detect both the sequence content and the exact location of such insertions. Additionally, de novo genome assembly algorithms typically require a very high depth of coverage, which may be a limiting factor for most genome studies. Therefore, characterization of novel sequence insertions is not a routine part of most sequencing projects. There are only a handful of algorithms that are specifically developed for novel sequence insertion discovery that can bypass the need for the whole genome de novo assembly. Still, most such algorithms rely on high depth of coverage, and to our knowledge there is only one method (PopIns) that can use multi-sample data to “collectively” obtain a very high coverage dataset to accurately find insertions common in a given population. Result: Here, we present Pamir, a new algorithm to efficiently and accurately discover and genotype novel sequence insertions using either single or multiple genome sequencing datasets. Pamir is able to detect breakpoint locations of the insertions and calculate their zygosity (i.e. heterozygous versus homozygous) by analyzing multiple sequence signatures, matching one-end-anchored sequences to small-scale de novo assemblies of unmapped reads, and conducting strand-aware local assembly. We test the efficacy of Pamir on both simulated and real data, and demonstrate its potential use in accurate and routine identification of novel sequence insertions in genome projects. Availability and implementation: Pamir is available at https://github.com/vpc-ccg/pamir. Contact:fhach@sfu.ca, prostatecentre.com or calkan@cs.bilkent.edu.tr Supplementary information:Supplementary data are available at Bioinformatics online.


July 7, 2019

The third restriction-modification system from Thermus aquaticus YT-1: solving the riddle of two TaqII specificities.

Two restriction-modification systems have been previously discovered in Thermus aquaticus YT-1. TaqI is a 263-amino acid (aa) Type IIP restriction enzyme that recognizes and cleaves within the symmetric sequence 5′-TCGA-3′. TaqII, in contrast, is a 1105-aa Type IIC restriction-and-modification enzyme, one of a family of Thermus homologs. TaqII was originally reported to recognize two different asymmetric sequences: 5′-GACCGA-3′ and 5′-CACCCA-3′. We previously cloned the taqIIRM gene, purified the recombinant protein from Escherichia coli, and showed that TaqII recognizes the 5′-GACCGA-3′ sequence only. Here, we report the discovery, isolation, and characterization of TaqIII, the third R-M system from T. aquaticus YT-1. TaqIII is a 1101-aa Type IIC/IIL enzyme and recognizes the 5′-CACCCA-3′ sequence previously attributed to TaqII. The cleavage site is 11/9 nucleotides downstream of the A residue. The enzyme exhibits striking biochemical similarity to TaqII. The 93% identity between their aa sequences suggests that they have a common evolutionary origin. The genes are located on two separate plasmids, and are probably paralogs or pseudoparalogs. Putative positions and aa that specify DNA recognition were identified and recognition motifs for 6 uncharacterized Thermus-family enzymes were predicted.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Unravelling the complete genome of Archangium gephyra DSM 2261T and evolutionary insights into myxobacterial chitinases.

Family Cystobacteraceae is a group of eubacteria within order Myxococcales and class Deltaproteobacteria that includes more than 20 species belonging to 6 genera, that is, Angiococcus, Archangium, Cystobacter, Hyalangium, Melittangium, and Stigmatella. Earlier these members have been classified based on chitin degrading efficiency such as Cystobacter fuscus and Stigmatella aurantiaca, which are efficient chitin degraders, C. violaceus a partial chitin degrader and Archangium gephyra a chitin nondegrader. Here we report the 12.5 Mbp complete genome of A. gephyra DSM 2261T and compare it with four available genomes within the family Cystobacteraceae. Phylogeny and DNA-DNA hybridization studies reveal that A. gephyra is closest to Angiococcus disciformis, C. violaceus and C. ferrugineus, which are partial chitin degraders of the family Cystobacteraceae. Homology studies reveal the conservation of approximately half of the proteins in these genomes, with about 15% unique proteins in each genome. The total carbohydrate-active enzymes (CAZome) analysis reveals the presence of one GH18 chitinase in the A. gephyra genome whereas eight copies are present in C. fuscus and S. aurantiaca. Evolutionary studies of myxobacterial GH18 chitinases reveal that most of them are likely related to Terrabacteria and Proteobacteria whereas the Archangium GH18 homolog shares maximum similarity with those of chitin nondegrading Acidobacteria.© The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Complete genome sequence of Leuconostoc suionicum DSM 20241(T) provides insights into its functional and metabolic features.

The genome of Leuconostoc suionicum DSM 20241(T) (=ATCC 9135(T) = LMG 8159(T) = NCIMB 6992(T)) was completely sequenced and its fermentative metabolic pathways were reconstructed to investigate the fermentative properties and metabolites of strain DSM 20241(T) during fermentation. The genome of L. suionicum DSM 20241(T) consists of a circular chromosome (2026.8 Kb) and a circular plasmid (21.9 Kb) with 37.58% G + C content, encoding 997 proteins, 12 rRNAs, and 72 tRNAs. Analysis of the metabolic pathways of L. suionicum DSM 20241(T) revealed that strain DSM 20241(T) performs heterolactic acid fermentation and can metabolize diverse organic compounds including glucose, fructose, galactose, cellobiose, mannose, sucrose, trehalose, arbutin, salcin, xylose, arabinose and ribose.


July 7, 2019

Evidence for contemporary switching of the O-antigen gene cluster between Shiga toxin-producing Escherichia coli strains colonizing cattle.

Shiga toxin-producing Escherichia coli (STEC) comprise a group of zoonotic enteric pathogens with ruminants, especially cattle, as the main reservoir. O-antigens are instrumental for host colonization and bacterial niche adaptation. They are highly immunogenic and, therefore, targeted by the adaptive immune system. The O-antigen is one of the most diverse bacterial cell constituents and variation not only exists between different bacterial species, but also between individual isolates/strains within a single species. We recently identified STEC persistently infecting cattle and belonging to the different serotypes O156:H25 (n = 21) and O182:H25 (n = 15) that were of the MLST sequence types ST300 or ST688. These STs differ by a single nucleotide in purA only. Fitness-, virulence-associated genome regions, and CRISPR/CAS (clustered regularly interspaced short palindromic repeats/CRISPR associated sequence) arrays of these STEC O156:H25 and O182:H25 isolates were highly similar, and identical genomic integration sites for the stx converting bacteriophages and the core LEE, identical Shiga toxin converting bacteriophage genes for stx1a, identical complete LEE loci, and identical sets of chemotaxis and flagellar genes were identified. In contrast to this genomic similarity, the nucleotide sequences of the O-antigen gene cluster (O-AGC) regions between galF and gnd and very few flanking genes differed fundamentally and were specific for the respective serotype. Sporadic aEPEC O156:H8 isolates (n = 5) were isolated in temporal and spatial proximity. While the O-AGC and the corresponding 5′ and 3′ flanking regions of these aEPEC isolates were identical to the respective region in the STEC O156:H25 isolates, the core genome, the virulence associated genome regions and the CRISPR/CAS elements differed profoundly. Our cumulative epidemiological and molecular data suggests a recent switch of the O-AGC between isolates with O156:H8 strains having served as DNA donors. Such O-antigen switches can affect the evaluation of a strain’s pathogenic and virulence potential, suggesting that NGS methods might lead to a more reliable risk assessment.


July 7, 2019

Whole-genome sequence of Photobacterium damselae subsp. piscicida strain 91-197, isolated from hybrid striped bass (Morone sp.) in the United States.

Photobacterium damselae subsp. piscicida is a causative bacterium of fish pasteurellosis, which has caused serious economic damage to aquaculture farms worldwide. Here, the whole-genome sequence of P. damselae subsp. piscicida 91-197, isolated in the United States, suggests that this genome consists of two chromosomes and two plasmids. Copyright © 2017 Teru et al.


July 7, 2019

A novel inversion in the chloroplast genome of marama (Tylosema esculentum).

Tylosema esculentum (marama bean) is being developed as a possible crop for resource-poor farmers in arid regions of Southern Africa. As part of the molecular characterization of this species, the chloroplast genome has been assembled from next-generation sequencing using both Illumina and Pac-Bio data. The genome is of typical organization with a large single-copy region and a small single-copy region separated by a pair of inverted repeats and covers 161537 bp. It contains a unique inversion not present in any other legumes, even in the closest relatives for which the complete chloroplast genome is available, and two complete copies of the ycf1 gene. These data extend the range of variability of legume chloroplast genomes. The sequencing of multiple individuals has identified two different chloroplast genomes which were geographically separated. The current sampling is limited so that the extent of the intraspecific variation is still to be determined, leaving open the question of legume chloroplast genomes adapted to particular arid environments.© The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.