Menu
July 7, 2019

Discovery and genotyping of novel sequence insertions in many sequenced individuals

Motivation: Despite recent advances in algorithms design to characterize structural variation using high-throughput short read sequencing (HTS) data, characterization of novel sequence insertions longer than the average read length remains a challenging task. This is mainly due to both computational difficulties and the complexities imposed by genomic repeats in generating reliable assemblies to accurately detect both the sequence content and the exact location of such insertions. Additionally, de novo genome assembly algorithms typically require a very high depth of coverage, which may be a limiting factor for most genome studies. Therefore, characterization of novel sequence insertions is not a routine part of most sequencing projects. There are only a handful of algorithms that are specifically developed for novel sequence insertion discovery that can bypass the need for the whole genome de novo assembly. Still, most such algorithms rely on high depth of coverage, and to our knowledge there is only one method (PopIns) that can use multi-sample data to “collectively” obtain a very high coverage dataset to accurately find insertions common in a given population. Result: Here, we present Pamir, a new algorithm to efficiently and accurately discover and genotype novel sequence insertions using either single or multiple genome sequencing datasets. Pamir is able to detect breakpoint locations of the insertions and calculate their zygosity (i.e. heterozygous versus homozygous) by analyzing multiple sequence signatures, matching one-end-anchored sequences to small-scale de novo assemblies of unmapped reads, and conducting strand-aware local assembly. We test the efficacy of Pamir on both simulated and real data, and demonstrate its potential use in accurate and routine identification of novel sequence insertions in genome projects. Availability and implementation: Pamir is available at https://github.com/vpc-ccg/pamir. Contact:fhach@sfu.ca, prostatecentre.com or calkan@cs.bilkent.edu.tr Supplementary information:Supplementary data are available at Bioinformatics online.


July 7, 2019

CLOVE: classification of genomic fusions into structural variation events.

A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than one breakpoint or fusion).We present CLOVE, an algorithm for integrating the results of multiple breakpoint or SV callers and classifying the results as a particular SV. CLOVE is based on a graph data structure that is created from the breakpoint information. The algorithm looks for patterns in the graph that are characteristic of more complex rearrangement types. CLOVE is able to integrate the results of multiple callers, producing a consensus call.We demonstrate using simulated and real data that re-classified SV calls produced by CLOVE improve on the raw call set of existing SV algorithms, particularly in terms of accuracy. CLOVE is freely available from http://www.github.com/PapenfussLab .


July 7, 2019

Whole genome sequencing and analysis of Campylobacter coli YH502 from retail chicken reveals a plasmid-borne type VI secretion system.

Campylobacter is a major cause of foodborne illnesses worldwide. Campylobacter infections, commonly caused by ingestion of undercooked poultry and meat products, can lead to gastroenteritis and chronic reactive arthritis in humans. Whole genome sequencing (WGS) is a powerful technology that provides comprehensive genetic information about bacteria and is increasingly being applied to study foodborne pathogens: e.g., evolution, epidemiology/outbreak investigation, and detection. Herein we report the complete genome sequence of Campylobacter coli strain YH502 isolated from retail chicken in the United States. WGS, de novo assembly, and annotation of the genome revealed a chromosome of 1,718,974 bp and a mega-plasmid (pCOS502) of 125,964 bp. GC content of the genome was 31.2% with 1931 coding sequences and 53 non-coding RNAs. Multiple virulence factors including a plasmid-borne type VI secretion system and antimicrobial resistance genes (beta-lactams, fluoroquinolones, and aminoglycoside) were found. The presence of T6SS in a mobile genetic element (plasmid) suggests plausible horizontal transfer of these virulence genes to other organisms. The C. coli YH502 genome also harbors CRISPR sequences and associated proteins. Phylogenetic analysis based on average nucleotide identity and single nucleotide polymorphisms identified closely related C. coli genomes available in the NCBI database. Taken together, the analyzed genomic data of this potentially virulent strain of C. coli will facilitate further understanding of this important foodborne pathogen most likely leading to better control strategies. The chromosome and plasmid sequences of C. coli YH502 have been deposited in GenBank under the accession numbers CP018900.1 and CP018901.1, respectively.


July 7, 2019

Analysis of the genome and mobilome of a dissimilatory arsenate reducing Aeromonas sp. O23A reveals multiple mechanisms for heavy metal resistance and metabolism.

Aeromonas spp. are among the most ubiquitous microorganisms, as they have been isolated from different environmental niches including waters, soil, as well as wounds and digestive tracts of poikilothermic animals and humans. Although much attention has been paid to the pathogenicity of Aeromonads, the role of these bacteria in environmentally important processes, such as transformation of heavy metals, remains to be discovered. Therefore, the aim of this study was a detailed genomic characterization of Aeromonas sp. O23A, the first representative of this genus capable of dissimilatory arsenate reduction. The strain was isolated from microbial mats from the Zloty Stok mine (SW Poland), an environment strongly contaminated with arsenic. Previous physiological studies indicated that O23A may be involved in both mobilization and immobilization of this metalloid in the environment. To discover the molecular basis of the mechanisms behind the observed abilities, the genome of O23A (~5.0 Mbp) was sequenced and annotated, and genes for arsenic respiration, heavy metal resistance (hmr) and other phenotypic traits, including siderophore production, were identified. The functionality of the indicated gene modules was assessed in a series of minimal inhibitory concentration analyses for various metals and metalloids, as well as mineral dissolution experiments. Interestingly, comparative analyses revealed that O23A is related to a fish pathogen Aeromonas salmonicida subsp. salmonicida A449 which, however, does not carry genes for arsenic respiration. This indicates that the dissimilatory arsenate reduction ability may have been lost during genome reduction in pathogenic strains, or acquired through horizontal gene transfer. Therefore, particular emphasis was placed upon the mobilome of O23A, consisting of four plasmids, a phage, and numerous transposable elements, which may play a role in the dissemination of hmr and arsenic metabolism genes in the environment. The obtained results indicate that Aeromonas sp. O23A is well-adapted to the extreme environmental conditions occurring in the Zloty Stok mine. The analysis of genome encoded traits allowed for a better understanding of the mechanisms of adaptation of the strain, also with respect to its presumable role in colonization and remediation of arsenic-contaminated waters, which may never have been discovered based on physiological analyses alone.


July 7, 2019

Conjugative ESBL plasmids differ in their potential to rescue susceptible bacteria via horizontal gene transfer in lethal antibiotic concentrations.

Emergence (and proliferation) of resistant pathogens under strong antibiotic selection is an evolutionary process where bacteria overcome the otherwise growth inhibiting or lethal concentration of antimicrobial substances. In this study, we set to investigate a largely unexplored mechanism, namely evolutionary rescue (that is, adaptive evolutionary change that restores positive growth to declining population and prevents extinction) via horizontal gene transfer, by which new resistant bacteria may emerge both in and out of clinical environments.


July 7, 2019

Adaptation of genetically monomorphic bacteria: evolution of copper resistance through multiple horizontal gene transfers of complex and versatile mobile genetic elements.

Copper-based antimicrobial compounds are widely used to control plant bacterial pathogens. Pathogens have adapted in response to this selective pressure. Xanthomonas citri pv. citri, a major citrus pathogen causing Asiatic citrus canker, was first reported to carry plasmid-encoded copper resistance in Argentina. This phenotype was conferred by the copLAB gene system. The emergence of resistant strains has since been reported in Réunion and Martinique. Using microsatellite-based genotyping and copLAB PCR, we demonstrated that the genetic structure of the copper-resistant strains from these three regions was made up of two distant clusters and varied for the detection of copLAB amplicons. In order to investigate this pattern more closely, we sequenced six copper-resistant X. citri pv. citri strains from Argentina, Martinique and Réunion, together with reference copper-resistant Xanthomonas and Stenotrophomonas strains using long-read sequencing technology. Genes involved in copper resistance were found to be strain dependent with the novel identification in X. citri pv. citri of copABCD and a cus heavy metal efflux resistance-nodulation-division system. The genes providing the adaptive trait were part of a mobile genetic element similar to Tn3-like transposons and included in a conjugative plasmid. This indicates the system’s great versatility. The mining of all available bacterial genomes suggested that, within the bacterial community, the spread of copper resistance associated with mobile elements and their plasmid environments was primarily restricted to the Xanthomonadaceae family.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Complete genome analysis of Thermus parvatiensis and comparative genomics of Thermus spp. provide insights into genetic variability and evolution of natural competence as strategic survival attributes.

Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness.


July 7, 2019

Characterization of a PVL-negative community-acquired methicillin-resistant Staphylococcus aureus strain of sequence type 88 in China.

Sequence type 88 community-acquired methicillin-resistant Staphylococcus aureus (CA-MRSA) strain SR434, isolated from an outpatient with skin and soft tissue infection, was subjected to whole genome sequencing, antimicrobial susceptibility testing, mouse skin infection model and hemolysis analysis to identify its virulence and resistance determinants. MRSA strain SR434 is resistant to clindamycin, erythromycin and fosfomycin. Four plasmids with resistance genes were identified in this strain, including a 20,658bp blaZ-carrying plasmid, a 2473bp ermC-carrying plasmid, a 2622bp fosB7-carrying plasmid (86% identity with plasmid in a ST2590 MRSA strain) and a 4817bp lnuA-carrying plasmid (99% identity with pLNU4 from bovine coagulase-nagetive Staphylococci). This strain contains staphylococcal cassette chromosome mec type IV and does not contain arginine catabolic mobile element or Panton-Valentine-Leukocidin. SR434 harbors genomic islands ?Saa, ?Saß, ?Sa? and FSa3 and pathogenicity islands ?Sa2 that carries genes encoding toxic shock syndrome toxin 1, superantigen enterotoxin C and superantigen enterotoxin L. Mouse skin infection model results show that SR434 had similar virulence potential causing invasive skin infection as a PVL-negative epidemic Korea clone HL1 (ST72). CA-MRSA strain of ST88 lineage might be a great concern for its high virulence. PVL has limited contribution to virulence phenotype among this lineage. Copyright © 2017 Elsevier GmbH. All rights reserved.


July 7, 2019

IS26-mediated formation of a virulence and resistance plasmid in Salmonella Enteritidis.

To characterize a novel virulence-resistance plasmid pSE380T carried by a Salmonella enterica serotype Enteritidis clinical strain SE380.The plasmid pSE380T was conjugated to Escherichia coli strain J53 and sequenced by PacBio RSII, followed by subsequent annotation and genetic analysis.Sequence analysis of this plasmid revealed that the entire Salmonella Enteritidis-specific virulence plasmid, pSEN, had been incorporated into an IncHI2 MDR plasmid, which comprises the cephalosporin and fosfomycin resistance determinants blaCTX-M-14 and fosA3. Based on BLAST analysis and scrutiny of insertion footprints, the insertion event was found to involve a replicative transposition process mediated by IS26, an IS element frequently detected in various resistance plasmids. The resulting pSE380T plasmid also comprises backbone elements of IncHI2 and IncFIA plasmids, producing a rare fusion product that simultaneously encodes functional features of both, i.e. virulence, resistance and high transmissibility.This is a novel hybrid plasmid mediating MDR and virulence from a clinical Salmonella Enteritidis strain. This plasmid is likely to be transmissible amongst various serotypes of Salmonella and other Enterobacteriaceae species, rendering a wide range of bacterial pathogens resistant to cephalosporins and fosfomycin, and further enhancing their virulence potential. It will be important to monitor the spread and further evolution of this plasmid among the Enterobacteriaceae strains.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

The blaOXA-23-associated transposons in the genome of Acinetobacter spp. represent an epidemiological situation of the species encountering carbapenems.

High rates of carbapenem resistance in the human pathogen Acinetobacter baumannii threaten public health and need to be scrutinized.A total of 356 A. baumannii and 50 non-baumannii Acinetobacter spp. (NBA) strains collected in 2013 throughout South Korea were studied. The type of blaOXA-23 transposon was determined by PCR mapping and molecular epidemiology was assessed by MLST. Twelve representative strains and two comparative A. baumannii were entirely sequenced by single-molecule real-time sequencing.The carbapenem resistance rate was 88% in A. baumannii, mainly due to blaOXA-23, with five exceptional cases associated with ISAba1-blaOXA-51-like. The blaOXA-23 gene in A. baumannii was carried either by Tn2006 (44%) or Tn2009 (54%), with a few exceptions carried by Tn2008 (1.6%). Of the NBA strains, 14% were resistant to carbapenems, two with blaOXA-58 and five with blaOXA-23 associated with Tn2006. The Tn2006-possessing strains belonged to various STs, whereas Tn2008- and Tn2009-possessing strains were limited to ST208 and ST191, respectively. The three transposons were often multiplied in the chromosome, and the gene copy number and the carbapenem MICs presented linear relationships either very strongly for Tn2008 or moderately for Tn2006 and Tn2009.The dissemination of Tn2006 was facilitated by its capability for intercellular transfer and that of Tn2009 was attributable to successful dissemination of the ST191 bacterial host carrying the transposon. Tn2008 was infrequent because of its insufficient ability to undergo intercellular transfer and the scarce bacterial host A. baumannii ST208. Gene amplification is an adaptive mechanism for bacteria that encounter antimicrobial drugs.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

One year genome evolution of Lausannevirus in allopatric versus sympatric conditions.

Amoeba-resisting microorganisms raised a great interest during the last decade. Among them, some large DNA viruses present huge genomes up to 2.5?Mb long, exceeding the size of small bacterial genomes. The rate of genome evolution in terms of mutation, deletion, and gene acquisition in these genomes is yet unknown. Given the suspected high plasticity of viral genomes, the microevolution of the 346?kb genome of Lausannevirus, a member of Megavirales, was studied. Hence, Lausannevirus was co-cultured within the amoeba Acanthamoeba castellanii over one year. Despite a low number of mutations, the virus showed a genome reduction of 3.7% after 12?months. Lausannevirus genome evolution in sympatric conditions was investigated by its co-culture with Estrella lausannensis, an obligate intracellular bacterium, in the amoeba A. castellanii during one year. Cultures were split every 3?months. Genome sequencing revealed that in these conditions both, Lausannevirus and E. lausannensis, show stable genome, presenting no major rearrangement. In fact, after one year they acquired from 2 to 7 and from 4 to 10 mutations per culture for Lausannevirus and E. lausannensis, respectively. Interestingly, different mutations in the endonuclease encoding genes of Lausannevirus were observed in different subcultures, highlighting the importance of this gene product in the replication of Lausannevirus. Conversely, mutations in E. lausannensis were mainly located in a gene encoding for a phosphoenolpyruvate-protein phosphotransferase (PtsI), implicated in sugar metabolism. Moreover, in our conditions and with our analyses we detected no horizontal gene transfer during one year of co-culture.© The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Whole genome sequence of two Rathayibacter toxicus strains reveals a tunicamycin biosynthetic cluster similar to Streptomyces chartreusis.

Rathayibacter toxicus is a forage grass associated Gram-positive bacterium of major concern to food safety and agriculture. This species is listed by USDA-APHIS as a plant pathogen select agent because it produces a tunicamycin-like toxin that is lethal to livestock and may be vectored by nematode species native to the U.S. The complete genomes of two strains of R. toxicus, including the type strain FH-79, were sequenced and analyzed in comparison with all available, complete R. toxicus genomes. Genome sizes ranged from 2,343,780 to 2,394,755 nucleotides, with 2079 to 2137 predicted open reading frames; all four strains showed remarkable synteny over nearly the entire genome, with only a small transposed region. A cluster of genes with similarity to the tunicamycin biosynthetic cluster from Streptomyces chartreusis was identified. The tunicamycin gene cluster (TGC) in R. toxicus contained 14 genes in two transcriptional units, with all of the functional elements for tunicamycin biosynthesis present. The TGC had a significantly lower GC content (52%) than the rest of the genome (61.5%), suggesting that the TGC may have originated from a horizontal transfer event. Further analysis indicated numerous remnants of other potential horizontal transfer events are present in the genome. In addition to the TGC, genes potentially associated with carotenoid and exopolysaccharide production, bacteriocins and secondary metabolites were identified. A CRISPR array is evident. There were relatively few plant-associated cell-wall hydrolyzing enzymes, but there were numerous secreted serine proteases that share sequence homology to the pathogenicity-associated protein Pat-1 of Clavibacter michiganensis. Overall, the genome provides clear insight into the possible mechanisms for toxin production in R. toxicus, providing a basis for future genetic approaches.


July 7, 2019

Complete genome sequence of the nematicidal Bacillus thuringiensis MYBT18246.

Bacillus thuringiensis is a rod-shaped facultative anaerobic spore forming bacterium of the genus Bacillus . The defining feature of the species is the ability to produce parasporal crystal inclusion bodies, consisting of d-endotoxins, encoded by cry-genes. Here we present the complete annotated genome sequence of the nematicidal B. thuringiensis strain MYBT18246. The genome comprises one 5,867,749 bp chromosome and 11 plasmids which vary in size from 6330 bp to 150,790 bp. The chromosome contains 6092 protein-coding and 150 RNA genes, including 36 rRNA genes. The plasmids encode 997 proteins and 4 t-RNA’s. Analysis of the genome revealed a large number of mobile elements involved in genome plasticity including 11 plasmids and 16 chromosomal prophages. Three different nematicidal toxin genes were identified and classified according to the Cry toxin naming committee as cry13Aa2, cry13Ba1, and cry13Ab1. Strikingly, these genes are located on the chromosome in close proximity to three separate prophages. Moreover, four putative toxin genes of different toxin classes were identified on the plasmids p120510 (Vip-like toxin), p120416 (Cry-like toxin) and p109822 (two Bin-like toxins). A comparative genome analysis of B. thuringiensis MYBT18246 with three closely related B. thuringiensis strains enabled determination of the pan-genome of B. thuringiensis MYBT18246, revealing a large number of singletons, mostly represented by phage genes, morons and cryptic genes.


July 7, 2019

Restriction-modification mediated barriers to exogenous DNA uptake and incorporation employed by Prevotella intermedia.

Prevotella intermedia, a major periodontal pathogen, is increasingly implicated in human respiratory tract and cystic fibrosis lung infections. Nevertheless, the specific mechanisms employed by this pathogen remain only partially characterized and poorly understood, largely due to its total lack of genetic accessibility. Here, using Single Molecule, Real-Time (SMRT) genome and methylome sequencing, bisulfite sequencing, in addition to cloning and restriction analysis, we define the specific genetic barriers to exogenous DNA present in two of the most widespread laboratory strains, P. intermedia ATCC 25611 and P. intermedia Strain 17. We identified and characterized multiple restriction-modification (R-M) systems, some of which are considerably divergent between the two strains. We propose that these R-M systems are the root cause of the P. intermedia transformation barrier. Additionally, we note the presence of conserved Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) systems in both strains, which could provide a further barrier to exogenous DNA uptake and incorporation. This work will provide a valuable resource during the development of a genetic system for P. intermedia, which will be required for fundamental investigation of this organism’s physiology, metabolism, and pathogenesis in human disease.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.