Menu
July 7, 2019

Phase-variable methylation and epigenetic regulation by type I restriction-modification systems.

Epigenetic modifications in bacteria, such as DNA methylation, have been shown to affect gene regulation, thereby generating cells that are isogenic but with distinctly different phenotypes. Restriction-modification (RM) systems contain prototypic methylases that are responsible for much of bacterial DNA methylation. This review focuses on a distinctive group of type I RM loci that , through phase variation, can modify their methylation target specificity and can thereby switch bacteria between alternative patterns of DNA methylation. Phase variation occurs at the level of the target recognition domains of the hsdS (specificity) gene via reversible recombination processes acting upon multiple hsdS alleles. We describe the global distribution of such loci throughout the prokaryotic kingdom and highlight the differences in loci structure across the various bacterial species. Although RM systems are often considered simply as an evolutionary response to bacteriophages, these multi-hsdS type I systems have also shown the capacity to change bacterial phenotypes. The ability of these RM systems to allow bacteria to reversibly switch between different physiological states, combined with the existence of such loci across many species of medical and industrial importance, highlights the potential of phase-variable DNA methylation to act as a global regulatory mechanism in bacteria.© FEMS 2017.


July 7, 2019

The complete genome sequence of the fish pathogen Tenacibaculum maritimum provides insights into virulence mechanisms.

Tenacibaculum maritimum is a devastating bacterial pathogen of wild and farmed marine fish with a broad host range and a worldwide distribution. We report here the complete genome sequence of the T. maritimum type strain NCIMB 2154(T). The genome consists of a 3,435,971-base pair circular chromosome with 2,866 predicted protein-coding genes. Genes encoding the biosynthesis of exopolysaccharides, the type IX secretion system, iron uptake systems, adhesins, hemolysins, proteases, and glycoside hydrolases were identified. They are likely involved in the virulence process including immune escape, invasion, colonization, destruction of host tissues, and nutrient scavenging. Among the predicted virulence factors, type IX secretion-mediated and cell-surface exposed proteins were identified including an atypical sialidase, a sphingomyelinase and a chondroitin AC lyase which activities were demonstrated in vitro.


July 7, 2019

Genome sequencing and comparative genomics of enterohemorrhagic Escherichia coli O145:H25 and O145:H28 reveal distinct evolutionary paths and marked variations in traits associated with virulence & colonization.

Enterohemorrhagic Escherichia coli (EHEC) O145 are among the top non-O157 serogroups associated with severe human disease worldwide. Two serotypes, O145:H25 and O145:H28 have been isolated from human patients but little information is available regarding the virulence repertoire, origin and evolutionary relatedness of O145:H25. Hence, we sequenced the complete genome of two O145:H25 strains associated with hemolytic uremic syndrome (HUS) and compared the genomes with those of previously sequenced O145:H28 and other EHEC strains.The genomes of the two O145:H25 strains were 5.3 Mbp in size; slightly smaller than those of O145:H28 and other EHEC strains. Both strains contained three nearly identical plasmids and several prophages and integrative elements, many of which differed significantly in size, gene content and organization as compared to those present in O145:H28 and other EHECs. Furthermore, notable variations were observed in several fimbrial gene cluster and intimin types possessed by O145:H25 and O145:H28 indicating potential adaptation to distinct areas of host colonization. Comparative genomics further revealed that O145:H25 are genetically more similar to other non-O157 EHEC strains than to O145:H28.Phylogenetic analysis accompanied by comparative genomics revealed that O145:H25 and O145:H28 evolved from two separate clonal lineages and that horizontal gene transfer and gene loss played a major role in the divergence of these EHEC serotypes. The data provide further evidence that ruminants might be a possible reservoir for O145:H25 but that they might be impaired in their ability to establish a persistent colonization as compared to other EHEC strains.


July 7, 2019

Complete genome sequence of a commensal bacterium, Hafnia alvei CBA7124, isolated from human feces.

Members of the genus Hafnia have been isolated from the feces of mammals, birds, reptiles, and fish, as well as from soil, water, sewage, and foods. Hafnia alvei is an opportunistic pathogen that has been implicated in intestinal and extraintestinal infections in humans. However, its pathogenicity is still unclear. In this study, we isolated H. alvei from human feces and performed sequencing as well as comparative genomic analysis to better understand its pathogenicity.The genome of H. alvei CBA7124 comprised a single circular chromosome with 4,585,298 bp and a GC content of 48.8%. The genome contained 25 rRNA genes (9 5S rRNA genes, 8 16S rRNA genes, and 8 23S rRNA genes), 88 tRNA genes, and 4043 protein-coding genes. Using comparative genomic analysis, the genome of this strain was found to have 72 strain-specific singletons. The genome also contained genes for antibiotic and antimicrobial resistance, as well as toxin-antitoxin systems.We revealed the complete genome sequence of the opportunistic gut pathogen, H. alvei CBA7124. We also performed comparative genomic analysis of the sequences in the genome of H. alvei CBA7124, and found that it contained strain-specific singletons, antibiotic resistance genes, and toxin-antitoxin systems. These results could improve our understanding of the pathogenicity and the mechanism behind the antibiotic resistance of H. alvei strains.


July 7, 2019

Genome sequencing and comparative genomics reveal the potential pathogenic mechanism of Cercospora sojina Hara on soybean.

Frogeye leaf spot, caused by Cercospora sojina Hara, is a common disease of soybean in most soybean-growing countries of the world. In this study, we report a high-quality genome sequence of C. sojina by Single Molecule Real-Time sequencing method. The 40.8-Mb genome encodes 11,655 predicated genes, and 8,474 genes are revealed by RNA sequencing. Cercospora sojina genome contains large numbers of gene clusters that are involved in synthesis of secondary metabolites, including mycotoxins and pigments. However, much less carbohydrate-binding module protein encoding genes are identified in C. sojina genome, when compared with other phytopathogenic fungi. Bioinformatics analysis reveals that C. sojina harbours about 752 secreted proteins, and 233 of them are effectors. During early infection, the genes for metabolite biosynthesis and effectors are significantly enriched, suggesting that they may play essential roles in pathogenicity. We further identify 13 effectors that can inhibit BAX-induced cell death. Taken together, our results provide insights into the infection mechanisms of C. sojina on soybean.© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Heterogeneity of the Epstein-Barr virus major internal repeat reveals evolutionary mechanisms of EBV and a functional defect in the prototype EBV strain B95-8.

Epstein-Barr virus (EBV) is a ubiquitous pathogen of humans that can cause several types of lymphoma and carcinoma. Like other herpesviruses, EBV has diversified both through co-evolution with its host, and genetic exchange between virus strains. Sequence analysis of the EBV genome is unusually challenging, because of the large number and length of repeat regions within the virus. Here we describe the sequence assembly and analysis of the large internal repeat of EBV (IR1 or BamW repeats) from over 70 strains.Diversity of the latency protein EBNA-LP resides predominantly within the exons downstream of IR1. The integrity of the putative BWRF1 ORF is retained in over 80% of strains, and deletions truncating IR1 always spare BWRF1. Conserved regions include the IR1 latency promoter (Wp), and one zone upstream of and two within BWRF1.IR1 is heterogeneous in 70% of strains, and this heterogeneity arises from sequence exchange between strains as well as spontaneous mutation, with inter-strain recombination more common in tumour-derived viruses. This genetic exchange often incorporates regions of <1kb, and allelic gene conversion changes the frequency of small regions within the repeat, but not close to the flanks. These observations suggest that IR1 - and by extension EBV - diversifies through both recombination and breakpoint repair, while concerted evolution of IR1 is driven by gene conversion of small regions. Finally, the prototype EBV strain B95-8 contains four non-consensus variants within a single IR1 repeat unit, including a STOP codon in EBNA-LP. Repairing IR1 improves EBNA-LP levels and the quality of transformation by the B95-8 BAC.IMPORTANCE Epstein-Barr virus (EBV) infects the majority of the world population, but only causes illness in a small minority. Nevertheless, over 1% of cancers worldwide are attributable to EBV. Recent sequencing projects investigating virus diversity, to see if different strains have different disease impacts, have excluded regions of repeating sequence, as they are more technically challenging. Here we analyse the sequence of the largest repeat in EBV (IR1). We first characterised the variations in protein sequences encoded across IR1. In studying variations within the repeat of each strain, we identified a mutation in the main laboratory strain of EBV that impairs virus function, and suggest that tumour-associated viruses may be more likely to contain DNA mixed from two strains. Patterns of this mixing suggest that sequences can spread between strains (and also within the repeat) by copying sequence from another strain (or repeat unit) to repair DNA damage. Copyright © 2017 Ba abdullah et al.


July 7, 2019

Parallel evolution of group B Streptococcus hypervirulent clonal complex 17 unveils new pathoadaptive mutations.

Group B Streptococcus (GBS) is a commensal of the gastrointestinal and genitourinary tracts, while a prevailing cause of neonatal disease worldwide. Of the various clonal complexes (CCs), CC17 is overrepresented in GBS-infected newborns for reasons that are still largely unknown. Here, we report a comprehensive genomic analysis of 626 CC17 isolates collected worldwide, identifying the genetic traits behind their successful adaptation to humans and the underlying differences between carriage and clinical strains. Comparative analysis with 923 GBS genomes belonging to CC1, CC19, and CC23 revealed that the evolution of CC17 is distinct from that of other human-adapted lineages and recurrently targets functions related to nucleotide and amino acid metabolism, cell adhesion, regulation, and immune evasion. We show that the most distinctive features of disease-specific CC17 isolates were frequent mutations in the virulence-associated CovS and Stk1 kinases, underscoring the crucial role of the entire CovRS regulatory pathway in modulating the pathogenicity of GBS. Importantly, parallel and convergent evolution of major components of the bacterial cell envelope, such as the capsule biosynthesis operon, the pilus, and Rib, reflects adaptation to host immune pressures and should be taken into account in the ongoing development of a GBS vaccine. The presence of recurrent targets of evolution not previously implicated in virulence also opens the way for uncovering new functions involved in host colonization and GBS pathogenesis. IMPORTANCE The incidence of group B Streptococcus (GBS) neonatal disease continues to be a significant cause of concern worldwide. Strains belonging to clonal complex 17 (CC17) are the most frequently responsible for GBS infections in neonates, especially among late-onset disease cases. Therefore, we undertook the largest genomic study of GBS CC17 strains to date to decipher the genetic bases of their remarkable colonization and infection ability. We show that crucial functions involved in different steps of the colonization or infection process of GBS are distinctly mutated during the adaptation of CC17 to the human host. In particular, our results implicate the CovRS two-component regulator of virulence in the differentiation between carriage- and disease-associated isolates. Not only does this work raise important implications for the ongoing development of a vaccine against GBS but might also drive the discovery of key functions for GBS adaptation and pathogenesis that have been overlooked until now. Author Video: An author video summary of this article is available.


July 7, 2019

Bacteriophages are the major drivers of Shigella flexneri serotype 1c genome plasticity: a complete genome analysis.

Shigella flexneri is the primary cause of bacillary dysentery in the developing countries. S. flexneri serotype 1c is a novel serotype, which is found to be endemic in many developing countries, but little is known about its genomic architecture and virulence signatures. We have sequenced for the first time, the complete genome of S. flexneri serotype 1c strain Y394, to provide insights into its diversity and evolution.We generated a high-quality reference genome of S. flexneri serotype 1c using the hybrid methods of long-read single-molecule real-time (SMRT) sequencing technology and short-read MiSeq (Illumina) sequencing technology. The Y394 chromosome is 4.58 Mb in size and shares the basic genomic features with other S. flexneri complete genomes. However, it possesses unique and highly modified O-antigen structure comprising of three distinct O-antigen modifying gene clusters that potentially came from three different bacteriophages. It also possesses a large number of hypothetical unique genes compared to other S. flexneri genomes.Despite a high level of structural and functional similarities of Y394 genome with other S. flexneri genomes, there are marked differences in the pathogenic islands. The diversity in the pathogenic islands suggests that these bacterial pathogens are well adapted to respond to the selection pressures during their evolution, which might contribute to the differences in their virulence potential.


July 7, 2019

Population structure and local adaptation of MAC lung disease agent Mycobacterium avium subsp. hominissuis.

Mycobacterium avium subsp. hominissuis (MAH) is one of the most common nontuberculous mycobacterial species responsible for chronic lung disease in humans. Despite increasing worldwide incidence, little is known about the genetic mechanisms behind the population evolution of MAH. To elucidate the local adaptation mechanisms of MAH, we assessed genetic population structure, the mutual homologous recombination, and gene content for 36 global MAH isolates, including 12 Japanese isolates sequenced in the present study. We identified five major MAH lineages and found that extensive mutual homologous recombination occurs among them. Two lineages (MahEastAsia1 and MahEastAsia2) were predominant in the Japanese isolates. We identified alleles unique to these two East Asian lineages in the loci responsible for trehalose biosynthesis (treS and mak) and in one mammalian cell entry operon, which presumably originated from as yet undiscovered mycobacterial lineages. Several genes and alleles unique to East Asian strains were located in the fragments introduced via recombination between East Asian lineages, suggesting implication of recombination in local adaptation. These patterns of MAH genomes are consistent with the signature of distribution conjugative transfer, a mode of sexual reproduction reported for other mycobacterial species.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Resistance to ceftazidime-avibactam is due to tranposition of KPC in a porin-deficient strain of Klebsiella pneumoniae with increased efflux activity.

Ceftazidime-avibactam is an antibiotic with activity against serine beta-lactamases, including Klebsiella pneumoniae carbapenemase (KPC). Recently, reports have emerged of KPC-producing isolates resistant to this antibiotic, including a report of a wild-type KPC-3 producing sequence type 258 Klebsiella pneumoniae that was resistant to ceftazidime-avibactam. We describe a detailed analysis of this isolate, in the context of two other closely related KPC-3 producing isolates, recovered from the same patient. Both isolates encoded a nonfunctional OmpK35, whereas we demonstrate that a novel T333N mutation in OmpK36, present in the ceftazidime-avibactam resistant isolate, reduced the activity of this porin and impacted ceftazidime-avibactam susceptibility. In addition, we demonstrate that the increased expression of blaKPC-3 and blaSHV-12 observed in the ceftazidime-avibactam-resistant isolate was due to transposition of the Tn4401 transposon harboring blaKPC-3 into a second plasmid, pIncX3, which also harbored blaSHV-12, ultimately resulting in a higher copy number of blaKPC-3 in the resistant isolate. pIncX3 plasmid from the ceftazidime-avibactam resistant isolate, conjugated into a OmpK35/36-deficient K. pneumoniae background that harbored a mutation to the ramR regulator of the acrAB efflux operon recreated the ceftazidime-avibactam-resistant MIC of 32 µg/ml, confirming that this constellation of mutations is responsible for the resistance phenotype. Copyright © 2017 American Society for Microbiology.


July 7, 2019

Parallel evolution of two clades of a major Atlantic endemic Vibrio parahaemolyticus pathogen lineage by independent acquisition of related pathogenicity islands.

Shellfish-transmitted Vibrio parahaemolyticus infections have recently increased from locations with historically low disease incidence, such as the Northeast United States (US). This change coincided with a bacterial population shift towards human pathogenic variants occurring in part through the introduction of several Pacific native lineages (ST36, ST43 and ST636) to near-shore areas off the Atlantic coast of the Northeast US. Concomitantly, ST631 emerged as a major endemic pathogen. Phylogenetic trees of clinical and environmental isolates indicated that two clades diverged from a common ST631 ancestor, and in each of these clades, a human pathogenic variant evolved independently through acquisition of distinct Vibrio pathogenicity islands (VPaI). These VPaI differ from each other and bear little resemblance to hemolysin-containing VPaI from isolates of the pandemic clonal complex. Clade I ST631 isolates either harbored no hemolysins, or contained a chromosome I-inserted island we call VPaIß that encodes a type three secretion system (T3SS2ß) typical of Trh hemolysin-producers. The more clinically prevalent and clonal ST631 clade II had an island we call VPaI? that encodes both tdh and trh and that was inserted in chromosome II. VPaI? was derived from VPaIß but with some additional acquired elements in common with VPaI carried by pandemic isolates, exemplifying the mosaic nature of pathogenicity islands. Genomics comparisons and amplicon assays identified VPaI?-type islands containing tdh inserted adjacent to the ure cluster in the three introduced Pacific and most other emergent lineages. that collectively cause 67% of Northeast US infections as of 2016.IMPORTANCE The availability of three different hemolysin genotypes in the ST631 lineage provided a unique opportunity to employ genome comparisons to further our understanding of the processes underlying pathogen evolution. The fact that two different pathogenic clades arose in parallel from the same potentially benign lineage by independent VPaI acquisition is surprising considering the historically low prevalence of community members harboring VPaI in waters along the Northeast US coast that could serve as the source of this material. This illustrates a possible predisposition of some lineages to not only acquire foreign DNA but also to become human pathogens. Whereas the underlying cause for the expansion of V. parahaemolyticus lineages harboring VPaI? along the US Atlantic coast and spread of this element to multiple lineages that underlies disease emergence is not known, this work underscores the need to define the environment factors that favor bacteria harboring VPaI in locations of emergent disease. Copyright © 2017 American Society for Microbiology.


July 7, 2019

SVachra: a tool to identify genomic structural variation in mate pair sequencing data containing inward and outward facing reads.

Characterization of genomic structural variation (SV) is essential to expanding the research and clinical applications of genome sequencing. Reliance upon short DNA fragment paired end sequencing has yielded a wealth of single nucleotide variants and internal sequencing read insertions-deletions, at the cost of limited SV detection. Multi-kilobase DNA fragment mate pair sequencing has supplemented the void in SV detection, but introduced new analytic challenges requiring SV detection tools specifically designed for mate pair sequencing data. Here, we introduce SVachra – Structural Variation Assessment of CHRomosomal Aberrations, a breakpoint calling program that identifies large insertions-deletions, inversions, inter- and intra-chromosomal translocations utilizing both inward and outward facing read types generated by mate pair sequencing.We demonstrate SVachra’s utility by executing the program on large-insert (Illumina Nextera) mate pair sequencing data from the personal genome of a single subject (HS1011). An additional data set of long-read (Pacific BioSciences RSII) was also generated to validate SV calls from SVachra and other comparison SV calling programs. SVachra exhibited the highest validation rate and reported the widest distribution of SV types and size ranges when compared to other SV callers.SVachra is a highly specific breakpoint calling program that exhibits a more unbiased SV detection methodology than other callers.


July 7, 2019

Convergent evolution of Y chromosome gene content in flies.

Sex-chromosomes have formed repeatedly across Diptera from ordinary autosomes, and X-chromosomes mostly conserve their ancestral genes. Y-chromosomes are characterized by abundant gene-loss and an accumulation of repetitive DNA, yet the nature of the gene repertoire of fly Y-chromosomes is largely unknown. Here we trace gene-content evolution of Y-chromosomes across 22 Diptera species, using a subtraction pipeline that infers Y genes from male and female genome, and transcriptome data. Few genes remain on old Y-chromosomes, but the number of inferred Y-genes varies substantially between species. Young Y-chromosomes still show clear evidence of their autosomal origins, but most genes on old Y-chromosomes are not simply remnants of genes originally present on the proto-sex-chromosome that escaped degeneration, but instead were recruited secondarily from autosomes. Despite almost no overlap in Y-linked gene content in different species with independently formed sex-chromosomes, we find that Y-linked genes have evolved convergent gene functions associated with testis expression. Thus, male-specific selection appears as a dominant force shaping gene-content evolution of Y-chromosomes across fly species.While X-chromosome gene content tends to be conserved, Y-chromosome evolution is dynamic and difficult to reconstruct. Here, Mahajan and Bachtrog use a subtraction pipeline to identify Y-linked genes in 22 Diptera species, revealing patterns of Y-chromosome gene-content evolution.


July 7, 2019

LOGAN: A framework for LOssless Graph-based ANalysis of high throughput sequence data

Recent massive growth in the production of sequencing data necessitates matching improvements in bioinformatics tools to effectively utilize it. Existing tools suffer from limitations in both scalability and applicability which are inherent to their underlying algorithms and data structures. We identify the key requirements for the ideal data structure for sequence analyses: it should be informationally lossless, locally updatable, and memory efficient; requirements which are not met by data structures underlying the major assembly strategies Overlap Layout Consensus and De Bruijn Graphs. We therefore propose a new data structure, the LOGAN graph, which is based on a memory efficient Sparse De Bruijn Graph with routing information. Innovations in storing routing information and careful implementation allow sequence datasets for Escherichia coli (4.6Mbp, 117x coverage), Arabidopsis thaliana (135Mbp, 17.5x coverage) and Solanum pennellii (1.2Gbp, 47x coverage) to be loaded into memory on a desktop computer in seconds, minutes, and hours respectively. Memory consumption is competitive with state of the art alternatives, while losslessly representing the reads in an indexed and updatable form. Both Second and Third Generation Sequencing reads are supported. Thus, the LOGAN graph is positioned to be the backbone for major breakthroughs in sequence analysis such as integrated hybrid assembly, assembly of exceptionally large and repetitive genomes, as well as assembly and representation of pan-genomes.


July 7, 2019

The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology.

Mobile element insertions (MEIs) represent ~25% of all structural variants in human genomes. Moreover, when they disrupt genes, MEIs can influence human traits and diseases. Therefore, MEIs should be fully discovered along with other forms of genetic variation in whole genome sequencing (WGS) projects involving population genetics, human diseases, and clinical genomics. Here, we describe the Mobile Element Locator Tool (MELT), which was developed as part of the 1000 Genomes Project to perform MEI discovery on a population scale. Using both Illumina WGS data and simulations, we demonstrate that MELT outperforms existing MEI discovery tools in terms of speed, scalability, specificity, and sensitivity, while also detecting a broader spectrum of MEI-associated features. Several run modes were developed to perform MEI discovery on local and cloud systems. In addition to using MELT to discover MEIs in modern humans as part of the 1000 Genomes Project, we also used it to discover MEIs in chimpanzees and ancient (Neanderthal and Denisovan) hominids. We detected diverse patterns of MEI stratification across these populations that likely were caused by (1) diverse rates of MEI production from source elements, (2) diverse patterns of MEI inheritance, and (3) the introgression of ancient MEIs into modern human genomes. Overall, our study provides the most comprehensive map of MEIs to date spanning chimpanzees, ancient hominids, and modern humans and reveals new aspects of MEI biology in these lineages. We also demonstrate that MELT is a robust platform for MEI discovery and analysis in a variety of experimental settings.© 2017 Gardner et al.; Published by Cold Spring Harbor Laboratory Press.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.