Menu
July 7, 2019

Benchmark datasets for phylogenomic pipeline validation, applications for foodborne pathogen surveillance.

As next generation sequence technology has advanced, there have been parallel advances in genome-scale analysis programs for determining evolutionary relationships as proxies for epidemiological relationship in public health. Most new programs skip traditional steps of ortholog determination and multi-gene alignment, instead identifying variants across a set of genomes, then summarizing results in a matrix of single-nucleotide polymorphisms or alleles for standard phylogenetic analysis. However, public health authorities need to document the performance of these methods with appropriate and comprehensive datasets so they can be validated for specific purposes, e.g., outbreak surveillance. Here we propose a set of benchmark datasets to be used for comparison and validation of phylogenomic pipelines.We identified four well-documented foodborne pathogen events in which the epidemiology was concordant with routine phylogenomic analyses (reference-based SNP and wgMLST approaches). These are ideal benchmark datasets, as the trees, WGS data, and epidemiological data for each are all in agreement. We have placed these sequence data, sample metadata, and “known” phylogenetic trees in publicly-accessible databases and developed a standard descriptive spreadsheet format describing each dataset. To facilitate easy downloading of these benchmarks, we developed an automated script that uses the standard descriptive spreadsheet format.Our “outbreak” benchmark datasets represent the four major foodborne bacterial pathogens (Listeria monocytogenes, Salmonella enterica, Escherichia coli, and Campylobacter jejuni) and one simulated dataset where the “known tree” can be accurately called the “true tree”. The downloading script and associated table files are available on GitHub: https://github.com/WGS-standards-and-analysis/datasets.These five benchmark datasets will help standardize comparison of current and future phylogenomic pipelines, and facilitate important cross-institutional collaborations. Our work is part of a global effort to provide collaborative infrastructure for sequence data and analytic tools-we welcome additional benchmark datasets in our recommended format, and, if relevant, we will add these on our GitHub site. Together, these datasets, dataset format, and the underlying GitHub infrastructure present a recommended path for worldwide standardization of phylogenomic pipelines.


July 7, 2019

One year genome evolution of Lausannevirus in allopatric versus sympatric conditions.

Amoeba-resisting microorganisms raised a great interest during the last decade. Among them, some large DNA viruses present huge genomes up to 2.5?Mb long, exceeding the size of small bacterial genomes. The rate of genome evolution in terms of mutation, deletion, and gene acquisition in these genomes is yet unknown. Given the suspected high plasticity of viral genomes, the microevolution of the 346?kb genome of Lausannevirus, a member of Megavirales, was studied. Hence, Lausannevirus was co-cultured within the amoeba Acanthamoeba castellanii over one year. Despite a low number of mutations, the virus showed a genome reduction of 3.7% after 12?months. Lausannevirus genome evolution in sympatric conditions was investigated by its co-culture with Estrella lausannensis, an obligate intracellular bacterium, in the amoeba A. castellanii during one year. Cultures were split every 3?months. Genome sequencing revealed that in these conditions both, Lausannevirus and E. lausannensis, show stable genome, presenting no major rearrangement. In fact, after one year they acquired from 2 to 7 and from 4 to 10 mutations per culture for Lausannevirus and E. lausannensis, respectively. Interestingly, different mutations in the endonuclease encoding genes of Lausannevirus were observed in different subcultures, highlighting the importance of this gene product in the replication of Lausannevirus. Conversely, mutations in E. lausannensis were mainly located in a gene encoding for a phosphoenolpyruvate-protein phosphotransferase (PtsI), implicated in sugar metabolism. Moreover, in our conditions and with our analyses we detected no horizontal gene transfer during one year of co-culture.© The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Improved PKS gene expression with strong endogenous promoter resulted in geldanamycin yield increase.

The type I polyketide geldanamycin is a potent anti-tumor reagent. Its biosynthesis includes three steps: the biosynthesis of precursors, such as 3-amino-5-hydroxybenzoic acid (AHBA), the polyketide synthase (PKS) chain extension, and the post-PKS modifications. According to the genomic and transcriptomic analysis, the PKS chain extension was deduced to be the rate-limiting step for geldanamycin production in Streptomyces hygroscopicus XM201. In order to improve the expression of PKS genes, a strong endogenous promoter 5063p was obtained based on the transcriptomic analysis and XylE enzymatic assay. By replacing the native PKS promoter gdmA1p with 5063p, the expression of the PKS genes during geldanamycin fermentation was increased by 4-141-folds, and the geldanamycin yield was increased by 39%. Interestingly, AHBA feeding experiment showed that the supply of AHBA in turn become a new rate-limiting factor for geldanamycin production. Further combined overexpression of the 6-gene AHBA biosynthetic cassette and PKS genes increased the yield of geldanamycin by 88%, from 773?mg?L(-1) of the wild-type to 1450?mg?L(-1) in the derived strain. Our results suggested that improved expression of all PKS genes in a particular biosynthetic gene cluster is important for the yield increase of the corresponding polyketide natural product.© 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


July 7, 2019

Novel pelagic iron-oxidizing Zetaproteobacteria from the Chesapeake Bay oxic-anoxic transition zone.

Chemolithotrophic iron-oxidizing bacteria (FeOB) could theoretically inhabit any environment where Fe(II) and O2 (or nitrate) coexist. Until recently, marine Fe-oxidizing Zetaproteobacteria had primarily been observed in benthic and subsurface settings, but not redox-stratified water columns. This may be due to the challenges that a pelagic lifestyle would pose for Zetaproteobacteria, given low Fe(II) concentrations in modern marine waters and the possibility that Fe oxyhydroxide biominerals could cause cells to sink. However, we recently cultivated Zetaproteobacteria from the Chesapeake Bay oxic-anoxic transition zone, suggesting that they can survive and contribute to biogeochemical cycling in a stratified estuary. Here we describe the isolation, characterization, and genomes of two new species, Mariprofundus aestuarium CP-5 and Mariprofundus ferrinatatus CP-8, which are the first Zetaproteobacteria isolates from a pelagic environment. We looked for adaptations enabling strains CP-5 and CP-8 to overcome the challenges of living in a low Fe redoxcline with frequent O2 fluctuations due to tidal mixing. We found that the CP strains produce distinctive dreadlock-like Fe oxyhydroxide structures that are easily shed, which would help cells maintain suspension in the water column. These oxides are by-products of Fe(II) oxidation, likely catalyzed by the putative Fe(II) oxidase encoded by the cyc2 gene, present in both CP-5 and CP-8 genomes; the consistent presence of cyc2 in all microaerophilic FeOB and other FeOB genomes supports its putative role in Fe(II) oxidation. The CP strains also have two gene clusters associated with biofilm formation (Wsp system and the Widespread Colonization Island) that are absent or rare in other Zetaproteobacteria. We propose that biofilm formation enables the CP strains to attach to FeS particles and form flocs, an advantageous strategy for scavenging Fe(II) and developing low [O2] microenvironments within more oxygenated waters. However, the CP strains appear to be adapted to somewhat higher concentrations of O2, as indicated by the presence of genes encoding aa3-type cytochrome c oxidases, but not the cbb3-type found in all other Zetaproteobacteria isolate genomes. Overall, our results reveal adaptations for life in a physically dynamic, low Fe(II) water column, suggesting that niche-specific strategies can enable Zetaproteobacteria to live in any environment with Fe(II).


July 7, 2019

Insights into the red algae and eukaryotic evolution from the genome of Porphyra umbilicalis (Bangiophyceae, Rhodophyta).

Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a small set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra, lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.


July 7, 2019

Dynamics and impact of homologous recombination on the evolution of Legionella pneumophila.

Legionella pneumophila is an environmental bacterium and the causative agent of Legionnaires’ disease. Previous genomic studies have shown that recombination accounts for a high proportion (>96%) of diversity within several major disease-associated sequence types (STs) of L. pneumophila. This suggests that recombination represents a potentially important force shaping adaptation and virulence. Despite this, little is known about the biological effects of recombination in L. pneumophila, particularly with regards to homologous recombination (whereby genes are replaced with alternative allelic variants). Using newly available population genomic data, we have disentangled events arising from homologous and non-homologous recombination in six major disease-associated STs of L. pneumophila (subsp. pneumophila), and subsequently performed a detailed characterisation of the dynamics and impact of homologous recombination. We identified genomic “hotspots” of homologous recombination that include regions containing outer membrane proteins, the lipopolysaccharide (LPS) region and Dot/Icm effectors, which provide interesting clues to the selection pressures faced by L. pneumophila. Inference of the origin of the recombined regions showed that isolates have most frequently imported DNA from isolates belonging to their own clade, but also occasionally from other major clades of the same subspecies. This supports the hypothesis that the possibility for horizontal exchange of new adaptations between major clades of the subspecies may have been a critical factor in the recent emergence of several clinically important STs from diverse genomic backgrounds. However, acquisition of recombined regions from another subspecies, L. pneumophila subsp. fraseri, was rarely observed, suggesting the existence of a recombination barrier and/or the possibility of ongoing speciation between the two subspecies. Finally, we suggest that multi-fragment recombination may occur in L. pneumophila, whereby multiple non-contiguous segments that originate from the same molecule of donor DNA are imported into a recipient genome during a single episode of recombination.


July 7, 2019

Genome diversity and evolution in the budding yeasts (Saccharomycotina).

Considerable progress in our understanding of yeast genomes and their evolution has been made over the last decade with the sequencing, analysis, and comparisons of numerous species, strains, or isolates of diverse origins. The role played by yeasts in natural environments as well as in artificial manufactures, combined with the importance of some species as model experimental systems sustained this effort. At the same time, their enormous evolutionary diversity (there are yeast species in every subphylum of Dikarya) sparked curiosity but necessitated further efforts to obtain appropriate reference genomes. Today, yeast genomes have been very informative about basic mechanisms of evolution, speciation, hybridization, domestication, as well as about the molecular machineries underlying them. They are also irreplaceable to investigate in detail the complex relationship between genotypes and phenotypes with both theoretical and practical implications. This review examines these questions at two distinct levels offered by the broad evolutionary range of yeasts: inside the best-studied Saccharomyces species complex, and across the entire and diversified subphylum of Saccharomycotina. While obviously revealing evolutionary histories at different scales, data converge to a remarkably coherent picture in which one can estimate the relative importance of intrinsic genome dynamics, including gene birth and loss, vs. horizontal genetic accidents in the making of populations. The facility with which novel yeast genomes can now be studied, combined with the already numerous available reference genomes, offer privileged perspectives to further examine these fundamental biological questions using yeasts both as eukaryotic models and as fungi of practical importance. Copyright © 2017 by the Genetics Society of America.


July 7, 2019

Complete genome sequence of Hymenobacter sedentarius DG5BT, a bacterium resistant to gamma radiation

The ionizing radiation toxicity becomes a primary concern of the world; several exceptional attention was given to the resistance mechanisms of the radiation-resistant bacteria. Hymenobacter sedentarius DG5BT strain isolated from the gamma ray-irradiated soil samples shows resistance against gamma and UV radiation; however, their level of resistance is lower than that of other radiation resistant bacteria. To gain insight of radiation resistance, we carried out the whole genome sequencing of this strain. The genome of DG5BT strain is comprised of 4,868,852 bp (G+C content of 60.96%) including 3,994 protein-coding genes and 55 RNA genes. When compared with other bacteria, there are differences in compositions and copy numbers of several genes involved in DNA repair pathways and defense mechanism against protein damages. In this study, we discuss the implication of such findings concerning other radiation resistant bacteria.


July 7, 2019

Emergence and genomic diversification of a virulent serogroup W: ST-2881 (CC175) Neisseria meningitidis clone in the African meningitis belt

Countries of the African ‘meningitis belt’ are susceptible to meningococcal meningitis outbreaks. While in the past major epidemics have been primarily caused by serogroup A meningococci, W strains are currently responsible for most of the cases. After an epidemic in Mecca in 2000, W:ST-11 strains have caused many outbreaks worldwide. An unrelated W:ST-2881 clone was described for the first time in 2002, with the first meningitis cases caused by these bacteria reported in 2003. Here we describe results of a comparative whole-genome analysis of 74 W:ST-2881 strains isolated within the framework of two longitudinal colonization and disease studies conducted in Ghana and Burkina Faso. Genomic data indicate that the W:ST-2881 clone has emerged from Y:ST-175(CC175) bacteria by capsule switching. The circulating W:ST-2881 populations were composed of a variety of closely related but distinct genomic variants with no systematic differences between colonization and disease isolates. Two distinct and geographically clustered phylogenetic clonal variants were identified in Burkina Faso and a third in Ghana. On the basis of the presence or absence of 17 recombination fragments, the Ghanaian variant could be differentiated into five clusters. All 25 Ghanaian disease isolates clustered together with 23 out of 40 Ghanaian isolates associated with carriage within one cluster, indicating that W:ST-2881 clusters differ in virulence. More than half of the genes affected by horizontal gene transfer encoded proteins of the ‘cell envelope’ and the ‘transport/binding protein’ categories, which indicates that exchange of non-capsular antigens plays an important role in immune evasion.


July 7, 2019

Morphological and genetic analyses of the invasive forest pathogen Phytophthora austrocedri reveal two clonal lineages colonised Britain and Argentina from a common ancestral population.

Phytophthora austrocedri is causing widespread mortality of Austrocedrus chilensis in Argentina and Juniperus communis in Britain. The pathogen has also been isolated from J. horizontalis in Germany. Isolates from Britain, Argentina and Germany are homothallic with no clear differences in the dimensions of sporangia, oogonia or oospores. Argentinian and German isolates grew faster than British isolates across a range of media and had a higher temperature tolerance although most isolates regardless of origin grew best at 15°C and all isolates were killed at 25°C. Argentinian and British isolates caused lesions on both hosts when inoculated onto A. chilensis and J. communis; however the Argentinian isolate caused longer lesions on A. chilensis than on J. communis and vice versa for the British isolate. Genetic analyses of nuclear and mitochondrial loci showed that all British isolates are identical. Argentinian isolates and the German isolate are also identical but differ from the British isolates. Single nucleotide polymorphisms are shared between the British and Argentinian isolates. It is concluded that British isolates and Argentinian isolates conform to two distinct clonal lineages of P. austrocedri founded from the same as-yet unidentified source population. These lineages should be recognised and treated as separate risks by international plant health legislation.


July 7, 2019

Neisseria lactamica Y92-1009 complete genome sequence.

We present the high quality, complete genome assembly of Neisseria lactamica Y92-1009 used to manufacture an outer membrane vesicle (OMV)-based vaccine, and a member of the Neisseria genus. The strain is available on request from the Public Health England Meningococcal Reference Unit. This Gram negative, dipplococcoid bacterium is an organism of worldwide clinical interest because human nasopharyngeal carriage is related inversely to the incidence of meningococcal disease, caused by Neisseria meningitidis. The organism sequenced was isolated during a school carriage survey in Northern Ireland in 1992 and has been the subject of a variety of laboratory and clinical studies. Four SMRT cells on a RSII machine by Pacific Biosystems were used to produce a complete, closed genome assembly. Sequence data were obtained for a total of 30,180,391 bases from 2621 reads and assembled using the HGAP algorithm. The assembly was corrected using short reads obtained from an Illumina HiSeq 2000instrument. This resulted in a 2,146,723 bp assembly with approximately 460 fold mean coverage depth and a GC ratio of 52.3%.


July 7, 2019

A case of decreased susceptibility to ceftriaxone in Neisseria gonorrhoeae in the absence of a mosaic penicillin-binding protein 2 (penA) allele.

We report a case of Neisseria gonorrhoeae with a non-mosaic penA allele that exhibited decreased susceptibility to extended-spectrum cephalosporins, including a ceftriaxone minimum inhibitory concentration of 0.5 µg/mL. An analysis of resistance determinants suggested that the observed phenotype might have resulted from the combined effects of mutations in multiple genes.


July 7, 2019

Improved annotation of the insect vector of citrus greening disease: biocuration by a diverse genomics community.

The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the pathogen associated with citrus Huanglongbing (HLB, citrus greening). HLB threatens citrus production worldwide. Suppression or reduction of the insect vector using chemical insecticides has been the primary method to inhibit the spread of citrus greening disease. Accurate structural and functional annotation of the Asian citrus psyllid genome, as well as a clear understanding of the interactions between the insect and CLas, are required for development of new molecular-based HLB control methods. A draft assembly of the D. citri genome has been generated and annotated with automated pipelines. However, knowledge transfer from well-curated reference genomes such as that of Drosophila melanogaster to newly sequenced ones is challenging due to the complexity and diversity of insect genomes. To identify and improve gene models as potential targets for pest control, we manually curated several gene families with a focus on genes that have key functional roles in D. citri biology and CLas interactions. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. A comprehensive transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models.


July 7, 2019

Complete genome sequence of the sand-sediment actinobacterium Nocardioides dokdonensis FR1436(T).

Nocardioides dokdonensis, belonging to the class Actinobacteria, was first isolated from sand sediment of a beach in Dokdo, Korea, in 2005. In this study, we determined the genome sequence of FR1436, the type strain of N. dokdonensis, and analyzed its gene contents. The genome sequence is the second complete one in the genus Nocardioides after that of Nocardioides sp. JS614. It is composed of a 4,376,707-bp chromosome with a G + C content of 72.26%. From the genome sequence, 4,104 CDSs, three rRNA operons, 51 tRNAs, and one tmRNA were predicted, and 71.38% of the genes were assigned putative functions. Through the sequence analysis, dozens of genes involved in steroid metabolism, especially its degradation, were detected. Most of the identified genes were located in large gene clusters, which showed high similarities with the gene clusters in Pimelobacter simplex VKM Ac-2033D. Genomic features of N. dokdonensis associated with steroid catabolism indicate that it could be used for research and application of steroids in science and industry.


July 7, 2019

Novel urease-negative Helicobacter sp. ‘H. enhydrae sp. nov.’ isolated from inflamed gastric tissue of southern sea otters.

A total of 31 sea otters Enhydra lutris nereis found dead or moribund (and then euthanized) were necropsied in California, USA. Stomach biopsies were collected and transected with equal portions frozen or placed in formalin and analyzed histologically and screened for Helicobacter spp. in gastric tissue. Helicobacter spp. were isolated from 9 sea otters (29%); 58% (18 of 31) animals were positive for helicobacter by PCR. The Helicobacter sp. was catalase- and oxidase-positive and urease-negative. By electron microscopy, the Helicobacter sp. had lateral and polar sheathed flagella and had a slightly curved rod morphology. 16S and 23S rRNA sequence analyses of all ‘H. enhydrae’ isolates had similar sequences, which clustered as a novel Helicobacter sp. closely related to H. mustelae (96-97%). The genome sequence of isolate MIT 01-6242 was assembled into a single ~1.6 Mb long contig with a 40.8% G+C content. The annotated genome contained 1699 protein-coding sequences and 43 RNAs, including 65 genes homologous to known Helicobacter spp. and Campylobacter spp. virulence factors. Histological changes in the gastric tissues extended from mild cystic degeneration of gastric glands to severe mucosal erosions and ulcers. Silver stains of infected tissues demonstrated slightly curved bacterial rods at the periphery of the gastric ulcers and on the epithelial surface of glands. The underlying mucosa and submucosa were infiltrated by low numbers of neutrophils, macrophages, and lymphocytes, with occasional lymphoid aggregates and well-defined lymphoid follicles. This is the second novel Helicobacter sp., which we have named ‘H. enhydrae’, isolated from inflamed stomachs of mustelids, the first being H. mustelae from a ferret.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.