Menu
July 7, 2019

Interpreting whole-genome sequence analyses of foodborne bacteria for regulatory applications and outbreak investigations.

Whole-genome sequence (WGS) analysis has revolutionized the food safety industry by enabling high-resolution typing of foodborne bacteria. Higher resolving power allows investigators to identify origins of contamination during illness outbreaks and regulatory activities quickly and accurately. Government agencies and industry stakeholders worldwide are now analyzing WGS data routinely. Although researchers have published many studies that assess the efficacy of WGS data analysis for source attribution, guidance for interpreting WGS analyses is lacking. Here, we provide the framework for interpreting WGS analyses used by the Food and Drug Administration’s Center for Food Safety and Applied Nutrition (CFSAN). We based this framework on the experiences of CFSAN investigators, collaborations and interactions with government and industry partners, and evaluation of the published literature. A fundamental question for investigators is whether two or more bacteria arose from the same source of contamination. Analysts often count the numbers of nucleotide differences [single-nucleotide polymorphisms (SNPs)] between two or more genome sequences to measure genetic distances. However, using SNP thresholds alone to assess whether bacteria originated from the same source can be misleading. Bacteria that are isolated from food, environmental, or clinical samples are representatives of bacterial populations. These populations are subject to evolutionary forces that can change genome sequences. Therefore, interpreting WGS analyses of foodborne bacteria requires a more sophisticated approach. Here, we present a framework for interpreting WGS analyses that combines SNP counts with phylogenetic tree topologies and bootstrap support. We also clarify the roles of WGS, epidemiological, traceback, and other evidence in forming the conclusions of investigations. Finally, we present examples that illustrate the application of this framework to real-world situations.


July 7, 2019

Complete genome sequence of industrial biocontrol strain Paenibacillus polymyxa HY96-2 and further analysis of Its biocontrol mechanism.

Paenibacillus polymyxa (formerly known as Bacillus polymyxa) has been extensively studied for agricultural applications as a plant-growth-promoting rhizobacterium and is also an important biocontrol agent. Our team has developed the P. polymyxa strain HY96-2 from the tomato rhizosphere as the first microbial biopesticide based on P. polymyxa for controlling plant diseases around the world, leading to the commercialization of this microbial biopesticide in China. However, further research is essential for understanding its precise biocontrol mechanisms. In this paper, we report the complete genome sequence of HY96-2 and the results of a comparative genomic analysis between different P. polymyxa strains. The complete genome size of HY96-2 was found to be 5.75 Mb and 5207 coding sequences were predicted. HY96-2 was compared with seven other P. polymyxa strains for which complete genome sequences have been published, using phylogenetic tree, pan-genome, and nucleic acid co-linearity analysis. In addition, the genes and gene clusters involved in biofilm formation, antibiotic synthesis, and systemic resistance inducer production were compared between strain HY96-2 and two other strains, namely, SC2 and E681. The results revealed that all three of the P. polymyxa strains have the ability to control plant diseases via the mechanisms of colonization (biofilm formation), antagonism (antibiotic production), and induced resistance (systemic resistance inducer production). However, the variation of the corresponding genes or gene clusters between the three strains may lead to different antimicrobial spectra and biocontrol efficacies. Two possible pathways of biofilm formation in P. polymyxa were reported for the first time after searching the KEGG database. This study provides a scientific basis for the further optimization of the field applications and quality standards of industrial microbial biopesticides based on HY96-2. It may also serve as a reference for studying the differences in antimicrobial spectra and biocontrol capability between different biocontrol agents.


July 7, 2019

Characterization and genome analysis of a phthalate esters-degrading strain Sphingobium yanoikuyae SHJ.

A bacterium capable of utilizing dimethyl phthalate (DMP), diethyl phthalate (DEP), di-n-butyl phthalate (DBP), and diisobuthyl phthalate (DIBP) as the sole carbon and energy source was isolated from shallow aquifer sediments. The strain was identified as Sphingobium yanoikuyae SHJ based on morphological characteristics, 16S rDNA gene phylogeny, and whole genome average nucleotide identity (ANI). The degradation half-life of DBP with substrate concentration of 8.5 and 50.0 mg/L by strain SHJ was 99.7 and 101.4 hours, respectively. The optimum degradation rate of DBP by SHJ was observed at 30°C and weak alkaline (pH 7.5). Genome sequence of the strain SHJ showed a circular chromosome and additional two circular plasmids with whole genome size of 5,669,383 bp and GC content of 64.23%. Functional annotation of SHJ revealed a total of 5,402 genes, with 5,183 protein-encoding genes, 143 pseudogenes, and 76 noncoding RNA genes. Based on genome annotation, 44 genes were identified to be involved in PAEs hydrolysis potentially. Besides, a region with size of about 6.9 kb comprised of seven ORFs, which is located on the smaller plasmid pSES189, was presumed to be responsible for the biodegradation of phthalate. These results provide insights into the genetic basis of DBP biodegradation in this strain.


July 7, 2019

Genomic characterization of methylotrophy of Oharaeibacter diazotrophicus strain SM30T.

Oharaeibacter diazotrophicus strain SM30T, isolated from rice rhizosphere, is an aerobic, facultative lanthanide (Ln3+)-utilizing methylotroph and diazotroph that belongs to the Methylocystaceae family. In this research, the complete genome sequence of strain SM30T was determined, and its methylotrophy modules were characterized. The genome consists of one chromosome and two plasmids, comprising a total of 5,004,097 bp, and the GC content was 71.6 mol%. A total of 4497 CDSs, 67 tRNA, and 9 rRNA were encoded. Typical alpha-proteobacterial methylotrophy genes were found: pyrroloquinoline quinone (PQQ)-dependent methanol dehydrogenase (MDH) (mxaF and xoxF1-4), methylotrophy regulatory proteins (mxbDM and mxcQE), PQQ synthesis, H4F pathway, H4MPT pathway, formate oxidation, serine cycle, and ethylmalonyl-CoA pathway. SDS-PAGE and subsequent LC-MS analysis, and qPCR analysis revealed that MxaF and XoxF1 were the dominant MDH in the absence or presence of lanthanum (La3+), respectively. The growth of MDH gene-deletion mutants on alcohols and qPCR results indicated that mxaF and xoxF1 are also involved in ethanol and propanol oxidation, xoxF2 participates in methanol oxidation in the presence of La3+, while xoxF3 was associated with methanol and ethanol oxidation in the absence of La3+, implying that XoxF3 is a calcium (Ca2+)-binding XoxF. Four Ln3+ such as La3+, cerium (Ce3+), praseodymium (Pr3+), and neodymium (Nd3+) served as cofactors for XoxF1 by supporting ?mxaF growth on methanol. Some heavier lanthanides inhibited growth of SM30 on methanol. This study contributes to the understanding of the function of various XoxF-type MDHs and their roles in methylotrophs. Copyright © 2018 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.


July 7, 2019

The complete genome sequence of Rhodobaca barguzinensis alga05 (DSM 19920) documents its adaptation for life in soda lakes.

Soda lakes, with their high salinity and high pH, pose a very challenging environment for life. Microorganisms living in these harsh conditions have had to adapt their physiology and gene inventory. Therefore, we analyzed the complete genome of the haloalkaliphilic photoheterotrophic bacterium Rhodobaca barguzinensis strain alga05. It consists of a 3,899,419 bp circular chromosome with 3624 predicted coding sequences. In contrast to most of Rhodobacterales, this strain lacks any extrachromosomal elements. To identify the genes responsible for adaptation to high pH, we compared the gene inventory in the alga05 genome with genomes of 17 reference strains belonging to order Rhodobacterales. We found that all haloalkaliphilic strains contain the mrpB gene coding for the B subunit of the MRP Na+/H+ antiporter, while this gene is absent in all non-alkaliphilic strains, which indicates its importance for adaptation to high pH. Further analysis showed that alga05 requires organic carbon sources for growth, but it also contains genes encoding the ethylmalonyl-CoA pathway for CO2 fixation. Remarkable is the genetic potential to utilize organophosphorus compounds as a source of phosphorus. In summary, its genetic inventory indicates a large flexibility of the alga05 metabolism, which is advantageous in rapidly changing environmental conditions in soda lakes.


July 7, 2019

Complete genome sequence of Clostridium kluyveri JZZ applied in Chinese strong-flavor liquor production.

Chinese strong-flavor liquor (CSFL), accounting for more than 70% of both Chinese liquor production and sales, was produced by complex fermentation with pit mud. Clostridium kluyveri, an important species coexisted with other microorganisms in fermentation pit mud (FPM), could produce caproic acid, which was subsequently converted to the key CSFL flavor substance ethyl caproate. In this study, we present the first complete genome sequence of C. kluyveri isolated from FPM. Clostridium kluyveri JZZ contains one circular chromosome and one circular plasmid with length of 4,454,353 and 58,581 bp, respectively. 4158 protein-coding genes were predicted and 2792 genes could be assigned with COG categories. It possesses the pathway predicted for biosynthesis of caproic acid with ethanol. Compared to other two C. kluyveri genomes, JZZ consists of longer chromosome with multiple gene rearrangements, and contains more genes involved in defense mechanisms, as well as DNA replication, recombination, and repair. Meanwhile, JZZ contains fewer genes involved in secondary metabolites biosynthesis, transport, and catabolism, including genes encoding Polyketide Synthases/Non-ribosomal Peptide Synthetases. Additionally, JZZ possesses 960 unique genes with relatively aggregating in defense mechanisms and transcription. Our study will be available for further research about C. kluyveri isolated from FPM, and will also facilitate the genetic engineering to increase biofuel production and improve fragrance flavor of CSFL.


July 7, 2019

Complete genome sequence of Rhodothermaceae bacterium RA with cellulolytic and xylanolytic activities.

Rhodothermaceae bacterium RA is a halo-thermophile isolated from a saline hot spring. Previously, the genome of this bacterium was sequenced using a HiSeq 2500 platform culminating in 91 contigs. In this report, we report on the resequencing of its complete genome using a PacBio RSII platform. The genome has a GC content of 68.3%, is 4,653,222 bp in size, and encodes 3711 genes. We are interested in understanding the carbohydrate metabolic pathway, in particular the lignocellulosic biomass degradation pathway. Strain RA harbors 57 glycosyl hydrolase (GH) genes that are affiliated with 30 families. The bacterium consists of cellulose-acting (GH 3, 5, 9, and 44) and hemicellulose-acting enzymes (GH 3, 10, and 43). A crude cell-free extract of the bacterium exhibited endoglucanase, xylanase, ß-glucosidase, and ß-xylosidase activities. The complete genome information coupled with biochemical assays confirms that strain RA is able to degrade cellulose and xylan. Therefore, strain RA is another excellent member of family Rhodothermaceae as a repository of novel and thermostable cellulolytic and hemicellulolytic enzymes.


July 7, 2019

Complete genome sequence of the halophile bacterium Kushneria konosiri X49T, isolated from salt-fermented Konosirus punctatus

Kushneria konosiri X49T is a member of the Halomonadaceae family within the order Oceanospirillales and can be isolated from salt-fermented larval gizzard shad. The genome of K. konosiri X49T reported here provides a genetic basis for its halophilic character. Diverse genes were involved in salt-in and -out strategies enabling adaptation of X49T to hypersaline environments. Due to resistance to high salt concentrations, genome research of K. konosiri X49T will contribute to the improvement of environmental and biotechnological usage by enhancing understanding of the osmotic equilibrium in the cytoplasm. Its genome consists of 3,584,631 bp, with an average Gthinspace+thinspaceC content of 59.1%, and 3261 coding sequences, 12 rRNAs, 66 tRNAs, and 8 miscRNAs.


July 7, 2019

Complete genome sequence of Kocuria rhizophila BT304, isolated from the small intestine of castrated beef cattle.

Members of the species Kocuria rhizophila, belonging to the family Micrococcaceae in the phylum Actinobacteria, have been isolated from a wide variety of natural sources, such as soil, freshwater, fish gut, and clinical specimens. K. rhizophila is important from an industrial viewpoint, because the bacterium grows rapidly with high cell density and exhibits robustness at various growth conditions. However, the bacterium is an opportunistic pathogen involved in human infections. Here, we sequenced and analyzed the genome of the K. rhizophila strain BT304, isolated from the small intestine of adult castrated beef cattle.The genome of K. rhizophila BT304 consisted of a single circular chromosome of 2,763,150 bp with a GC content of 71.2%. The genome contained 2359 coding sequences, 51 tRNA genes, and 9 rRNA genes. Sequence annotations with the RAST server revealed many genes related to amino acid, carbohydrate, and protein metabolism. Moreover, the genome contained genes related to branched chain amino acid biosynthesis and degradation. Analysis of the OrthoANI values revealed that the genome has high similarity (>?97.8%) with other K. rhizophila strains, such as DC2201, FDAARGOS 302, and G2. Comparative genomic analysis further revealed that the antibiotic properties of K. rhizophila vary among the strains.The relatively small number of virulence-related genes and the great potential in production of host available nutrients suggest potential application of the BT304 strain as a probiotic in breeding beef cattle.


July 7, 2019

Complete genome sequence of the Robinia pseudoacacia L. symbiont Mesorhizobium amorphae CCNWGS0123.

Mesorhizobium amorphae CCNWGS0123 was isolated in 2006, from effective nodules of Robinia pseudoacacia L. grown in lead-zinc mine tailing site, in Gansu Province, China. M. amorphae CCNWGS0123 is an aerobic, Gram-negative, non-spore-forming rod strain. This paper characterized M. amorphae CCNWGS0123 and presents its complete genome sequence information and genome annotation. The 7,374,589 bp long genome which encodes 7136 protein-coding genes and 63 RNA coding genes, contains one chromosome and four plasmids. Moreover, a chromosome with no gaps was assembled.


July 7, 2019

STRetch: detecting and discovering pathogenic short tandem repeat expansions.

Short tandem repeat (STR) expansions have been identified as the causal DNA mutation in dozens of Mendelian diseases. Most existing tools for detecting STR variation with short reads do so within the read length and so are unable to detect the majority of pathogenic expansions. Here we present STRetch, a new genome-wide method to scan for STR expansions at all loci across the human genome. We demonstrate the use of STRetch for detecting STR expansions using short-read whole-genome sequencing data at known pathogenic loci as well as novel STR loci. STRetch is open source software, available from github.com/Oshlack/STRetch .


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.