Menu
September 22, 2019

Integrating long-range connectivity information into de Bruijn graphs.

The de Bruijn graph is a simple and efficient data structure that is used in many areas of sequence analysis including genome assembly, read error correction and variant calling. The data structure has a single parameter k, is straightforward to implement and is tractable for large genomes with high sequencing depth. It also enables representation of multiple samples simultaneously to facilitate comparison. However, unlike the string graph, a de Bruijn graph does not retain long range information that is inherent in the read data. For this reason, applications that rely on de Bruijn graphs can produce sub-optimal results given their input data.We present a novel assembly graph data structure: the Linked de Bruijn Graph (LdBG). Constructed by adding annotations on top of a de Bruijn graph, it stores long range connectivity information through the graph. We show that with error-free data it is possible to losslessly store and recover sequence from a Linked de Bruijn graph. With assembly simulations we demonstrate that the LdBG data structure outperforms both our de Bruijn graph and the String Graph Assembler (SGA). Finally we apply the LdBG to Klebsiella pneumoniae short read data to make large (12 kbp) variant calls, which we validate using PacBio sequencing data, and to characterize the genomic context of drug-resistance genes.Linked de Bruijn Graphs and associated algorithms are implemented as part of McCortex, which is available under the MIT license at https://github.com/mcveanlab/mccortex.Supplementary data are available at Bioinformatics online.


September 22, 2019

Periodic variation of mutation rates in bacterial genomes associated with replication timing

The causes and consequences of spatiotemporal variation in mutation rates remain to be explored in nearly all organisms. Here we examine relationships between local mutation rates and replication timing in three bacterial species whose genomes have multiple chromosomes: Vibrio fischeri, Vibrio cholerae, and Burkholderia cenocepacia Following five mutation accumulation experiments with these bacteria conducted in the near absence of natural selection, the genomes of clones from each lineage were sequenced and analyzed to identify variation in mutation rates and spectra. In lineages lacking mismatch repair, base substitution mutation rates vary in a mirrored wave-like pattern on opposing replichores of the large chromosomes of V. fischeri and V. cholerae, where concurrently replicated regions experience similar base substitution mutation rates. The base substitution mutation rates on the small chromosome are less variable in both species but occur at similar rates to those in the concurrently replicated regions of the large chromosome. Neither nucleotide composition nor frequency of nucleotide motifs differed among regions experiencing high and low base substitution rates, which along with the inferred ~800-kb wave period suggests that the source of the periodicity is not sequence specific but rather a systematic process related to the cell cycle. These results support the notion that base substitution mutation rates are likely to vary systematically across many bacterial genomes, which exposes certain genes to elevated deleterious mutational load.IMPORTANCE That mutation rates vary within bacterial genomes is well known, but the detailed study of these biases has been made possible only recently with contemporary sequencing methods. We applied these methods to understand how bacterial genomes with multiple chromosomes, like those of Vibrio and Burkholderia, might experience heterogeneous mutation rates because of their unusual replication and the greater genetic diversity found on smaller chromosomes. This study captured thousands of mutations and revealed wave-like rate variation that is synchronized with replication timing and not explained by sequence context. The scale of this rate variation over hundreds of kilobases of DNA strongly suggests that a temporally regulated cellular process may generate wave-like variation in mutation risk. These findings add to our understanding of how mutation risk is distributed across bacterial and likely also eukaryotic genomes, owing to their highly conserved replication and repair machinery. Copyright © 2018 Dillon et al.


September 22, 2019

A reference genome of the Chinese hamster based on a hybrid assembly strategy.

Accurate and complete genome sequences are essential in biotechnology to facilitate genome-based cell engineering efforts. The current genome assemblies for Cricetulus griseus, the Chinese hamster, are fragmented and replete with gap sequences and misassemblies, consistent with most short-read-based assemblies. Here, we completely resequenced C. griseus using single molecule real time sequencing and merged this with Illumina-based assemblies. This generated a more contiguous and complete genome assembly than either technology alone, reducing the number of scaffolds by >28-fold, with 90% of the sequence in the 122 longest scaffolds. Most genes are now found in single scaffolds, including up- and downstream regulatory elements, enabling improved study of noncoding regions. With >95% of the gap sequence filled, important Chinese hamster ovary cell mutations have been detected in draft assembly gaps. This new assembly will be an invaluable resource for continued basic and pharmaceutical research.© 2018 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc.


September 22, 2019

Tracing genomic divergence of Vibrio bacteria in the Harveyi clade.

The mechanism of bacterial speciation remains a topic of tremendous interest. To understand the ecological and evolutionary mechanisms of speciation in Vibrio bacteria, we analyzed the genomic dissimilarities between three closely related species in the so-called Harveyi clade of the genus Vibrio, V. campbellii, V. jasicida, and V. hyugaensis The analysis focused on strains isolated from diverse geographic locations over a long period of time. The results of phylogenetic analyses and calculations of average nucleotide identity (ANI) supported the classification of V. jasicida and V. hyugaensis into two species. These analyses also identified two well-supported clades in V. campbellii; however, strains from both clades were classified as members of the same species. Comparative analyses of the complete genome sequences of representative strains from the three species identified higher syntenic coverage between genomes of V. jasicida and V. hyugaensis than that between the genomes from the two V. campbellii clades. The results from comparative analyses of gene content between bacteria from the three species did not support the hypothesis that gene gain and/or loss contributed to their speciation. We also did not find support for the hypothesis that ecological diversification toward associations with marine animals contributed to the speciation of V. jasicida and V. hyugaensis Overall, based on the results obtained in this study, we propose that speciation in Harveyi clade species is a result of stochastic diversification of local populations, which was influenced by multiple evolutionary processes, followed by extinction events.IMPORTANCE To investigate the mechanisms underlying speciation in the genus Vibrio, we provided a well-assembled reference of genomes and performed systematic genomic comparisons among three evolutionarily closely related species. We resolved taxonomic ambiguities and identified genomic features separating the three species. Based on the study results, we propose a hypothesis explaining how species in the Harveyi clade of Vibrio bacteria diversified. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Analysis of the draft genome of the red seaweed Gracilariopsis chorda provides insights into genome size evolution in Rhodophyta.

Red algae (Rhodophyta) underwent two phases of large-scale genome reduction during their early evolution. The red seaweeds did not attain genome sizes or gene inventories typical of other multicellular eukaryotes. We generated a high-quality 92.1 Mb draft genome assembly from the red seaweed Gracilariopsis chorda, including methylation and small (s)RNA data. We analyzed these and other Archaeplastida genomes to address three questions: 1) What is the role of repeats and transposable elements (TEs) in explaining Rhodophyta genome size variation, 2) what is the history of genome duplication and gene family expansion/reduction in these taxa, and 3) is there evidence for TE suppression in red algae? We find that the number of predicted genes in red algae is relatively small (4,803-13,125 genes), particularly when compared with land plants, with no evidence of polyploidization. Genome size variation is primarily explained by TE expansion with the red seaweeds having the largest genomes. Long terminal repeat elements and DNA repeats are the major contributors to genome size growth. About 8.3% of the G. chorda genome undergoes cytosine methylation among gene bodies, promoters, and TEs, and 71.5% of TEs contain methylated-DNA with 57% of these regions associated with sRNAs. These latter results suggest a role for TE-associated sRNAs in RNA-dependent DNA methylation to facilitate silencing. We postulate that the evolution of genome size in red algae is the result of the combined action of TE spread and the concomitant emergence of its epigenetic suppression, together with other important factors such as changes in population size.


September 22, 2019

Genomes of ubiquitous marine and hypersaline Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira spp. encode a diversity of mechanisms to sustain chemolithoautotrophy in heterogeneous environments.

Chemolithoautotrophic bacteria from the genera Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira are common, sometimes dominant, isolates from sulfidic habitats including hydrothermal vents, soda and salt lakes and marine sediments. Their genome sequences confirm their membership in a deeply branching clade of the Gammaproteobacteria. Several adaptations to heterogeneous habitats are apparent. Their genomes include large numbers of genes for sensing and responding to their environment (EAL- and GGDEF-domain proteins and methyl-accepting chemotaxis proteins) despite their small sizes (2.1-3.1 Mbp). An array of sulfur-oxidizing complexes are encoded, likely to facilitate these organisms’ use of multiple forms of reduced sulfur as electron donors. Hydrogenase genes are present in some taxa, including group 1d and 2b hydrogenases in Hydrogenovibrio marinus and H. thermophilus MA2-6, acquired via horizontal gene transfer. In addition to high-affinity cbb3 cytochrome c oxidase, some also encode cytochrome bd-type quinol oxidase or ba3 -type cytochrome c oxidase, which could facilitate growth under different oxygen tensions, or maintain redox balance. Carboxysome operons are present in most, with genes downstream encoding transporters from four evolutionarily distinct families, which may act with the carboxysomes to form CO2 concentrating mechanisms. These adaptations to habitat variability likely contribute to the cosmopolitan distribution of these organisms.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Genetic and biochemical characterization of 5-hydroxypicolinic acid metabolism in Alcaligenes faecalis JQ135.

5-Hydroxypicolinic acid (5HPA), a natural pyridine derivative, is microbially degraded in the environment. However, the physiological, biochemical, and genetic foundations of the 5HPA metabolism remain unknown. In this study, an operon (hpa), responsible for 5HPA degradation, was cloned from Alcaligenes faecalis JQ135. HpaM was a monocomponent FAD-dependent monooxygenase and shared low identity (only 28-31%) with reported monooxygenases. HpaM catalyzed the ortho decarboxylative hydroxylation of 5HPA, generating 2,5-dihydroxypyridine (2,5DHP). The monooxygenase activity of HpaM was FAD and NADH-dependent. The apparent Km values of HpaM for 5HPA and NADH were 45.4 µM and 37.8 µM, respectively. The genes hpaX, hpaD, and hpaF were found to encode 2,5DHP dioxygenase, N-formylmaleamic acid deformylase, and maleamate amidohydrolase, respectively; however, the three genes were not essential for 5HPA degradation in A. faecalis JQ135. Furthermore, the gene maiA, which encodes a maleic acid cis-trans isomerase, was essential for the metabolism of 5HPA, nicotinic acid, and picolinic acid in A. faecalis JQ135, indicating that it might be a key gene in the metabolism of pyridine derivatives. The genes and proteins identified in this study showed a novel degradation mechanism of pyridine derivatives.Importance Unlike the benzene ring, the uneven distribution of the electron density of pyridine ring influences the positional reactivity and the interaction with enzymes, e.g., the ortho and para oxidation are more difficult than the meta oxidations. Hydroxylation is an important oxidation process for the pyridine derivative metabolism. In previous reports, the ortho hydroxylation of pyridine derivatives were catalyzed by multicomponent molybdenum-containing monooxygenases, while the meta hydroxylations were catalyzed by monocomponent FAD-dependent monooxygenases. This study identified the new monocomponent FAD-dependent monooxygenase HpaM that catalyzed the ortho decarboxylative hydroxylation of 5HPA. In addition, we found that the maiA coding for maleic acid cis-trans isomerase was pivotal for the metabolism of 5HPA, nicotinic acid, and picolinic acid in A. faecalis JQ135. This study provides novel insights into the microbial metabolism of pyridine derivatives. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Identification and characterization of conjugative plasmids that encode ciprofloxacin resistance in Salmonella.

This study aimed to characterize novel conjugative plasmids that encode transferrable ciprofloxacin resistance in Salmonella In this study, 157 non-duplicated Salmonella isolates were recovered from food products, 55 out of which were found to be resistant to ciprofloxacin. Interestingly, 37 out of the 55 (67%) CipRSalmonella isolates did not harbor any mutations in the Quinolone resistance determine regions (QRDR). Interestingly, six Salmonella isolates were shown to carry two novel types of conjugative plasmids that could transfer ciprofloxacin resistance phenotype to E. coli J53 (AziR). The first type belonged to the ~110kb IncFIB type conjugative plasmid carrying qnrB-bearing and aac(6′)-Ib-cr-bearing mobile elements. Transfer of the plasmid between E. coli or Salmonella could confer CIP MIC to 1 to 2µg/ml. The second type of conjugative plasmid belonged to ~240kb IncH1/IncF plasmids carrying a single PMQR gene, qnrS Importantly, this type of conjugative ciprofloxacin resistance plasmids could be detected in clinical isolates of Salmonella Dissemination of these conjugative plasmids that confer ciprofloxaicn resistance poses serious public health impact and Salmonella infection control. Copyright © 2018 American Society for Microbiology.


September 22, 2019

The plant growth-promoting rhizobacterium Variovorax boronicumulans CGMCC 4969 regulates the level of indole-3-acetic acid synthesized from indole-3-acetonitrile.

Variovorax is a metabolically diverse genus of plant growth-promoting rhizobacteria (PGPR) that engages in mutually beneficial interactions between plants and microbes. Unlike most PGPR, Variovorax cannot synthesize the phytohormone indole-3-acetic acid (IAA) via tryptophan. However, we found that V. boronicumulans strain CGMCC 4969 could produce IAA using indole-3-acetonitrile (IAN) as the precursor. Thus, in the present study, the IAA synthesis mechanism of V. boronicumulans CGMCC 4969 was investigated. V. boronicumulans CGMCC 4969 metabolized IAN to IAA through both a nitrilase-dependent pathway and a nitrile hydratase (NHase) and amidase-dependent pathway. Cobalt enhanced the metabolic flux via the NHase/amidase, by which IAN was rapidly converted to indole-3-acetamide (IAM) and in turn to IAA. IAN stimulated the metabolic flux via the nitrilase, by which IAN was rapidly converted to IAA. Subsequently, the IAA was degraded. V. boronicumulans CGMCC 4969 could use IAN as the sole carbon and nitrogen source for growth. Genome sequencing confirmed the IAA synthesis pathways. Gene cloning and overexpression in Escherichia coli indicated that NitA has the nitrilase activity, and IamA has the amidase activity to respectively transform IAN and IAM to IAA. Interestingly, NitA showed a close genetic relationship with the nitrilase of the phytopathogen Pseudomonas syringae Quantitative PCR analysis indicated that the NHase/amidase system is constitutively expressed, whereas the nitrilase is inducible. The present study helps our understanding of the versatile functions of Variovorax nitrile-converting enzymes that mediate IAA synthesis and the interactions between plants and these bacteria.IMPORTANCE We demonstrated that Variovorax boronicumulans CGMCC 4969 has two enzymatic systems-nitrilase and nitrile hydratase/amidase-that convert indole-3-acetonitrile (IAN) to the important plant hormone indole-3-acetic acid (IAA). The two IAA synthesis systems have very different regulatory mechanisms, affecting the IAA synthesis rate and duration. The nitrilase was induced by IAN, which was rapidly converted to IAA; subsequently IAA was rapidly consumed for cell growth. The NHase and amidase system was constitutively expressed and slowly but continuously synthesized IAA. In addition to synthesizing IAA from IAN, CGMCC 4969 has a rapid IAA degradation system, which would be helpful for a host plant to eliminate redundant IAA. This study indicates that the plant growth-promoting rhizobacterium V. boronicumulans CGMCC 4969 has the potential to be used by host plants to regulate the IAA level. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Transcriptome analysis of Neisseria gonorrhoeae during natural infection reveals differential expression of antibiotic resistance determinants between men and women.

Neisseria gonorrhoeae is a bacterial pathogen responsible for the sexually transmitted infection gonorrhea. Emergence of antimicrobial resistance (AMR) of N. gonorrhoeae worldwide has resulted in limited therapeutic choices for this infection. Men who seek treatment often have symptomatic urethritis; in contrast, gonococcal cervicitis in women is usually minimally symptomatic, but may progress to pelvic inflammatory disease. Previously, we reported the first analysis of gonococcal transcriptome expression determined in secretions from women with cervical infection. Here, we defined gonococcal global transcriptional responses in urethral specimens from men with symptomatic urethritis and compared these with transcriptional responses in specimens obtained from women with cervical infections and in vitro-grown N. gonorrhoeae isolates. This is the first comprehensive comparison of gonococcal gene expression in infected men and women. RNA sequencing analysis revealed that 9.4% of gonococcal genes showed increased expression exclusively in men and included genes involved in host immune cell interactions, while 4.3% showed increased expression exclusively in women and included phage-associated genes. Infected men and women displayed comparable antibiotic-resistant genotypes and in vitro phenotypes, but a 4-fold higher expression of the Mtr efflux pump-related genes was observed in men. These results suggest that expression of AMR genes is programed genotypically and also driven by sex-specific environments. Collectively, our results indicate that distinct N. gonorrhoeae gene expression signatures are detected during genital infection in men and women. We propose that therapeutic strategies could target sex-specific differences in expression of antibiotic resistance genes.IMPORTANCE Recent emergence of antimicrobial resistance of Neisseria gonorrhoeae worldwide has resulted in limited therapeutic choices for treatment of infections caused by this organism. We performed global transcriptomic analysis of N. gonorrhoeae in subjects with gonorrhea who attended a Nanjing, China, sexually transmitted infection (STI) clinic, where antimicrobial resistance of N. gonorrhoeae is high and increasing. We found that N. gonorrhoeae transcriptional responses to infection differed in genital specimens taken from men and women, particularly antibiotic resistance gene expression, which was increased in men. These sex-specific findings may provide a new approach to guide therapeutic interventions and preventive measures that are also sex specific while providing additional insight to address antimicrobial resistance of N. gonorrhoeae. Copyright © 2018 Nudel et al.


September 22, 2019

Analysis of the Gli-D2 locus identifies a genetic target for simultaneously improving the breadmaking and health-related traits of common wheat.

Gliadins are a major component of wheat seed proteins. However, the complex homoeologous Gli-2 loci (Gli-A2, -B2 and -D2) that encode the a-gliadins in commercial wheat are still poorly understood. Here we analyzed the Gli-D2 locus of Xiaoyan 81 (Xy81), a winter wheat cultivar. A total of 421.091 kb of the Gli-D2 sequence was assembled from sequencing multiple bacterial artificial clones, and 10 a-gliadin genes were annotated. Comparative genomic analysis showed that Xy81 carried only eight of the a-gliadin genes of the D genome donor Aegilops tauschii, with two of them each experiencing a tandem duplication. A mutant line lacking Gli-D2 (DLGliD2) consistently exhibited better breadmaking quality and dough functionalities than its progenitor Xy81, but without penalties in other agronomic traits. It also had an elevated lysine content in the grains. Transcriptome analysis verified the lack of Gli-D2 a-gliadin gene expression in DLGliD2. Furthermore, the transcript and protein levels of protein disulfide isomerase were both upregulated in DLGliD2 grains. Consistent with this finding, DLGliD2 had increased disulfide content in the flour. Our work sheds light on the structure and function of Gli-D2 in commercial wheat, and suggests that the removal of Gli-D2 and the gliadins specified by it is likely to be useful for simultaneously enhancing the end-use and health-related traits of common wheat. Because gliadins and homologous proteins are widely present in grass species, the strategy and information reported here may be broadly useful for improving the quality traits of diverse cereal crops.© 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


September 22, 2019

Extensive genomic diversity among Mycobacterium marinum strains revealed by whole genome sequencing.

Mycobacterium marinum is the causative agent for the tuberculosis-like disease mycobacteriosis in fish and skin lesions in humans. Ubiquitous in its geographical distribution, M. marinum is known to occupy diverse fish as hosts. However, information about its genomic diversity is limited. Here, we provide the genome sequences for 15 M. marinum strains isolated from infected humans and fish. Comparative genomic analysis of these and four available genomes of the M. marinum strains M, E11, MB2 and Europe reveal high genomic diversity among the strains, leading to the conclusion that M. marinum should be divided into two different clusters, the “M”- and the “Aronson”-type. We suggest that these two clusters should be considered to represent two M. marinum subspecies. Our data also show that the M. marinum pan-genome for both groups is open and expanding and we provide data showing high number of mutational hotspots in M. marinum relative to other mycobacteria such as Mycobacterium tuberculosis. This high genomic diversity might be related to the ability of M. marinum to occupy different ecological niches.


September 22, 2019

Characterization of LE3 and LE4, the only lytic phages known to infect the spirochete Leptospira.

Leptospira is a phylogenetically unique group of bacteria, and includes the causative agents of leptospirosis, the most globally prevalent zoonosis. Bacteriophages in Leptospira are largely unexplored. To date, a genomic sequence is available for only one temperate leptophage called LE1. Here, we sequenced and analysed the first genomes of the lytic phages LE3 and LE4 that can infect the saprophyte Leptospira biflexa using the lipopolysaccharide O-antigen as receptor. Bioinformatics analysis showed that the 48-kb LE3 and LE4 genomes are similar and contain 62% genes whose function cannot be predicted. Mass spectrometry led to the identification of 21 and 23 phage proteins in LE3 and LE4, respectively. However we did not identify significant similarities with other phage genomes. A search for prophages close to LE4 in the Leptospira genomes allowed for the identification of a related plasmid in L. interrogans and a prophage-like region in the draft genome of a clinical isolate of L. mayottensis. Long-read whole genome sequencing of the L. mayottensis revealed that the genome contained a LE4 phage-like circular plasmid. Further isolation and genomic comparison of leptophages should reveal their role in the genetic evolution of Leptospira.


September 22, 2019

Linking genotype and phenotype in an economically viable propionic acid biosynthesis process

Propionic acid (PA) is used as a food preservative and increasingly, as a precursor for the synthesis of monomers. PA is produced mainly through hydrocarboxylation of ethylene, also known as the `oxo-process’; however, Propionibacterium species are promising biological PA producers natively producing PA as their main fermentation product. However, for fermentation to be competitive, a PA yield of at least 0.6 g/g is required.


September 22, 2019

Citrobacter freundii fitness during bloodstream infection.

Sepsis resulting from microbial colonization of the bloodstream is a serious health concern associated with high mortality rates. The objective of this study was to define the physiologic requirements of Citrobacter freundii in the bloodstream as a model for bacteremia caused by opportunistic Gram-negative pathogens. A genetic screen in a murine host identified 177 genes that contributed significantly to fitness, the majority of which were broadly classified as having metabolic or cellular maintenance functions. Among the pathways examined, the Tat protein secretion system conferred the single largest fitness contribution during competition infections and a putative Tat-secreted protein, SufI, was also identified as a fitness factor. Additional work was focused on identifying relevant metabolic pathways for bacteria in the bloodstream environment. Mutations that eliminated the use of glucose or mannitol as carbon sources in vitro resulted in loss of fitness in the murine model and similar results were obtained upon disruption of the cysteine biosynthetic pathway. Finally, the conservation of identified fitness factors was compared within a cohort of Citrobacter bloodstream isolates and between Citrobacter and Serratia marcescens, the results of which suggest the presence of conserved strategies for bacterial survival and replication in the bloodstream environment.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.