Menu
September 22, 2019

Comparative genomics of degradative Novosphingobium strains with special reference to the microcystin-degrading Novosphingobium sp. THN1

Bacteria in genus Novosphingobium associated with biodegradation of substrates are prevalent in environments such as lakes, soil, sea, wood and sediments. To better understand the characteristics linked to their wide distribution and metabolic versatility, we report the whole genome sequence of Novosphingobium sp. THN1, a microcystin-degrading strain previously isolated by Jiang et al. (2011) from cyanobacteria-blooming water samples from Lake Taihu, China. We performed a genomic comparison analysis of Novosphingobium sp. THN1 with 21 other degradative Novosphingobium strains downloaded from GenBank. Phylogenetic trees were constructed using 16S rRNA genes, core genes, protein-coding sequences, and average nucleotide identity of whole genomes. Orthologous protein analysis showed that the 22 genomes contained 674 core genes and each strain contained a high proportion of distributed genes that are shared by a subset of strains. Inspection of their genomic plasticity revealed a high number of insertion sequence elements and genomic islands that were distributed on both chromosomes and plasmids. We also compared the predicted functional profiles of the Novosphingobium protein-coding genes. The flexible genes and all protein-coding genes produced the same heatmap clusters. The COG annotations were used to generate a dendrogram correlated with the compounds degraded. Furthermore, the metabolic profiles predicted from KEGG pathways showed that the majority of genes involved in central carbon metabolism, nitrogen, phosphate, sulfate metabolism, energy metabolism and cell mobility (above 62.5%) are located on chromosomes. Whereas, a great many of genes involved in degradation pathways (21–50%) are located on plasmids. The abundance and distribution of aromatics-degradative mono- and dioxygenases varied among 22 Novosphingoibum strains. Comparative analysis of the microcystin-degrading mlr gene cluster provided evidence for horizontal acquisition of this cluster. The Novosphingobium sp. THN1 genome sequence contained all the functional genes crucial for microcystin degradation and the mlr gene cluster shared high sequence similarity (=85%) with the sequences of other microcystin-degrading genera isolated from cyanobacteria-blooming water. Our results indicate that Novosphingobium species have high genomic and functional plasticity, rearranging their genomes according to environment variations and shaping their metabolic profiles by the substrates they are exposed to, to better adapt to their environments.


September 22, 2019

Identification of the KPC plasmid pCT-KPC334: New insights on the evolutionary pathway of epidemic plasmids harboring fosA3-blaKPC-2 genes.

A novel, non-conjugative plasmid pKP1034 isolated from a fosfomycin-resistant, carbapenemase-producing Klebsiella pneumonia strain KP1034 was recently reported to carry fosA3, blaKPC-2, blaCTX-M-65, blaSHV-12 and rmtB genes, and was hypothesized to evolve from several recombination events of two closely related plasmids, pHN7A8 and pKPC-LK30 [1]. In this study, a plasmid pCT-KPC334 carrying fosA3, blaKPC-2, blaCTX-M-65, blaSHV-12, blaTEM-1, and rmtB genes was identified, providing evidence on the evolutionary pathway of plasmids harboring fosA3-blaKPC-2 genes.


September 22, 2019

Extraordinary genome instability and widespread chromosome rearrangements during vegetative growth

The haploid genome of the pathogenic fungus Zymoseptoria tritici is contained on “core” and “accessory” chromosomes. While 13 core chromosomes are found in all strains, as many as eight accessory chromosomes show presence/absence variation and rearrangements among field isolates. The factors influencing these presence/absence polymorphisms are so far unknown. We investigated chromosome stability using experimental evolution, karyotyping, and genome sequencing. We report extremely high and variable rates of accessory chromosome loss during mitotic propagation in vitro and in planta Spontaneous chromosome loss was observed in 2 to >50% of cells during 4 weeks of incubation. Similar rates of chromosome loss in the closely related Zymoseptoria ardabiliae suggest that this extreme chromosome dynamic is a conserved phenomenon in the genus. Elevating the incubation temperature greatly increases instability of accessory and even core chromosomes, causing severe rearrangements involving telomere fusion and chromosome breakage. Chromosome losses do not affect the fitness of Zymoseptoria tritici in vitro, but some lead to increased virulence, suggesting an adaptive role of this extraordinary chromosome instability. Copyright © 2018 by the Genetics Society of America.


September 22, 2019

Generic accelerated sequence alignment in SeqAn using vectorization and multi-threading.

Pairwise sequence alignment is undoubtedly a central tool in many bioinformatics analyses. In this paper, we present a generically accelerated module for pairwise sequence alignments applicable for a broad range of applications. In our module, we unified the standard dynamic programming kernel used for pairwise sequence alignments and extended it with a generalized inter-sequence vectorization layout, such that many alignments can be computed simultaneously by exploiting SIMD (single instruction multiple data) instructions of modern processors. We then extended the module by adding two layers of thread-level parallelization, where we (a) distribute many independent alignments on multiple threads and (b) inherently parallelize a single alignment computation using a work stealing approach producing a dynamic wavefront progressing along the minor diagonal.We evaluated our alignment vectorization and parallelization on different processors, including the newest Intel® Xeon® (Skylake) and Intel® Xeon PhiTM (KNL) processors, and use cases. The instruction set AVX512-BW (Byte and Word), available on Skylake processors, can genuinely improve the performance of vectorized alignments. We could run single alignments 1600 times faster on the Xeon PhiTM and 1400 times faster on the Xeon® than executing them with our previous sequential alignment module.The module is programmed in C++?using the SeqAn (Reinert et al., 2017) library and distributed with version 2.4 under the BSD license. We support SSE4, AVX2, AVX512 instructions and included UME: SIMD, a SIMD-instruction wrapper library, to extend our module for further instruction sets. We thoroughly test all alignment components with all major C++?compilers on various platforms.Supplementary data are available at Bioinformatics online.


September 22, 2019

Development and validation of 58K SNP-array and high-density linkage map in Nile tilapia (O. niloticus).

Despite being the second most important aquaculture species in the world accounting for 7.4% of global production in 2015, tilapia aquaculture has lacked genomic tools like SNP-arrays and high-density linkage maps to improve selection accuracy and accelerate genetic progress. In this paper, we describe the development of a genotyping array containing more than 58,000 SNPs for Nile tilapia (Oreochromis niloticus). SNPs were identified from whole genome resequencing of 32 individuals from the commercial population of the Genomar strain, and were selected for the SNP-array based on polymorphic information content and physical distribution across the genome using the Orenil1.1 genome assembly as reference sequence. SNP-performance was evaluated by genotyping 4991 individuals, including 689 offspring belonging to 41 full-sib families, which revealed high-quality genotype data for 43,588 SNPs. A preliminary genetic linkage map was constructed using Lepmap2 which in turn was integrated with information from the O_niloticus_UMD1 genome assembly to produce an integrated physical and genetic linkage map comprising 40,186 SNPs distributed across 22 linkage groups (LGs). Around one-third of the LGs showed a different recombination rate between sexes, with the female being greater than the male map by a factor of 1.2 (1632.9 to 1359.6 cM, respectively), with most LGs displaying a sigmoid recombination profile. Finally, the sex-determining locus was mapped to position 40.53 cM on LG23, in the vicinity of the anti-Müllerian hormone (amh) gene. These new resources has the potential to greatly influence and improve the genetic gain when applying genomic selection and surpass the difficulties of efficient selection for invasively measured traits in Nile tilapia.


September 22, 2019

Genome plasticity of agr-defective Staphylococcus aureus during clinical infection.

Therapy for bacteremia caused by Staphylococcus aureus is often ineffective, even when treatment conditions are optimal according to experimental protocols. Adapted subclones, such as those bearing mutations that attenuate agr-mediated virulence activation, are associated with persistent infection and patient mortality. To identify additional alterations in agr-defective mutants, we sequenced and assembled the complete genomes of clone pairs from colonizing and infected sites of several patients in whom S. aureus demonstrated a within-host loss of agr function. We report that events associated with agr inactivation result in agr-defective blood and nares strain pairs that are enriched in mutations compared to pairs from wild-type controls. The random distribution of mutations between colonizing and infecting strains from the same patient, and between strains from different patients, suggests that much of the genetic complexity of agr-defective strains results from prolonged infection or therapy-induced stress. However, in one of the agr-defective infecting strains, multiple genetic changes resulted in increased virulence in a murine model of bloodstream infection, bypassing the mutation of agr and raising the possibility that some changes were selected. Expression profiling correlated the elevated virulence of this agr-defective mutant to restored expression of the agr-regulated ESAT6-like type VII secretion system, a known virulence factor. Thus, additional mutations outside the agr locus can contribute to diversification and adaptation during infection by S. aureus agr mutants associated with poor patient outcomes. Copyright © 2018 Altman et al.


September 22, 2019

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019

Variation graph toolkit improves read mapping by representing genetic variation in the reference.

Reference genomes guide our interpretation of DNA sequence data. However, conventional linear references represent only one version of each locus, ignoring variation in the population. Poor representation of an individual’s genome sequence impacts read mapping and introduces bias. Variation graphs are bidirected DNA sequence graphs that compactly represent genetic variation across a population, including large-scale structural variation such as inversions and duplications. Previous graph genome software implementations have been limited by scalability or topological constraints. Here we present vg, a toolkit of computational methods for creating, manipulating, and using these structures as references at the scale of the human genome. vg provides an efficient approach to mapping reads onto arbitrary variation graphs using generalized compressed suffix arrays, with improved accuracy over alignment to a linear reference, and effectively removing reference bias. These capabilities make using variation graphs as references for DNA sequencing practical at a gigabase scale, or at the topological complexity of de novo assemblies.


September 22, 2019

Complete Genome Sequence of Bacillus sp. SJ-10 (KCCM 90078) Producing 400-kDa Poly-?-glutamic Acid.

Bacillus sp. SJ-10 (KCCM 90078, JCM 15709) is a halotolerant bacterium isolated from a traditional Korean food, i.e., salt-fermented fish (jeotgal). The bacterium can survive and engage in metabolism at high salt concentrations. Here, we reported complete genome sequence of Bacillus sp. SJ-10, which has a single circular chromosome of 4,041,649 base pairs with a guanine-cytosine content of 46.39%. Bacillus sp. SJ-10 encodes a subunit of poly-?-glutamic acid (?-PGA) with a molecular weight of approximately 400 kDa, which contains four ?-PGA synthases (pgsB, pgsC, pgsAA and pgsE) and one ?-PGA-releasing gene (pgsS). This bacterium also able to produce salt-stable enzymes such as protease, ß-glucosidase, and ß-1,3-1,4-glucanase. This affords significant insights into strategies employed by halotolerant bacteria to survive at high salt concentrations. The sequence contains information on secondary metabolites biosynthetic gene cluster, and most importantly enzymes produced by the bacterium may be valuable with respect to food, beverage, detergent, animal feed, and certain commercial contexts.


September 22, 2019

Bias in resistance gene prediction due to repeat masking

Several recently published Brassicaceae genome annotations show strong differences in resistance (R)-gene content. We believe that this is caused by different approaches to repeat masking. Here we show that some of the repeats stored in public databases used for repeat masking carry pieces of predicted R-gene-related domains, and demonstrate that at least some of the variance in R-gene content in recent genome annotations is caused by using these repeats for repeat masking. We also show that other classes of genes are less affected by this phenomenon, and estimate a false positive rate of R genes (0 to 4.6%) that are in reality transposons carrying the R-gene domains. These results may partially explain why there has been a decrease in published novel R genes in recent years, which has implications for plant breeding, especially in the face of pathogens changing as a response to climate change.


September 22, 2019

Cloning of the wheat Yr15 resistance gene sheds light on the plant tandem kinase-pseudokinase family.

Yellow rust, caused by Puccinia striiformis f. sp. tritici (Pst), is a devastating fungal disease threatening much of global wheat production. Race-specific resistance (R)-genes are used to control rust diseases, but the rapid emergence of virulent Pst races has prompted the search for a more durable resistance. Here, we report the cloning of Yr15, a broad-spectrum R-gene derived from wild emmer wheat, which encodes a putative kinase-pseudokinase protein, designated as wheat tandem kinase 1, comprising a unique R-gene structure in wheat. The existence of a similar gene architecture in 92 putative proteins across the plant kingdom, including the barley RPG1 and a candidate for Ug8, suggests that they are members of a distinct family of plant proteins, termed here tandem kinase-pseudokinases (TKPs). The presence of kinase-pseudokinase structure in both plant TKPs and the animal Janus kinases sheds light on the molecular evolution of immune responses across these two kingdoms.


September 22, 2019

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019

2,3-Butanediol production by the non-pathogenic bacterium Paenibacillus brasilensis.

2,3-Butanediol (2,3-BDO) is of considerable importance in the chemical, plastic, pharmaceutical, cosmetic, and food industries. The main bacterial species producing this compound are considered pathogenic, hindering large-scale productivity. The species Paenibacillus brasilensis is generally recognized as safe (GRAS) and is phylogenetically similar to P. polymyxa, a species widely used for 2,3-BDO production. Here, we demonstrate, for the first time, that P. brasilensis strains produce 2,3-BDO. Total 2,3-BDO concentrations for 15 P. brasilensis strains varied from 5.5 to 7.6 g/l after 8 h incubation at 32 °C in modified YEPD medium containing 20 g/l glucose. Strain PB24 produced 8.2 g/l of 2,3-BDO within a 12-h growth period, representing a yield of 0.43 g/g and a productivity of 0.68 g/l/h. An increase in 2,3-BDO production by strain PB24 was observed using higher concentrations of glucose, reaching 27 g/l of total 2,3-BDO in YEPD containing about 80 g/l glucose within a 72-h growth period. We sequenced the genome of P. brasilensis PB24 and uncovered at least six genes related to the 2,3-BDO pathway at four distinct loci. We also compared gene sequences related to the 2,3-BDO pathway in P. brasilensis PB24 with those of other spore-forming bacteria, and found strong similarity to P. polymyxa, P. terrae, and P. peoriae 2,3-BDO-related genes. Regulatory regions upstream of these genes indicated that they are probably co-regulated. Finally, we propose a production pathway from glucose to 2,3-BDO in P. brasilensis PB24. Although the gene encoding S-2,3-butanediol dehydrogenase (butA) was found in the genome of P. brasilensis PB24, only R,R-2,3- and meso-2,3-butanediol were detected by gas chromatography under the growth conditions tested here. Our findings can serve as a basis for further improvements to the metabolic capabilities of this little-studied Paenibacillus species in relation to production of the high-value chemical 2,3-butanediol.


September 22, 2019

Antiviral adaptive immunity and tolerance in the mosquito Aedes aegyti

Mosquitoes spread pathogenic arboviruses while themselves tolerate infection. We here characterize an immunity pathway providing long-term antiviral protection and define how this pathway discriminates between self and non-self. Mosquitoes use viral RNAs to create viral derived cDNAs (vDNAs) central to the antiviral response. vDNA molecules are acquired through a process of reverse-transcription and recombination directed by endogenous retrotransposons. These vDNAs are thought to integrate in the host genome as endogenous viral elements (EVEs). Sequencing of pre-integrated vDNA revealed that the acquisition process exquisitely distinguishes viral from host RNA, providing one layer of self-nonself discrimination. Importantly, we show EVE-derived piRNAs have antiviral activity and are loaded onto Piwi4 to inhibit virus replication. In a second layer of self-non-self discrimination, Piwi4 preferentially loads EVE-derived piRNAs, discriminating against transposon-targeting piRNAs. Our findings define a fundamental virus-specific immunity pathway in mosquitoes that uses EVEs as a potent and specific antiviral transgenerational mechanism.


September 22, 2019

The opium poppy genome and morphinan production.

Morphinan-based painkillers are derived from opium poppy (Papaver somniferum L.). We report a draft of the opium poppy genome, with 2.72 gigabases assembled into 11 chromosomes with contig N50 and scaffold N50 of 1.77 and 204 megabases, respectively. Synteny analysis suggests a whole-genome duplication at ~7.8 million years ago and ancient segmental or whole-genome duplication(s) that occurred before the Papaveraceae-Ranunculaceae divergence 110 million years ago. Syntenic blocks representative of phthalideisoquinoline and morphinan components of a benzylisoquinoline alkaloid cluster of 15 genes provide insight into how this cluster evolved. Paralog analysis identified P450 and oxidoreductase genes that combined to form the STORR gene fusion essential for morphinan biosynthesis in opium poppy. Thus, gene duplication, rearrangement, and fusion events have led to evolution of specialized metabolic products in opium poppy. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.