Menu
September 22, 2019

The genomic basis of color pattern polymorphism in the Harlequin ladybird.

Many animal species comprise discrete phenotypic forms. A common example in natural populations of insects is the occurrence of different color patterns, which has motivated a rich body of ecological and genetic research [1-6]. The occurrence of dark, i.e., melanic, forms displaying discrete color patterns is found across multiple taxa, but the underlying genomic basis remains poorly characterized. In numerous ladybird species (Coccinellidae), the spatial arrangement of black and red patches on adult elytra varies wildly within species, forming strikingly different complex color patterns [7, 8]. In the harlequin ladybird, Harmonia axyridis, more than 200 distinct color forms have been described, which classic genetic studies suggest result from allelic variation at a single, unknown, locus [9, 10]. Here, we combined whole-genome sequencing, population-based genome-wide association studies, gene expression, and functional analyses to establish that the transcription factor Pannier controls melanic pattern polymorphism in H. axyridis. We show that pannier is necessary for the formation of melanic elements on the elytra. Allelic variation in pannier leads to protein expression in distinct domains on the elytra and thus determines the distinct color patterns in H. axyridis. Recombination between pannier alleles may be reduced by a highly divergent sequence of ~170 kb in the cis-regulatory regions of pannier, with a 50 kb inversion between color forms. This most likely helps maintain the distinct alleles found in natural populations. Thus, we propose that highly variable discrete color forms can arise in natural populations through cis-regulatory allelic variation of a single gene. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.


September 22, 2019

Genome analysis of the yeast M14, an industrial brewing yeast strain widely used in China

The lager brewing yeast M14 is the most widely used yeast strain in the high gravity brewing process in China. To investigate the characteristics of this strain, the genome of the yeast M14 was sequenced and the genome annotation information is presented in this study. The current assembly contained 133 scaffolds and its total size was around 23?Mb with a GC content of 38.98%. The brewing yeast M14 is a hybrid Saccharomyces cerevisiae?×?Saccharomyces uvarum at the genomic level and its genome is comprised of one circular mitochondrial genome originating from S. uvarum. Furthermore, the functions of the 9,796 protein coding genes were annotated and their functions were analyzed using the Swiss-Prot database. Among them, the key genes responsible for typical lager brewing yeast characteristics, such as maltotriose uptake and sulfite production, were annotated and analyzed. Interestingly, nine specific genes present in the brewing yeast M14 were not found in the genome of either S. uvarum CBS 7001 or S. cerevisiae S288C, which are very close to strain M14 in the phylogenetic relationship. These nine genes encoding proteins were melibiase, DNA replication protein, fructose symporter, hypothetical protein, hypothetical protein M773_09155, LIF1, minor spike protein H, ribosomal protein S27, and mitochondrial chaperones, respectively. The genome sequence of the yeast strain M14 provides a new tool to better understand brewing yeast behavior in industrial beer production.


September 22, 2019

Endogenous rRNA sequence variation can regulate stress response gene expression and phenotype.

Prevailing dogma holds that ribosomes are uniform in composition and function. Here, we show that nutrient limitation-induced stress in E. coli changes the relative expression of rDNA operons to alter the rRNA composition within the actively translating ribosome pool. The most upregulated operon encodes the unique 16S rRNA, rrsH, distinguished by conserved sequence variation within the small ribosomal subunit. rrsH-bearing ribosomes affect the expression of functionally coherent gene sets and alter the levels of the RpoS sigma factor, the master regulator of the general stress response. These impacts are associated with phenotypic changes in antibiotic sensitivity, biofilm formation, and cell motility and are regulated by stress response proteins, RelA and RelE, as well as the metabolic enzyme and virulence-associated protein, AdhE. These findings establish that endogenously encoded, naturally occurring rRNA sequence variation can modulate ribosome function, central aspects of gene expression regulation, and cellular physiology. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.


September 22, 2019

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.


September 22, 2019

Loss of bacitracin resistance due to a large genomic deletion among Bacillus anthracis strains.

Bacillus anthracis is a Gram-positive endospore-forming bacterial species that causes anthrax in both humans and animals. In Zambia, anthrax cases are frequently reported in both livestock and wildlife, with occasional transmission to humans, causing serious public health problems in the country. To understand the genetic diversity of B. anthracis strains in Zambia, we sequenced and compared the genomic DNA of B. anthracis strains isolated across the country. Single nucleotide polymorphisms clustered these strains into three groups. Genome sequence comparisons revealed a large deletion in strains belonging to one of the groups, possibly due to unequal crossing over between a pair of rRNA operons. The deleted genomic region included genes conferring resistance to bacitracin, and the strains with the deletion were confirmed with loss of bacitracin resistance. Similar deletions between rRNA operons were also observed in a few B. anthracis strains phylogenetically distant from Zambian strains. The structure of bacitracin resistance genes flanked by rRNA operons was conserved only in members of the Bacillus cereus group. The diversity and genomic characteristics of B. anthracis strains determined in this study would help in the development of genetic markers and treatment of anthrax in Zambia. IMPORTANCE Anthrax is caused by Bacillus anthracis, an endospore-forming soil bacterium. The genetic diversity of B. anthracis is known to be low compared with that of Bacillus species. In this study, we performed whole-genome sequencing of Zambian isolates of B. anthracis to understand the genetic diversity between closely related strains. Comparison of genomic sequences revealed that closely related strains were separated into three groups based on single nucleotide polymorphisms distributed throughout the genome. A large genomic deletion was detected in the region containing a bacitracin resistance gene cluster flanked by rRNA operons, resulting in the loss of bacitracin resistance. The structure of the deleted region, which was also conserved among species of the Bacillus cereus group, has the potential for both deletion and amplification and thus might be enabling the species to flexibly control the level of bacitracin resistance for adaptive evolution.


September 22, 2019

Genome sequence and metabolic analysis of a fluoranthene-degrading strain Pseudomonas aeruginosa DN1.

Pseudomonas aeruginosa DN1, isolated from petroleum-contaminated soil, showed excellent degradation ability toward diverse polycyclic aromatic hydrocarbons (PAHs). Many studies have been done to improve its degradation ability. However, the molecular mechanisms of PAHs degradation in DN1 strain are unclear. In this study, the whole genome of DN1 strain was sequenced and analyzed. Its genome contains 6,641,902 bp and encodes 6,684 putative open reading frames (ORFs), which has the largest genome in almost all the comparative Pseudomonas strains. Results of gene annotation showed that this strain harbored over 100 candidate genes involved in PAHs degradation, including those encoding 25 dioxygenases, four ring-hydroxylating dioxygenases, five ring-cleaving dioxygenases, and various catabolic enzymes, transcriptional regulators, and transporters in the degradation pathways. In addition, gene knockout experiments revealed that the disruption of some key PAHs degradation genes in DN1 strain, such as catA, pcaG, pcaH, and rhdA, did not completely inhibit fluoranthene degradation, even though their degradative rate reduced to some extent. Three intermediate metabolites, including 9-hydroxyfluorene, 1-acenaphthenone, and 1, 8-naphthalic anhydride, were identified as the dominating intermediates in presence of 50 µg/mL fluoranthene as the sole carbon source according to gas chromatography mass spectrometry analysis. Taken together, the genomic and metabolic analysis indicated that the fluoranthene degradation by DN1 strain was initiated by dioxygenation at the C-1, 2-, C-2, 3-, and C-7, 8- positions. These results provide new insights into the genomic plasticity and environmental adaptation of DN1 strain.


September 22, 2019

Characterization and genomic analyses of Pseudomonas aeruginosa podovirus TC6: establishment of genus Pa11virus.

Phages have attracted a renewed interest as alternative to chemical antibiotics. Although the number of phages is 10-fold higher than that of bacteria, the number of genomically characterized phages is far less than that of bacteria. In this study, phage TC6, a novel lytic virus of Pseudomonas aeruginosa, was isolated and characterized. TC6 consists of an icosahedral head with a diameter of approximately 54 nm and a short tail with a length of about 17 nm, which are characteristics of the family Podoviridae. TC6 can lyse 86 out of 233 clinically isolated P. aeruginosa strains, thus showing application potentials for phage therapy. The linear double-stranded genomic DNA of TC6 consisted of 49796 base pairs and was predicted to contain 71 protein-coding genes. A total of 11 TC6 structural proteins were identified by mass spectrometry. Comparative analysis revealed that the P. aeruginosa phages TC6, O4, PA11, and IME180 shared high similarity at DNA sequence and proteome levels, among which PA11 was the first phage discovered and published. Meanwhile, these phages contain 54 core genes and have very close phylogenetic relationships, which distinguish them from other known phage genera. We therefore proposed that these four phages can be classified as Pa11virus, comprising a new phage genus of Podoviridae that infects Pseudomonas spp. The results of this work promoted our understanding of phage biology, classification, and diversity.


September 22, 2019

Therapeutic potential of a new jumbo phage that infects Vibrio coralliilyticus, a widespread coral pathogen.

Biological control using bacteriophages is a promising approach for mitigating the devastating effects of coral diseases. Several phages that infect Vibrio coralliilyticus, a widespread coral pathogen, have been isolated, suggesting that this bacterium is permissive to viral infection and is, therefore, a suitable candidate for treatment by phage therapy. In this study, we combined functional and genomic approaches to evaluate the therapeutic potential of BONAISHI, a novel V. coralliilyticus phage, which was isolated from the coral reef in Van Phong Bay (Vietnam). BONAISHI appears to be strictly lytic for several pathogenic strains of V. coralliilyticus and remains infectious over a broad range of environmental conditions. This candidate has an unusually large dsDNA genome (303 kb), with no genes that encode known toxins or implicated in lysogeny control. We identified several proteins involved in host lysis, which may offer an interesting alternative to the use of whole bacteriophages for controlling V. coralliilyticus. A preliminary therapy test showed that adding BONAISHI to an infected culture of Symbiodinium sp. cells reduced the impact of V. coralliilyticus on Symbiodinium sp. photosynthetic activity. This study showed that BONAISHI is able to mitigate V. coralliilyticus infections, making it a good candidate for phage therapy for coral disease.


September 22, 2019

A complete Cannabis chromosome assembly and adaptive admixture for elevated cannabidiol (CBD) content

Cannabis has been cultivated for millennia with distinct cultivars providing either fiber and grain or tetrahydrocannabinol. Recent demand for cannabidiol rather than tetrahydrocannabinol has favored the breeding of admixed cultivars with extremely high cannabidiol content. Despite several draft Cannabis genomes, the genomic structure of cannabinoid synthase loci has remained elusive. A genetic map derived from a tetrahydrocannabinol/cannabidiol segregating population and a complete chromosome assembly from a high-cannabidiol cultivar together resolve the linkage of cannabidiolic and tetrahydrocannabinolic acid synthase gene clusters which are associated with transposable elements. High-cannabidiol cultivars appear to have been generated by integrating hemp-type cannabidiolic acid synthase gene clusters into a background of marijuana-type cannabis. Quantitative trait locus mapping suggests that overall drug potency, however, is associated with other genomic regions needing additional study.


September 22, 2019

SKA: Split Kmer Analysis Toolkit for Bacterial Genomic Epidemiology

Genome sequencing is revolutionising infectious disease epidemiology, providing a huge step forward in sensitivity and specificity over more traditional molecular typing techniques. However, the complexity of genome data often means that its analysis and interpretation requires high-performance compute infrastructure and dedicated bioinformatics support. Furthermore, current methods have limitations that can differ between analyses and are often opaque to the user, and their reliance on multiple external dependencies makes reproducibility difficult. Here I introduce SKA, a toolkit for analysis of genome sequence data from closely-related, small, haploid genomes. SKA uses split kmers to rapidly identify variation between genome sequences, making it possible to analyse hundreds of genomes on a standard home computer. Tests on publicly available simulated and real-life data show that SKA is both faster and more efficient than the gold standard methods used today while retaining similar levels of accuracy for epidemiological purposes. SKA can take raw read data or genome assemblies as input and calculate pairwise distances, create single linkage clusters and align genomes to a reference genome or using a reference-free approach. SKA requires few decisions to be made by the user, which, along with its computational efficiency, allows genome analysis to become accessible to those with only basic bioinformatics training. The limitations of SKA are also far more transparent than for current approaches, and future improvements to mitigate these limitations are possible. Overall, SKA is a powerful addition to the armoury of the genomic epidemiologist. SKA source code is available from Github (https://github.com/simonrharris/SKA).


September 22, 2019

Physiological genomics of dietary adaptation in a marine herbivorous fish

Adopting a new diet is a significant evolutionary change and can profoundly affect an animaltextquoterights physiology, biochemistry, ecology, and its genome. To study this evolutionary transition, we investigated the physiology and genomics of digestion of a derived herbivorous fish, the monkeyface prickleback (Cebidichthys violaceus). We sequenced and assembled its genome and digestive transcriptome and revealed the molecular changes related to important dietary enzymes, finding abundant evidence for adaptation at the molecular level. In this species, two gene families experienced expansion in copy number and adaptive amino acid substitutions. These families, amylase, and bile salt activated lipase, are involved digestion of carbohydrates and lipids, respectively. Both show elevated levels of gene expression and increased enzyme activity. Because carbohydrates are abundant in the pricklebacktextquoterights diet and lipids are rare, these findings suggest that such dietary specialization involves both exploiting abundant resources and scavenging rare ones, especially essential nutrients, like essential fatty acids.


September 22, 2019

Antimicrobial resistance profile of mcr-1 positive clinical isolates of Escherichia coli in China From 2013 to 2016.

Multidrug-resistant (MDR) Escherichia coli poses a great challenge for public health in recent decades. Polymyxins have been reconsidered as a valuable therapeutic option for the treatment of infections caused by MDR E. coli. A plasmid-encoded colistin resistance gene mcr-1 encoding phosphoethanolamine transferase has been recently described in Enterobacteriaceae. In this study, a total of 123 E. coli isolates obtained from patients with diarrheal diseases in China were used for the genetic analysis of colistin resistance in clinical isolates. Antimicrobial resistance profile of polymyxin B (PB) and 11 commonly used antimicrobial agents were determined. Among the 123 E. coli isolates, 9 isolates (7.3%) were resistant to PB and PCR screening showed that seven (5.7%) isolates carried the mcr-1 gene. A hybrid sequencing analysis using single-molecule, real-time (SMRT) sequencing and Illumina sequencing was then performed to resolve the genomes of the seven mcr-1 positive isolates. These seven isolates harbored multiple plasmids and are MDR, with six isolates carrying one mcr-1 positive plasmid and one isolate (14EC033) carrying two mcr-1 positive plasmids. These eight mcr-1 positive plasmids belonged to the IncX4, IncI2, and IncP1 types. In addition, the mcr-1 gene was the solo antibiotic resistance gene identified in the mcr-1 positive plasmids, while the rest of the antibiotic resistance genes were mostly clustered into one or two plasmids. Interestingly, one mcr-1 positive isolate (14EC047) was susceptible to PB, and we showed that the activity of MCR-1-mediated colistin resistance was not phenotypically expressed in 14EC047 host strain. Furthermore, three isolates exhibited resistance to PB but did not carry previously reported mcr-related genes. Multilocus sequence typing (MLST) showed that these mcr-1 positive E. coli isolates belonged to five different STs, and three isolates belonged to ST301 which carried multiple virulence factors related to diarrhea. Additionally, the mcr-1 positive isolates were all susceptible to imipenem (IMP), suggesting that IMP could be used to treat infection caused by mcr-1 positive E. coli isolates. Collectively, this study showed a high occurrence of mcr-1 positive plasmids in patients with diarrheal diseases of Guangzhou in China and the abolishment of the MCR-1 mediated colistin resistance in one E. coli isolate.


September 22, 2019

Bacterial virulence against an oceanic bloom-forming phytoplankter is mediated by algal DMSP

Emiliania huxleyi is a bloom-forming microalga that affects the global sulfur cycle by producing large amounts of dimethylsulfoniopropionate (DMSP) and its volatile metabolic product dimethyl sulfide. Top-down regulation of E. huxleyi blooms has been attributed to viruses and grazers; however, the possible involvement of algicidal bacteria in bloom demise has remained elusive. We demonstrate that a Roseobacter strain, Sulfitobacter D7, that we isolated from a North Atlantic E. huxleyi bloom, exhibited algicidal effects against E. huxleyi upon coculturing. Both the alga and the bacterium were found to co-occur during a natural E. huxleyi bloom, therefore establishing this host-pathogen system as an attractive, ecologically relevant model for studying algal-bacterial interactions in the oceans. During interaction, Sulfitobacter D7 consumed and metabolized algal DMSP to produce high amounts of methanethiol, an alternative product of DMSP catabolism. We revealed a unique strain-specific response, in which E. huxleyi strains that exuded higher amounts of DMSP were more susceptible to Sulfitobacter D7 infection. Intriguingly, exogenous application of DMSP enhanced bacterial virulence and induced susceptibility in an algal strain typically resistant to the bacterial pathogen. This enhanced virulence was highly specific to DMSP compared to addition of propionate and glycerol which had no effect on bacterial virulence. We propose a novel function for DMSP, in addition to its central role in mutualistic interactions among marine organisms, as a mediator of bacterial virulence that may regulate E. huxleyi blooms.


September 22, 2019

pYR4 from a Norwegian isolate of Yersinia ruckeri is a putative virulence plasmid encoding both a type IV pilus and a type IV secretion system

Enteric redmouth disease caused by the pathogen Yersinia ruckeri is a significant problem for fish farming around the world. Despite its importance, only a few virulence factors of Y. ruckeri have been identified and studied in detail. Here, we report and analyze the complete DNA sequence of pYR4, a plasmid from a highly pathogenic Norwegian Y. ruckeri isolate, sequenced using PacBio SMRT technology. Like the well-known pYV plasmid of human pathogenic Yersiniae, pYR4 is a member of the IncFII family. Thirty-one percent of the pYR4 sequence is unique compared to other Y. ruckeri plasmids. The unique regions contain, among others genes, a large number of mobile genetic elements and two partitioning systems. The G+C content of pYR4 is higher than that of the Y. ruckeri NVH_3758 genome, indicating its relatively recent horizontal acquisition. pYR4, as well as the related plasmid pYR3, comprises operons that encode for type IV pili and for a conjugation system (tra). In contrast to other Yersinia plasmids, pYR4 cannot be cured at elevated temperatures. Our study highlights the power of PacBio sequencing technology for identifying mis-assembled segments of genomic sequences. Comparative analysis of pYR4 and other Y. ruckeri plasmids and genomes, which were sequenced by second and the third generation sequencing technologies, showed errors in second generation sequencing assemblies. Specifically, in the Y. ruckeri 150 and Y. ruckeri ATCC29473 genome assemblies, we mapped the entire pYR3 plasmid sequence. Placing plasmid sequences on the chromosome can result in erroneous biological conclusions. Thus, PacBio sequencing or similar long-read methods should always be preferred for de novo genome sequencing. As the tra operons of pYR3, although misplaced on the chromosome during the genome assembly process, were demonstrated to have an effect on virulence, and type IV pili are virulence factors in many bacteria, we suggest that pYR4 directly contributes to Y. ruckeri virulence.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.