Menu
September 22, 2019  |  

Genomic surveillance of Neisseria gonorrhoeae to investigate the distribution and evolution of antimicrobial-resistance determinants and lineages.

The first extensively drug resistant (XDR) Neisseria gonorrhoeae strain with high resistance to the extended-spectrum cephalosporin ceftriaxone was identified in 2009 in Japan, but no other strain with this antimicrobial-resistance profile has been reported since. However, surveillance to date has been based on phenotypic methods and sequence typing, not genome sequencing. Therefore, little is known about the local population structure at the genomic level, and how resistance determinants and lineages are distributed and evolve. We analysed the whole-genome sequence data and the antimicrobial- susceptibility testing results of 204 strains sampled in a region where the first XDR ceftriaxone-resistant N. gonorrhoeae was isolated, complemented with 67 additional genomes from other time frames and locations within Japan. Strains resistant to ceftriaxone were not found, but we discovered a sequence type (ST)7363 sub-lineage susceptible to ceftriaxone and cefixime in which the mosaic penA allele responsible for reduced susceptibility had reverted to a susceptible allele by recombination. Approximately 85% of isolates showed resistance to fluoroquinolones (ciprofloxacin) explained by linked amino acid substitutions at positions 91 and 95 of GyrA with 99% sensitivity and 100% specificity. Approximately 10% showed resistance to macrolides (azithromycin), for which genetic determinants are less clear. Furthermore, we revealed different evolutionary paths of the two major lineages: single acquisition of penA X in the ST7363-associated lineage, followed by multiple independent acquisitions of the penA X and XXXIV in the ST1901-associated lineage. Our study provides a detailed picture of the distribution of resistance determinants and disentangles the evolution of the two major lineages spreading worldwide.


September 22, 2019  |  

Spread of carbapenem resistance by transposition and conjugation among Pseudomonas aeruginosa.

The emergence of carbapenem-resistant Pseudomonas aeruginosa represents a worldwide problem. To understand the carbapenem-resistance mechanisms and their spreading among P. aeruginosa strains, whole genome sequences were determined of two extensively drug-resistant strains that are endemic in Dutch hospitals. Strain Carb01 63 is of O-antigen serotype O12 and of sequence type ST111, whilst S04 90 is a serotype O11 strain of ST446. Both strains carry a gene for metallo-ß-lactamase VIM-2 flanked by two aacA29 genes encoding aminoglycoside acetyltransferases on a class 1 integron. The integron is located on the chromosome in strain Carb01 63 and on a plasmid in strain S04 90. The backbone of the 159-kb plasmid, designated pS04 90, is similar to a previously described plasmid, pND6-2, from Pseudomonas putida. Analysis of the context of the integron showed that it is present in both strains on a ~30-kb mosaic DNA segment composed of four different transposons that can presumably act together as a novel, active, composite transposon. Apart from the presence of a 1237-bp insertion sequence element in the composite transposon on pS04 90, these transposons show > 99% sequence identity indicating that transposition between plasmid and chromosome could have occurred only very recently. The pS04 90 plasmid could be transferred by conjugation to a susceptible P. aeruginosa strain. A second class 1 integron containing a gene for a CARB-2 ß-lactamase flanked by an aacA4′-8 and an aadA2 gene, encoding an aminoglycoside acetyltransferase and adenylyltransferase, respectively, was present only in strain Carb01 63. This integron is located also on a composite transposon that is inserted in an integrative and conjugative element on the chromosome. Additionally, this strain contains a frameshift mutation in the oprD gene encoding a porin involved in the transport of carbapenems across the outer membrane. Together, the results demonstrate that integron-encoded carbapenem and carbapenicillin resistance can easily be disseminated by transposition and conjugation among Pseudomonas aeruginosa strains.


September 22, 2019  |  

The Arctic charr (Salvelinus alpinus) genome and transcriptome assembly.

Arctic charr have a circumpolar distribution, persevere under extreme environmental conditions, and reach ages unknown to most other salmonids. The Salvelinus genus is primarily composed of species with genomes that are structured more like the ancestral salmonid genome than most Oncorhynchus and Salmo species of sister genera. It is thought that this aspect of the genome may be important for local adaptation (due to increased recombination) and anadromy (the migration of fish from saltwater to freshwater). In this study, we describe the generation of a new genetic map, the sequencing and assembly of the Arctic charr genome (GenBank accession: GCF_002910315.2) using the newly created genetic map and a previous genetic map, and present several analyses of the Arctic charr genes and genome assembly. The newly generated genetic map consists of 8,574 unique genetic markers and is similar to previous genetic maps with the exception of three major structural differences. The N50, identified BUSCOs, repetitive DNA content, and total size of the Arctic charr assembled genome are all comparable to other assembled salmonid genomes. An analysis to identify orthologous genes revealed that a large number of orthologs could be identified between salmonids and many appear to have highly conserved gene expression profiles between species. Comparing orthologous gene expression profiles may give us a better insight into which genes are more likely to influence species specific phenotypes.


September 22, 2019  |  

Structural variants exhibit allelic heterogeneity and shape variation in complex traits

Despite extensive effort to reveal the genetic basis of complex phenotypic variation, studies typically explain only a fraction of trait heritability. It has been hypothesized that individually rare hidden structural variants (SVs) could account for a significant fraction of variation in complex traits. To investigate this hypothesis, we assembled 14 Drosophila melanogaster genomes and systematically identified more than 20,000 euchromatic SVs, of which ~40% are invisible to high specificity short read genotyping approaches. SVs are common in Drosophila genes, with almost one third of diploid individuals harboring an SV in genes larger than 5kb, and nearly a quarter harboring multiple SVs in genes larger than 10kb. We show that SV alleles are rarer than amino acid polymorphisms, implying that they are more strongly deleterious. A number of functionally important genes harbor previously hidden structural variants that likely affect complex phenotypes (e.g., Cyp6g1, Drsl5, Cyp28d1&2, InR, and Gss1&2). Furthermore, SVs are overrepresented in quantitative trait locus candidate genes from eight Drosophila Synthetic Population Resource (DSPR) mapping experiments. We conclude that SVs are pervasive in genomes, are frequently present as heterogeneous allelic series, and can act as rare alleles of large effect.


September 22, 2019  |  

Forward genetics by genome sequencing uncovers the central role of the Aspergillus niger goxB locus in hydrogen peroxide induced glucose oxidase expression.

Aspergillus niger is an industrially important source for gluconic acid and glucose oxidase (GOx), a secreted commercially important flavoprotein which catalyses the oxidation of ß-D-glucose by molecular oxygen to D-glucolactone and hydrogen peroxide. Expression of goxC, the GOx encoding gene and the concomitant two step conversion of glucose to gluconic acid requires oxygen and the presence of significant amounts of glucose in the medium and is optimally induced at pH 5.5. The molecular mechanisms underlying regulation of goxC expression are, however, still enigmatic. Genetic studies aimed at understanding GOx induction have indicated the involvement of at least seven complementation groups, for none of which the molecular basis has been resolved. In this study, a mapping-by-sequencing forward genetics approach was used to uncover the molecular role of the goxB locus in goxC expression. Using the Illumina and PacBio sequencing platforms a hybrid high quality draft genome assembly of laboratory strain N402 was obtained and used as a reference for mapping of genomic reads obtained from the derivative NW103:goxB mutant strain. The goxB locus encodes a thioredoxin reductase. A deletion of the encoding gene in the N402 parent strain led to a high constitutive expression level of the GOx and the lactonase encoding genes required for the two-step conversion of glucose in gluconic acid and of the catR gene encoding catalase R. This high constitutive level of expression was observed to be irrespective of the carbon source and oxidative stress applied. A model clarifying the role of GoxB in the regulation of the expression of goxC involving hydrogen peroxide as second messenger is presented.


September 22, 2019  |  

Draft genome sequence and transcriptional analysis of Rosellinia necatrix infected with a virulent mycovirus.

Understanding the molecular mechanisms of pathogenesis is useful in developing effective control methods for fungal diseases. The white root rot fungus Rosellinia necatrix is a soilborne pathogen that causes serious economic losses in various crops, including fruit trees, worldwide. Here, using next-generation sequencing techniques, we first produced a 44-Mb draft genome sequence of R. necatrix strain W97, an isolate from Japan, in which 12,444 protein-coding genes were predicted. To survey differentially expressed genes (DEGs) associated with the pathogenesis of the fungus, the hypovirulent W97 strain infected with Rosellinia necatrix megabirnavirus 1 (RnMBV1) was used for a comprehensive transcriptome analysis. In total, 545 and 615 genes are up- and down-regulated, respectively, in R. necatrix infected with RnMBV1. Gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses of the DEGs suggested that primary and secondary metabolism would be greatly disturbed in R. necatrix infected with RnMBV1. The genes encoding transcriptional regulators, plant cell wall-degrading enzymes, and toxin production, such as cytochalasin E, were also found in the DEGs. The genetic resources provided in this study will accelerate the discovery of genes associated with pathogenesis and other biological characteristics of R. necatrix, thus contributing to disease control.


September 22, 2019  |  

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019  |  

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019  |  

The sequence of a male-specific genome region containing the sex determination switch in Aedes aegypti.

Aedes aegypti is the principal vector of several important arboviruses. Among the methods of vector control to limit transmission of disease are genetic strategies that involve the release of sterile or genetically modified non-biting males, which has generated interest in manipulating mosquito sex ratios. Sex determination in Ae. aegypti is controlled by a non-recombining Y chromosome-like region called the M locus, yet characterisation of this locus has been thwarted by the repetitive nature of the genome. In 2015, an M locus gene named Nix was identified that displays the qualities of a sex determination switch.With the use of a whole-genome bacterial artificial chromosome (BAC) library, we amplified and sequenced a ~200 kb region containing the male-determining gene Nix. In this study, we show that Nix is comprised of two exons separated by a 99 kb intron primarily composed of repetitive DNA, especially transposable elements.Nix, an unusually large and highly repetitive gene, exhibits features in common with Y chromosome genes in other organisms. We speculate that the lack of recombination at the M locus has allowed the expansion of repeats in a manner characteristic of a sex-limited chromosome, in accordance with proposed models of sex chromosome evolution in insects.


September 22, 2019  |  

Whole genome sequencing for investigations of meningococcal outbreaks in the United States: a retrospective analysis.

Although rare in the U.S., outbreaks due to Neisseria meningitidis do occur. Rapid, early outbreak detection is important for timely public health response. In this study, we characterized U.S. meningococcal isolates (N?=?201) from 15 epidemiologically defined outbreaks (2009-2015) along with temporally and geographically matched sporadic isolates using multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), and six whole genome sequencing (WGS) based methods. Recombination-corrected maximum likelihood (ML) and Bayesian phylogenies were reconstructed to identify genetically related outbreak isolates. All WGS analysis methods showed high degree of agreement and distinguished isolates with similar or indistinguishable PFGE patterns, or the same strain genotype. Ten outbreaks were caused by a single strain; 5 were due to multiple strains. Five sporadic isolates were phylogenetically related to 2 outbreaks. Analysis of 9 outbreaks using timed phylogenies identified the possible origin and estimated the approximate time that the most recent common ancestor emerged for outbreaks analyzed. U.S. meningococcal outbreaks were caused by single- or multiple-strain introduction, with organizational outbreaks mainly caused by a clonal strain and community outbreaks by divergent strains. WGS can infer linkage of meningococcal cases when epidemiological links are uncertain. Accurate identification of outbreak-associated cases requires both WGS typing and epidemiological data.


September 22, 2019  |  

First draft genome sequence of the rock bream in the family Oplegnathidae.

The rock bream (Oplegnathus fasciatus) is one of the most economically valuable marine fish in East Asia, and due to various environmental factors, there is substantial revenue loss in the production sector. Therefore, knowledge of its genome is required to uncover the genetic factors and the solutions to these problems. In this study, we constructed the first draft genome of O. fasciatus as a reference for the family Oplegnathidae. The genome size is estimated to be 749?Mb, and it was assembled into 766?Mb by combining Illumina and PacBio sequences. A total of 24,053 transcripts (23,338 genes) are predicted, and among those transcripts, 23,362 (97%), are annotated with functional terms. Finally, the completeness of the genome assembly was assessed by CEGMA, which resulted in the complete mapping of 220 (88.7%) core genes in the genome. To the best of our knowledge, this is the first draft genome for the family Oplegnathidae.


September 22, 2019  |  

Genomic structural variations within five continental populations of Drosophila melanogaster.

Chromosomal structural variations (SV) including insertions, deletions, inversions, and translocations occur within the genome and can have a significant effect on organismal phenotype. Some of these effects are caused by structural variations containing genes. Large structural variations represent a significant amount of the genetic diversity within a population. We used a global sampling of Drosophila melanogaster (Ithaca, Zimbabwe, Beijing, Tasmania, and Netherlands) to represent diverse populations within the species. We used long-read sequencing and optical mapping technologies to identify SVs in these genomes. Among the five lines examined, we found an average of 2,928 structural variants within these genomes. These structural variations varied greatly in size and location, included many exonic regions, and could impact adaptation and genomic evolution. Copyright © 2018 Long et al.


September 22, 2019  |  

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

The ascomycete fungus Colletotrichum truncatum is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in C. truncatum draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different Colletotrichum species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in C. truncatum, C. higginsianum, and C. scovillei. TEs in C. truncatum were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in C. truncatum showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). C. orbiculare and C. graminicola showed strong signatures of RIP across their genomes and “two-speed” genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of Colletotrichum species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in C. truncatum and their comparative analysis with six other Colletotrichum species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of Colletotrichum fungi and for development of SSR-based molecular markers for population genomic studies.


September 22, 2019  |  

Conversion of methionine to cysteine in Lactobacillus paracasei depends on the highly mobile cysK-ctl-cysE gene cluster.

Milk and dairy products are rich in nutrients and are therefore habitats for various microbiomes. However, the composition of nutrients can be quite diverse, in particular among the sulfur containing amino acids. In milk, methionine is present in a 25-fold higher abundance than cysteine. Interestingly, a fraction of strains of the species L. paracasei – a flavor-enhancing adjunct culture species – can grow in medium with methionine as the sole sulfur source. In this study, we focus on genomic and evolutionary aspects of sulfur dependence in L. paracasei strains. From 24 selected L. paracasei strains, 16 strains can grow in medium with methionine as sole sulfur source. We sequenced these strains to perform gene-trait matching. We found that one gene cluster – consisting of a cysteine synthase, a cystathionine lyase, and a serine acetyltransferase – is present in all strains that grow in medium with methionine as sole sulfur source. In contrast, strains that depend on other sulfur sources do not have this gene cluster. We expanded the study and searched for this gene cluster in other species and detected it in the genomes of many bacteria species used in the food production. The comparison to these species showed that two different versions of the gene cluster exist in L. paracasei which were likely gained in two distinct events of horizontal gene transfer. Additionally, the comparison of 62 L. paracasei genomes and the two versions of the gene cluster revealed that this gene cluster is mobile within the species.


September 22, 2019  |  

The genomic basis of color pattern polymorphism in the Harlequin ladybird.

Many animal species comprise discrete phenotypic forms. A common example in natural populations of insects is the occurrence of different color patterns, which has motivated a rich body of ecological and genetic research [1-6]. The occurrence of dark, i.e., melanic, forms displaying discrete color patterns is found across multiple taxa, but the underlying genomic basis remains poorly characterized. In numerous ladybird species (Coccinellidae), the spatial arrangement of black and red patches on adult elytra varies wildly within species, forming strikingly different complex color patterns [7, 8]. In the harlequin ladybird, Harmonia axyridis, more than 200 distinct color forms have been described, which classic genetic studies suggest result from allelic variation at a single, unknown, locus [9, 10]. Here, we combined whole-genome sequencing, population-based genome-wide association studies, gene expression, and functional analyses to establish that the transcription factor Pannier controls melanic pattern polymorphism in H. axyridis. We show that pannier is necessary for the formation of melanic elements on the elytra. Allelic variation in pannier leads to protein expression in distinct domains on the elytra and thus determines the distinct color patterns in H. axyridis. Recombination between pannier alleles may be reduced by a highly divergent sequence of ~170 kb in the cis-regulatory regions of pannier, with a 50 kb inversion between color forms. This most likely helps maintain the distinct alleles found in natural populations. Thus, we propose that highly variable discrete color forms can arise in natural populations through cis-regulatory allelic variation of a single gene. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.