Menu
July 7, 2019

Whole-genome assembly of Babesia ovata and comparative genomics between closely related pathogens.

Babesia ovata, belonging to the phylum Apicomplexa, is an infectious parasite of bovids. It is not associated with the manifestation of severe symptoms, in contrast to other types of bovine babesiosis caused by B. bovis and B. bigemina; however, upon co-infection with Theileria orientalis, it occasionally induces exacerbated symptoms. Asymptomatic chronic infection in bovines is usually observed only for B. ovata. Comparative genomic analysis could potentially reveal factors involved in these distinguishing characteristics; however, the genomic and molecular basis of these phenotypes remains elusive, especially in B. ovata. From a technical perspective, the current development of a very long read sequencer, MinION, will facilitate the obtainment of highly integrated genome sequences. Therefore, we applied next-generation sequencing to acquire a high-quality genome of the parasite, which provides fundamental information for understanding apicomplexans.The genome was assembled into 14,453,397 bp in size with 5031 protein-coding sequences (91 contigs and N50 = 2,090,503 bp). Gene family analysis revealed that ves1 alpha and beta, which belong to multigene families in B. bovis, were absent from B. ovata, the same as in B. bigemina. Instead, ves1a and ves1b, which were originally specified in B. bigemina, were present. The B. ovata and B. bigemina ves1a configure one cluster together even though they divided into two sub-clusters according to the spp. In contrast, the ves1b cluster was more dispersed and the overlap among B. ovata and B. bigemina was limited. The observed redundancy and rapid evolution in sequence might reflect the adaptive history of these parasites. Moreover, same candidate genes which potentially involved in the distinct phenotypes were specified by functional analysis. An anamorsin homolog is one of them. The human anamorsin is involved in hematopoiesis and the homolog was present in B. ovata but absent in B. bigemina which causes severe anemia.Taking these findings together, the differences demonstrated by comparative genomics potentially explain the evolutionary history of these parasites and the differences in their phenotypes. Besides, the draft genome provides fundamental information for further characterization and understanding of these parasites.


July 7, 2019

Public health surveillance in the UK revolutionises our understanding of the invasive Salmonella Typhimurium epidemic in Africa.

The ST313 sequence type of Salmonella Typhimurium causes invasive non-typhoidal salmonellosis and was thought to be confined to sub-Saharan Africa. Two distinct phylogenetic lineages of African ST313 have been identified.We analysed the whole genome sequences of S. Typhimurium isolates from UK patients that were generated following the introduction of routine whole-genome sequencing (WGS) of Salmonella enterica by Public Health England in 2014.We found that 2.7% (84/3147) of S. Typhimurium from patients in England and Wales were ST313 and were associated with gastrointestinal infection. Phylogenetic analysis revealed novel diversity of ST313 that distinguished UK-linked gastrointestinal isolates from African-associated extra-intestinal isolates. The majority of genome degradation of African ST313 lineage 2 was conserved in the UK-ST313, but the African lineages carried a characteristic prophage and antibiotic resistance gene repertoire. These findings suggest that a strong selection pressure exists for certain horizontally acquired genetic elements in the African setting. One UK-isolated lineage 2 strain that probably originated in Kenya carried a chromosomally located bla CTX-M-15, demonstrating the continual evolution of this sequence type in Africa in response to widespread antibiotic usage.The discovery of ST313 isolates responsible for gastroenteritis in the UK reveals new diversity in this important sequence type. This study highlights the power of routine WGS by public health agencies to make epidemiologically significant deductions that would be missed by conventional microbiological methods. We speculate that the niche specialisation of sub-Saharan African ST313 lineages is driven in part by the acquisition of accessory genome elements.


July 7, 2019

The sea cucumber genome provides insights into morphological evolution and visceral regeneration.

Apart from sharing common ancestry with chordates, sea cucumbers exhibit a unique morphology and exceptional regenerative capacity. Here we present the complete genome sequence of an economically important sea cucumber, A. japonicus, generated using Illumina and PacBio platforms, to achieve an assembly of approximately 805 Mb (contig N50 of 190 Kb and scaffold N50 of 486 Kb), with 30,350 protein-coding genes and high continuity. We used this resource to explore key genetic mechanisms behind the unique biological characters of sea cucumbers. Phylogenetic and comparative genomic analyses revealed the presence of marker genes associated with notochord and gill slits, suggesting that these chordate features were present in ancestral echinoderms. The unique shape and weak mineralization of the sea cucumber adult body were also preliminarily explained by the contraction of biomineralization genes. Genome, transcriptome, and proteome analyses of organ regrowth after induced evisceration provided insight into the molecular underpinnings of visceral regeneration, including a specific tandem-duplicated prostatic secretory protein of 94 amino acids (PSP94)-like gene family and a significantly expanded fibrinogen-related protein (FREP) gene family. This high-quality genome resource will provide a useful framework for future research into biological processes and evolution in deuterostomes, including remarkable regenerative abilities that could have medical applications. Moreover, the multiomics data will be of prime value for commercial sea cucumber breeding programs.


July 7, 2019

Large-scale suppression of recombination predates genomic rearrangements in Neurospora tetrasperma.

A common feature of eukaryote genomes is large chromosomal regions where recombination is absent or strongly reduced, but the factors that cause this reduction are not well understood. Genomic rearrangements have often been implicated, but they may also be a consequence of recombination suppression rather than a cause. In this study, we generate eight high-quality genomic data sets of the filamentous ascomycete Neurospora tetrasperma, a fungus that lacks recombination over most of its largest chromosome. The genomes surprisingly reveal collinearity of the non-recombining regions and although large inversions are enriched in these regions, we conclude these inversions to be derived and not the cause of the suppression. To our knowledge, this is the first time that non-recombining, genic regions as large as 86% of a full chromosome (or 8?Mbp), are shown to be collinear. These findings are of significant interest for our understanding of the evolution of sex chromosomes and other supergene complexes.


July 7, 2019

Comparative analysis of mitochondrial genomes of geographic variants of the gypsy moth, Lymantria dispar, reveals a previously undescribed genotypic entity.

The gypsy moth, Lymantria dispar L., is one of the most destructive forest pests in the world. While the subspecies established in North America is the European gypsy moth (L. dispar dispar), whose females are flightless, the two Asian subspecies, L. dispar asiatica and L. dispar japonica, have flight-capable females, enhancing their invasiveness and warranting precautionary measures to prevent their permanent establishment in North America. Various molecular tools have been developed to help distinguish European from Asian subspecies, several of which are based on the mitochondrial barcode region. In an effort to identify additional informative markers, we undertook the sequencing and analysis of the mitogenomes of 10 geographic variants of L. dispar, including two or more variants of each subspecies, plus the closely related L. umbrosa as outgroup. Several regions of the gypsy moth mitogenomes displayed nucleotide substitutions with potential usefulness for the identification of subspecies and/or geographic origins. Interestingly, the mitogenome of one geographic variant displayed significant divergence relative to the remaining variants, raising questions about its taxonomic status. Phylogenetic analyses placed this population from northern Iran as basal to the L. dispar clades. The present findings will help improve diagnostic tests aimed at limiting risks of AGM invasions.


July 7, 2019

Genome expansion and lineage-specific genetic innovations in the forest pathogenic fungi Armillaria.

Armillaria species are both devastating forest pathogens and some of the largest terrestrial organisms on Earth. They forage for hosts and achieve immense colony sizes via rhizomorphs, root-like multicellular structures of clonal dispersal. Here, we sequenced and analysed the genomes of four Armillaria species and performed RNA sequencing and quantitative proteomic analysis on the invasive and reproductive developmental stages of A.?ostoyae. Comparison with 22 related fungi revealed a significant genome expansion in Armillaria, affecting several pathogenicity-related genes, lignocellulose-degrading enzymes and lineage-specific genes expressed during rhizomorph development. Rhizomorphs express an evolutionarily young transcriptome that shares features with the transcriptomes of both fruiting bodies and vegetative mycelia. Several genes show concomitant upregulation in rhizomorphs and fruiting bodies and share cis-regulatory signatures in their promoters, providing genetic and regulatory insights into complex multicellularity in fungi. Our results suggest that the evolution of the unique dispersal and pathogenicity mechanisms of Armillaria might have drawn upon ancestral genetic toolkits for wood-decay, morphogenesis and complex multicellularity.


July 7, 2019

New insights into the diversity of the genus Faecalibacterium.

Faecalibacterium prausnitzii is a commensal bacterium, ubiquitous in the gastrointestinal tracts of animals and humans. This species is a functionally important member of the microbiota and studies suggest it has an impact on the physiology and health of the host. F. prausnitzii is the only identified species in the genus Faecalibacterium, but a recent study clustered strains of this species in two different phylogroups. Here, we propose the existence of distinct species in this genus through the use of comparative genomics. Briefly, we performed analyses of 16S rRNA gene phylogeny, phylogenomics, whole genome Multi-Locus Sequence Typing (wgMLST), Average Nucleotide Identity (ANI), gene synteny, and pangenome to better elucidate the phylogenetic relationships among strains of Faecalibacterium. For this, we used 12 newly sequenced, assembled, and curated genomes of F. prausnitzii, which were isolated from feces of healthy volunteers from France and Australia, and combined these with published data from 5 strains downloaded from public databases. The phylogenetic analysis of the 16S rRNA sequences, together with the wgMLST profiles and a phylogenomic tree based on comparisons of genome similarity, all supported the clustering of Faecalibacterium strains in different genospecies. Additionally, the global analysis of gene synteny among all strains showed a highly fragmented profile, whereas the intra-cluster analyses revealed larger and more conserved collinear blocks. Finally, ANI analysis substantiated the presence of three distinct clusters-A, B, and C-composed of five, four, and four strains, respectively. The pangenome analysis of each cluster corroborated the classification of these clusters into three distinct species, each containing less variability than that found within the global pangenome of all strains. Here, we propose that comparison of pangenome subsets and their associated a values may be used as an alternative approach, together with ANI, in the in silico classification of new species. Altogether, our results provide evidence not only for the reconsideration of the phylogenetic and genomic relatedness among strains currently assigned to F. prausnitzii, but also the need for lineage (strain-based) differentiation of this taxon to better define how specific members might be associated with positive or negative host interactions.


July 7, 2019

Complete genome sequence of Pseudomonas corrugata strain RM1-1-4, a stress protecting agent from the rhizosphere of an oilseed rape bait plant

Pseudomonas corrugata strain RM1-1-4 is a rhizosphere colonizer of oilseed rape. A previous study has shown that this motile, Gram-negative, non-sporulating bacterium is an effective stress protecting and biocontrol agent, which protects their hosts against abiotic and biotic stresses. Here, we announce and describe the complete genome sequence of P. corrugata RM1-1-4 consisting of a single 6.1 Mb circular chromosome that encodes 5189 protein coding genes and 85 RNA-only encoding genes. Genome analysis revealed genes predicting functions such as detoxifying mechanisms, stress inhibitors, exoproteases, lipoproteins or volatile components as well as rhizobactin siderophores and spermidine. Further analysis of its genome will help to identify traits promising for stress protection, biocontrol and plant growth promotion properties.


July 7, 2019

Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).

The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly.We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome.We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies.We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.


July 7, 2019

Comparative genome analysis of programmed DNA elimination in nematodes.

Programmed DNA elimination is a developmentally regulated process leading to the reproducible loss of specific genomic sequences. DNA elimination occurs in unicellular ciliates and a variety of metazoans, including invertebrates and vertebrates. In metazoa, DNA elimination typically occurs in somatic cells during early development, leaving the germline genome intact. Reference genomes for metazoa that undergo DNA elimination are not available. Here, we generated germline and somatic reference genome sequences of the DNA eliminating pig parasitic nematode Ascaris suum and the horse parasite Parascaris univalens. In addition, we carried out in-depth analyses of DNA elimination in the parasitic nematode of humans, Ascaris lumbricoides, and the parasitic nematode of dogs, Toxocara canis. Our analysis of nematode DNA elimination reveals that in all species, repetitive sequences (that differ among the genera) and germline-expressed genes (approximately 1000-2000 or 5%-10% of the genes) are eliminated. Thirty-five percent of these eliminated genes are conserved among these nematodes, defining a core set of eliminated genes that are preferentially expressed during spermatogenesis. Our analysis supports the view that DNA elimination in nematodes silences germline-expressed genes. Over half of the chromosome break sites are conserved between Ascaris and Parascaris, whereas only 10% are conserved in the more divergent T. canis. Analysis of the chromosomal breakage regions suggests a sequence-independent mechanism for DNA breakage followed by telomere healing, with the formation of more accessible chromatin in the break regions prior to DNA elimination. Our genome assemblies and annotations also provide comprehensive resources for analysis of DNA elimination, parasitology research, and comparative nematode genome and epigenome studies.© 2017 Wang et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.© 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Genomic comparison between Staphylococcus aureus GN strains clinically isolated from a familial infection case: IS1272 transposition through a novel inverted repeat-replacing mechanism.

A bacterial insertion sequence (IS) is a mobile DNA sequence carrying only the transposase gene (tnp) that acts as a mutator to disrupt genes, alter gene expressions, and cause genomic rearrangements. “Canonical” ISs have historically been characterized by their terminal inverted repeats (IRs), which may form a stem-loop structure, and duplications of a short (non-IR) target sequence at both ends, called target site duplications (TSDs). The IS distributions and virulence potentials of Staphylococcus aureus genomes in familial infection cases are unclear. Here, we determined the complete circular genome sequences of familial strains from a Panton-Valentine leukocidin (PVL)-positive ST50/agr4 S. aureus (GN) infection of a 4-year old boy with skin abscesses. The genomes of the patient strain (GN1) and parent strain (GN3) were rich for “canonical” IS1272 with terminal IRs, both having 13 commonly-existing copies (ce-IS1272). Moreover, GN1 had a newly-inserted IS1272 (ni-IS1272) on the PVL-converting prophage, while GN3 had two copies of ni-IS1272 within the DNA helicase gene and near rot. The GN3 genome also had a small deletion. The targets of ni-IS1272 transposition were IR structures, in contrast with previous “canonical” ISs. There were no TSDs. Based on a database search, the targets for ce-IS1272 were IRs or “non-IRs”. IS1272 included a larger structure with tandem duplications of the left (IRL) side sequence; tnp included minor cases of a long fusion form and truncated form. One ce-IS1272 was associated with the segments responsible for immune evasion and drug resistance. Regarding virulence, GN1 expressed cytolytic peptides (phenol-soluble modulin a and d-hemolysin) and PVL more strongly than some other familial strains. These results suggest that IS1272 transposes through an IR-replacing mechanism, with an irreversible process unlike that of “canonical” transpositions, resulting in genomic variations, and that, among the familial strains, the patient strain has strong virulence potential based on community-associated virulence factors.


July 7, 2019

Complete genome sequence of Spirosoma rigui KCTC 12531 T, a bacterium isolated from fresh water from the Woopo wetland for taxonomic study

Spirosoma rigui KCTC 12531T was isolated from fresh water from the Woopo wetland, Korea. In this study, we report the complete genome sequence of a bacterium Spirosoma rigui KCTC 12531T, its complete genome sequence was obtained using the PacBio RS II platform. The genome comprised of 5,828,404 bp with the G + C content of 54.4%, the genome included 4,774 genes were predicted, among them, 4,647 genes are protein-coding genes.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.