Menu
September 22, 2019

Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina-and SMRT-based RNA-seq datasets

The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family.


September 22, 2019

Capturing single cell genomes of active polysaccharide degraders: an unexpected contribution of Verrucomicrobia.

Microbial hydrolysis of polysaccharides is critical to ecosystem functioning and is of great interest in diverse biotechnological applications, such as biofuel production and bioremediation. Here we demonstrate the use of a new, efficient approach to recover genomes of active polysaccharide degraders from natural, complex microbial assemblages, using a combination of fluorescently labeled substrates, fluorescence-activated cell sorting, and single cell genomics. We employed this approach to analyze freshwater and coastal bacterioplankton for degraders of laminarin and xylan, two of the most abundant storage and structural polysaccharides in nature. Our results suggest that a few phylotypes of Verrucomicrobia make a considerable contribution to polysaccharide degradation, although they constituted only a minor fraction of the total microbial community. Genomic sequencing of five cells, representing the most predominant, polysaccharide-active Verrucomicrobia phylotype, revealed significant enrichment in genes encoding a wide spectrum of glycoside hydrolases, sulfatases, peptidases, carbohydrate lyases and esterases, confirming that these organisms were well equipped for the hydrolysis of diverse polysaccharides. Remarkably, this enrichment was on average higher than in the sequenced representatives of Bacteroidetes, which are frequently regarded as highly efficient biopolymer degraders. These findings shed light on the ecological roles of uncultured Verrucomicrobia and suggest specific taxa as promising bioprospecting targets. The employed method offers a powerful tool to rapidly identify and recover discrete genomes of active players in polysaccharide degradation, without the need for cultivation.


September 22, 2019

The microbiota of freshwater fish and freshwater niches contain omega-3 producing Shewanella species.

Approximately 30 years ago, it was discovered that free-living bacteria isolated from cold ocean depths could produce polyunsaturated fatty acids (PUFA) such as eicosapentaenoic acid (EPA) (20:5n-3) or docosahexaenoic acid (DHA) (22:6n-3), two PUFA essential for human health. Numerous laboratories have also discovered that EPA- and/or DHA-producing bacteria, many of them members of the Shewanella genus, could be isolated from the intestinal tracts of omega-3 fatty acid-rich marine fish. If bacteria contribute omega-3 fatty acids to the host fish in general or if they assist some bacterial species in adaptation to cold, then cold freshwater fish or habitats should also harbor these producers. Thus, we undertook a study to see if these niches also contained omega-3 fatty acid producers. We were successful in isolating and characterizing unique EPA-producing strains of Shewanella from three strictly freshwater native fish species, i.e., lake whitefish (Coregonus clupeaformis), lean lake trout (Salvelinus namaycush), and walleye (Sander vitreus), and from two other freshwater nonnative fish, i.e., coho salmon (Oncorhynchus kisutch) and seeforellen brown trout (Salmo trutta). We were also able to isolate four unique free-living strains of EPA-producing Shewanella from freshwater habitats. Phylogenetic and phenotypic analyses suggest that one producer is clearly a member of the Shewanella morhuae species and another is sister to members of the marine PUFA-producing Shewanella baltica species. However, the remaining isolates have more ambiguous relationships, sharing a common ancestor with non-PUFA-producing Shewanella putrefaciens isolates rather than marine S. baltica isolates despite having a phenotype more consistent with S. baltica strains. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


September 22, 2019

Draft genome assembly of the poultry red mite, Dermanyssus gallinae.

The poultry red mite, Dermanyssus gallinae, is a major worldwide concern in the egg-laying industry. Here, we report the first draft genome assembly and gene prediction of Dermanyssus gallinae, based on combined PacBio and MinION long-read de novo sequencing. The ~959-Mb genome is predicted to encode 14,608 protein-coding genes.


September 22, 2019

Resolving the complexity of human skin metagenomes using single-molecule sequencing.

Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.The species comprising a microbial community are often difficult to deconvolute due to technical limitations inherent to most short-read sequencing technologies. Here, we leverage new advances in sequencing technology, single-molecule sequencing, to significantly improve reconstruction of a complex human skin microbial community. With this long-read technology, we were able to reconstruct and annotate a closed, high-quality genome of a previously uncharacterized skin species. We demonstrate that hybrid approaches with short-read technology are sufficiently powerful to reconstruct even single-nucleotide polymorphism level variation of species in this a community. Copyright © 2016 Tsai et al.


September 22, 2019

Contemporary evolution of a Lepidopteran species, Heliothis virescens, in response to modern agricultural practices.

Adaptation to human-induced environmental change has the potential to profoundly influence the genomic architecture of affected species. This is particularly true in agricultural ecosystems, where anthropogenic selection pressure is strong. Heliothis virescens primarily feeds on cotton in its larval stages, and US populations have been declining since the widespread planting of transgenic cotton, which endogenously expresses proteins derived from Bacillus thuringiensis (Bt). No physiological adaptation to Bt toxin has been found in the field, so adaptation in this altered environment could involve (i) shifts in host plant selection mechanisms to avoid cotton, (ii) changes in detoxification mechanisms required for cotton-feeding vs. feeding on other hosts or (iii) loss of resistance to previously used management practices including insecticides. Here, we begin to address whether such changes occurred in H. virescens populations between 1997 and 2012, as Bt-cotton cultivation spread through the agricultural landscape. For our study, we produced an H. virescens genome assembly and used this in concert with a ddRAD-seq-enabled genome scan to identify loci with significant allele frequency changes over the 15-year period. Genetic changes at a previously described H. virescens insecticide target of selection were detectable in our genome scan and increased our confidence in this methodology. Additional loci were also detected as being under selection, and we quantified the selection strength required to elicit observed allele frequency changes at each locus. Potential contributions of genes near loci under selection to adaptive phenotypes in the H. virescens cotton system are discussed.© 2017 John Wiley & Sons Ltd.


September 22, 2019

Prey range and genome evolution of Halobacteriovorax marinus predatory bacteria from an estuary

Halobacteriovorax strains are saltwater-adapted predatory bacteria that attack Gram-negative bacteria and may play an important role in shaping microbial communities. To understand how Halobacteriovorax strains impact ecosystems and develop them as biocontrol agents, it is important to characterize variation in predation phenotypes and investigate Halobacteriovorax genome evolution. We isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island using Vibrio from the same site as prey. Small, fast-moving, attack-phase BE01 cells attach to and invade prey cells, consistent with the intraperiplasmic predation strategy of the H. marinus type strain, SJ. BE01 is a prey generalist, forming plaques on Vibrio strains from the estuary, Pseudomonas from soil, and Escherichia coli. Genome analysis revealed extremely high conservation of gene order and amino acid sequences between BE01 and SJ, suggesting strong selective pressure to maintain the genome in this H. marinus lineage. Despite this, we identified two regions of gene content difference that likely resulted from horizontal gene transfer. Analysis of modal codon usage frequencies supports the hypothesis that these regions were acquired from bacteria with different codon usage biases than H. marinus. In one of these regions, BE01 and SJ carry different genes associated with mobile genetic elements. Acquired functions in BE01 include the dnd operon, which encodes a pathway for DNA modification, and a suite of genes involved in membrane synthesis and regulation of gene expression that was likely acquired from another Halobacteriovorax lineage. This analysis provides further evidence that horizontal gene transfer plays an important role in genome evolution in predatory bacteria. IMPORTANCE Predatory bacteria attack and digest other bacteria and therefore may play a role in shaping microbial communities. To investigate phenotypic and genotypic variation in saltwater-adapted predatory bacteria, we isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island, assayed whether it could attack different prey bacteria, and sequenced and analyzed its genome. We found that BE01 is a prey generalist, attacking bacteria from different phylogenetic groups and environments. Gene order and amino acid sequences are highly conserved between BE01 and the H. marinus type strain, SJ. By comparative genomics, we detected two regions of gene content difference that likely occurred via horizontal gene transfer events. Acquired genes encode functions such as modification of DNA, membrane synthesis and regulation of gene expression. Understanding genome evolution and variation in predation phenotypes among predatory bacteria will inform their development as biocontrol agents and clarify how they impact microbial communities.


September 22, 2019

MUMmer4: A fast and versatile genome alignment system.

The MUMmer system and the genome sequence aligner nucmer included within it are among the most widely used alignment packages in genomics. Since the last major release of MUMmer version 3 in 2004, it has been applied to many types of problems including aligning whole genome sequences, aligning reads to a reference genome, and comparing different assemblies of the same genome. Despite its broad utility, MUMmer3 has limitations that can make it difficult to use for large genomes and for the very large sequence data sets that are common today. In this paper we describe MUMmer4, a substantially improved version of MUMmer that addresses genome size constraints by changing the 32-bit suffix tree data structure at the core of MUMmer to a 48-bit suffix array, and that offers improved speed through parallel processing of input query sequences. With a theoretical limit on the input size of 141Tbp, MUMmer4 can now work with input sequences of any biologically realistic length. We show that as a result of these enhancements, the nucmer program in MUMmer4 is easily able to handle alignments of large genomes; we illustrate this with an alignment of the human and chimpanzee genomes, which allows us to compute that the two species are 98% identical across 96% of their length. With the enhancements described here, MUMmer4 can also be used to efficiently align reads to reference genomes, although it is less sensitive and accurate than the dedicated read aligners. The nucmer aligner in MUMmer4 can now be called from scripting languages such as Perl, Python and Ruby. These improvements make MUMer4 one the most versatile genome alignment packages available.


September 22, 2019

Genomic diversity in the endosymbiotic bacterium Rhizobium leguminosarum.

Rhizobium leguminosarum bv. viciae is a soil a-proteobacterium that establishes a diazotrophic symbiosis with different legumes of the Fabeae tribe. The number of genome sequences from rhizobial strains available in public databases is constantly increasing, although complete, fully annotated genome structures from rhizobial genomes are scarce. In this work, we report and analyse the complete genome of R. leguminosarum bv. viciae UPM791. Whole genome sequencing can provide new insights into the genetic features contributing to symbiotically relevant processes such as bacterial adaptation to the rhizosphere, mechanisms for efficient competition with other bacteria, and the ability to establish a complex signalling dialogue with legumes, to enter the root without triggering plant defenses, and, ultimately, to fix nitrogen within the host. Comparison of the complete genome sequences of two strains of R. leguminosarum bv. viciae, 3841 and UPM791, highlights the existence of different symbiotic plasmids and a common core chromosome. Specific genomic traits, such as plasmid content or a distinctive regulation, define differential physiological capabilities of these endosymbionts. Among them, strain UPM791 presents unique adaptations for recycling the hydrogen generated in the nitrogen fixation process.


September 22, 2019

Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.


September 22, 2019

The sea lamprey germline genome provides insights into programmed genome rearrangement and vertebrate evolution.

The sea lamprey (Petromyzon marinus) serves as a comparative model for reconstructing vertebrate evolution. To enable more informed analyses, we developed a new assembly of the lamprey germline genome that integrates several complementary data sets. Analysis of this highly contiguous (chromosome-scale) assembly shows that both chromosomal and whole-genome duplications have played significant roles in the evolution of ancestral vertebrate and lamprey genomes, including chromosomes that carry the six lamprey HOX clusters. The assembly also contains several hundred genes that are reproducibly eliminated from somatic cells during early development in lamprey. Comparative analyses show that gnathostome (mouse) homologs of these genes are frequently marked by polycomb repressive complexes (PRCs) in embryonic stem cells, suggesting overlaps in the regulatory logic of somatic DNA elimination and bivalent states that are regulated by early embryonic PRCs. This new assembly will enhance diverse studies that are informed by lampreys’ unique biology and evolutionary/comparative perspective.


September 22, 2019

The DNA methylome of the hyperthermoacidophilic crenarchaeon Sulfolobus acidocaldarius.

DNA methylation is the most common epigenetic modification observed in the genomic DNA (gDNA) of prokaryotes and eukaryotes. Methylated nucleobases, N6-methyl-adenine (m6A), N4-methyl-cytosine (m4C), and 5-methyl-cytosine (m5C), detected on gDNA represent the discrimination mark between self and non-self DNA when they are part of restriction-modification systems in prokaryotes (Bacteria and Archaea). In addition, m5C in Eukaryotes and m6A in Bacteria play an important role in the regulation of key cellular processes. Although archaeal genomes present modified bases as in the two other domains of life, the significance of DNA methylations as regulatory mechanisms remains largely uncharacterized in Archaea. Here, we began by investigating the DNA methylome of Sulfolobus acidocaldarius. The strategy behind this initial study entailed the use of combined digestion assays, dot blots, and genome resequencing, which utilizes specific restriction enzymes, antibodies specifically raised against m6A and m5C and single-molecule real-time (SMRT) sequencing, respectively, to identify DNA methylations occurring in exponentially growing cells. The previously identified restriction-modification system, specific of S. acidocaldarius, was confirmed by digestion assay and SMRT sequencing while, the presence of m6A was revealed by dot blot and identified on the characteristic Dam motif by SMRT sequencing. No m5C was detected by dot blot under the conditions tested. Furthermore, by comparing the distribution of both detected methylations along the genome and, by analyzing DNA methylation profiles in synchronized cells, we investigated in which cellular pathways, in particular the cell cycle, this m6A methylation could be a key player. The analysis of sequencing data rejected a role for m6A methylation in another defense system and also raised new questions about a potential involvement of this modification in the regulation of other biological functions in S. acidocaldarius.


September 22, 2019

Emergence of an extensively drug-resistant Salmonella enterica serovar Typhi clone harboring a promiscuous plasmid encoding resistance to fluoroquinolones and third-generation cephalosporins.

Antibiotic resistance is a major problem in Salmonella enterica serovar Typhi, the causative agent of typhoid. Multidrug-resistant (MDR) isolates are prevalent in parts of Asia and Africa and are often associated with the dominant H58 haplotype. Reduced susceptibility to fluoroquinolones is also widespread, and sporadic cases of resistance to third-generation cephalosporins or azithromycin have also been reported. Here, we report the first large-scale emergence and spread of a novel S. Typhi clone harboring resistance to three first-line drugs (chloramphenicol, ampicillin, and trimethoprim-sulfamethoxazole) as well as fluoroquinolones and third-generation cephalosporins in Sindh, Pakistan, which we classify as extensively drug resistant (XDR). Over 300 XDR typhoid cases have emerged in Sindh, Pakistan, since November 2016. Additionally, a single case of travel-associated XDR typhoid has recently been identified in the United Kingdom. Whole-genome sequencing of over 80 of the XDR isolates revealed remarkable genetic clonality and sequence conservation, identified a large number of resistance determinants, and showed that these isolates were of haplotype H58. The XDR S. Typhi clone encodes a chromosomally located resistance region and harbors a plasmid encoding additional resistance elements, including the blaCTX-M-15 extended-spectrum ß-lactamase, and carrying the qnrS fluoroquinolone resistance gene. This antibiotic resistance-associated IncY plasmid exhibited high sequence identity to plasmids found in other enteric bacteria isolated from widely distributed geographic locations. This study highlights three concerning problems: the receding antibiotic arsenal for typhoid treatment, the ability of S. Typhi to transform from MDR to XDR in a single step by acquisition of a plasmid, and the ability of XDR clones to spread globally. IMPORTANCE Typhoid fever is a severe disease caused by the Gram-negative bacterium Salmonella enterica serovar Typhi. Antibiotic-resistant S. Typhi strains have become increasingly common. Here, we report the first large-scale emergence and spread of a novel extensively drug-resistant (XDR) S. Typhi clone in Sindh, Pakistan. The XDR S. Typhi is resistant to the majority of drugs available for the treatment of typhoid fever. This study highlights the evolving threat of antibiotic resistance in S. Typhi and the value of antibiotic susceptibility testing and whole-genome sequencing in understanding emerging infectious diseases. We genetically characterized the XDR S. Typhi to investigate the phylogenetic relationship between these isolates and a global collection of S. Typhi isolates and to identify multiple genes linked to antibiotic resistance. This S. Typhi clone harbored a promiscuous antibiotic resistance plasmid previously identified in other enteric bacteria. The increasing antibiotic resistance in S. Typhi observed here adds urgency to the need for typhoid prevention measures.


September 22, 2019

Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides.

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted function in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. These results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi.


September 22, 2019

Primordial origin and diversification of plasmids in Lyme disease agent bacteria.

With approximately one-third of their genomes consisting of linear and circular plasmids, the Lyme disease agent cluster of species has the most complex genomes among known bacteria. We report here a comparative analysis of plasmids in eleven Borreliella (also known as Borrelia burgdorferi sensu lato) species.We sequenced the complete genomes of two B. afzelii, two B. garinii, and individual B. spielmanii, B. bissettiae, B. valaisiana and B. finlandensis isolates. These individual isolates carry between seven and sixteen plasmids, and together harbor 99 plasmids. We report here a comparative analysis of these plasmids, along with 70 additional Borreliella plasmids available in the public sequence databases. We identify only one new putative plasmid compatibility type (the 30th) among these 169 plasmid sequences, suggesting that all or nearly all such types have now been discovered. We find that the linear plasmids in the non-B. burgdorferi species have undergone the same kinds of apparently random, chaotic rearrangements mediated by non-homologous recombination that we previously discovered in B. burgdorferi. These rearrangements occurred independently in the different species lineages, and they, along with an expanded chromosomal phylogeny reported here, allow the identification of several whole plasmid transfer events among these species. Phylogenetic analyses of the plasmid partition genes show that a majority of the plasmid compatibility types arose early, most likely before separation of the Lyme agent Borreliella and relapsing fever Borrelia clades, and this, with occasional cross species plasmid transfers, has resulted in few if any species-specific or geographic region-specific Borreliella plasmid types.The primordial origin and persistent maintenance of the Borreliella plasmid types support their functional indispensability as well as evolutionary roles in facilitating genome diversity. The improved resolution of Borreliella plasmid phylogeny based on conserved partition-gene clusters will lead to better determination of gene orthology which is essential for prediction of biological function, and it will provide a basis for inferring detailed evolutionary mechanisms of Borreliella genomic variability including homologous gene and plasmid exchanges as well as non-homologous rearrangements.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.