Menu
July 7, 2019

Unique transposon landscapes are pervasive across Drosophila melanogaster genomes.

To understand how transposon landscapes (TLs) vary across animal genomes, we describe a new method called the Transposon Insertion and Depletion AnaLyzer (TIDAL) and a database of >300 TLs in Drosophila melanogaster (TIDAL-Fly). Our analysis reveals pervasive TL diversity across cell lines and fly strains, even for identically named sub-strains from different laboratories such as the ISO1 strain used for the reference genome sequence. On average, >500 novel insertions exist in every lab strain, inbred strains of the Drosophila Genetic Reference Panel (DGRP), and fly isolates in the Drosophila Genome Nexus (DGN). A minority (<25%) of transposon families comprise the majority (>70%) of TL diversity across fly strains. A sharp contrast between insertion and depletion patterns indicates that many transposons are unique to the ISO1 reference genome sequence. Although TL diversity from fly strains reaches asymptotic limits with increasing sequencing depth, rampant TL diversity causes unsaturated detection of TLs in pools of flies. Finally, we show novel transposon insertions negatively correlate with Piwi-interacting RNA (piRNA) levels for most transposon families, except for the highly-abundant roo retrotransposon. Our study provides a useful resource for Drosophila geneticists to understand how transposons create extensive genomic diversity in fly cell lines and strains.© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Evidence for extensive horizontal gene transfer from the draft genome of a tardigrade.

Horizontal gene transfer (HGT), or the transfer of genes between species, has been recognized recently as more pervasive than previously suspected. Here, we report evidence for an unprecedented degree of HGT into an animal genome, based on a draft genome of a tardigrade, Hypsibius dujardini. Tardigrades are microscopic eight-legged animals that are famous for their ability to survive extreme conditions. Genome sequencing, direct confirmation of physical linkage, and phylogenetic analysis revealed that a large fraction of the H. dujardini genome is derived from diverse bacteria as well as plants, fungi, and Archaea. We estimate that approximately one-sixth of tardigrade genes entered by HGT, nearly double the fraction found in the most extreme cases of HGT into animals known to date. Foreign genes have supplemented, expanded, and even replaced some metazoan gene families within the tardigrade genome. Our results demonstrate that an unexpectedly large fraction of an animal genome can be derived from foreign sources. We speculate that animals that can survive extremes may be particularly prone to acquiring foreign genes.


July 7, 2019

Comparative genomics and metabolic profiling of the genus Lysobacter.

Lysobacter species are Gram-negative bacteria widely distributed in soil, plant and freshwater habitats. Lysobacter owes its name to the lytic effects on other microorganisms. To better understand their ecology and interactions with other (micro)organisms, five Lysobacter strains representing the four species L. enzymogenes, L. capsici, L. gummosus and L. antibioticus were subjected to genomics and metabolomics analyses.Comparative genomics revealed a diverse genome content among the Lysobacter species with a core genome of 2,891 and a pangenome of 10,028 coding sequences. Genes encoding type I, II, III, IV, V secretion systems and type IV pili were highly conserved in all five genomes, whereas type VI secretion systems were only found in L. enzymogenes and L. gummosus. Genes encoding components of the flagellar apparatus were absent in the two sequenced L. antibioticus strains. The genomes contained a large number of genes encoding extracellular enzymes including chitinases, glucanases and peptidases. Various nonribosomal peptide synthase (NRPS) and polyketide synthase (PKS) gene clusters encoding putative bioactive metabolites were identified but only few of these clusters were shared between the different species. Metabolic profiling by imaging mass spectrometry complemented, in part, the in silico genome analyses and allowed visualisation of the spatial distribution patterns of several secondary metabolites produced by or induced in Lysobacter species during interactions with the soil-borne fungus Rhizoctonia solani.Our work shows that mining the genomes of Lysobacter species in combination with metabolic profiling provides novel insights into the genomic and metabolic potential of this widely distributed but understudied and versatile bacterial genus.


July 7, 2019

The genomic sequence of lymphocryptovirus from cynomolgus macaque.

Lymphocryptoviruses such as Epstein-Barr virus (EBV) cause persistent infections in human and non-human primates, and suppression of the immune system can increase the risk of lymphocryptovirus (LCV)-associated tumor development in both human and non-human primates. To enable LCV infection as a non-clinical model to study effects of therapeutics on EBV immunity, we determined the genomic DNA sequence of the LCV from cynomolgus macaque, a species commonly used for non-clinical testing. Comparison to rhesus macaque LCV and human EBV sequences indicates that LCV from the cynomolgus macaque has the same genomic arrangement and a high degree of similarity in most genes, especially with rhesus macaque LCV. Genes showing lower similarity were those encoding proteins involved in latency and/or tumor promotion or immune evasion. The genomic sequence of LCV from cynomolgus macaque should aid the development of non-clinical tools for identifying therapeutics that impact LCV immunity and carry potential lymphoma risk. Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Genome analysis of Staphylococcus agnetis, an agent of lameness in broiler chickens.

Lameness in broiler chickens is a significant animal welfare and financial issue. Lameness can be enhanced by rearing young broilers on wire flooring. We have identified Staphylococcus agnetis as significantly involved in bacterial chondronecrosis with osteomyelitis (BCO) in proximal tibia and femorae, leading to lameness in broiler chickens in the wire floor system. Administration of S. agnetis in water induces lameness. Previously reported in some cases of cattle mastitis, this is the first report of this poorly described pathogen in chickens. We used long and short read next generation sequencing to assemble single finished contigs for the genome and a large plasmid from the chicken pathogen. Comparison of the S. agnetis genome to those of other pathogenic Staphylococci shows that S.agnetis contains a distinct repertoire of virulence determinants. Additionally, the S. agnetis genome has several regions that differ substantially from the genomes of other pathogenic Staphylococci. Comparison of our finished genome to a recent draft genome for a cattle mastitis isolate suggests that future investigations focus on the evolutionary epidemiology of this emerging pathogen of domestic animals.


July 7, 2019

De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping.

It remains a challenge to perform de novo assembly using next-generation sequencing (NGS). Despite the availability of multiple sequencing technologies and tools (e.g., assemblers) it is still difficult to assemble new genomes at chromosome resolution (i.e., one sequence per chromosome). Obtaining high quality draft assemblies is extremely important in the case of yeast genomes to better characterise major events in their evolutionary history. The aim of this work is two-fold: on the one hand we want to show how combining different and somewhat complementary technologies is key to improving assembly quality and correctness, and on the other hand we present a de novo assembly pipeline we believe to be beneficial to core facility bioinformaticians. To demonstrate both the effectiveness of combining technologies and the simplicity of the pipeline, here we present the results obtained using the Dekkera bruxellensis genome.In this work we used short-read Illumina data and long-read PacBio data combined with the extreme long-range information from OpGen optical maps in the task of de novo genome assembly and finishing. Moreover, we developed NouGAT, a semi-automated pipeline for read-preprocessing, de novo assembly and assembly evaluation, which was instrumental for this work.We obtained a high quality draft assembly of a yeast genome, resolved on a chromosomal level. Furthermore, this assembly was corrected for mis-assembly errors as demonstrated by resolving a large collapsed repeat and by receiving higher scores by assembly evaluation tools. With the inclusion of PacBio data we were able to fill about 5 % of the optical mapped genome not covered by the Illumina data.


July 7, 2019

Complete genome sequence of Enterococcus durans KLDS6.0930, a strain with probiotic properties.

Enterococcus durans KLDS6.0930 strain was originally isolated from traditional naturally fermented cream in Inner Mongolia of China. The complete genome sequence of E. durans KLDS6.0930 was carried out using the PacBio RSII platform. The genome contains a circular chromosome and two circular plasmids. Genome sequencing information provides the genetic basis for bioinformatics analysis of bile salt and acid tolerance, cell adhesion, and molecular mechanisms responsible for lipid metabolism. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Genomic epidemiology of an endoscope-associated outbreak of Klebsiella pneumoniae carbapenemase (KPC)-producing K. pneumoniae.

Increased incidence of infections due to Klebsiella pneumoniae carbapenemase (KPC)-producing Klebsiella pneumoniae (KPC-Kp) was noted among patients undergoing endoscopic retrograde cholangiopancreatography (ERCP) at a single hospital. An epidemiologic investigation identified KPC-Kp and non-KPC-producing, extended-spectrum ß-lactamase (ESBL)-producing Kp in cultures from 2 endoscopes. Genotyping was performed on patient and endoscope isolates to characterize the microbial genomics of the outbreak. Genetic similarity of 51 Kp isolates from 37 patients and 3 endoscopes was assessed by pulsed-field gel electrophoresis (PFGE) and multi-locus sequence typing (MLST). Five patient and 2 endoscope isolates underwent whole genome sequencing (WGS). Two KPC-encoding plasmids were characterized by single molecule, real-time sequencing. Plasmid diversity was assessed by endonuclease digestion. Genomic and epidemiologic data were used in conjunction to investigate the outbreak source. Two clusters of Kp patient isolates were genetically related to endoscope isolates by PFGE. A subset of patient isolates were collected post-ERCP, suggesting ERCP endoscopes as a possible source. A phylogeny of 7 Kp genomes from patient and endoscope isolates supported ERCP as a potential source of transmission. Differences in gene content defined 5 ST258 subclades and identified 2 of the subclades as outbreak-associated. A novel KPC-encoding plasmid, pKp28 helped define and track one endoscope-associated ST258 subclade. WGS demonstrated high genetic relatedness of patient and ERCP endoscope isolates suggesting ERCP-associated transmission of ST258 KPC-Kp. Gene and plasmid content discriminated the outbreak from endemic ST258 populations and assisted with the molecular epidemiologic investigation of an extended KPC-Kp outbreak.


July 7, 2019

Wham: Identifying structural variants of biological consequence.

Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools-Lumpy, Delly and SoftSearch-and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.


July 7, 2019

Finished annotated genome sequence of Burkholderia pseudomallei strain Bp1651, a multidrug-resistant clinical isolate.

Burkholderia pseudomallei strain Bp1651, a human isolate, is resistant to all clinically relevant antibiotics. We report here on the finished genome sequence assembly and annotation of the two chromosomes of this strain. This genome sequence may assist in understanding the mechanisms of antimicrobial resistance for this pathogenic species. Copyright © 2015 Bugrysheva et al.


July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Insights into sex chromosome evolution and aging from the genome of a short-lived fish.

The killifish Nothobranchius furzeri is the shortest-lived vertebrate that can be bred in the laboratory. Its rapid growth, early sexual maturation, fast aging, and arrested embryonic development (diapause) make it an attractive model organism in biomedical research. Here, we report a draft sequence of its genome that allowed us to uncover an intra-species Y chromosome polymorphism representing-in real time-different stages of sex chromosome formation that display features of early mammalian XY evolution “in action.” Our data suggest that gdf6Y, encoding a TGF-ß family growth factor, is the master sex-determining gene in N. furzeri. Moreover, we observed genomic clustering of aging-related genes, identified genes under positive selection, and revealed significant similarities of gene expression profiles between diapause and aging, particularly for genes controlling cell cycle and translation. The annotated genome sequence is provided as an online resource (http://www.nothobranchius.info/NFINgb). Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of Salinicoccus halodurans H3B36, isolated from the Qaidam Basin in China.

Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further analysis of epigenetic modifications revealed the presence of 11,000 m4C-type modified bases, 7,545 m6A-type modified bases, and 89,064 other modified bases. The data on the genome of this strain may provide an insight into the metabolism of N (a)-acetyl-a-lysine.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.