Menu
July 7, 2019

The genomic sequence of lymphocryptovirus from cynomolgus macaque.

Lymphocryptoviruses such as Epstein-Barr virus (EBV) cause persistent infections in human and non-human primates, and suppression of the immune system can increase the risk of lymphocryptovirus (LCV)-associated tumor development in both human and non-human primates. To enable LCV infection as a non-clinical model to study effects of therapeutics on EBV immunity, we determined the genomic DNA sequence of the LCV from cynomolgus macaque, a species commonly used for non-clinical testing. Comparison to rhesus macaque LCV and human EBV sequences indicates that LCV from the cynomolgus macaque has the same genomic arrangement and a high degree of similarity in most genes, especially with rhesus macaque LCV. Genes showing lower similarity were those encoding proteins involved in latency and/or tumor promotion or immune evasion. The genomic sequence of LCV from cynomolgus macaque should aid the development of non-clinical tools for identifying therapeutics that impact LCV immunity and carry potential lymphoma risk. Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Genome analysis of Staphylococcus agnetis, an agent of lameness in broiler chickens.

Lameness in broiler chickens is a significant animal welfare and financial issue. Lameness can be enhanced by rearing young broilers on wire flooring. We have identified Staphylococcus agnetis as significantly involved in bacterial chondronecrosis with osteomyelitis (BCO) in proximal tibia and femorae, leading to lameness in broiler chickens in the wire floor system. Administration of S. agnetis in water induces lameness. Previously reported in some cases of cattle mastitis, this is the first report of this poorly described pathogen in chickens. We used long and short read next generation sequencing to assemble single finished contigs for the genome and a large plasmid from the chicken pathogen. Comparison of the S. agnetis genome to those of other pathogenic Staphylococci shows that S.agnetis contains a distinct repertoire of virulence determinants. Additionally, the S. agnetis genome has several regions that differ substantially from the genomes of other pathogenic Staphylococci. Comparison of our finished genome to a recent draft genome for a cattle mastitis isolate suggests that future investigations focus on the evolutionary epidemiology of this emerging pathogen of domestic animals.


July 7, 2019

De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping.

It remains a challenge to perform de novo assembly using next-generation sequencing (NGS). Despite the availability of multiple sequencing technologies and tools (e.g., assemblers) it is still difficult to assemble new genomes at chromosome resolution (i.e., one sequence per chromosome). Obtaining high quality draft assemblies is extremely important in the case of yeast genomes to better characterise major events in their evolutionary history. The aim of this work is two-fold: on the one hand we want to show how combining different and somewhat complementary technologies is key to improving assembly quality and correctness, and on the other hand we present a de novo assembly pipeline we believe to be beneficial to core facility bioinformaticians. To demonstrate both the effectiveness of combining technologies and the simplicity of the pipeline, here we present the results obtained using the Dekkera bruxellensis genome.In this work we used short-read Illumina data and long-read PacBio data combined with the extreme long-range information from OpGen optical maps in the task of de novo genome assembly and finishing. Moreover, we developed NouGAT, a semi-automated pipeline for read-preprocessing, de novo assembly and assembly evaluation, which was instrumental for this work.We obtained a high quality draft assembly of a yeast genome, resolved on a chromosomal level. Furthermore, this assembly was corrected for mis-assembly errors as demonstrated by resolving a large collapsed repeat and by receiving higher scores by assembly evaluation tools. With the inclusion of PacBio data we were able to fill about 5 % of the optical mapped genome not covered by the Illumina data.


July 7, 2019

Complete genome sequence of Enterococcus durans KLDS6.0930, a strain with probiotic properties.

Enterococcus durans KLDS6.0930 strain was originally isolated from traditional naturally fermented cream in Inner Mongolia of China. The complete genome sequence of E. durans KLDS6.0930 was carried out using the PacBio RSII platform. The genome contains a circular chromosome and two circular plasmids. Genome sequencing information provides the genetic basis for bioinformatics analysis of bile salt and acid tolerance, cell adhesion, and molecular mechanisms responsible for lipid metabolism. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Genomic epidemiology of an endoscope-associated outbreak of Klebsiella pneumoniae carbapenemase (KPC)-producing K. pneumoniae.

Increased incidence of infections due to Klebsiella pneumoniae carbapenemase (KPC)-producing Klebsiella pneumoniae (KPC-Kp) was noted among patients undergoing endoscopic retrograde cholangiopancreatography (ERCP) at a single hospital. An epidemiologic investigation identified KPC-Kp and non-KPC-producing, extended-spectrum ß-lactamase (ESBL)-producing Kp in cultures from 2 endoscopes. Genotyping was performed on patient and endoscope isolates to characterize the microbial genomics of the outbreak. Genetic similarity of 51 Kp isolates from 37 patients and 3 endoscopes was assessed by pulsed-field gel electrophoresis (PFGE) and multi-locus sequence typing (MLST). Five patient and 2 endoscope isolates underwent whole genome sequencing (WGS). Two KPC-encoding plasmids were characterized by single molecule, real-time sequencing. Plasmid diversity was assessed by endonuclease digestion. Genomic and epidemiologic data were used in conjunction to investigate the outbreak source. Two clusters of Kp patient isolates were genetically related to endoscope isolates by PFGE. A subset of patient isolates were collected post-ERCP, suggesting ERCP endoscopes as a possible source. A phylogeny of 7 Kp genomes from patient and endoscope isolates supported ERCP as a potential source of transmission. Differences in gene content defined 5 ST258 subclades and identified 2 of the subclades as outbreak-associated. A novel KPC-encoding plasmid, pKp28 helped define and track one endoscope-associated ST258 subclade. WGS demonstrated high genetic relatedness of patient and ERCP endoscope isolates suggesting ERCP-associated transmission of ST258 KPC-Kp. Gene and plasmid content discriminated the outbreak from endemic ST258 populations and assisted with the molecular epidemiologic investigation of an extended KPC-Kp outbreak.


July 7, 2019

High-quality draft genome sequence of Kallotenue papyrolyticum JKG1T reveals broad heterotrophic capacity focused on carbohydrate and amino acid metabolism.

The draft genome of Kallotenue papyrolyticum JKG1(T), a member of the order Kallotenuales, class Chloroflexia, consists of 4,475,263 bp in 4 contigs and encodes 4,010 predicted genes, 49 tRNA-encoding genes, and 3 rRNA operons. The genome is consistent with a heterotrophic lifestyle including catabolism of polysaccharides and amino acids. Copyright © 2015 Hedlund et al.


July 7, 2019

Wham: Identifying structural variants of biological consequence.

Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools-Lumpy, Delly and SoftSearch-and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.


July 7, 2019

Finished annotated genome sequence of Burkholderia pseudomallei strain Bp1651, a multidrug-resistant clinical isolate.

Burkholderia pseudomallei strain Bp1651, a human isolate, is resistant to all clinically relevant antibiotics. We report here on the finished genome sequence assembly and annotation of the two chromosomes of this strain. This genome sequence may assist in understanding the mechanisms of antimicrobial resistance for this pathogenic species. Copyright © 2015 Bugrysheva et al.


July 7, 2019

Complete genome sequences of 11 Bordetella pertussis strains representing the pandemic ptxP3 lineage.

Pathogen adaptation has contributed to the resurgence of pertussis. To facilitate our understanding of this adaptation we report here 11 completely closed and annotated Bordetella pertussis genomes representing the pandemic ptxP3 lineage. Our analyses included six strains which do not produce the vaccine components pertactin and/or filamentous hemagglutinin. Copyright © 2015 Bart et al.


July 7, 2019

Complete genome sequence and characterization of the haloacid-degrading Burkholderia caribensis MBA4.

Burkholderia caribensis MBA4 was isolated from soil for its capability to grow on haloacids. This bacterium has a genome size of 9,482,704 bp. Here we report the genome sequences and annotation, together with characteristics of the genome. The complete genome sequence consists of three replicons, comprising 9056 protein-coding genes and 80 RNA genes. Genes responsible for dehalogenation and uptake of haloacids were arranged as an operon. While dehalogenation of haloacetate would produce glycolate, three glycolate operons were identified. Two of these operons contain an upstream glcC regulator gene. It is likely that the expression of one of these operons is responsive to haloacetate. Genes responsible for the metabolism of dehalogenation product of halopropionate were also identified.


July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Insights into sex chromosome evolution and aging from the genome of a short-lived fish.

The killifish Nothobranchius furzeri is the shortest-lived vertebrate that can be bred in the laboratory. Its rapid growth, early sexual maturation, fast aging, and arrested embryonic development (diapause) make it an attractive model organism in biomedical research. Here, we report a draft sequence of its genome that allowed us to uncover an intra-species Y chromosome polymorphism representing-in real time-different stages of sex chromosome formation that display features of early mammalian XY evolution “in action.” Our data suggest that gdf6Y, encoding a TGF-ß family growth factor, is the master sex-determining gene in N. furzeri. Moreover, we observed genomic clustering of aging-related genes, identified genes under positive selection, and revealed significant similarities of gene expression profiles between diapause and aging, particularly for genes controlling cell cycle and translation. The annotated genome sequence is provided as an online resource (http://www.nothobranchius.info/NFINgb). Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of Salinicoccus halodurans H3B36, isolated from the Qaidam Basin in China.

Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further analysis of epigenetic modifications revealed the presence of 11,000 m4C-type modified bases, 7,545 m6A-type modified bases, and 89,064 other modified bases. The data on the genome of this strain may provide an insight into the metabolism of N (a)-acetyl-a-lysine.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.