Menu
July 7, 2019

Genome puzzle master (GPM): an integrated pipeline for building and editing pseudomolecules from fragmented sequences.

Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build ‘new-gen-assemblies’ that result in high-quality ‘annotation-ready’ pseudomolecules.With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to ‘group,’ ‘merge,’ ‘order and orient’ sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user’s total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory.The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.


July 7, 2019

First complete genome sequence of the skin-improving Lactobacillus curvatus strain FBA2, isolated from fermented vegetables, determined by PacBio single-molecule real-time technology.

The first complete genome sequence of Lactobacillus curvatus was determined by PacBio RS II. The single circular chromosome (1,848,756 bp, G+C content of 42.1%) of L. curvatus FBA2, isolated from fermented vegetables, contained low G+C regions (26.9% minimum) and 43 sets of >1,000-bp identical sequence pairs. No plasmids were detected. Copyright © 2016 Nakano et al.


July 7, 2019

Draft genome sequence of Mycobacterium rufum JS14(T), a polycyclic-aromatic-hydrocarbon-degrading bacterium from petroleum-contaminated soil in Hawaii.

Mycobacterium rufum JS14(T) (=ATCC BAA-1377(T), CIP 109273(T), JCM 16372(T), DSM 45406(T)), a type strain of the species Mycobacterium rufum sp. . belonging to the family Mycobacteriaceae, was isolated from polycyclic aromatic hydrocarbon (PAH)-contaminated soil in Hilo (HI, USA) because it harbors the capability of degrading PAH. Here, we describe the first genome sequence of strain JS14(T), with brief phenotypic characteristics. The genome is composed of 6,176,413 bp with 69.25 % G?+?C content and contains 5810 protein-coding genes with 54 RNA genes. The genome information on M. rufum JS14(T) will provide a better understanding of the complexity of bacterial catabolic pathways for degradation of specific chemicals.


July 7, 2019

Improved hybrid de novo genome assembly of domesticated apple (Malus x domestica).

Domesticated apple (Malus?×?domestica Borkh) is a popular temperate fruit with high nutrient levels and diverse flavors. In 2012, global apple production accounted for at least one tenth of all harvested fruits. A high-quality apple genome assembly is crucial for the selection and breeding of new cultivars. Currently, a single reference genome is available for apple, assembled from 16.9?×?genome coverage short reads via Sanger and 454 sequencing technologies. Although a useful resource, this assembly covers only ~89 % of the non-repetitive portion of the genome, and has a relatively short (16.7 kb) contig N50 length. These downsides make it difficult to apply this reference in transcriptive or whole-genome re-sequencing analyses.Here we present an improved hybrid de novo genomic assembly of apple (Golden Delicious), which was obtained from 76 Gb (~102?×?genome coverage) Illumina HiSeq data and 21.7 Gb (~29?×?genome coverage) PacBio data. The final draft genome is approximately 632.4 Mb, representing?~?90 % of the estimated genome. The contig N50 size is 111,619 bp, representing a 7 fold improvement. Further annotation analyses predicted 53,922 protein-coding genes and 2,765 non-coding RNA genes.The new apple genome assembly will serve as a valuable resource for investigating complex apple traits at the genomic level. It is not only suitable for genome editing and gene cloning, but also for RNA-seq and whole-genome re-sequencing studies.


July 7, 2019

Isolation and genomic characterization of ‘Desulfuromonas soudanensis WTL’, a metal- and electrode-respiring bacterium from anoxic deep subsurface brine.

Reaching a depth of 713 m below the surface, the Soudan Underground Iron Mine (Soudan, MN, USA) transects a massive Archaean (2.7 Ga) banded iron formation, providing a remarkably accessible window into the terrestrial deep biosphere. Despite organic carbon limitation, metal-reducing microbial communities are present in potentially ancient anoxic brines continuously emanating from exploratory boreholes on Level 27. Using graphite electrodes deposited in situ as bait, we electrochemically enriched and isolated a novel halophilic iron-reducing Deltaproteobacterium, ‘Desulfuromonas soudanensis’ strain WTL, from an acetate-fed three-electrode bioreactor poised at +0.24 V (vs. standard hydrogen electrode). Cyclic voltammetry revealed that ‘D. soudanensis’ releases electrons at redox potentials approximately 100 mV more positive than the model freshwater surface isolate Geobacter sulfurreducens, suggesting that its extracellular respiration is tuned for higher potential electron acceptors. ‘D. soudanensis’ contains a 3,958,620-bp circular genome, assembled to completion using single-molecule real-time (SMRT) sequencing reads, which encodes a complete TCA cycle, 38 putative multiheme c-type cytochromes, one of which contains 69 heme-binding motifs, and a LuxI/LuxR quorum sensing cassette that produces an unidentified N-acyl homoserine lactone. Another cytochrome is predicted to lie within a putative prophage, suggesting that horizontal gene transfer plays a role in respiratory flexibility among metal reducers. Isolation of ‘D. soudanensis’ underscores the utility of electrode-based approaches for enriching rare metal reducers from a wide range of habitats.


July 7, 2019

Complete genome of the starch-degrading myxobacteria Sandaracinus amylolyticus DSM 53668T.

Myxobacteria are members of d-proteobacteria and are typified by large genomes, well-coordinated social behavior, gliding motility, and starvation-induced fruiting body formation. Here, we report the 10.33 Mb whole genome of a starch-degrading myxobacterium Sandaracinus amylolyticus DSM 53668(T) that encodes 8,962 proteins, 56 tRNA, and two rRNA operons. Phylogenetic analysis, in silico DNA-DNA hybridization and average nucleotide identity reveal its divergence from other myxobacterial species and support its taxonomic characterization into a separate family Sandaracinaceae, within the suborder Sorangiineae. Sequence similarity searches using the Carbohydrate-active enzymes (CAZyme) database help identify the enzyme repertoire of S. amylolyticus involved in starch, agar, chitin, and cellulose degradation. We identified 16 a-amylases and two ?-amylases in the S. amylolyticus genome that likely play a role in starch degradation. While many of the amylases are seen conserved in other d-proteobacteria, we notice several novel amylases acquired via horizontal transfer from members belonging to phylum Deinococcus-Thermus, Acidobacteria, and Cyanobacteria. No agar degrading enzyme(s) were identified in the S. amylolyticus genome. Interestingly, several putative ß-glucosidases and endoglucanases proteins involved in cellulose degradation were identified. However, the absence of cellobiohydrolases/exoglucanases corroborates with the lack of cellulose degradation by this bacteria. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Genome sequence and analysis of Peptoclostridium difficile strain ZJCDC-S82.

Peptoclostridium difficile (Clostridium difficile) is the major pathogen associated with infectious diarrhea in humans. Concomitant with the increased incidence of C. difficile infection worldwide, there is an increasing concern regarding this infection type. This study reports a draft assembly and detailed sequence analysis of C. difficile strain ZJCDC-S82. The de novo assembled genome was 4.19 Mb in size, which includes 4,013 protein-coding genes, 41 rRNA genes, and 84 tRNA genes. Along with the nuclear genome, we also assembled sequencing information for a single plasmid consisting of 11,930 nucleotides. Comparative genomic analysis of C. difficile ZJCDC-S82 and two other previously published strains, such as M120 and CD630, showed extensive similarity. Phylogenetic analysis revealed that genetic diversity among C. difficile strains was not influenced by geographic location. Evolutionary analysis suggested that four genes encoding surface proteins exhibited positive selection in C. difficile ZJCDC-S82. Codon usage analysis indicated that C. difficile ZJCDC-S82 had high codon usage bias toward A/U-ended codons. Furthermore, codon usage patterns in C. difficile ZJCDC-S82 were predominantly affected by mutation pressure. Our results provide detailed information pertaining to the C. difficile genome associated with a strain from mainland China. This analysis will facilitate the understanding of genomic diversity and evolution of C. difficile strains in this region.


July 7, 2019

Improved complete genome sequence of the extremely radioresistant bacterium Deinococcus radiodurans R1 obtained using PacBio single-molecule sequencing.

The genome sequence of Deinococcus radiodurans R1 was published in 1999. We resequenced D. radiodurans R1 using PacBio and compared the sequence with the published one. Large insertions and single nucleotide polymorphisms (SNPs) were observed among the genome sequences. A more accurate genome sequence will be helpful to studies of D. radiodurans. Copyright © 2016 Hua and Hua.


July 7, 2019

Assemblytics: a web analytics tool for the detection of variants from an assembly.

Assemblytics is a web app for detecting and analyzing variants from a de novo genome assembly aligned to a reference genome. It incorporates a unique anchor filtering approach to increase robustness to repetitive elements, and identifies six classes of variants based on their distinct alignment signatures. Assemblytics can be applied both to comparing aberrant genomes, such as human cancers, to a reference, or to identify differences between related species. Multiple interactive visualizations enable in-depth explorations of the genomic distributions of variants.http://assemblytics.com, https://github.com/marianattestad/assemblytics CONTACT: mnattest@cshl.eduSupplementary information: Supplementary data are available at Bioinformatics online.© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

High-quality genome assembly and annotation for Plasmodium coatneyi, generated using single-molecule real-time PacBio technology.

Plasmodium coatneyi is a protozoan parasite species that causes simian malaria and is an excellent model for studying disease caused by the human malaria parasite, P. falciparum Here we report the complete (nontelomeric) genome sequence of P. coatneyi Hackeri generated by the application of only Pacific Biosciences RS II (PacBio RS II) single-molecule real-time (SMRT) high-resolution sequence technology and assembly using the Hierarchical Genome Assembly Process (HGAP). This is the first Plasmodium genome sequence reported to use only PacBio technology. This approach has proven to be superior to short-read only approaches for this species. Copyright © 2016 Chien et al.


July 7, 2019

The Ditylenchus destructor genome provides new insights into the evolution of plant parasitic nematodes.

Plant-parasitic nematodes were found in 4 of the 12 clades of phylum Nematoda. These nematodes in different clades may have originated independently from their free-living fungivorous ancestors. However, the exact evolutionary process of these parasites is unclear. Here, we sequenced the genome sequence of a migratory plant nematode, Ditylenchus destructor We performed comparative genomics among the free-living nematode, Caenorhabditis elegans and all the plant nematodes with genome sequences available. We found that, compared with C. elegans, the core developmental control processes underwent heavy reduction, though most signal transduction pathways were conserved. We also found D. destructor contained more homologies of the key genes in the above processes than the other plant nematodes. We suggest that Ditylenchus spp. may be an intermediate evolutionary history stage from free-living nematodes that feed on fungi to obligate plant-parasitic nematodes. Based on the facts that D. destructor can feed on fungi and has a relatively short life cycle, and that it has similar features to both C. elegans and sedentary plant-parasitic nematodes from clade 12, we propose it as a new model to study the biology, biocontrol of plant nematodes and the interaction between nematodes and plants.© 2016 The Author(s).


July 7, 2019

Whole-genome sequencing recommendations

Recent technological developments have revolutionized the way we perform genetic analyses. In particular whole-genome sequencing provides access to the entire genetic makeup of an individual, and it is now an affordable approach for many research groups. As a consequence genome sequencing is pervading many fields of biological research. Sequencing technologies are evolving rapidly and so do their applications. Here we provide a first primer on whole-genome sequencing, focusing on two of the most popular applications: (1) de novo genome sequencing, in which the objective is obtaining a high-quality genome assembly that can serve as a reference for a species or variety, and (2) genome resequencing, when there is an available reference genome and the objective is to map sequence variation of an individual or a set of individuals. It is not our intention to provide a comprehensive overview of current methodologies that will likely soon become obsolete, but rather focus on general principles that will have a more general applicability.


July 7, 2019

Whole genomic sequence analysis of Bacillus infantis: defining the genetic blueprint of strain NRRL B-14911, an emerging cardiopathogenic microbe.

We recently reported the identification of Bacillus sp. NRRL B-14911 that induces heart autoimmunity by generating cardiac-reactive T cells through molecular mimicry. This marine bacterium was originally isolated from the Gulf of Mexico, but no associations with human diseases were reported. Therefore, to characterize its biological and medical significance, we sought to determine and analyze the complete genome sequence of Bacillus sp. NRRL B-14911.Based on the phylogenetic analysis of 16S ribosomal RNA (rRNA) genes, sequence analysis of the 16S-23S rDNA intergenic transcribed spacers, phenotypic microarray, and matrix-assisted laser desorption ionization time-of-flight mass spectrometry, we propose that this organism belongs to the species Bacillus infantis, previously shown to be associated with sepsis in a newborn child. Analysis of the complete genome of Bacillus sp. NRRL B-14911 revealed several virulence factors including adhesins, invasins, colonization factors, siderophores and transporters. Likewise, the bacterial genome encodes a wide range of methyl transferases, transporters, enzymatic and biochemical pathways, and insertion sequence elements that are distinct from other closely related bacilli.The complete genome sequence of Bacillus sp. NRRL B-14911 provided in this study may facilitate genetic manipulations to assess gene functions associated with bacterial survival and virulence. Additionally, this bacterium may serve as a useful tool to establish a disease model that permits systematic analysis of autoimmune events in various susceptible rodent strains.


July 7, 2019

Molecular evolution of a Klebsiella pneumoniae ST278 isolate harboring blaNDM-7 and involved in nosocomial transmission.

During 2013, ST278 Klebsiella pneumoniae with blaNDM-7 was isolated from the urine (KpN01) and rectum (KpN02) of a patient in Calgary, Canada. The same strain (KpN04) was subsequently isolated from another patient in the same unit. Interestingly, a carbapenem-susceptible K. pneumoniae ST278 (KpN06) was obtained 1 month later from the blood of the second patient. Next-generation sequencing (NGS) revealed that the loss of carbapenem-resistance in KpN06 was due to a 5-kb deletion on the blaNDM-7-harboring IncX3 plasmid. In addition, an IncFIB plasmid in KpN06 had a 27-kb deletion that removed genes encoding for heavy metal resistance. Phylogenetic analysis showed that the K. pneumoniae ST278 from patient 2 was likely a descendant of KpN02 and that KpN06 was a close progenitor of an environmental ST278. It is unclear whether KpN06 lost the blaNDM-7 gene in vivo. This study detailed the remarkable plasticity and speed of evolutionary changes in multidrug-resistant K. pneumoniae, demonstrating the highly recombinant nature of this species. It also highlights the ability of NGS to clarify molecular microevolutionary events within antibiotic-resistant organisms.© The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.