Menu
September 22, 2019

MultiMotifMaker: a multi-thread tool for identifying DNA methylation motifs from Pacbio reads.

The methylation of DNA is important mechanism to control biological processes. Recently, the Pacbio SMRT technology provides a new way to identify base methylation in the genome. MotifMaker is a tool developed by Pacbio for discovering DNA methylation motifs from methylated DNA sequences. However, MotifMaker is single-threaded and computational expensive for identifying methylation motifs from large genomes. Here, we present an efficient motif finding algorithm (MultiMotifMaker) by implementing multi threads of the MotifMaker. The MultiMotifMaker, speeds up the motif search about 8-9 times on a 32 core computer comparing to MotifMaker. MultiMotifMaker makes it possible to identify methylation motifs from Pacbio reads for large genomes.


September 22, 2019

Pol V-mediated translesion synthesis elicits localized untargeted mutagenesis during post-replicative gap repair.

In vivo, replication forks proceed beyond replication-blocking lesions by way of downstream repriming, generating daughter strand gaps that are subsequently processed by post-replicative repair pathways such as homologous recombination and translesion synthesis (TLS). The way these gaps are filled during TLS is presently unknown. The structure of gap repair synthesis was assessed by sequencing large collections of single DNA molecules that underwent specific TLS events in vivo. The higher error frequency of specialized relative to replicative polymerases allowed us to visualize gap-filling events at high resolution. Unexpectedly, the data reveal that a specialized polymerase, Pol V, synthesizes stretches of DNA both upstream and downstream of a site-specific DNA lesion. Pol V-mediated untargeted mutations are thus spread over several hundred nucleotides, strongly eliciting genetic instability on either side of a given lesion. Consequently, post-replicative gap repair may be a source of untargeted mutations critical for gene diversification in adaptation and evolution. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

The complete mitochondrial genome of the Basidiomycete edible fungus Hypsizygus marmoreus

The complete mitochondrial genome of the edible fungus Hypsizygus marmoreus was published in this paper. It was determined using Pacbio and Illumina sequencing. The complete mitochondrial DNA (mtDNA) is 106,417?bp in length with a GC content of 31.74%, which was the fourth large mitogenome in Agaricales. The circular mitogenome encoded 67 protein-coding genes and one ribosomal RNAs (rns). Among these genes, 13 conserved protein-coding genes were determined in the genome, including 6 subunits of NAD dehydrogenase (nad1-4, 4L and 6), three cytochrome oxidases (cox1-3), one apocytochrome b (cob) and three ATP synthases (atp6, apt 8 and apt 9). The phylogenic analysis confirmed that H. marmoreus (Lyophyllaceae) clustered together with Tricholoma matsutake (Tricholomataceae).


September 22, 2019

Nine draft genome sequences of Claviceps purpurea s.lat., including C. arundinis, C. humidiphila, and C. cf. spartinae, pseudomolecules for the pitch canker pathogen Fusarium circinatum, draft genome of Davidsoniella eucalypti, Grosmannia galeiformis, Quambalaria eucalypti, and Teratosphaeria destructans.

This genome announcement includes draft genomes from Claviceps purpurea s.lat., including C. arundinis, C. humidiphila and C. cf. spartinae. The draft genomes of Davidsoniella eucalypti, Quambalaria eucalypti and Teratosphaeria destructans, all three important eucalyptus pathogens, are presented. The insect associate Grosmannia galeiformis is also described. The pine pathogen genome of Fusarium circinatum has been assembled into pseudomolecules, based on additional sequence data and by harnessing the known synteny within the Fusarium fujikuroi species complex. This new assembly of the F. circinatum genome provides 12 pseudomolecules that correspond to the haploid chromosome number of F. circinatum. These are comparable to other chromosomal assemblies within the FFSC and will enable more robust genomic comparisons within this species complex.


September 22, 2019

Validation of Genomic Structural Variants Through Long Sequencing Technologies.

Although numerous algorithms have been developed to identify large chromosomal rearrangements (i.e., genomic structural variants, SVs), there remains a dearth of approaches to evaluate their results. This is significant, as the accurate identification of SVs is still an outstanding problem whereby no single algorithm has been shown to be able to achieve high sensitivity and specificity across different classes of SVs. The method introduced in this chapter, VaPoR, is specifically designed to evaluate the accuracy of SV predictions using third-generation long sequences. This method uses a recurrence approach and collects direct evidence from raw reads thus avoiding computationally costly whole genome assembly. This chapter would describe in detail as how to apply this tool onto different data types.


September 22, 2019

Potential survival and pathogenesis of a novel strain, Vibrio parahaemolyticus FORC_022, isolated from a soy sauce marinated crab by genome and transcriptome analyses.

Vibrio parahaemolyticus can cause gastrointestinal illness through consumption of seafood. Despite frequent food-borne outbreaks of V. parahaemolyticus, only 19 strains have subjected to complete whole-genome analysis. In this study, a novel strain of V. parahaemolyticus, designated FORC_022 (Food-borne pathogen Omics Research Center_022), was isolated from soy sauce marinated crabs, and its genome and transcriptome were analyzed to elucidate the pathogenic mechanisms. FORC_022 did not include major virulence factors of thermostable direct hemolysin (tdh) and TDH-related hemolysin (trh). However, FORC_022 showed high cytotoxicity and had several V. parahaemolyticus islands (VPaIs) and other virulence factors, such as various secretion systems (types I, II, III, IV, and VI), in comparative genome analysis with CDC_K4557 (the most similar strain) and RIMD2210633 (genome island marker strain). FORC_022 harbored additional virulence genes, including accessory cholera enterotoxin, zona occludens toxin, and tight adhesion (tad) locus, compared with CDC_K4557. In addition, O3 serotype specific gene and the marker gene of pandemic O3:K6 serotype (toxRS) were detected in FORC_022. The expressions levels of genes involved in adherence and carbohydrate transporter were high, whereas those of genes involved in motility, arginine biosynthesis, and proline metabolism were low after exposure to crabs. Moreover, the virulence factors of the type III secretion system, tad locus, and thermolabile hemolysin were overexpressed. Therefore, the risk of foodborne-illness may be high following consumption of FORC_022 contaminated crab. These results provided molecular information regarding the survival and pathogenesis of V. parahaemolyticus FORC_022 strain in contaminated crab and may have applications in food safety.


September 22, 2019

Sequencing of Panax notoginseng genome reveals genes involved in disease resistance and ginsenoside biosynthesis

Background: Panax notoginseng is a traditional Chinese herb with high medicinal and economic value. There has been considerable research on the pharmacological activities of ginsenosides contained in Panax spp.; however, very little is known about the ginsenoside biosynthetic pathway. Results: We reported the first de novo genome of 2.36 Gb of sequences from P. notoginseng with 35,451 protein-encoding genes. Compared to other plants, we found notable gene family contraction of disease-resistance genes in P. notoginseng, but notable expansion for several ATP-binding cassette (ABC) transporter subfamilies, such as the Gpdr subfamily, indicating that ABCs might be an additional mechanism for the plant to cope with biotic stress. Combining eight transcriptomes of roots and aerial parts, we identified several key genes, their transcription factor binding sites and all their family members involved in the synthesis pathway of ginsenosides in P. notoginseng, including dammarenediol synthase, CYP716 and UGT71. Conclusions: The complete genome analysis of P. notoginseng, the first in genus Panax, will serve as an important reference sequence for improving breeding and cultivation of this important nutraceutical and medicinal but vulnerable plant species.


September 22, 2019

Comparative genomics and genotype-phenotype associations in Bifidobacterium breve.

Bifidobacteria are common members of the gastro-intestinal microbiota of a broad range of animal hosts. Their successful adaptation to this particular niche is linked to their saccharolytic metabolism, which is supported by a wide range of glycosyl hydrolases. In the current study a large-scale gene-trait matching (GTM) effort was performed to explore glycan degradation capabilities in B. breve. By correlating the presence/absence of genes and associated genomic clusters with growth/no-growth patterns across a dataset of 20 Bifidobacterium breve strains and nearly 80 different potential growth substrates, we not only validated the approach for a number of previously characterized carbohydrate utilization clusters, but we were also able to discover novel genetic clusters linked to the metabolism of salicin and sucrose. Using GTM, genetic associations were also established for antibiotic resistance and exopolysaccharide production, thereby identifying (novel) bifidobacterial antibiotic resistance markers and showing that the GTM approach is applicable to a variety of phenotypes. Overall, the GTM findings clearly expand our knowledge on members of the B. breve species, in particular how their variable genetic features can be linked to specific phenotypes.


September 22, 2019

Complete genome sequence of Enterococcus durans KLDS6.0933, a potential probiotic strain with high cholesterol removal ability

Enterococci are commensal bacteria in the mammalian gastrointestinal tract which play an important role in the production of various fermented foods. Thus, certain enterococcal strains are commonly used as probiotics to confer health benefits to human and animals. Enterococcus durans KLDS6.0933 is a potential probiotic strain with high cholesterol removal ability, which was isolated from traditional naturally fermented cream in Inner Mongolia of China. To better understand the genetic basis of the probiotic properties of this strain, the whole-genome sequence was performed using the PacBio RSII platform.


September 22, 2019

The draft genomes of Elizabethkingia anophelis of equine origin are genetically similar to three isolates from human clinical specimens.

We report the isolation and characterization of two Elizabethkingia anophelis strains (OSUVM-1 and OSUVM-2) isolated from sources associated with horses in Oklahoma. Both strains appeared susceptible to fluoroquinolones and demonstrated high MICs to all cell wall active antimicrobials including vancomycin, along with aminoglycosides, fusidic acid, chloramphenicol, and tetracycline. Typical of the Elizabethkingia, both draft genomes contained multiple copies of ß-lactamase genes as well as genes predicted to function in antimicrobial efflux. Phylogenetic analysis of the draft genomes revealed that OSUVM-1 and OSUVM-2 differ by only 6 SNPs and are in a clade with 3 strains of Elizabethkingia anophelis that were responsible for human infections. These findings therefore raise the possibility that Elizabethkingia might have the potential to move between humans and animals in a manner similar to known zoonotic pathogens.


September 22, 2019

Fusarium species complex causing Pokkah Boeng in China

Sugarcane is one of the most important crops for sugar production in sugarcane-growing areas. Many biotic and abiotic stresses affected the sugarcane production which leads to severe losses. Pokkah boeng is now playing a very important role due to its economic threats. Currently, the occurrence and rigorousness of pokkah boeng disease have been spread like wildfire from major sugarcane-growing countries. Pokkah boeng is a fungal disease that can cause serious yield losses in susceptible varieties. Infection of the disease is caused either by spores or ascospores. It may cause serious yield losses in commercial plantings. However, there have been many reported outbreaks of the disease which have looked spectacular but have caused trade and industry loss. Fusarium species complex is the major causal agent of this disease around the world, but some researchers have documented the increased importance of Fusarium. Three Fusarium species have been identified to cause the sugarcane pokkah boeng disease in China. Moreover, Fusarium may be accompanied of its mycotoxin production, genomic sequencing, and association with nitrogen application in China. Many studies on disease investigations, breeding of disease-resistant varieties, and strategy of disease control have also been carried out in China.


September 22, 2019

Integrating long-range connectivity information into de Bruijn graphs.

The de Bruijn graph is a simple and efficient data structure that is used in many areas of sequence analysis including genome assembly, read error correction and variant calling. The data structure has a single parameter k, is straightforward to implement and is tractable for large genomes with high sequencing depth. It also enables representation of multiple samples simultaneously to facilitate comparison. However, unlike the string graph, a de Bruijn graph does not retain long range information that is inherent in the read data. For this reason, applications that rely on de Bruijn graphs can produce sub-optimal results given their input data.We present a novel assembly graph data structure: the Linked de Bruijn Graph (LdBG). Constructed by adding annotations on top of a de Bruijn graph, it stores long range connectivity information through the graph. We show that with error-free data it is possible to losslessly store and recover sequence from a Linked de Bruijn graph. With assembly simulations we demonstrate that the LdBG data structure outperforms both our de Bruijn graph and the String Graph Assembler (SGA). Finally we apply the LdBG to Klebsiella pneumoniae short read data to make large (12 kbp) variant calls, which we validate using PacBio sequencing data, and to characterize the genomic context of drug-resistance genes.Linked de Bruijn Graphs and associated algorithms are implemented as part of McCortex, which is available under the MIT license at https://github.com/mcveanlab/mccortex.Supplementary data are available at Bioinformatics online.


September 22, 2019

Periodic variation of mutation rates in bacterial genomes associated with replication timing

The causes and consequences of spatiotemporal variation in mutation rates remain to be explored in nearly all organisms. Here we examine relationships between local mutation rates and replication timing in three bacterial species whose genomes have multiple chromosomes: Vibrio fischeri, Vibrio cholerae, and Burkholderia cenocepacia Following five mutation accumulation experiments with these bacteria conducted in the near absence of natural selection, the genomes of clones from each lineage were sequenced and analyzed to identify variation in mutation rates and spectra. In lineages lacking mismatch repair, base substitution mutation rates vary in a mirrored wave-like pattern on opposing replichores of the large chromosomes of V. fischeri and V. cholerae, where concurrently replicated regions experience similar base substitution mutation rates. The base substitution mutation rates on the small chromosome are less variable in both species but occur at similar rates to those in the concurrently replicated regions of the large chromosome. Neither nucleotide composition nor frequency of nucleotide motifs differed among regions experiencing high and low base substitution rates, which along with the inferred ~800-kb wave period suggests that the source of the periodicity is not sequence specific but rather a systematic process related to the cell cycle. These results support the notion that base substitution mutation rates are likely to vary systematically across many bacterial genomes, which exposes certain genes to elevated deleterious mutational load.IMPORTANCE That mutation rates vary within bacterial genomes is well known, but the detailed study of these biases has been made possible only recently with contemporary sequencing methods. We applied these methods to understand how bacterial genomes with multiple chromosomes, like those of Vibrio and Burkholderia, might experience heterogeneous mutation rates because of their unusual replication and the greater genetic diversity found on smaller chromosomes. This study captured thousands of mutations and revealed wave-like rate variation that is synchronized with replication timing and not explained by sequence context. The scale of this rate variation over hundreds of kilobases of DNA strongly suggests that a temporally regulated cellular process may generate wave-like variation in mutation risk. These findings add to our understanding of how mutation risk is distributed across bacterial and likely also eukaryotic genomes, owing to their highly conserved replication and repair machinery. Copyright © 2018 Dillon et al.


September 22, 2019

Plasmodium vivax-like genome sequences shed new insights into Plasmodium vivax biology and evolution.

Although Plasmodium vivax is responsible for the majority of malaria infections outside Africa, little is known about its evolution and pathway to humans. Its closest genetic relative, P. vivax-like, was discovered in African great apes and is hypothesized to have given rise to P. vivax in humans. To unravel the evolutionary history and adaptation of P. vivax to different host environments, we generated using long- and short-read sequence technologies 2 new P. vivax-like reference genomes and 9 additional P. vivax-like genotypes. Analyses show that the genomes of P. vivax and P. vivax-like are highly similar and colinear within the core regions. Phylogenetic analyses clearly show that P. vivax-like parasites form a genetically distinct clade from P. vivax. Concerning the relative divergence dating, we show that the evolution of P. vivax in humans did not occur at the same time as the other agents of human malaria, thus suggesting that the transfer of Plasmodium parasites to humans happened several times independently over the history of the Homo genus. We further identify several key genes that exhibit signatures of positive selection exclusively in the human P. vivax parasites. Two of these genes have been identified to also be under positive selection in the other main human malaria agent, P. falciparum, thus suggesting their key role in the evolution of the ability of these parasites to infect humans or their anthropophilic vectors. Finally, we demonstrate that some gene families important for red blood cell (RBC) invasion (a key step of the life cycle of these parasites) have undergone lineage-specific evolution in the human parasite (e.g., reticulocyte-binding proteins [RBPs]).


September 22, 2019

A reference genome of the Chinese hamster based on a hybrid assembly strategy.

Accurate and complete genome sequences are essential in biotechnology to facilitate genome-based cell engineering efforts. The current genome assemblies for Cricetulus griseus, the Chinese hamster, are fragmented and replete with gap sequences and misassemblies, consistent with most short-read-based assemblies. Here, we completely resequenced C. griseus using single molecule real time sequencing and merged this with Illumina-based assemblies. This generated a more contiguous and complete genome assembly than either technology alone, reducing the number of scaffolds by >28-fold, with 90% of the sequence in the 122 longest scaffolds. Most genes are now found in single scaffolds, including up- and downstream regulatory elements, enabling improved study of noncoding regions. With >95% of the gap sequence filled, important Chinese hamster ovary cell mutations have been detected in draft assembly gaps. This new assembly will be an invaluable resource for continued basic and pharmaceutical research.© 2018 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.