Menu
July 7, 2019

Comparative genome analysis of programmed DNA elimination in nematodes.

Programmed DNA elimination is a developmentally regulated process leading to the reproducible loss of specific genomic sequences. DNA elimination occurs in unicellular ciliates and a variety of metazoans, including invertebrates and vertebrates. In metazoa, DNA elimination typically occurs in somatic cells during early development, leaving the germline genome intact. Reference genomes for metazoa that undergo DNA elimination are not available. Here, we generated germline and somatic reference genome sequences of the DNA eliminating pig parasitic nematode Ascaris suum and the horse parasite Parascaris univalens. In addition, we carried out in-depth analyses of DNA elimination in the parasitic nematode of humans, Ascaris lumbricoides, and the parasitic nematode of dogs, Toxocara canis. Our analysis of nematode DNA elimination reveals that in all species, repetitive sequences (that differ among the genera) and germline-expressed genes (approximately 1000-2000 or 5%-10% of the genes) are eliminated. Thirty-five percent of these eliminated genes are conserved among these nematodes, defining a core set of eliminated genes that are preferentially expressed during spermatogenesis. Our analysis supports the view that DNA elimination in nematodes silences germline-expressed genes. Over half of the chromosome break sites are conserved between Ascaris and Parascaris, whereas only 10% are conserved in the more divergent T. canis. Analysis of the chromosomal breakage regions suggests a sequence-independent mechanism for DNA breakage followed by telomere healing, with the formation of more accessible chromatin in the break regions prior to DNA elimination. Our genome assemblies and annotations also provide comprehensive resources for analysis of DNA elimination, parasitology research, and comparative nematode genome and epigenome studies.© 2017 Wang et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Complete genome sequence of Vibrio campbellii LMB 29 isolated from red drum with four native megaplasmids.

Vibrio spp. are the most common pathogens for animals reared in aquaculture. Vibrio campbellii, which is often involved in shrimp, fish and mollusks diseases, is widely distributed in the marine environment worldwide, but our knowledge about its pathogenesis and antimicrobial resistance is very limited. The existence of this knowledge gap is at least partially because that V. campbellii was originally classified as Vibrio harveyi, and the detailed information of its comparative genome analysis to other Vibrio spp. is currently lacking. In this study, the complete genome of a V. campbellii predominant strain, LMB29, was determined by MiSeq in conjunction with PacBio SMRT sequencing. This genome consists of two circular DNA chromosomes and four megaplasmids. Comparative genome analysis indicates that LMB29 shares a 96.66% similarity (average nucleotide identity) with the V. campbellii ATCC strain BAA-1116 based on a 75% AF (average fraction) calculations, and its functional profile is very similar to V. campbellii E1 and V. campbellii CAIM115. Both type III secretion system (T3SS) and type VI secretion system (T6SS), along with the tlh gene which encodes a thermolabile hemolysin, are present in LMB29 which may contribute to the bacterial pathogenesis. The virulence of this strain was experimental confirmed by performing a LDH assay on a fish cell infection model, and cell death was observed as early as within 3 h post infection. Thirty-seven antimicrobial resistance genes (>45% identity) were predicted in LMB29 which includes a novel rifampicin ADP ribosyltransferase, arr-9, in plasmid pLMB157. The gene arr-9 was predicted on a genomic island with horizontal transferable potentials which may facilitate the rifampicin resistance dissemination. Future researches are needed to explore the pathogenesis of V. campbellii LMB29, but the availability of this genome sequence will certainly aid as a basis for further analysis.


July 7, 2019

Contributions of Zea mays subspecies mexicana haplotypes to modern maize.

Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17?×?10-8 ~3.87?×?10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.


July 7, 2019

Assembly of an early-matured japonica (Geng) rice genome, Suijing18, based on PacBio and Illumina sequencing.

The early-matured japonica (Geng) rice variety, Suijing18 (SJ18), carries multiple elite traits including durable blast resistance, good grain quality, and high yield. Using PacBio SMRT technology, we produced over 25?Gb of long-read sequencing raw data from SJ18 with a coverage of 62×. Using Illumina paired-end whole-genome shotgun sequencing technology, we generated 59?Gb of short-read sequencing data from SJ18 (23.6?Gb from a 200?bp library with a coverage of 59× and 35.4?Gb from an 800?bp library with a coverage of 88×). With these data, we assembled a single SJ18 genome and then generated a set of annotation data. These data sets can be used to test new programs for variation deep mining, and will provide new insights into the genome structure, function, and evolution of SJ18, and will provide essential support for biological research in general.


July 7, 2019

Automation of PacBio SMRTbell NGS library preparation for bacterial genome sequencing.

The PacBio RS II provides for single molecule, real-time DNA technology to sequence genomes and detect DNA modifications. The starting point for high-quality sequence production is high molecular weight genomic DNA. To automate the library preparation process, there must be high-throughput methods in place to assess the genomic DNA, to ensure the size and amounts of the sheared DNA fragments and final library.The library construction automation was accomplished using the Agilent NGS workstation with Bravo accessories for heating, shaking, cooling, and magnetic bead manipulations for template purification. The quality control methods from gDNA input to final library using the Agilent Bioanalyzer System and Agilent TapeStation System were evaluated.Automated protocols of PacBio 10 kb library preparation produced libraries with similar technical performance to those generated manually. The TapeStation System proved to be a reliable method that could be used in a 96-well plate format to QC the DNA equivalent to the standard Bioanalyzer System results. The DNA Integrity Number that is calculated in the TapeStation System software upon analysis of genomic DNA is quite helpful to assure that the starting genomic DNA is not degraded. In this respect, the gDNA assay on the TapeStation System is preferable to the DNA 12000 assay on the Bioanalyzer System, which cannot run genomic DNA, nor can the Bioanalyzer work directly from the 96-well plates.


July 7, 2019

Evolutionary context of non-sorbitol-fermenting Shiga toxin-producing Escherichia coli O55:H7.

In July 2014, an outbreak of Shiga toxin-producing Escherichia coli (STEC) O55:H7 in England involved 31 patients, 13 (42%) of whom had hemolytic uremic syndrome. Isolates were sequenced, and the sequences were compared with publicly available sequences of E. coli O55:H7 and O157:H7. A core-genome phylogeny of the evolutionary history of the STEC O55:H7 outbreak strain revealed that the most parsimonious model was a progenitor enteropathogenic O55:H7 sorbitol-fermenting strain, lysogenized by a Shiga toxin (Stx) 2a-encoding phage, followed by loss of the ability to ferment sorbitol because of a non-sense mutation in srlA. The parallel, convergent evolutionary histories of STEC O157:H7 and STEC O55:H7 may indicate a common driver in the evolutionary process. Because emergence of STEC O157:H7 as a clinically significant pathogen was associated with acquisition of the Stx2a-encoding phage, the emergence of STEC O55:H7 harboring the stx2a gene is of public health concern.


July 7, 2019

Genome sequencing brought Gossypium biology research into a new era.

The first sequenced diploid cotton genome was published in 2012 by the group led by the Institute of Cotton Research, Chinese Academy of Agricultural Sciences. Cotton genomics research subsequently entered a period of rapid development. The accumulating data have provided new insights into the evolution and domestication of cotton, the development of important agronomic traits, and strategies for improving cotton quality and production.


July 7, 2019

A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0).

The chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.1.4), the sequence is scattered across more then 183 000 contigs, incorporating more than 159 000 gaps, with a genome-wide contig N50 of 51 Kbp. In this work, we produce an extensive and diverse array of sequencing datasets to rapidly assemble a new chimpanzee reference that surpasses previous iterations in bases represented and organized in large scaffolds. To this end, we show substantial improvements over the current release of the chimpanzee genome (Pan_tro_2.1.4) by several metrics, such as increased contiguity by >750% and 300% on contigs and scaffolds, respectively, and closure of 77% of gaps in the Pan_tro_2.1.4 assembly gaps spanning >850 Kbp of the novel coding sequence based on RNASeq data. We further report more than 2700 genes that had putatively erroneous frame-shift predictions to human in Pan_tro_2.1.4 and show a substantial increase in the annotation of repetitive elements. We apply a simple 3-way hybrid approach to considerably improve the reference genome assembly for the chimpanzee, providing a valuable resource for the study of human origins. Furthermore, we produce extensive sequencing datasets that are all derived from the same cell line, generating a broad non-human benchmark dataset.© The Author 2017. Published by Oxford University Press.


July 7, 2019

Evaluation of the impact of ul54 gene-deletion on the global transcription and DNA replication of pseudorabies virus.

Pseudorabies virus (PRV) is an animal alphaherpesvirus with a wide host range. PRV has 67 protein-coding genes and several non-coding RNA molecules, which can be classified into three temporal groups, immediate early, early and late classes. The ul54 gene of PRV and its homolog icp27 of herpes simplex virus have a multitude of functions, including the regulation of viral DNA synthesis and the control of the gene expression. Therefore, abrogation of PRV ul54 function was expected to exert a significant effect on the global transcriptome and on DNA replication. Real-time PCR and real-time RT-PCR platforms were used to investigate these presumed effects. Our analyses revealed a drastic impact of the ul54 mutation on the genome-wide expression of PRV genes, especially on the transcription of the true late genes. A more than two hour delay was observed in the onset of DNA replication, and the amount of synthesized DNA molecules was significantly decreased in comparison to the wild-type virus. Furthermore, in this work, we were able to successfully demonstrate the utility of long-read SMRT sequencing for genotyping of mutant viruses.


July 7, 2019

Systems biology analysis of the key genes of surfactin production in Bacillus subtilis MJ01 (isolated from soil contaminated oil in south of Iran), spizizenii, and 168 isolates

Applying microorganism in oil recovery has attracted attentions recently. Surfactin produced by Bacillus subtilis is widely used industrially in a range of industrial applications in pharmecutical and environmental sectors. Little information about molecular mechanism of suffactin compound is available. In this study, we performed promoter and network analysis of surfactin production genes in Bacillus subtilis subsp. MJ01 (isolated from oil contaminated soil in South of Iran), spizizenii and 168. Our analysis revealed that comQ and comX are the genes with sequence alterations among these three strains of Bacillus subtilis and are involved in surfactin production. Promoter analysis indicated that lrp, argR, rpoD, purr and ihf are overrepresented and have the highest number of transcription factor binding sites (TFBs) on the key surfactin production genes in all 3 strains. Also the pattern of TFBs among these three strains was completely different. Interestingly, there is distinct difference between 168, spizizenii and MJ01 in their frequency of TFs that activate genes involve in surfactin production. Attribute weighting algorithms and decision tree analysis revealed ihf, rpoD and flHCD as the most important TF among surfactin production. Network analysis identified two significant network modules. The first one consists of key genes involved in surfactin production and the second module includes key TFs, involved in regulation of surfactin production. Our findings enhance understanding the molecular mechanism of surfactin production through systems biology analysis.


July 7, 2019

Complete genome sequence of Lactobacillus plantarum JBE245 isolated from Meju

Lactobacillus plantarum is widely found in fermented foods and has various phenotypic and genetic characteristics to adapt to the environment. Here we report the complete annotated genome sequence of the L. plantarum strain JBE245 (= KCCM43243) isolated for malolactic fermentation of apple juice. The genome comprises a single circular 3,262,611 bp chromosome with 2907 coding regions, 45 pseudogenes, and 91 RNA genes. The genome contains 4 malate dehydrogenase genes, 3 malate permease genes and various types of plantaricin-synthesizing genes. These genetic traits meet the selection criteria of the strains that should prevent the spoilage of apple juice during fermentation and efficiently convert malate to lactic acid.


July 7, 2019

Letting go: bacterial genome reduction solves the dilemma of adapting to predation mortality in a substrate-restricted environment.

Resource limitation and predation mortality are major determinants of microbial population dynamics, and optimization for either aspect is considered to imply a trade-off with respect to the other. Adaptation to these selective factors may, moreover, lead to disadvantages at rich growth conditions. We present an example of a concomitant evolutionary optimization to both, substrate limitation and predation in an aggregate-forming freshwater bacterial isolate, and we elucidate an underlying genomic mechanism. Bacteria were propagated in serial batch culture in a nutrient-restricted environment either with or without a bacterivorous flagellate. Strains isolated after 26 growth cycles of the predator-prey co-cultures formed as much total biomass as the ancestor at ancestral growth conditions, albeit largely reallocated to cell aggregates. A ~273?kbp genome fragment was lost in three strains that had independently evolved with predators. These strains had significantly higher growth yield on substrate-restricted media than others that were isolated from the same treatment before the excision event. Under predation pressure, the isolates with the deletion outcompeted both, the ancestor and the strains evolved without predators even at rich growth conditions. At the same time, genome reduction led to a growth disadvantage in the presence of benzoate due to the loss of the respective degradation pathway, suggesting that niche constriction might be the price for the bidirectional optimization.


July 7, 2019

Genome mining of astaxanthin biosynthetic genes from Sphingomonas sp. ATCC 55669 for heterologous overproduction in Escherichia coli.

As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the a-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform was constructed in E. coli for identifying the functional roles of candidate genes. All genes involved in astaxanthin biosynthesis showed discrete distributions on the chromosome. Moreover, the overexpression of exogenous E. coli idi in Sphingomonas sp. ATCC 55669 increased astaxanthin production by 5.4-fold. This study described a new astaxanthin producer and provided more biosynthesis components for bioengineering of astaxanthin in the future. © 2015 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


July 7, 2019

Long read and single molecule DNA sequencing simplifies genome assembly and TAL effector gene analysis of Xanthomonas translucens.

The species Xanthomonas translucens encompasses a complex of bacterial strains that cause diseases and yield loss on grass species including important cereal crops. Three pathovars, X. translucens pv. undulosa, X. translucens pv. translucens and X. translucens pv.cerealis, have been described as pathogens of wheat, barley, and oats. However, no complete genome sequence for a strain of this complex is currently available.A complete genome sequence of X. translucens pv. undulosa strain XT4699 was obtained by using PacBio long read, single molecule, real time (SMRT) DNA sequences and Illumina sequences. Draft genome sequences of nineteen additional X. translucens strains, which were collected from wheat or barley in different regions and at different times, were generated by Illumina sequencing. Phylogenetic relationships among different Xanthomonas strains indicates that X. translucens are members of a distinct clade from so-called group 2 xanthomonads and three pathovars of this species, undulosa, translucens and cerealis, represent distinct subclades in the group 1 clade. Knockout mutation of type III secretion system of XT4699 eliminated the ability to cause water-soaking symptoms on wheat and barley and resulted in a reduction in populations on wheat in comparison to the wild type strain. Sequence comparison of X. translucens strains revealed the genetic variation on type III effector repertories among different pathovars or within one pathovar. The full genome sequence of XT4699 reveals the presence of eight members of the Transcription-Activator Like (TAL) effector genes, which are phylogenetically distant from previous known TAL effector genes of group 2 xanthomonads. Microarray and qRT-PCR analyses revealed TAL effector-specific wheat gene expression modulation.PacBio long read sequencing facilitates the assembly of Xanthomonas genomes and the multiple TAL effector genes, which are difficult to assemble from short read platforms. The complete genome sequence of X. translucens pv. undulosa strain XT4699 and draft genome sequences of nineteen additional X. translucens strains provides a resource for further genetic analyses of pathogenic diversity and host range of the X. translucens species complex. TAL effectors of XT4699 strain play roles in modulating wheat host gene expressions.


July 7, 2019

Draft genome sequence of Alternaria alternata ATCC 34957.

We report the draft genome sequence of Alternaria alternata ATCC 34957. This strain was previously reported to produce alternariol and alternariol monomethyl ether on weathered grain sorghum. The genome was sequenced with PacBio technology and assembled into 27 scaffolds with a total genome size of 33.5 Mb. Copyright © 2016 Nguyen et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.