Menu
July 7, 2019

Echinochloa crus-galli genome analysis provides insight into its adaptation and invasiveness as a weed.

Barnyardgrass (Echinochloa crus-galli) is a pernicious weed in agricultural fields worldwide. The molecular mechanisms underlying its success in the absence of human intervention are presently unknown. Here we report a draft genome sequence of the hexaploid species E. crus-galli, i.e., a 1.27?Gb assembly representing 90.7% of the predicted genome size. An extremely large repertoire of genes encoding cytochrome P450 monooxygenases and glutathione S-transferases associated with detoxification are found. Two gene clusters involved in the biosynthesis of an allelochemical 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-one (DIMBOA) and a phytoalexin momilactone A are found in the E. crus-galli genome, respectively. The allelochemical DIMBOA gene cluster is activated in response to co-cultivation with rice, while the phytoalexin momilactone A gene cluster specifically to infection by pathogenic Pyricularia oryzae. Our results provide a new understanding of the molecular mechanisms underlying the extreme adaptation of the weed.


July 7, 2019

Determination of the genome and primary transcriptome of syngas fermenting Eubacterium limosum ATCC 8486.

Autotrophic conversion of CO2 to value-added biochemicals has received considerable attention as a sustainable route to replace fossil fuels. Particularly, anaerobic acetogenic bacteria are naturally capable of reducing CO2 or CO to various metabolites. To fully utilize their biosynthetic potential, an understanding of acetogenesis-related genes and their regulatory elements is required. Here, we completed the genome sequence of the syngas fermenting Eubacterium limosum ATCC 8486 and determined its transcription start sites (TSS). We constructed a 4.4?Mb long circular genome with a GC content of 47.2% and 4,090 protein encoding genes. To understand the transcriptional and translational regulation, the primary transcriptome was augmented, identifying 1,458 TSSs containing a high pyrimidine (T/C) and purine nucleotide (A/G) content at the -1 and +1 position, respectively, along with 1,253 5′-untranslated regions, and principal promoter elements such as -10 (TATAAT) and -35 (TTGACA), and Shine-Dalgarno motifs (GGAGR). Further analysis revealed 93 non-coding RNAs, including one for potential transcriptional regulation of the hydrogenase complex via interaction with molybdenum or tungsten cofactors, which in turn controls formate dehydrogenase activity of the initial step of Wood-Ljungdahl pathway. Our results provide comprehensive genomic information for strain engineering to enhance the syngas fermenting capacity of acetogenic bacteria.


July 7, 2019

Complete genome sequence of the pathogenic Vibrio vulnificus type strain ATCC 27562.

Vibrio vulnificus has the highest death rate and economic burden per case of any foodborne pathogen in the United States. A complete genome sequence of the type strain promotes comparative analyses with other clinical and environmental isolates, improving our understanding of this important human pathogen and successful environmental organism. Copyright © 2017 Rusch and Rowe-Magnus.


July 7, 2019

Genome architecture and evolution of a unichromosomal asexual nematode.

Asexual reproduction in animals, though rare, is the main or exclusive mode of reproduction in some long-lived lineages. The longevity of asexual clades may be correlated with the maintenance of heterozygosity by mechanisms that rearrange genomes and reduce recombination. Asexual species thus provide an opportunity to gain insight into the relationship between molecular changes, genome architecture, and cellular processes. Here we report the genome sequence of the parthenogenetic nematode Diploscapter pachys with only one chromosome pair. We show that this unichromosomal architecture is shared by a long-lived clade of asexual nematodes closely related to the genetic model organism Caenorhabditis elegans. Analysis of the genome assembly reveals that the unitary chromosome arose through fusion of six ancestral chromosomes, with extensive rearrangement among neighboring regions. Typical nematode telomeres and telomeric protection-encoding genes are lacking. Most regions show significant heterozygosity; homozygosity is largely concentrated to one region and attributed to gene conversion. Cell-biological and molecular evidence is consistent with the absence of key features of meiosis I, including synapsis and recombination. We propose that D. pachys preserves heterozygosity and produces diploid embryos without fertilization through a truncated meiosis. As a prelude to functional studies, we demonstrate that D. pachys is amenable to experimental manipulation by RNA interference. Copyright © 2017 Elsevier Ltd. All rights reserved.


July 7, 2019

Lightweight BWT and LCP merging via the gap algorithm

Recently, Holt and McMillan [Bioinformatics 2014, ACM-BCB 2014] have proposed a simple and elegant algorithm to merge the Burrows-Wheeler transforms of a collection of strings. In this paper we show that their algorithm can be improved so that, in addition to the BWTs, it also merges the Longest Common Prefix (LCP) arrays. Because of its small memory footprint this new algorithm can be used for the final merge of BWT and LCP arrays computed by a faster but memory intensive construction algorithm.


July 7, 2019

Identification of low allele frequency mosaic mutations in Alzheimer disease

Germline mutations ofAPP,PSEN1, andPSEN2 genes cause autosomal dominant Alzheimer disease (AD). Somatic variants of the same genes may underlie pathogenesis in sporadic AD, which is the most prevalent form of the disease. Importantly, such somatic variants may be present at very low allelic frequency, confined to the brain, and are thus very difficult or impossible to detect in blood-derived DNA. Ever-refined methodologies to identify mutations present in a fraction of the DNA of the original tissue are rapidly transforming our understanding of DNA mutation and their role in complex pathologies such as tumors. These methods stand poised to test to what extend somatic variants may play a role in AD and other neurodegenerative diseases.


July 7, 2019

Single-virion sequencing of lamivudine-treated HBV populations reveal population evolution dynamics and demographic history.

Viral populations are complex, dynamic, and fast evolving. The evolution of groups of closely related viruses in a competitive environment is termed quasispecies. To fully understand the role that quasispecies play in viral evolution, characterizing the trajectories of viral genotypes in an evolving population is the key. In particular, long-range haplotype information for thousands of individual viruses is critical; yet generating this information is non-trivial. Popular deep sequencing methods generate relatively short reads that do not preserve linkage information, while third generation sequencing methods have higher error rates that make detection of low frequency mutations a bioinformatics challenge. Here we applied BAsE-Seq, an Illumina-based single-virion sequencing technology, to eight samples from four chronic hepatitis B (CHB) patients – once before antiviral treatment and once after viral rebound due to resistance.With single-virion sequencing, we obtained 248-8796 single-virion sequences per sample, which allowed us to find evidence for both hard and soft selective sweeps. We were able to reconstruct population demographic history that was independently verified by clinically collected data. We further verified four of the samples independently through PacBio SMRT and Illumina Pooled deep sequencing.Overall, we showed that single-virion sequencing yields insight into viral evolution and population dynamics in an efficient and high throughput manner. We believe that single-virion sequencing is widely applicable to the study of viral evolution in the context of drug resistance and host adaptation, allows differentiation between soft or hard selective sweeps, and may be useful in the reconstruction of intra-host viral population demographic history.


July 7, 2019

The sea cucumber genome provides insights into morphological evolution and visceral regeneration.

Apart from sharing common ancestry with chordates, sea cucumbers exhibit a unique morphology and exceptional regenerative capacity. Here we present the complete genome sequence of an economically important sea cucumber, A. japonicus, generated using Illumina and PacBio platforms, to achieve an assembly of approximately 805 Mb (contig N50 of 190 Kb and scaffold N50 of 486 Kb), with 30,350 protein-coding genes and high continuity. We used this resource to explore key genetic mechanisms behind the unique biological characters of sea cucumbers. Phylogenetic and comparative genomic analyses revealed the presence of marker genes associated with notochord and gill slits, suggesting that these chordate features were present in ancestral echinoderms. The unique shape and weak mineralization of the sea cucumber adult body were also preliminarily explained by the contraction of biomineralization genes. Genome, transcriptome, and proteome analyses of organ regrowth after induced evisceration provided insight into the molecular underpinnings of visceral regeneration, including a specific tandem-duplicated prostatic secretory protein of 94 amino acids (PSP94)-like gene family and a significantly expanded fibrinogen-related protein (FREP) gene family. This high-quality genome resource will provide a useful framework for future research into biological processes and evolution in deuterostomes, including remarkable regenerative abilities that could have medical applications. Moreover, the multiomics data will be of prime value for commercial sea cucumber breeding programs.


July 7, 2019

Genome expansion and lineage-specific genetic innovations in the forest pathogenic fungi Armillaria.

Armillaria species are both devastating forest pathogens and some of the largest terrestrial organisms on Earth. They forage for hosts and achieve immense colony sizes via rhizomorphs, root-like multicellular structures of clonal dispersal. Here, we sequenced and analysed the genomes of four Armillaria species and performed RNA sequencing and quantitative proteomic analysis on the invasive and reproductive developmental stages of A.?ostoyae. Comparison with 22 related fungi revealed a significant genome expansion in Armillaria, affecting several pathogenicity-related genes, lignocellulose-degrading enzymes and lineage-specific genes expressed during rhizomorph development. Rhizomorphs express an evolutionarily young transcriptome that shares features with the transcriptomes of both fruiting bodies and vegetative mycelia. Several genes show concomitant upregulation in rhizomorphs and fruiting bodies and share cis-regulatory signatures in their promoters, providing genetic and regulatory insights into complex multicellularity in fungi. Our results suggest that the evolution of the unique dispersal and pathogenicity mechanisms of Armillaria might have drawn upon ancestral genetic toolkits for wood-decay, morphogenesis and complex multicellularity.


July 7, 2019

Genomic variation and evolution of Vibrio parahaemolyticus ST36 over the course of a transcontinental epidemic expansion.

Vibrio parahaemolyticus is the leading cause of seafood-related infections with illnesses undergoing a geographic expansion. In this process of expansion, the most fundamental change has been the transition from infections caused by local strains to the surge of pandemic clonal types. Pandemic clone sequence type 3 (ST3) was the only example of transcontinental spreading until 2012, when ST36 was detected outside the region where it is endemic in the U.S. Pacific Northwest causing infections along the U.S. northeast coast and Spain. Here, we used genome-wide analyses to reconstruct the evolutionary history of the V. parahaemolyticus ST36 clone over the course of its geographic expansion during the previous 25 years. The origin of this lineage was estimated to be in ~1985. By 1995, a new variant emerged in the region and quickly replaced the old clone, which has not been detected since 2000. The new Pacific Northwest (PNW) lineage was responsible for the first cases associated with this clone outside the Pacific Northwest region. After several introductions into the northeast coast, the new PNW clone differentiated into a highly dynamic group that continues to cause illness on the northeast coast of the United States. Surprisingly, the strains detected in Europe in 2012 diverged from this ancestral group around 2000 and have conserved genetic features present only in the old PNW lineage. Recombination was identified as the major driver of diversification, with some preliminary observations suggesting a trend toward a more specialized lifestyle, which may represent a critical element in the expansion of epidemics under scenarios of coastal warming.IMPORTANCEVibrio parahaemolyticus and Vibrio cholerae represent the only two instances of pandemic expansions of human pathogens originating in the marine environment. However, while the current pandemic of V. cholerae emerged more than 50 years ago, the global expansion of V. parahaemolyticus is a recent phenomenon. These modern expansions provide an exceptional opportunity to study the evolutionary process of these pathogens at first hand and gain an understanding of the mechanisms shaping the epidemic dynamics of these diseases, in particular, the emergence, dispersal, and successful introduction in new regions facilitating global spreading of infections. In this study, we used genomic analysis to examine the evolutionary divergence that has occurred over the course of the most recent transcontinental expansion of a pathogenic Vibrio, the spreading of the V. parahaemolyticus sequence type 36 clone from the region where it is endemic on the Pacific coast of North America to the east coast of the United States and finally to the west coast of Europe.


July 7, 2019

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.© 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

N6-adenine DNA methylation is associated with the linker DNA of H2A.Z-containing well-positioned nucleosomes in Pol II-transcribed genes in Tetrahymena.

DNA N6-methyladenine (6mA) is newly rediscovered as a potential epigenetic mark across a more diverse range of eukaryotes than previously realized. As a unicellular model organism, Tetrahymena thermophila is among the first eukaryotes reported to contain 6mA modification. However, lack of comprehensive information about 6mA distribution hinders further investigations into its function and regulatory mechanism. In this study, we provide the first genome-wide, base pair-resolution map of 6mA in Tetrahymena by applying single-molecule real-time (SMRT) sequencing. We provide evidence that 6mA occurs mostly in the AT motif of the linker DNA regions. More strikingly, these linker DNA regions with 6mA are usually flanked by well-positioned nucleosomes and/or H2A.Z-containing nucleosomes. We also find that 6mA is exclusively associated with RNA polymerase II (Pol II)-transcribed genes, but is not an unambiguous mark for active transcription. These results support that 6mA is an integral part of the chromatin landscape shaped by adenosine triphosphate (ATP)-dependent chromatin remodeling and transcription.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.