Menu
September 22, 2019

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019

Mycobacterial biomaterials and resources for researchers.

There are many resources available to mycobacterial researchers, including culture collections around the world that distribute biomaterials to the general scientific community, genomic and clinical databases, and powerful bioinformatics tools. However, many of these resources may be unknown to the research community. This review article aims to summarize and publicize many of these resources, thus strengthening the quality and reproducibility of mycobacterial research by providing the scientific community access to authenticated and quality-controlled biomaterials and a wealth of information, analytical tools and research opportunities.


September 22, 2019

Multiplex assessment of protein variant abundance by massively parallel sequencing.

Determining the pathogenicity of genetic variants is a critical challenge, and functional assessment is often the only option. Experimentally characterizing millions of possible missense variants in thousands of clinically important genes requires generalizable, scalable assays. We describe variant abundance by massively parallel sequencing (VAMP-seq), which measures the effects of thousands of missense variants of a protein on intracellular abundance simultaneously. We apply VAMP-seq to quantify the abundance of 7,801 single-amino-acid variants of PTEN and TPMT, proteins in which functional variants are clinically actionable. We identify 1,138 PTEN and 777 TPMT variants that result in low protein abundance, and may be pathogenic or alter drug metabolism, respectively. We observe selection for low-abundance PTEN variants in cancer, and show that p.Pro38Ser, which accounts for ~10% of PTEN missense variants in melanoma, functions via a dominant-negative mechanism. Finally, we demonstrate that VAMP-seq is applicable to other genes, highlighting its generalizability.


September 22, 2019

Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089?bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.


September 22, 2019

Diversity and evolution of the emerging Pandoraviridae family.

With DNA genomes reaching 2.5?Mb packed in particles of bacterium-like shape and dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes.


September 22, 2019

Parallels between experimental and natural evolution of legume symbionts.

The emergence of symbiotic interactions has been studied using population genomics in nature and experimental evolution in the laboratory, but the parallels between these processes remain unknown. Here we compare the emergence of rhizobia after the horizontal transfer of a symbiotic plasmid in natural populations of Cupriavidus taiwanensis, over 10 MY ago, with the experimental evolution of symbiotic Ralstonia solanacearum for a few hundred generations. In spite of major differences in terms of time span, environment, genetic background, and phenotypic achievement, both processes resulted in rapid genetic diversification dominated by purifying selection. We observe no adaptation in the plasmid carrying the genes responsible for the ecological transition. Instead, adaptation was associated with positive selection in a set of genes that led to the co-option of the same quorum-sensing system in both processes. Our results provide evidence for similarities in experimental and natural evolutionary transitions and highlight the potential of comparisons between both processes to understand symbiogenesis.


September 22, 2019

Phylogenomic analysis of Lactobacillus curvatus reveals two lineages distinguished by genes for fermenting plant-derived carbohydrates.

Lactobacillus curvatus is a lactic acid bacterium encountered in many different types of fermented food (meat, seafood, vegetables, and cereals). Although this species plays an important role in the preservation of these foods, few attempts have been made to assess its genomic diversity. This study uses comparative analyses of 13 published genomes (complete or draft) to better understand the evolutionary processes acting on the genome of this species. Phylogenomic analysis, based on a coalescent model of evolution, revealed that the 6,742 sites of single nucleotide polymorphism within the L. curvatus core genome delineate two major groups, with lineage 1 represented by the newly sequenced strain FLEC03, and lineage 2 represented by the type-strain DSM20019. The two lineages could also be distinguished by the content of their accessory genome, which sheds light on a long-term evolutionary process of lineage-dependent genetic acquisition and the possibility of population structure. Interestingly, one clade from lineage 2 shared more accessory genes with strains of lineage 1 than with other strains of lineage 2, indicating recent convergence in carbohydrate catabolism. Both lineages had a wide repertoire of accessory genes involved in the fermentation of plant-derived carbohydrates that are released from polymers of a/ß-glucans, a/ß-fructans, and N-acetylglucosan. Other gene clusters were distributed among strains according to the type of food from which the strains were isolated. These results give new insight into the ecological niches in which L. curvatus may naturally thrive (such as silage or compost heaps) in addition to fermented food.


September 22, 2019

A reference genome of the European beech (Fagus sylvatica L.).

The European beech is arguably the most important climax broad-leaved tree species in Central Europe, widely planted for its valuable wood. Here, we report the 542 Mb draft genome sequence of an up to 300-year-old individual (Bhaga) from an undisturbed stand in the Kellerwald-Edersee National Park in central Germany.Using a hybrid assembly approach, Illumina reads with short- and long-insert libraries, coupled with long Pacific Biosciences reads, we obtained an assembled genome size of 542 Mb, in line with flow cytometric genome size estimation. The largest scaffold was of 1.15 Mb, the N50 length was 145 kb, and the L50 count was 983. The assembly contained 0.12% of Ns. A Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis retrieved 94% complete BUSCO genes, well in the range of other high-quality draft genomes of trees. A total of 62,012 protein-coding genes were predicted, assisted by transcriptome sequencing. In addition, we are reporting an efficient method for extracting high-molecular-weight DNA from dormant buds, by which contamination by environmental bacteria and fungi was kept at a minimum.The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe. A continuously updated genome browser and download page can be accessed from beechgenome.net, which will include future genome versions of the reference individual Bhaga, as new sequencing approaches develop.


September 22, 2019

Footprints of parasitism in the genome of the parasitic flowering plant Cuscuta campestris.

A parasitic lifestyle, where plants procure some or all of their nutrients from other living plants, has evolved independently in many dicotyledonous plant families and is a major threat for agriculture globally. Nevertheless, no genome sequence of a parasitic plant has been reported to date. Here we describe the genome sequence of the parasitic field dodder, Cuscuta campestris. The genome contains signatures of a fairly recent whole-genome duplication and lacks genes for pathways superfluous to a parasitic lifestyle. Specifically, genes needed for high photosynthetic activity are lost, explaining the low photosynthesis rates displayed by the parasite. Moreover, several genes involved in nutrient uptake processes from the soil are lost. On the other hand, evidence for horizontal gene transfer by way of genomic DNA integration from the parasite’s hosts is found. We conclude that the parasitic lifestyle has left characteristic footprints in the C. campestris genome.


September 22, 2019

Landscape of the genome and host cell response of Mycobacterium shigaense reveals pathogenic features.

A systems approach was used to explore the genome and transcriptome of Mycobacterium shigaense, a new opportunistic pathogen isolated from a patient with a skin infection, and the host response transcriptome was assessed using a macrophage infection model. The M. shigaense genome comprises 5,207,883?bp, with 67.2% G+C content and 5098 predicted coding genes. Evolutionarily, the bacterium belongs to a cluster in the phylogenetic tree along with three target opportunistic pathogenic strains, namely, M. avium, M. triplex and M. simiae. Potential virulence genes are indeed expressed by M. shigaense under culture conditions. Phenotypically, M. shigaense had similar infection and replication capacities in a macrophage model as the opportunistic species compared to M. tuberculosis. M. shigaense activated NF-?B, TNF, cytokines and chemokines in the host innate immune-related signaling pathways and elicited an early response shared with pathogenic bacilli except M. tuberculosis. M. shigaense upregulated specific host response genes such as TLR7, CCL4 and CXCL5. We performed an integrated and comparative analysis of M. shigaense. Multigroup comparison indicated certain differences with typical pathogenic bacilli in terms of gene features and the macrophage response.


September 22, 2019

Sea cucumber genome provides insights into saponin biosynthesis and aestivation regulation.

Echinoderms exhibit several fascinating evolutionary innovations that are rarely seen in the animal kingdom, but how these animals attained such features is not well understood. Here we report the sequencing and analysis of the genome and extensive transcriptomes of the sea cucumber Apostichopus japonicus, a species from a special echinoderm group with extraordinary potential for saponin synthesis, aestivation and organ regeneration. The sea cucumber does not possess a reorganized Hox cluster as previously assumed for all echinoderms, and the spatial expression of Hox7 and Hox11/13b potentially guides the embryo-to-larva axial transformation. Contrary to the typical production of lanosterol in animal cholesterol synthesis, the oxidosqualene cyclase of sea cucumber produces parkeol for saponin synthesis and has “plant-like” motifs suggestive of convergent evolution. The transcriptional factors Klf2 and Egr1 are identified as key regulators of aestivation, probably exerting their effects through a clock gene-controlled process. Intestinal hypometabolism during aestivation is driven by the DNA hypermethylation of various metabolic gene pathways, whereas the transcriptional network of intestine regeneration involves diverse signaling pathways, including Wnt, Hippo and FGF. Decoding the sea cucumber genome provides a new avenue for an in-depth understanding of the extraordinary features of sea cucumbers and other echinoderms.


September 22, 2019

Characteristics of carbapenem-resistant Enterobacteriaceae in ready-to-eat vegetables in China.

Vegetables harboring bacteria resistant to antibiotics are a growing food safety issue. However, data concerning carbapenem-resistant Enterobacteriaceae (CRE) in ready-to-eat fresh vegetables is still rare. In this study, 411 vegetable samples from 36 supermarkets or farmer’s markets in 18 cities in China, were analyzed for CRE. Carbapenemase-encoding genes and other resistance genes were analyzed among the CRE isolates. Plasmids carrying carbapenemase genes were studied by conjugation, replicon typing, S1-PFGE southern blot, restriction fragment length polymorphism (RFLP), and sequencing. CRE isolates were also analyzed by pulsed-field gel electrophoresis (PFGE). Ten vegetable samples yielded one or more CRE isolates. The highest detection rate of CRE (14.3%, 4/28) was found in curly endive. Twelve CRE isolates were obtained and all showed multidrug resistance: Escherichia coli, 5; Citrobacter freundii, 5; and Klebsiella pneumoniae, 2. All E. coli and C. freundii carried blaNDM, while K. pneumoniae harbored blaKPC-2. Notably, E. coli with blaNDM and ST23 hypervirulent Klebsiella pneumoniae (hvKP) carrying blaKPC-2 were found in the same cucumber sample and clonal spread of E. coli, C. freundii, and K. pneumoniae isolates were all observed between vegetable types and/or cities. IncX3 plasmids carrying blaNDM from E. coli and C. freundii showed identical or highly similar RFLP patterns, and the sequenced IncX3 plasmid from cucumber was also identical or highly similar (99%) to the IncX3 plasmids from clinical patients reported in other countries, while blaKPC-2 in K. pneumoniae was mediated by similar F35:A-:B1 plasmids. Our results suggest that both clonal expansion and horizontal transmission of IncX3- or F35:A-:B1-type plasmids may mediate the spread of CRE in ready-to-eat vegetables in China. The presence of CRE in ready-to-eat vegetables is alarming and constitutes a food safety issue. To our knowledge, this is the first report of either the C. freundii carrying blaNDM, or K. pneumoniae harboring blaKPC-2 in vegetables. This is also the first report of ST23 carbapenem-resistant hvKP strain in vegetables.


September 22, 2019

Draft genome sequence of Annulohypoxylon stygium, Aspergillus mulundensis, Berkeleyomyces basicola (syn. Thielaviopsis basicola), Ceratocystis smalleyi, two Cercospora beticola strains, Coleophoma cylindrospora, Fusarium fracticaudum, Phialophora cf. hyalina, and Morchella septimelata.

Draft genomes of the species Annulohypoxylon stygium, Aspergillus mulundensis, Berkeleyomyces basicola (syn. Thielaviopsis basicola), Ceratocystis smalleyi, two Cercospora beticola strains, Coleophoma cylindrospora, Fusarium fracticaudum, Phialophora cf. hyalina and Morchella septimelata are presented. Both mating types (MAT1-1 and MAT1-2) of Cercospora beticola are included. Two strains of Coleophoma cylindrospora that produce sulfated homotyrosine echinocandin variants, FR209602, FR220897 and FR220899 are presented. The sequencing of Aspergillus mulundensis, Coleophoma cylindrospora and Phialophora cf. hyalina has enabled mapping of the gene clusters encoding the chemical diversity from the echinocandin pathways, providing data that reveals the complexity of secondary metabolism in these different species. Overall these genomes provide a valuable resource for understanding the molecular processes underlying pathogenicity (in some cases), biology and toxin production of these economically important fungi.


September 22, 2019

Adaptation of Pseudomonas aeruginosa to phage PaP1 predation via O-antigen polymerase mutation.

Adaptation of bacteria to phage predation poses a major obstacle for phage therapy. Bacteria adopt multiple mechanisms, such as inhibition of phage adsorption and CRISPR/Cas systems, to resist phage infection. Here, a phage-resistant mutant of Pseudomonas aeruginosa strain PA1 under the infection of lytic phage PaP1 was selected for further study. The PaP1-resistant variant, termed PA1RG, showed decreased adsorption to PaP1 and was devoid of long chain O-antigen on its cell envelope. Whole genome sequencing and comparative analysis revealed a single nucleotide mutation in the gene PA1S_08510, which encodes the O-antigen polymerase Wzy that is involved in lipopolysaccharide (LPS) biosynthesis. PA1_Wzy was classified into the O6 serotype based on sequence homology analysis and adopts a transmembrane topology similar to that seem with P. aeruginosa strain PAO1. Complementation of gene wzy in trans enabled the mutant PA1RG to produce the normal LPS pattern with long chain O-antigen and restored the susceptibility of PA1RG to phage PaP1 infection. While wzy mutation did not affect bacterial growth, mutant PA1RG exhibited decreased biofilm production, suggesting a fitness cost of PA1 associated with resistance of phage PaP1 predation. This study uncovered the mechanism responsible for PA1RG resistance to phage PaP1 via wzy mutation and revealed the role of phages in regulating bacterial behavior.


September 22, 2019

Recurrent loss, horizontal transfer, and the obscure origins of mitochondrial introns in diatoms (Bacillariophyta).

We sequenced mitochondrial genomes from five diverse diatoms (Toxarium undulatum, Psammoneis japonica, Eunotia naegelii, Cylindrotheca closterium, and Nitzschia sp.), chosen to fill important phylogenetic gaps and help us characterize broadscale patterns of mitochondrial genome evolution in diatoms. Although gene content was strongly conserved, intron content varied widely across species. The vast majority of introns were of group II type and were located in the cox1 or rnl genes. Although recurrent intron loss appears to be the principal underlying cause of the sporadic distributions of mitochondrial introns across diatoms, phylogenetic analyses showed that intron distributions superficially consistent with a recurrent-loss model were sometimes more complicated, implicating horizontal transfer as a likely mechanism of intron acquisition as well. It was not clear, however, whether diatoms were the donors or recipients of horizontally transferred introns, highlighting a general challenge in resolving the evolutionary histories of many diatom mitochondrial introns. Although some of these histories may become clearer as more genomes are sampled, high rates of intron loss suggest that the origins of many diatom mitochondrial introns are likely to remain unclear.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.