Menu
April 21, 2020

Sequential evolution of virulence and resistance during clonal spread of community-acquired methicillin-resistant Staphylococcus aureus.

The past two decades have witnessed an alarming expansion of staphylococcal disease caused by community-acquired methicillin-resistant Staphylococcus aureus (CA-MRSA). The factors underlying the epidemic expansion of CA-MRSA lineages such as USA300, the predominant CA-MRSA clone in the United States, are largely unknown. Previously described virulence and antimicrobial resistance genes that promote the dissemination of CA-MRSA are carried by mobile genetic elements, including phages and plasmids. Here, we used high-resolution genomics and experimental infections to characterize the evolution of a USA300 variant plaguing a patient population at increased risk of infection to understand the mechanisms underlying the emergence of genetic elements that facilitate clonal spread of the pathogen. Genetic analyses provided conclusive evidence that fitness (manifest as emergence of a dominant clone) changed coincidently with the stepwise emergence of (i) a unique prophage and mutation of the regulator of the pyrimidine nucleotide biosynthetic operon that promoted abscess formation and colonization, respectively, thereby priming the clone for success; and (ii) a unique plasmid that conferred resistance to two topical microbiocides, mupirocin and chlorhexidine, frequently used for decolonization and infection prevention. The resistance plasmid evolved through successive incorporation of DNA elements from non-S. aureus spp. into an indigenous cryptic plasmid, suggesting a mechanism for interspecies genetic exchange that promotes antimicrobial resistance. Collectively, the data suggest that clonal spread in a vulnerable population resulted from extensive clinical intervention and intense selection pressure toward a pathogen lifestyle that involved the evolution of consequential mutations and mobile genetic elements.


April 21, 2020

Maleness-on-the-Y (MoY) orchestrates male sex determination in major agricultural fruit fly pests.

In insects, rapidly evolving primary sex-determining signals are transduced by a conserved regulatory module controlling sexual differentiation. In the agricultural pest Ceratitis capitata (Mediterranean fruit fly, or Medfly), we identified a Y-linked gene, Maleness-on-the-Y (MoY), encoding a small protein that is necessary and sufficient for male development. Silencing or disruption of MoY in XY embryos causes feminization, whereas overexpression of MoY in XX embryos induces masculinization. Crosses between transformed XY females and XX males give rise to males and females, indicating that a Y chromosome can be transmitted by XY females. MoY is Y-linked and functionally conserved in other species of the Tephritidae family, highlighting its potential to serve as a tool for developing more effective control strategies against these major agricultural insect pests.Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020

Genetic basis for the establishment of endosymbiosis in Paramecium.

The single-celled ciliate Paramecium bursaria is an indispensable model for investigating endosymbiosis between protists and green-algal symbionts. To elucidate the mechanism of this type of endosymbiosis, we combined PacBio and Illumina sequencing to assemble a high-quality and near-complete macronuclear genome of P. bursaria. The genomic characteristics and phylogenetic analyses indicate that P. bursaria is the basal clade of the Paramecium genus. Through comparative genomic analyses with its close relatives, we found that P. bursaria encodes more genes related to nitrogen metabolism and mineral absorption, but encodes fewer genes involved in oxygen binding and N-glycan biosynthesis. A comparison of the transcriptomic profiles between P. bursaria with and without endosymbiotic Chlorella showed differential expression of a wide range of metabolic genes. We selected 32 most differentially expressed genes to perform RNA interference experiment in P. bursaria, and found that P. bursaria can regulate the abundance of their symbionts through glutamine supply. This study provides novel insights into Paramecium evolution and will extend our knowledge of the molecular mechanism for the induction of endosymbiosis between P. bursaria and green algae.


April 21, 2020

The role of genomic structural variation in the genetic improvement of polyploid crops

Many of our major crop species are polyploids, containing more than one genome or set of chromosomes. Polyploid crops present unique challenges, including difficulties in genome assembly, in discriminating between multiple gene and sequence copies, and in genetic mapping, hindering use of genomic data for genetics and breeding. Polyploid genomes may also be more prone to containing structural variation, such as loss of gene copies or sequences (presence–absence variation) and the presence of genes or sequences in multiple copies (copy-number variation). Although the two main types of genomic structural variation commonly identified are presence–absence variation and copy-number variation, we propose that homeologous exchanges constitute a third major form of genomic structural variation in polyploids. Homeologous exchanges involve the replacement of one genomic segment by a similar copy from another genome or ancestrally duplicated region, and are known to be extremely common in polyploids. Detecting all kinds of genomic structural variation is challenging, but recent advances such as optical mapping and long-read sequencing offer potential strategies to help identify structural variants even in complex polyploid genomes. All three major types of genomic structural variation (presence–absence, copy-number, and homeologous exchange) are now known to influence phenotypes in crop plants, with examples of flowering time, frost tolerance, and adaptive and agronomic traits. In this review, we summarize the challenges of genome analysis in polyploid crops, describe the various types of genomic structural variation and the genomics technologies and data that can be used to detect them, and collate information produced to date related to the impact of genomic structural variation on crop phenotypes. We highlight the importance of genomic structural variation for the future genetic improvement of polyploid crops.


April 21, 2020

A Species-Wide Inventory of NLR Genes and Alleles in Arabidopsis thaliana.

Infectious disease is both a major force of selection in nature and a prime cause of yield loss in agriculture. In plants, disease resistance is often conferred by nucleotide-binding leucine-rich repeat (NLR) proteins, intracellular immune receptors that recognize pathogen proteins and their effects on the host. Consistent with extensive balancing and positive selection, NLRs are encoded by one of the most variable gene families in plants, but the true extent of intraspecific NLR diversity has been unclear. Here, we define a nearly complete species-wide pan-NLRome in Arabidopsis thaliana based on sequence enrichment and long-read sequencing. The pan-NLRome largely saturates with approximately 40 well-chosen wild strains, with half of the pan-NLRome being present in most accessions. We chart NLR architectural diversity, identify new architectures, and quantify selective forces that act on specific NLRs and NLR domains. Our study provides a blueprint for defining pan-NLRomes.Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.


April 21, 2020

A survey and evaluations of histogram-based statistics in alignment-free sequence comparison.

Since the dawn of the bioinformatics field, sequence alignment scores have been the main method for comparing sequences. However, alignment algorithms are quadratic, requiring long execution time. As alternatives, scientists have developed tens of alignment-free statistics for measuring the similarity between two sequences.We surveyed tens of alignment-free k-mer statistics. Additionally, we evaluated 33 statistics and multiplicative combinations between the statistics and/or their squares. These statistics are calculated on two k-mer histograms representing two sequences. Our evaluations using global alignment scores revealed that the majority of the statistics are sensitive and capable of finding similar sequences to a query sequence. Therefore, any of these statistics can filter out dissimilar sequences quickly. Further, we observed that multiplicative combinations of the statistics are highly correlated with the identity score. Furthermore, combinations involving sequence length difference or Earth Mover’s distance, which takes the length difference into account, are always among the highest correlated paired statistics with identity scores. Similarly, paired statistics including length difference or Earth Mover’s distance are among the best performers in finding the K-closest sequences. Interestingly, similar performance can be obtained using histograms of shorter words, resulting in reducing the memory requirement and increasing the speed remarkably. Moreover, we found that simple single statistics are sufficient for processing next-generation sequencing reads and for applications relying on local alignment. Finally, we measured the time requirement of each statistic. The survey and the evaluations will help scientists with identifying efficient alternatives to the costly alignment algorithm, saving thousands of computational hours.The source code of the benchmarking tool is available as Supplementary Materials. © The Author 2017. Published by Oxford University Press.


April 21, 2020

Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.

The DNA sequencing technologies in use today produce either highly accurate short reads or less-accurate long reads. We report the optimization of circular consensus sequencing (CCS) to improve the accuracy of single-molecule real-time (SMRT) sequencing (PacBio) and generate highly accurate (99.8%) long high-fidelity (HiFi) reads with an average length of 13.5?kilobases (kb). We applied our approach to sequence the well-characterized human HG002/NA24385 genome and obtained precision and recall rates of at least 99.91% for single-nucleotide variants (SNVs), 95.98% for insertions and deletions <50 bp (indels) and 95.99% for structural variants. Our CCS method matches or exceeds the ability of short-read sequencing to detect small variants and structural variants. We estimate that 2,434 discordances are correctable mistakes in the 'genome in a bottle' (GIAB) benchmark set. Nearly all (99.64%) variants can be phased into haplotypes, further improving variant detection. De novo genome assembly using CCS reads alone produced a contiguous and accurate genome with a contig N50 of >15?megabases (Mb) and concordance of 99.997%, substantially outperforming assembly with less-accurate long reads.


April 21, 2020

Systematic evasion of the restriction-modification barrier in bacteria.

Bacteria that are recalcitrant to genetic manipulation using modern in vitro techniques are termed genetically intractable. Genetic intractability is a fundamental barrier to progress that hinders basic, synthetic, and translational microbiology research and development beyond a few model organisms. The most common underlying causes of genetic intractability are restriction-modification (RM) systems, ubiquitous defense mechanisms against xenogeneic DNA that hinder the use of genetic approaches in the vast majority of bacteria and exhibit strain-level variation. Here, we describe a systematic approach to overcome RM systems. Our approach was inspired by a simple hypothesis: if a synthetic piece of DNA lacks the highly specific target recognition motifs for a host’s RM systems, then it is invisible to these systems and will not be degraded during artificial transformation. Accordingly, in this process, we determine the genome and methylome of an individual bacterial strain and use this information to define the bacterium’s RM target motifs. We then synonymously eliminate RM targets from the nucleotide sequence of a genetic tool in silico, synthesize an RM-silent “SyngenicDNA” tool, and propagate the tool as minicircle plasmids, termed SyMPL (SyngenicDNA Minicircle Plasmid) tools, before transformation. In a proof-of-principle of our approach, we demonstrate a profound improvement (five orders of magnitude) in the transformation of a clinically relevant USA300 strain of Staphylococcus aureus This stealth-by-engineering SyngenicDNA approach is effective, flexible, and we expect in future applications could enable microbial genetics free of the restraints of restriction-modification barriers.Copyright © 2019 the Author(s). Published by PNAS.


April 21, 2020

Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

Metagenomic samples are snapshots of complex ecosystems at work. They comprise hundreds of known and unknown species, contain multiple strain variants and vary greatly within and across environments. Many microbes found in microbial communities are not easily grown in culture making their DNA sequence our only clue into their evolutionary history and biological function. Metagenomic assembly is a computational process aimed at reconstructing genes and genomes from metagenomic mixtures. Current methods have made significant strides in reconstructing DNA segments comprising operons, tandem gene arrays and syntenic blocks. Shorter, higher-throughput sequencing technologies have become the de facto standard in the field. Sequencers are now able to generate billions of short reads in only a few days. Multiple metagenomic assembly strategies, pipelines and assemblers have appeared in recent years. Owing to the inherent complexity of metagenome assembly, regardless of the assembly algorithm and sequencing method, metagenome assemblies contain errors. Recent developments in assembly validation tools have played a pivotal role in improving metagenomics assemblers. Here, we survey recent progress in the field of metagenomic assembly, provide an overview of key approaches for genomic and metagenomic assembly validation and demonstrate the insights that can be derived from assemblies through the use of assembly validation strategies. We also discuss the potential for impact of long-read technologies in metagenomics. We conclude with a discussion of future challenges and opportunities in the field of metagenomic assembly and validation. © The Author 2017. Published by Oxford University Press.


April 21, 2020

Systematic Identification of Pathogenic Streptomyces sp. AMCC400023 That Causes Common Scab and Genomic Analysis of Its Pathogenicity Island.

Potato scab, a serious soilborne disease caused by Streptomyces spp., occurs in potato-growing areas worldwide and results in severe economic losses. In this paper, the pathogenicity of Streptomyces strain AMCC400023, isolated from potato scabs in Hebei Province, China, was verified systematically by the radish seedling test, the potato tuber slice assay, the potted back experiment, and the detection of phytotoxin thaxtomin A. Morphological, physiological, and biochemical characteristics were determined, and the 16S ribosomal RNA analyses of Streptomyces sp. AMCC400023 were carried out. To obtain the accurate taxonomic status of the pathogen strain, the whole genome was sequenced, and the phylogenetic tree among 31 Streptomyces genomes was formed. The average nucleotide identity (ANI) and in silico DNA-DNA hybridization (isDDH) were analyzed, and at the same time, the toxicity-related genes between Streptomyces sp. AMCC400023 and Streptomyces scabiei were compared, all based on the whole-genome level. All of the data supported that, instead of a member of S. scabiei, test strain Streptomyces sp. AMCC400023 was a distinct phytopathogen of potato common scab, which had a relatively close relationship with S. scabiei while separating clearly from S. scabiei at least in the species level of taxonomic status. The complete pathogenicity island (PAI) composition of Streptomyces sp. AMCC400023 was identified, which contained a toxin region and a colonization region. It was conjectured that the PAI of Streptomyces sp. AMCC400023 might be directly or indirectly acquired from S. scabiei 87-22 by horizontal gene transfer, or at the very least, there was a very close homologous relationship between the two pathogens as indicated by a series of analyses, such as phylogenetic relationships among 31 Streptomyces species, ANI and isDDH analyses, PAI structure mapping, thaxtomin A synthetic gene cluster tree construction, and most important, the collinearity analysis at the genome level.


April 21, 2020

Real time monitoring of Aeromonas salmonicida evolution in response to successive antibiotic therapies in a commercial fish farm.

Our ability to predict evolutionary trajectories of pathogens in response to antibiotic pressure is one of the promising leverage to fight against the present antibiotic resistance worldwide crisis. Yet, few studies tackled this question in situ at the outbreak level, due to the difficulty to link a given pathogenic clone evolution with its precise antibiotic exposure over time. In this study, we monitored the real-time evolution of an Aeromonas salmonicida clone in response to successive antibiotic and vaccine therapies in a commercial fish farm. The clone was responsible for a four-year outbreak of furunculosis within a Recirculating Aquaculture System Salmo salar farm in China, and we reconstructed the precise tempo of mobile genetic elements (MGEs) acquisition events during this period. The resistance profile provided by the acquired MGEs closely mirrored the antibiotics used to treat the outbreak, and we evidenced that two subclonal groups developed similar resistances although unrelated MGE acquisitions. Finally, we also demonstrated the efficiency of vaccination in outbreak management and its positive effect on antibiotic resistance prevalence. Our study provides unprecedented knowledge critical to understand evolutionary trajectories of resistant pathogens outside the laboratory. © 2019 Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Mutation of a bHLH transcription factor allowed almond domestication.

Wild almond species accumulate the bitter and toxic cyanogenic diglucoside amygdalin. Almond domestication was enabled by the selection of genotypes harboring sweet kernels. We report the completion of the almond reference genome. Map-based cloning using an F1 population segregating for kernel taste led to the identification of a 46-kilobase gene cluster encoding five basic helix-loop-helix transcription factors, bHLH1 to bHLH5. Functional characterization demonstrated that bHLH2 controls transcription of the P450 monooxygenase-encoding genes PdCYP79D16 and PdCYP71AN24, which are involved in the amygdalin biosynthetic pathway. A nonsynonymous point mutation (Leu to Phe) in the dimerization domain of bHLH2 prevents transcription of the two cytochrome P450 genes, resulting in the sweet kernel trait. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


April 21, 2020

Recompleting the Caenorhabditis elegans genome.

Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted =53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology. © 2019 Yoshimura et al.; Published by Cold Spring Harbor Laboratory Press.


April 21, 2020

A global survey of full-length transcriptome of Ginkgo biloba reveals transcript variants involved in flavonoid biosynthesis

Ginkgo biloba, which contains flavonoids as bioactive components, is widely used in traditional Chinese medicine. Increasing the flavonoid production of medicinal plants through genetic engineering generally focuses on the key genes involved in flavonoid biosynthesis. However, the molecular mechanisms underlying such biosynthesis are not yet well understood. To understand these mechanisms, a combination of second-generation sequencing (SGS) and single-molecule real-time (SMRT) sequencing was applied to G. biloba. Eight tissues were sampled for SMRT sequencing to generate a high-quality, full-length transcriptome database. From 23.36 Gb clean reads, 12,954 alternative polyadenylation events, 12,290 alternative splicing events, 929 fusion transcripts, 2,286 novel transcripts, and 1,270 lncRNAs were predicted by removing redundant reads. Further studies reveal that 7 AS, 5 lncRNA, and 6 fusion gene events were identified in flavonoid biosynthesis. A total of 12 gene modules were revealed to be involved in flavonoid metabolism structural genes and transcription factors by constructing co-expression networks. Weighted gene coexpression network analysis (WGCNA) analysis reveals that some hub genes operate during the biosynthesis by identifying transcription factors (TFs) and structure genes. Seven key hub genes were also identified by analyzing the correlation between gene expression level and flavonoids content. The results highlight the importance of SMRT sequencing of the full-length transcriptome in improving genome annotation and elucidating the gene regulation of flavonoid biosynthesis in G. biloba by providing a comprehensive set of reference transcripts.


April 21, 2020

Distribution and antimicrobial activity of lactic acid bacteria from raw camel milk.

Consumer demand for natural pathogen-control agents for substitution of synthetic food preservatives and traditional antibiotics is increasing. This study aimed to reveal the distribution of lactic acid bacteria (LAB) in raw camel milk and to characterize their antimicrobial traits. The genetic identification by 16S rRNA sequencing of 58 LAB isolates showed the predominance of Enterococcus (24.2%), Lactococcus (22.4%) and Pediococcus (20.7%) genera in raw camel milk. These genera exhibited inhibitory activity against a broad spectrum of Gram-positive and Gram-negative bacteria including multidrug-resistant Salmonella. Among these LAB, two isolates-identified as Pediococcus pentosaceus CM16 and Lactobacillus brevis CM22-were selected for their strong bacteriocinogenic anti-listerial activity estimated at 1600 and 800 AU/mL, respectively. The bacteriocins produced were partially purified by ammonium sulphate precipitation and gel filtration and then biochemically characterized. The proteinaceous nature of bacteriocins was confirmed by the susceptibility to enzymes. These bacteriocins showed significant technological characteristics such as heat-resistance, and stability over a wide range of pH (2.0-10.0). In conclusion, these results indicated that Pediococcus pentosaceus CM16 and Lactobacillus brevis CM22 could be useful as potential probiotics. Moreover, their partially purified bacteriocins may play an important role as food preservatives and feed additives. To our knowledge, this is the first report describing the distribution of LAB population in raw camel milk and the characterization of their bacteriocins from the Arabian Peninsula of western Asia.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.