RNA-Seq de novo assembly is an important method to generate transcriptomes for non-model organisms before any downstream analysis. Given many great de novo assembly methods developed by now, one critical issue is that there is no consensus on the evaluation of de novo assembly methods yet. Therefore, to set up a benchmark for evaluating the quality of de novo assemblies is very critical. Addressing this challenge will help us deepen the insights on the properties of different de novo assemblers and their evaluation methods, and provide hints on choosing the best assembly sets as transcriptomes of non-model organisms for the further functional analysis. In this article, we generate a textquotedblleftreal timetextquotedblright transcriptome using PacBio long reads as a benchmark for evaluating five de novo assemblers and two model-based de novo assembly evaluation methods. By comparing the de novo assmblies generated by RNA-Seq short reads with the textquotedblleftreal timetextquotedblright transcriptome from the same biological sample, we find that Trinity is best at the completeness by generating more assemblies than the alternative assemblers, but less continuous and having more misassemblies; Oases is best at the continuity and specificity, but less complete; The performance of SOAPdenovo-Trans, Trans-AByss and IDBA-Tran are in between of five assemblers. For evaluation methods, DETONATE leverages multiple aspects of the assembly set and ranks the assembly set with an average performance as the best, meanwhile the contig score can serve as a good metric to select assemblies with high completeness, specificity, continuity but not sensitive to misassemblies; TransRate contig score is useful for removing misassemblies, yet often the assemblies in the optimal set is too few to be used as a transcriptome.
Antibiotic susceptibility of plant-derived lactic acid bacteria conferring health benefits to human.
Lactic acid bacteria (LAB) confer health benefits to human when administered orally. We have recently isolated several species of LAB strains from plant sources, such as fruits, vegetables, flowers, and medicinal plants. Since antibiotics used to treat bacterial infection diseases induce the emergence of drug-resistant bacteria in intestinal microflora, it is important to evaluate the susceptibility of LAB strains to antibiotics to ensure the safety and security of processed foods. The aim of the present study is to determine the minimum inhibitory concentration (MIC) of antibiotics against several plant-derived LAB strains. When aminoglycoside antibiotics, such as streptomycin (SM), kanamycin (KM), and gentamicin (GM), were evaluated using LAB susceptibility test medium (LSM), the MIC was higher than when using Mueller-Hinton (MH) medium. Etest, which is an antibiotic susceptibility assay method consisting of a predefined gradient of antibiotic concentrations on a plastic strip, is used to determine the MIC of antibiotics world-wide. In the present study, we demonstrated that Etest was particularly valuable while testing LAB strains. We also show that the low susceptibility of the plant-derived LAB strains against each antibiotic tested is due to intrinsic resistance and not acquired resistance. This finding is based on the whole-genome sequence information reflecting the horizontal spread of the drug-resistance genes in the LAB strains.
Rapid transcriptional responses to serum exposure are associated with sensitivity and resistance to antibody-mediated complement killing in invasive Salmonella Typhimurium ST313
Background: Salmonella Typhimurium ST313 exhibits signatures of adaptation to invasive human infection, including higher resistance to humoral immune responses than gastrointestinal isolates. Full resistance to antibody-mediated complement killing (serum resistance) among nontyphoidal Salmonellae is uncommon, but selection of highly resistant strains could compromise vaccine-induced antibody immunity. Here, we address the hypothesis that serum resistance is due to a distinct genotype or transcriptome response in S. Typhimurium ST313.
A Novel Bacteriophage Exclusion (BREX) System Encoded by the pglX Gene in Lactobacillus casei Zhang.
The bacteriophage exclusion (BREX) system is a novel prokaryotic defense system against bacteriophages. To our knowledge, no study has systematically characterized the function of the BREX system in lactic acid bacteria. Lactobacillus casei Zhang is a probiotic bacterium originating from koumiss. By using single-molecule real-time sequencing, we previously identified N6-methyladenine (m6A) signatures in the genome of L. casei Zhang and a putative methyltransferase (MTase), namely, pglX This work further analyzed the genomic locus near the pglX gene and identified it as a component of the BREX system. To decipher the biological role of pglX, an L. casei Zhang pglX mutant (?pglX) was constructed. Interestingly, m6A methylation of the 5′-ACRCAG-3′ motif was eliminated in the ?pglX mutant. The wild-type and mutant strains exhibited no significant difference in morphology or growth performance in de Man-Rogosa-Sharpe (MRS) medium. A significantly higher plasmid acquisition capacity was observed for the ?pglX mutant than for the wild type if the transformed plasmids contained pglX recognition sites (i.e., 5′-ACRCAG-3′). In contrast, no significant difference was observed in plasmid transformation efficiency between the two strains when plasmids lacking pglX recognition sites were tested. Moreover, the ?pglX mutant had a lower capacity to retain the plasmids than the wild type, suggesting a decrease in genetic stability. Since the Rebase database predicted that the L. casei PglX protein was bifunctional, as both an MTase and a restriction endonuclease, the PglX protein was heterologously expressed and purified but failed to show restriction endonuclease activity. Taken together, the results show that the L. casei Zhang pglX gene is a functional adenine MTase that belongs to the BREX system.IMPORTANCELactobacillus casei Zhang is a probiotic that confers beneficial effects on the host, and it is thus increasingly used in the dairy industry. The possession of an effective bacterial immune system that can defend against invasion of phages and exogenous DNA is a desirable feature for industrial bacterial strains. The bacteriophage exclusion (BREX) system is a recently described phage resistance system in prokaryotes. This work confirmed the function of the BREX system in L. casei and that the methyltransferase (pglX) is an indispensable part of the system. Overall, our study characterizes a BREX system component gene in lactic acid bacteria. Copyright © 2019 American Society for Microbiology.
Plantibacter flavus, Curtobacterium herbarum, Paenibacillus taichungensis, and Rhizobium selenitireducens Endophytes Provide Host-Specific Growth Promotion of Arabidopsis thaliana, Basil, Lettuce, and Bok Choy Plants.
A collection of bacterial endophytes isolated from stem tissues of plants growing in soils highly contaminated with petroleum hydrocarbons were screened for plant growth-promoting capabilities. Twenty-seven endophytic isolates significantly improved the growth of Arabidopsis thaliana plants in comparison to that of uninoculated control plants. The five most beneficial isolates, one strain each of Curtobacterium herbarum, Paenibacillus taichungensis, and Rhizobium selenitireducens and two strains of Plantibacter flavus were further examined for growth promotion in Arabidopsis, lettuce, basil, and bok choy plants. Host-specific plant growth promotion was observed when plants were inoculated with the five bacterial strains. P. flavus strain M251 increased the total biomass and total root length of Arabidopsis plants by 4.7 and 5.8 times, respectively, over that of control plants and improved lettuce and basil root growth, while P. flavus strain M259 promoted Arabidopsis shoot and root growth, lettuce and basil root growth, and bok choy shoot growth. A genome comparison between P. flavus strains M251 and M259 showed that both genomes contain up to 70 actinobacterial putative plant-associated genes and genes involved in known plant-beneficial pathways, such as those for auxin and cytokinin biosynthesis and 1-aminocyclopropane-1-carboxylate deaminase production. This study provides evidence of direct plant growth promotion by Plantibacter flavusIMPORTANCE The discovery of new plant growth-promoting bacteria is necessary for the continued development of biofertilizers, which are environmentally friendly and cost-efficient alternatives to conventional chemical fertilizers. Biofertilizer effects on plant growth can be inconsistent due to the complexity of plant-microbe interactions, as the same bacteria can be beneficial to the growth of some plant species and neutral or detrimental to others. We examined a set of bacterial endophytes isolated from plants growing in a unique petroleum-contaminated environment to discover plant growth-promoting bacteria. We show that strains of Plantibacter flavus exhibit strain-specific plant growth-promoting effects on four different plant species.Copyright © 2019 American Society for Microbiology.
Genomic Islands in the Full-Genome Sequence of an NAD-Hemin-Independent Avibacterium paragallinarum Strain Isolated from Peru.
Here, we report the full-genome sequence of an NAD-hemin-independent Avibacterium paragallinarum serovar C-2 strain, FARPER-174, isolated from layer hens in Peru. This genome contained 12 potential genomic islands that include ribosomal protein-coding genes, a nadR gene, hemocin-coding genes, sequences of fagos, an rtx operon, and drug resistance genes. Copyright © 2019 Tataje-Lavanda et al.
Genomic and Functional Analysis of Emerging Virulent and Multidrug-Resistant Escherichia coli Lineage Sequence Type 648.
The pathogenic extended-spectrum-beta-lactamase (ESBL)-producing Escherichia coli lineage ST648 is increasingly reported from multiple origins. Our study of a large and global ST648 collection from various hosts (87 whole-genome sequences) combining core and accessory genomics with functional analyses and in vivo experiments suggests that ST648 is a nascent and generalist lineage, lacking clear phylogeographic and host association signals. By including large numbers of ST131 (n?=?107) and ST10 (n?=?96) strains for comparative genomics and phenotypic analysis, we demonstrate that the combination of multidrug resistance and high-level virulence are the hallmarks of ST648, similar to international high-risk clonal lineage ST131. Specifically, our in silico, in vitro, and in vivo results demonstrate that ST648 is well equipped with biofilm-associated features, while ST131 shows sophisticated signatures indicative of adaption to urinary tract infection, potentially conveying individual ecological niche adaptation. In addition, we used a recently developed NFDS (negative frequency-dependent selection) population model suggesting that ST648 will increase significantly in frequency as a cause of bacteremia within the next few years. Also, ESBL plasmids impacting biofilm formation aided in shaping and maintaining ST648 strains to successfully emerge worldwide across different ecologies. Our study contributes to understanding what factors drive the evolution and spread of emerging international high-risk clonal lineages.Copyright © 2019 American Society for Microbiology.
Genomic and transcriptomic characterization of Pseudomonas aeruginosa small colony variants derived from a chronic infection model.
Phenotypic change is a hallmark of bacterial adaptation during chronic infection. In the case of chronic Pseudomonas aeruginosa lung infection in patients with cystic fibrosis, well-characterized phenotypic variants include mucoid and small colony variants (SCVs). It has previously been shown that SCVs can be reproducibly isolated from the murine lung following the establishment of chronic infection with mucoid P. aeruginosa strain NH57388A. Using a combination of single-molecule real-time (PacBio) and Illumina sequencing we identify a large genomic inversion in the SCV through recombination between homologous regions of two rRNA operons and an associated truncation of one of the 16S rRNA genes and suggest this may be the genetic switch for conversion to the SCV phenotype. This phenotypic conversion is associated with large-scale transcriptional changes distributed throughout the genome. This global rewiring of the cellular transcriptomic output results in changes to normally differentially regulated genes that modulate resistance to oxidative stress, central metabolism and virulence. These changes are of clinical relevance because the appearance of SCVs during chronic infection is associated with declining lung function.
Supernumerary B chromosomes (Bs) are extra karyotype units in addition to A chromosomes, and are found in some fungi and thousands of animals and plant species. Bs are uniquely characterized due to their non-Mendelian inheritance, and represent one of the best examples of genomic conflict. Over the last decades, their genetic composition, function and evolution have remained an unresolved query, although a few successful attempts have been made to address these phenomena. A classical concept based on cytogenetics and genetics is that Bs are selfish and abundant with DNA repeats and transposons, and in most cases, they do not carry any function. However, recently, the modern quantum development of high scale multi-omics techniques has shifted B research towards a new-born field that we call “B-omics”. We review the recent literature and add novel perspectives to the B research, discussing the role of new technologies to understand the mechanistic perspectives of the molecular evolution and function of Bs. The modern view states that B chromosomes are enriched with genes for many significant biological functions, including but not limited to the interesting set of genes related to cell cycle and chromosome structure. Furthermore, the presence of B chromosomes could favor genomic rearrangements and influence the nuclear environment affecting the function of other chromatin regions. We hypothesize that B chromosomes might play a key function in driving their transmission and maintenance inside the cell, as well as offer an extra genomic compartment for evolution.
Genome assembly and annotation of the Trichoplusia ni Tni-FNL insect cell line enabled by long-read technologies.
Trichoplusiani derived cell lines are commonly used to enable recombinant protein expression via baculovirus infection to generate materials approved for clinical use and in clinical trials. In order to develop systems biology and genome engineering tools to improve protein expression in this host, we performed de novo genome assembly of the Trichoplusiani-derived cell line Tni-FNL.By integration of PacBio single-molecule sequencing, Bionano optical mapping, and 10X Genomics linked-reads data, we have produced a draft genome assembly of Tni-FNL.Our assembly contains 280 scaffolds, with a N50 scaffold size of 2.3 Mb and a total length of 359 Mb. Annotation of the Tni-FNL genome resulted in 14,101 predicted genes and 93.2% of the predicted proteome contained recognizable protein domains. Ortholog searches within the superorder Holometabola provided further evidence of high accuracy and completeness of the Tni-FNL genome assembly.This first draft Tni-FNL genome assembly was enabled by complementary long-read technologies and represents a high-quality, well-annotated genome that provides novel insight into the complexity of this insect cell line and can serve as a reference for future large-scale genome engineering work in this and other similar recombinant protein production hosts.
Identification, expression, alternative splicing and functional analysis of pepper WRKY gene family in response to biotic and abiotic stresses.
WRKY proteins are a large group of plant transcription factors that are involved in various biological processes, including biotic and abiotic stress responses, hormone response, plant development, and metabolism. WRKY proteins have been identified in several plants, but only a few have been identified in Capsicum annuum. Here, we identified a total of 62 WRKY genes in the latest pepper genome. These genes were classified into three groups (Groups 1-3) based on the structural features of their proteins. The structures of the encoded proteins, evolution, and expression under normal growth conditions were analyzed and 35 putative miRNA target sites were predicted in 20 CaWRKY genes. Moreover, the response to cold or CMV treatments of selected WRKY genes were examined to validate the roles under stresses. And alternative splicing (AS) events of some CaWRKYs were also identified under CMV infection. Promoter analysis confirmed that CaWRKY genes are involved in growth, development, and biotic or abiotic stress responses in hot pepper. The comprehensive analysis provides fundamental information for better understanding of the signaling pathways involved in the WRKY-mediated regulation of developmental processes, as well as biotic and abiotic stress responses.
A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.
In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.
Human metapneumovirus (HMPV) has been a notable etiological agent of acute respiratory infection in humans, but it was not discovered until 2001, because HMPV replicates only in a limited number of cell lines and the cytopathic effect (CPE) is often mild. To promote the study of HMPV, several groups have generated green fluorescent protein (GFP)-expressing recombinant HMPV strains (HMPVGFP). However, the growing evidence has complicated the understanding of cell line specificity of HMPV, because it seems to vary notably among HMPV strains. In addition, unique A2b clade HMPV strains with a 180-nucleotide duplication in the G gene (HMPV A2b180nt-dup strains) have recently been detected. In this study, we re-evaluated and compared the cell line specificity of clinical isolates of HMPV strains, including the novel HMPV A2b180nt-dup strains, and six recombinant HMPVGFP strains, including the newly generated recombinant HMPV A2b180nt-dup strain, MG0256-EGFP. Our data demonstrate that VeroE6 and LLC-MK2 cells generally showed the highest infectivity with any clinical isolates and recombinant HMPVGFP strains. Other human-derived cell lines (BEAS-2B, A549, HEK293, MNT-1, and HeLa cells) showed certain levels of infectivity with HMPV, but these were significantly lower than those of VeroE6 and LLC-MK2 cells. Also, the infectivity in these suboptimal cell lines varied greatly among HMPV strains. The variations were not directly related to HMPV genotypes, cell lines used for isolation and propagation, specific genome mutations, or nucleotide duplications in the G gene. Thus, these variations in suboptimal cell lines are likely intrinsic to particular HMPV strains.
Comparative genomic analysis of eight novel haloalkaliphilic bacteriophages from Lake Elmenteita, Kenya.
We report complete genome sequences of eight bacteriophages isolated from Haloalkaline Lake Elmenteita found on the floor of Kenyan Rift Valley. The bacteriophages were sequenced, annotated and a comparative genomic analysis using various Bioinformatics tools carried out to determine relatedness of the bacteriophages to each other, and to those in public databases. Basic genome properties like genome size, percentage coding density, number of open reading frames, percentage GC content and gene organizations revealed the bacteriophages had no relationship to each other. Comparison to other nucleotide sequences in GenBank database showed no significant similarities hence novel. At the amino acid level, phages of our study revealed mosaicism to genes with conserved domains to already described phages. Phylogenetic analyses of large terminase gene responsible for DNA packaging and DNA polymerase gene for replication further showed diversity among the bacteriophages. Our results give insight into diversity of bacteriophages in Lake Elmenteita and provide information on their evolution. By providing primary sequence information, this study not only provides novel sequences for biotechnological exploitation, but also sets stage for future studies aimed at better understanding of virus diversity and genomes from haloalkaline lakes in the Rift Valley.
A High-Quality Grapevine Downy Mildew Genome Assembly Reveals Rapidly Evolving and Lineage-Specific Putative Host Adaptation Genes.
Downy mildews are obligate biotrophic oomycete pathogens that cause devastating plant diseases on economically important crops. Plasmopara viticola is the causal agent of grapevine downy mildew, a major disease in vineyards worldwide. We sequenced the genome of Pl. viticola with PacBio long reads and obtained a new 92.94?Mb assembly with high contiguity (359 scaffolds for a N50 of 706.5?kb) due to a better resolution of repeat regions. This assembly presented a high level of gene completeness, recovering 1,592 genes encoding secreted proteins involved in plant-pathogen interactions. Plasmopara viticola had a two-speed genome architecture, with secreted protein-encoding genes preferentially located in gene-sparse, repeat-rich regions and evolving rapidly, as indicated by pairwise dN/dS values. We also used short reads to assemble the genome of Plasmopara muralis, a closely related species infecting grape ivy (Parthenocissus tricuspidata). The lineage-specific proteins identified by comparative genomics analysis included a large proportion of RxLR cytoplasmic effectors and, more generally, genes with high dN/dS values. We identified 270 candidate genes under positive selection, including several genes encoding transporters and components of the RNA machinery potentially involved in host specialization. Finally, the Pl. viticola genome assembly generated here will allow the development of robust population genomics approaches for investigating the mechanisms involved in adaptation to biotic and abiotic selective pressures in this species. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.