Menu
September 22, 2019

Comparative genomics of completely sequenced Lactobacillus helveticus genomes provides insights into strain-specific genes and resolves metagenomics data down to the strain level.

Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences’ long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus-to our knowledge-identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level.


September 22, 2019

Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.


September 22, 2019

By land, air, and sea: hemipteran diversity through the genomic lens

Thanks to a recent spate of sequencing projects, the Hemiptera are the first hemimetabolous insect order to achieve a critical mass of species with sequenced genomes, establishing the basis for comparative genomics of the bugs. However, as the most speciose hemimetabolous order, there is still a vast swathe of the hemipteran phylogeny that awaits genomic representation across subterranean, terrestrial, and aquatic habitats, and with lineage-specific and developmentally plastic cases of both wing polyphenisms and flightlessness. In this review, we highlight opportunities for taxonomic sampling beyond obvious pest species candidates, motivated by intriguing biological features of certain groups as well as the rich research tradition of ecological, physiological, developmental, and particularly cytogenetic investigation that spans the diversity of the Hemiptera.


September 22, 2019

LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons.

Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTR_retriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTR_retriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTR_retriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4.5× genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTR-RTs with 5′-TG…CA-3′ termini, LTR_retriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs areCopiaelements, with which the LTR is four times shorter than that of otherCopiaelements, which may be a result of their target specificity. Strikingly, non-TGCACopiaelements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.© 2018 American Society of Plant Biologists. All Rights Reserved.


September 22, 2019

Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium conductrix SAG 241.80: implications to maltose excretion by a green alga.

Green algae represent a key segment of the global species capable of photoautotrophic-driven biological carbon fixation. Algae partition fixed-carbon into chemical compounds required for biomass, while diverting excess carbon into internal storage compounds such as starch and lipids or, in certain cases, into targeted extracellular compounds. Two green algae were selected to probe for critical components associated with sugar production and release in a model alga. Chlorella sorokiniana UTEX 1602 – which does not release significant quantities of sugars to the extracellular space – was selected as a control to compare with the maltose-releasing Micractinium conductrix SAG 241.80 – which was originally isolated from an endosymbiotic association with the ciliate Paramecium bursaria. Both strains were subjected to three sequencing approaches to assemble their genomes and annotate their genes. This analysis was further complemented with transcriptional studies during maltose release by M. conductrix SAG 241.80 versus conditions where sugar release is minimal. The annotation revealed that both strains contain homologs for the key components of a putative pathway leading to cytosolic maltose accumulation, while transcriptional studies found few changes in mRNA levels for the genes associated with these established intracellular sugar pathways. A further analysis of potential sugar transporters found multiple homologs for SWEETs and tonoplast sugar transporters. The analysis of transcriptional differences revealed a lesser and more measured global response for M. conductrix SAG 241.80 versus C. sorokiniana UTEX 1602 during conditions resulting in sugar release, providing a catalog of genes that might play a role in extracellular sugar transport.© 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.


September 22, 2019

Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants.

Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated.De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups.We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.


September 22, 2019

Bat biology, genomes, and the Bat1K project: To generate chromosome-level genomes for all living bat species.

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n~1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.


September 22, 2019

Complete genome sequence of Bacillus velezensis 157 isolated from Eucommia ulmoides with pathogenic bacteria inhibiting and lignocellulolytic enzymes production by SSF.

Bacillus velezensis 157 was isolated from the bark of Eucommia ulmoides, and exhibited antagonistic activity against a broad spectrum of pathogenic bacteria and fungi. Moreover, B. velezensis 157 also showed various lignocellulolytic activities including cellulase, xylanase, a-amylase, and pectinase, which had the ability of using the agro-industrial waste (soybean meal, wheat bran, sugarcane bagasse, wheat straw, rice husk, maize flour and maize straw) under solid-state fermentation and obtained several industrially valuable enzymes. Soybean meal appeared to be the most efficient substrate for the single fermentation of B. velezensis 157. Highest yield of pectinase (19.15 ± 2.66 U g-1), cellulase (46.69 ± 1.19 U g-1) and amylase (2097.18 ± 15.28 U g-1) was achieved on untreated soybean meal. Highest yield of xylanase (22.35 ± 2.24 U g-1) was obtained on untreated wheat bran. Here, we report the complete genome sequence of the B. velezensis 157, composed of a circular 4,013,317 bp chromosome with 3789 coding genes and a G + C content of 46.41%, one circular 8439 bp plasmid and a G + C content of 40.32%. The genome contained a total of 8 candidate gene clusters (bacillaene, difficidin, macrolactin, butirosin, bacillibactin, bacilysin, fengycin and surfactin), and dedicates over 15.8% of the whole genome to synthesize secondary metabolite biosynthesis. In addition, the genes encoding enzymes involved in degradation of cellulose, xylan, lignin, starch, mannan, galactoside and arabinan were found in the B. velezensis 157 genome. Thus, the study of B. velezensis 157 broadened that B. velezensis can not only be used as biocontrol agents, but also has potentially a wide range of applications in lignocellulosic biomass conversion.


September 22, 2019

Molecular epidemiology and mechanism of sulbactam resistance in Acinetobacter baumannii isolates with diverse genetic background in China

Sulbactam is a plausible option for treating Acinetobacter infections because of its intrinsic antibacterial activity against the members of the Acinetobacter genus, but the mechanisms of sulbactam resistance have not been fully studied in Acinetobacter baumannii In this study, a total of 2,197 clinical A. baumannii isolates were collected from 27 provinces in China. Eighty-eight isolates with various MICs for sulbactam were selected on the basis of their diverse clonality and underwent multilocus sequence typing (MLST), antimicrobial susceptibility testing, and resistance gene screening. The copy number and relative expression of blaTEM-1D and ampC were measured via quantitative PCR and quantitative reverse transcription-PCR, respectively. The genetic structure of multicopy blaTEM-1D was determined using the whole-genome sequencing technology. The cefoperazone-sulbactam resistance rate of the 2,197 isolates was 39.7%. The rate of positivity for blaTEM-1D or ISAba1-ampC in the sulbactam-nonsusceptible group (64.91% and 78.95%, respectively) was significantly higher than that in the sulbactam-susceptible group (0% and 0%, respectively; P < 0.001). The MIC of sulbactam (P < 0.001) varied considerably between the groups expressing ampC with or without upstream ISAba1 Notably, the genetic structure of the multicopy blaTEM-1D gene in strain ZS3 revealed that blaTEM-1D was embedded within four tandem copies of the cassette IS26-blaTEM-1D-Tn3-IS26 Therefore, blaTEM-1D and ISAba1-ampC represent the prevalent mechanism underlying sulbactam resistance in clinical A. baumannii isolates in China. The structure of the four tandem copies of blaTEM-1D first identified may increase sulbactam resistance. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Using experimental evolution to identify druggable targets that could inhibit the evolution of antimicrobial resistance.

With multi-drug and pan-drug-resistant bacteria becoming increasingly common in hospitals, antibiotic resistance has threatened to return us to a pre-antibiotic era that would completely undermine modern medicine. There is an urgent need to develop new antibiotics and strategies to combat resistance that are substantially different from earlier drug discovery efforts. One such strategy that would complement current and future antibiotics would be a class of co-drugs that target the evolution of resistance and thereby extend the efficacy of specific classes of antibiotics. A critical step in the development of such strategies lies in understanding the critical evolutionary trajectories responsible for resistance and which proteins or biochemical pathways within those trajectories would be good candidates for co-drug discovery. We identify the most important steps in the evolution of resistance for a specific pathogen and antibiotic combination by evolving highly polymorphic populations of pathogens to resistance in a novel bioreactor that favors biofilm development. As the populations evolve to increasing drug concentrations, we use deep sequencing to elucidate the network of genetic changes responsible for resistance and subsequent in vitro biochemistry and often structure determination to determine how the adaptive mutations produce resistance. Importantly, the identification of the molecular steps, their frequency within the populations and their chronology within the evolutionary trajectory toward resistance is critical to assessing their relative importance. In this work, we discuss findings from the evolution of the ESKAPE pathogen, Pseudomonas aeruginosa to the drug of last resort, colistin to illustrate the power of this approach.


September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.


September 22, 2019

Bacterial artificial chromosome clones randomly selected for sequencing reveal genomic differences between soybean cultivars

This study pioneered the use of multiple technologies to combine the bacterial artificial chromosome (BAC) pooling strategy with high-throughput next- and third-generation sequencing technologies to analyse genomic difference. To understand the genetic background of the Chinese soybean cultivar N23601, we built a BAC library and sequenced 10 randomly selected clones followed by de novo assembly. Comparative analysis was conducted against the reference genome of Glycine max var. Williams 82 (2.0). Therefore, our result is an assessment of the reference genome. Our results revealed that 3517 single nucleotide polymorphisms (SNPs) and 662 insertion–deletions (InDels) occurred in ~1.2 Mb of the genomic region and that four of the 10 BAC clones contained 15 large structural variations (72?887?bp) compared with the reference genome. Gene annotation of the reference genome showed that Glyma.18g181000 was missing from the corresponding position of the 10 BAC clones. Additionally, there may be a problem with the assembly of some positions of the reference genome. Several gap regions in the reference genome could be supplemented by using the complete sequence of the 10 BAC clones. We believe that accurate and complete BAC sequence is a valuable resource that contributes to the completeness of the reference genome.


September 22, 2019

Induced salt tolerance of perennial ryegrass by a novel bacterium strain from the rhizosphere of a desert shrub Haloxylon ammodendron.

Drought and soil salinity reduce agricultural output worldwide. Plant-growth-promoting rhizobacteria (PGPR) can enhance plant growth and augment plant tolerance to biotic and abiotic stresses.Haloxylon ammodendron, a C4 perennial succulent xerohalophyte shrub with excellent drought and salt tolerance, is naturally distributed in the desert area of northwest China. In our previous work, a bacterium strain numbered as M30-35 was isolated from the rhizosphere ofH. ammodendronin Tengger desert, Gansu province, northwest China. In current work, the effects of M30-35 inoculation on salt tolerance of perennial ryegrass were evaluated and its genome was sequenced to identify genes associated with plant growth promotion. Results showed that M30-35 significantly enhanced growth and salt tolerance of perennial ryegrass by increasing shoot fresh and dry weights, chlorophyll content, root volume, root activity, leaf catalase activity, soluble sugar and proline contents that contributed to reduced osmotic potential, tissue K? content and K?/Na? ratio, while decreasing malondialdehyde (MDA) content and relative electric conductivity (REC), especially under higher salinity. The genome of M30-35 contains 4421 protein encoding genes, 12 rRNA, 63 tRNA-encoding genes and four rRNA operons. M30-35 was initially classified as a new species inPseudomonasand named asPseudomonassp. M30-35. Thirty-four genes showing homology to genes associated with PGPR traits and abiotic stress tolerance were identified inPseudomonassp. M30-35 genome, including 12 related to insoluble phosphorus solubilization, four to auxin biosynthesis, four to other process of growth promotion, seven to oxidative stress alleviation, four to salt and drought tolerance and three to cold and heat tolerance. Further study is needed to clarify the correlation between these genes from M30-35 and the salt stress alleviation of inoculated plants under salt stress. Overall, our research indicated that desert shrubs appear rich in PGPRs that can help important crops tolerate abiotic stress.


September 22, 2019

Complete genome sequence and analysis of the industrial Saccharomyces cerevisiae strain N85 used in Chinese rice wine production.

Chinese rice wine is a popular traditional alcoholic beverage in China, while its brewing processes have rarely been explored. We herein report the first gapless, near-finished genome sequence of the yeast strain Saccharomyces cerevisiae N85 for Chinese rice wine production. Several assembly methods were used to integrate Pacific Bioscience (PacBio) and Illumina sequencing data to achieve high-quality genome sequencing of the strain. The genome encodes more than 6,000 predicted proteins, and 238 long non-coding RNAs, which are validated by RNA-sequencing data. Moreover, our annotation predicts 171 novel genes that are not present in the reference S288c genome. We also identified 65,902 single nucleotide polymorphisms and small indels, many of which are located within genic regions. Dozens of larger copy-number variations and translocations were detected, mainly enriched in the subtelomeres, suggesting these regions may be related to genomic evolution. This study will serve as a milestone in studying of Chinese rice wine and related beverages in China and in other countries. It will help to develop more scientific and modern fermentation processes of Chinese rice wine, and explore metabolism pathways of desired and harmful components in Chinese rice wine to improve its taste and nutritional value.© The Author(s) 2018. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


September 22, 2019

Pantoea ananatis genetic diversity analysis reveals limited genomic diversity as well as accessory genes correlated with onion pathogenicity.

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.