Menu
September 22, 2019

A hybrid-hierarchical genome assembly strategy to sequence the invasive golden mussel Limnoperna fortunei.

For more than 25 years, the golden mussel Limnoperna fortunei has aggressively invaded South American freshwaters, having travelled more than 5,000 km upstream across five countries. Along the way, the golden mussel has outcompeted native species and economically harmed aquaculture, hydroelectric powers, and ship transit. We have sequenced the complete genome of the golden mussel to understand the molecular basis of its invasiveness and search for ways to control it.We assembled the 1.6 Gb genome into 20548 scaffolds with an N50 length of 312 Kb using a hybrid and hierarchical assembly strategy from short and long DNA reads and transcriptomes. A total of 60717 coding genes were inferred from a customized transcriptome-trained AUGUSTUS run. We also compared predicted protein sets with those of complete molluscan genomes, revealing an exacerbation of protein-binding domains in L. fortunei. Conclusions: We built one of the best bivalve genome assemblies available using a cost-effective approach using Illumina pair-end, mate pair, and PacBio long reads. We expect that the continuous and careful annotation of L. fortunei’s genome will contribute to the investigation of bivalve genetics, evolution, and invasiveness, as well as to the development of biotechnological tools for aquatic pest control.© The Authors 2017. Published by Oxford University Press.


September 22, 2019

LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons.

Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTR_retriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTR_retriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTR_retriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4.5× genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTR-RTs with 5′-TG…CA-3′ termini, LTR_retriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs areCopiaelements, with which the LTR is four times shorter than that of otherCopiaelements, which may be a result of their target specificity. Strikingly, non-TGCACopiaelements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.© 2018 American Society of Plant Biologists. All Rights Reserved.


September 22, 2019

Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants.

Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated.De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups.We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.


September 22, 2019

Bat biology, genomes, and the Bat1K project: To generate chromosome-level genomes for all living bat species.

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n~1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.


September 22, 2019

Complete genome sequence of Bacillus velezensis 157 isolated from Eucommia ulmoides with pathogenic bacteria inhibiting and lignocellulolytic enzymes production by SSF.

Bacillus velezensis 157 was isolated from the bark of Eucommia ulmoides, and exhibited antagonistic activity against a broad spectrum of pathogenic bacteria and fungi. Moreover, B. velezensis 157 also showed various lignocellulolytic activities including cellulase, xylanase, a-amylase, and pectinase, which had the ability of using the agro-industrial waste (soybean meal, wheat bran, sugarcane bagasse, wheat straw, rice husk, maize flour and maize straw) under solid-state fermentation and obtained several industrially valuable enzymes. Soybean meal appeared to be the most efficient substrate for the single fermentation of B. velezensis 157. Highest yield of pectinase (19.15 ± 2.66 U g-1), cellulase (46.69 ± 1.19 U g-1) and amylase (2097.18 ± 15.28 U g-1) was achieved on untreated soybean meal. Highest yield of xylanase (22.35 ± 2.24 U g-1) was obtained on untreated wheat bran. Here, we report the complete genome sequence of the B. velezensis 157, composed of a circular 4,013,317 bp chromosome with 3789 coding genes and a G + C content of 46.41%, one circular 8439 bp plasmid and a G + C content of 40.32%. The genome contained a total of 8 candidate gene clusters (bacillaene, difficidin, macrolactin, butirosin, bacillibactin, bacilysin, fengycin and surfactin), and dedicates over 15.8% of the whole genome to synthesize secondary metabolite biosynthesis. In addition, the genes encoding enzymes involved in degradation of cellulose, xylan, lignin, starch, mannan, galactoside and arabinan were found in the B. velezensis 157 genome. Thus, the study of B. velezensis 157 broadened that B. velezensis can not only be used as biocontrol agents, but also has potentially a wide range of applications in lignocellulosic biomass conversion.


September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.


September 22, 2019

Bacterial artificial chromosome clones randomly selected for sequencing reveal genomic differences between soybean cultivars

This study pioneered the use of multiple technologies to combine the bacterial artificial chromosome (BAC) pooling strategy with high-throughput next- and third-generation sequencing technologies to analyse genomic difference. To understand the genetic background of the Chinese soybean cultivar N23601, we built a BAC library and sequenced 10 randomly selected clones followed by de novo assembly. Comparative analysis was conducted against the reference genome of Glycine max var. Williams 82 (2.0). Therefore, our result is an assessment of the reference genome. Our results revealed that 3517 single nucleotide polymorphisms (SNPs) and 662 insertion–deletions (InDels) occurred in ~1.2 Mb of the genomic region and that four of the 10 BAC clones contained 15 large structural variations (72?887?bp) compared with the reference genome. Gene annotation of the reference genome showed that Glyma.18g181000 was missing from the corresponding position of the 10 BAC clones. Additionally, there may be a problem with the assembly of some positions of the reference genome. Several gap regions in the reference genome could be supplemented by using the complete sequence of the 10 BAC clones. We believe that accurate and complete BAC sequence is a valuable resource that contributes to the completeness of the reference genome.


September 22, 2019

Induced salt tolerance of perennial ryegrass by a novel bacterium strain from the rhizosphere of a desert shrub Haloxylon ammodendron.

Drought and soil salinity reduce agricultural output worldwide. Plant-growth-promoting rhizobacteria (PGPR) can enhance plant growth and augment plant tolerance to biotic and abiotic stresses.Haloxylon ammodendron, a C4 perennial succulent xerohalophyte shrub with excellent drought and salt tolerance, is naturally distributed in the desert area of northwest China. In our previous work, a bacterium strain numbered as M30-35 was isolated from the rhizosphere ofH. ammodendronin Tengger desert, Gansu province, northwest China. In current work, the effects of M30-35 inoculation on salt tolerance of perennial ryegrass were evaluated and its genome was sequenced to identify genes associated with plant growth promotion. Results showed that M30-35 significantly enhanced growth and salt tolerance of perennial ryegrass by increasing shoot fresh and dry weights, chlorophyll content, root volume, root activity, leaf catalase activity, soluble sugar and proline contents that contributed to reduced osmotic potential, tissue K? content and K?/Na? ratio, while decreasing malondialdehyde (MDA) content and relative electric conductivity (REC), especially under higher salinity. The genome of M30-35 contains 4421 protein encoding genes, 12 rRNA, 63 tRNA-encoding genes and four rRNA operons. M30-35 was initially classified as a new species inPseudomonasand named asPseudomonassp. M30-35. Thirty-four genes showing homology to genes associated with PGPR traits and abiotic stress tolerance were identified inPseudomonassp. M30-35 genome, including 12 related to insoluble phosphorus solubilization, four to auxin biosynthesis, four to other process of growth promotion, seven to oxidative stress alleviation, four to salt and drought tolerance and three to cold and heat tolerance. Further study is needed to clarify the correlation between these genes from M30-35 and the salt stress alleviation of inoculated plants under salt stress. Overall, our research indicated that desert shrubs appear rich in PGPRs that can help important crops tolerate abiotic stress.


September 22, 2019

Pantoea ananatis genetic diversity analysis reveals limited genomic diversity as well as accessory genes correlated with onion pathogenicity.

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study.


September 22, 2019

Enterobacter bugandensis: a novel enterobacterial species associated with severe clinical infection.

Nosocomial pathogens can cause life-threatening infections in neonates and immunocompromised patients. E. bugandensis (EB-247) is a recently described species of Enterobacter, associated with neonatal sepsis. Here we demonstrate that the extended spectrum ß-lactam (ESBL) producing isolate EB-247 is highly virulent in both Galleria mellonella and mouse models of infection. Infection studies in a streptomycin-treated mouse model showed that EB-247 is as efficient as Salmonella Typhimurium in inducing systemic infection and release of proinflammatory cytokines. Sequencing and analysis of the complete genome and plasmid revealed that virulence properties are associated with the chromosome, while antibiotic-resistance genes are exclusively present on a 299?kb IncHI plasmid. EB-247 grew in high concentrations of human serum indicating septicemic potential. Using whole genome-based transcriptome analysis we found 7% of the genome was mobilized for growth in serum. Upregulated genes include those involved in the iron uptake and storage as well as metabolism. The lasso peptide microcin J25 (MccJ25), an inhibitor of iron-uptake and RNA polymerase activity, inhibited EB-247 growth. Our studies indicate that Enterobacter bugandensis is a highly pathogenic species of the genus Enterobacter. Further studies on the colonization and virulence potential of E. bugandensis and its association with septicemic infection is now warranted.


September 22, 2019

Biodegradation of di-n-butyl phthalate (DBP) by a novel endophytic Bacillus megaterium strain YJB3.

Phthalic acid esters (PAEs) are a group of recalcitrant and hazardous organic compounds that pose a great threat to both ecosystem and human beings. A novel endophytic strain YJB3 that could utilize a wide range of PAEs as the sole carbon and energy sources for cell growth was isolated from Canna indica root tissue. It was identified as Bacillus megaterium based on morphological characteristics and 16S rDNA sequence homology analysis. The degradation capability of the strain YJB3 was investigated by incubation in mineral salt medium containing di-n-butyl-phthalate (DBP), one of important PAEs under different environmental conditions, showing 82.5% of the DBP removal in 5days of incubation under the optimum conditions (acetate 1.2g·L-1, inocula 1.8%, and temperature 34.2°C) achieved by two-step sequential optimization technologies. The DBP metabolites including mono-butyl phthalate (MBP), phthalic acid (PA), protocatechuic acid (PCA), etc. were determined by GC-MS. The PCA catabolic genes responsible for the aromatic ring cleavage of PCA in the strain YJB3 were excavated by whole-genome sequencing. Thus, a degradation pathway of DBP by the strain YJB3 was proposed that MBP was formed, followed by PA, and then the intermediates were further utilized till complete degradation. To our knowledge, this is the first study to show the biodegradation of PAEs using endophyte. The results in the present study suggest that the strain YJB3 is greatly promising to act as a competent inoculum in removal of PAEs in both soils and crops. Copyright © 2017 Elsevier B.V. All rights reserved.


September 22, 2019

Rhizospheric microbial communities are driven by Panax ginseng at different growth stages and biocontrol bacteria alleviates replanting mortality

The cultivation of Panax plants is hindered by replanting problems, which may be caused by plant-driven changes in the soil microbial community. Inoculation with microbial antagonists may efficiently alleviate replanting issues. Through high-throughput sequencing, this study revealed that bacterial diversity decreased, whereas fungal diversity increased, in the rhizosphere soils of adult ginseng plants at the root growth stage under different ages. Few microbial community, such as Luteolibacter, Cytophagaceae, Luteibacter, Sphingomonas, Sphingomonadaceae, and Zygomycota, were observed; the relative abundance of microorganisms, namely, Brevundimonas, Enterobacteriaceae, Pandoraea, Cantharellales, Dendryphion, Fusarium, and Chytridiomycota, increased in the soils of adult ginseng plants compared with those in the soils of 2-year-old seedlings. Bacillus subtilis 50-1, a microbial antagonist against the pathogenic Fusarium oxysporum, was isolated through a dual culture technique. These bacteria acted with a biocontrol efficacy of 67.8%. The ginseng death rate and Fusarium abundance decreased by 63.3% and 46.1%, respectively, after inoculation with B. subtilis 50-1. Data revealed that microecological degradation could result from ginseng-driven changes in rhizospheric microbial communities; these changes are associated with the different ages and developmental stages of ginseng plants. Biocontrol using microbial antagonists alleviated the replanting problem.


September 22, 2019

The hardy rubber tree genome provides insights into the evolution of polyisoprene biosynthesis.

Eucommia ulmoides, also called hardy rubber tree, is an economically important tree; however, the lack of its genome sequence restricts the fundamental biological research and applied studies of this plant species. Here, we present a high-quality assembly of its ~1.2-Gb genome (scaffold N50 = 1.88 Mb) with at least 26 723 predicted genes for E. ulmoides, the first sequenced genome of the order Garryales, which was obtained using an integrated strategy combining Illumina sequencing, PacBio sequencing, and BioNano mapping. As a sister taxon to lamiids and campanulids, E. ulmoides underwent an ancient genome triplication shared by core eudicots but no further whole-genome duplication in the last ~125 million years. E. ulmoides exhibits high expression levels and/or gene number expansion for multiple genes involved in stress responses and the biosynthesis of secondary metabolites, which may account for its considerable environmental adaptability. In contrast to the rubber tree (Hevea brasiliensis), which produces cis-polyisoprene, E. ulmoides has evolved to synthesize long-chain trans-polyisoprene via farnesyl diphosphate synthases (FPSs). Moreover, FPS and rubber elongation factor/small rubber particle protein gene families were expanded independently from the H. brasiliensis lineage. These results provide new insights into the biology of E. ulmoides and the origin of polyisoprene biosynthesis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Genotype assembly, biological activity and adaptation of spatially separated isolates of Spodoptera litura nucleopolyhedrovirus.

The cotton leafworm Spodoptera litura is a polyphagous insect. It has recently made a comeback as a primary insect pest of cotton in Pakistan due to reductions in pesticide use on the advent of genetically modified cotton, resistant to Helicoverpa armigera. Spodoptera litura nucleopolyhedrovirus (SpltNPV) infects S. litura and is recognized as a potential candidate to control this insect. Twenty-two NPV isolates were collected from S. litura from different agro-ecological zones (with collection sites up to 600?km apart) and cropping systems in Pakistan to see whether there is spatial dispersal and adaptation of the virus and/or adaptation to crops. Therefore, the genetic make-up and biological activity of these isolates was measured. Among the SpltNPV isolates tested for speed of kill in 3rd instar larvae of S. litura, TAX1, SFD1, SFD2 and GRW1 were significantly faster killing isolates than other Pakistani isolates. Restriction fragment length analysis of the DNA showed that the Pakistan SpltNPV isolates are all variants of a single SpltNPV biotype. The isolates could be grouped into three genogroups (A-C). The speed of kill of genogroup A viruses was higher than in group C according to a Cox’ proportional hazards analysis. Sequence analysis showed that the Pakistan SpltNPV isolates are more closely related to each other than to the SpltNPV type species G2 (Pang et al., 2001). This suggests a single introduction of SpltNPV into Pakistan. The SpltNPV-PAK isolates are distinct from Spodoptera littoralis nucleopolyhedrovirus. There was a strong correlation between geographic spread and the genetic variation of SpltNPV, and a marginally significant correlation between the latter and the cropping system. The faster killing isolates may be good candidates for biological control of S. litura in Pakistan. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Dynamic evolution of a-gliadin prolamin gene family in homeologous genomes of hexaploid wheat.

Wheat Gli-2 loci encode complex groups of a-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of a-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous a-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 a-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of a-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of a-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of a-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.