Menu
July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Botrytis, the good, the bad and the ugly

Botrytis spp. are efficient pathogens, causing devastating diseases and significant crop losses in a wide variety of plant species. Here we outline our review of these pathogens, as well as highlight the major advances of the past 10 years in studying Botrytis in interaction with its hosts. Progress in molecular genetics and the development of relevant phylogenetic markers in particular, has resulted in the characterisation of approximately 30 species. The host range of Botrytis spp. includes plant species that are members of 170 families of cultivated plants.


July 7, 2019

Draft genome sequence of Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate.

We determined the genome sequence of a thermotolerant yeast, Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate, and the sequence provides further insights into the genomic differences between this strain and other reported K. marxianus strains. The genome described here is composed of 11,165,408 bases and has 4,943 protein-coding genes. Copyright © 2014 Suzuki et al.


July 7, 2019

Whole-genome analysis of Exserohilum rostratum from an outbreak of fungal meningitis and other infections.

Exserohilum rostratum was the cause of most cases of fungal meningitis and other infections associated with the injection of contaminated methylprednisolone acetate produced by the New England Compounding Center (NECC). Until this outbreak, very few human cases of Exserohilum infection had been reported, and very little was known about this dematiaceous fungus, which usually infects plants. Here, we report using whole-genome sequencing (WGS) for the detection of single nucleotide polymorphisms (SNPs) and phylogenetic analysis to investigate the molecular origin of the outbreak using 22 isolates of E. rostratum retrieved from 19 case patients with meningitis or epidural/spinal abscesses, 6 isolates from contaminated NECC vials, and 7 isolates unrelated to the outbreak. Our analysis indicates that all 28 isolates associated with the outbreak had nearly identical genomes of 33.8 Mb. A total of 8 SNPs were detected among the outbreak genomes, with no more than 2 SNPs separating any 2 of the 28 genomes. The outbreak genomes were separated from the next most closely related control strain by ~136,000 SNPs. We also observed significant genomic variability among strains unrelated to the outbreak, which may suggest the possibility of cryptic speciation in E. rostratum. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Signature gene expression reveals novel clues to the molecular mechanisms of dimorphic transition in Penicillium marneffei.

Systemic dimorphic fungi cause more than one million new infections each year, ranking them among the significant public health challenges currently encountered. Penicillium marneffei is a systemic dimorphic fungus endemic to Southeast Asia. The temperature-dependent dimorphic phase transition between mycelium and yeast is considered crucial for the pathogenicity and transmission of P. marneffei, but the underlying mechanisms are still poorly understood. Here, we re-sequenced P. marneffei strain PM1 using multiple sequencing platforms and assembled the genome using hybrid genome assembly. We determined gene expression levels using RNA sequencing at the mycelial and yeast phases of P. marneffei, as well as during phase transition. We classified 2,718 genes with variable expression across conditions into 14 distinct groups, each marked by a signature expression pattern implicated at a certain stage in the dimorphic life cycle. Genes with the same expression patterns tend to be clustered together on the genome, suggesting orchestrated regulations of the transcriptional activities of neighboring genes. Using qRT-PCR, we validated expression levels of all genes in one of clusters highly expressed during the yeast-to-mycelium transition. These included madsA, a gene encoding MADS-box transcription factor whose gene family is exclusively expanded in P. marneffei. Over-expression of madsA drove P. marneffei to undergo mycelial growth at 37°C, a condition that restricts the wild-type in the yeast phase. Furthermore, analyses of signature expression patterns suggested diverse roles of secreted proteins at different developmental stages and the potential importance of non-coding RNAs in mycelium-to-yeast transition. We also showed that RNA structural transition in response to temperature changes may be related to the control of thermal dimorphism. Together, our findings have revealed multiple molecular mechanisms that may underlie the dimorphic transition in P. marneffei, providing a powerful foundation for identifying molecular targets for mechanism-based interventions.


July 7, 2019

Draft genome sequence of the pathogenic fungus Scedosporium apiospermum.

The first genome of one species of the Scedosporium apiospermum complex, responsible for localized to severe disseminated infections according to the immune status of the host, will contribute to a better understanding of the pathogenicity of these fungi and also to the discovery of the mechanisms underlying their low susceptibility to current antifungals. Copyright © 2014 Vandeputte et al.


July 7, 2019

Get your high-quality low-cost genome sequence.

The study of whole-genome sequences has become essential for almost all branches of biological research. Next-generation sequencing (NGS) has revolutionized the scalability, speed, and resolution of sequencing and brought genomic science within reach of academic laboratories that study non-model organisms. Here, we show that a high-quality draft genome of a eukaryote can be obtained at relatively low cost by exploiting a hybrid combination of sequencing strategies. Copyright © 2014 Elsevier Ltd. All rights reserved.


July 7, 2019

The genome of the anaerobic fungus Orpinomyces sp. strain C1A reveals the unique evolutionary history of a remarkable plant biomass degrader.

Anaerobic gut fungi represent a distinct early-branching fungal phylum (Neocallimastigomycota) and reside in the rumen, hindgut, and feces of ruminant and nonruminant herbivores. The genome of an anaerobic fungal isolate, Orpinomyces sp. strain C1A, was sequenced using a combination of Illumina and PacBio single-molecule real-time (SMRT) technologies. The large genome (100.95 Mb, 16,347 genes) displayed extremely low G+C content (17.0%), large noncoding intergenic regions (73.1%), proliferation of microsatellite repeats (4.9%), and multiple gene duplications. Comparative genomic analysis identified multiple genes and pathways that are absent in Dikarya genomes but present in early-branching fungal lineages and/or nonfungal Opisthokonta. These included genes for posttranslational fucosylation, the production of specific intramembrane proteases and extracellular protease inhibitors, the formation of a complete axoneme and intraflagellar trafficking machinery, and a near-complete focal adhesion machinery. Analysis of the lignocellulolytic machinery in the C1A genome revealed an extremely rich repertoire, with evidence of horizontal gene acquisition from multiple bacterial lineages. Experimental analysis indicated that strain C1A is a remarkable biomass degrader, capable of simultaneous saccharification and fermentation of the cellulosic and hemicellulosic fractions in multiple untreated grasses and crop residues examined, with the process significantly enhanced by mild pretreatments. This capability, acquired during its separate evolutionary trajectory in the rumen, along with its resilience and invasiveness compared to prokaryotic anaerobes, renders anaerobic fungi promising agents for consolidated bioprocessing schemes in biofuels production.


July 7, 2019

A gapless genome sequence of the fungus Botrytis cinerea.

Following earlier incomplete and fragmented versions of a genome sequence for the grey mould Botrytis cinerea, we here report a gapless, near-finished genome sequence for B. cinerea strain B05.10. The assembly comprises 18 chromosomes and was confirmed by an optical map and a genetic map based on ~75 000 SNP markers. All chromosomes contain fully assembled centromeric regions, and 10 chromosomes have telomeres on both ends. The genetic map consisted of 4153 cM and comparison of genetic distances with the physical distances identified 40 recombination hotspots. The linkage map also identified two mutations, located in the previously described genes Bos1 and BcsdhB, that confer resistance to the fungicides boscalid and iprodione. The genome was predicted to encode 11 701 proteins. RNAseq data from >20 different samples were used to validate and improve gene models. Manual curation of chromosome 1 revealed interesting features, such as the occurrence of a dicistronic transcript and fully overlapping genes in opposite orientations, as well as many spliced antisense transcripts. Manual curation also revealed that UTRs of genes can be complex and long, with many UTRs exceeding lengths of 1 kb and possessing multiple introns. Community annotation is in progress. This article is protected by copyright. All rights reserved. © 2016 BSPP AND JOHN WILEY & SONS LTD.


July 7, 2019

Whole genome sequencing analysis of the cutaneous pathogenic yeast Malassezia restricta and identification of the major lipase expressed on the scalp of patients with dandruff.

Malassezia species are opportunistic pathogenic fungi that are frequently associated with seborrhoeic dermatitis, including dandruff. Most Malassezia species are lipid dependent, a property that is compensated by breaking down host sebum into fatty acids by lipases. In this study, we aimed to sequence and analyse the whole genome of Malassezia restricta KCTC 27527, a clinical isolate from a Korean patient with severe dandruff, to search for lipase orthologues and identify the lipase that is the most frequently expressed on the scalp of patients with dandruff. The genome of M. restricta KCTC 27527 was sequenced using the Illumina MiSeq and PacBio platforms. Lipase orthologues were identified by comparison with known lipase genes in the genomes of Malassezia globosa and Malassezia sympodialis. The expression of the identified lipase genes was directly evaluated in swab samples from the scalps of 56 patients with dandruff. We found that, among the identified lipase-encoding genes, the gene encoding lipase homolog MRES_03670, named LIP5 in this study, was the most frequently expressed lipase in the swab samples. Our study provides an overview of the genome of a clinical isolate of M. restricta and fundamental information for elucidating the role of lipases during fungus-host interaction.© 2016 Blackwell Verlag GmbH.


July 7, 2019

Competition assays and physiological experiments of soil and phyllosphere yeasts identify Candida subhashii as a novel antagonist of filamentous fungi.

While recent advances in next generation sequencing technologies have enabled researchers to readily identify countless microbial species in soil, rhizosphere, and phyllosphere microbiomes, the biological functions of the majority of these species are unknown. Functional studies are therefore urgently needed in order to characterize the plethora of microorganisms that are being identified and to point out species that may be used for biotechnology or plant protection. Here, we used a dual culture assay and growth analyses to characterise yeasts (40 different isolates) and their antagonistic effect on 16 filamentous fungi; comprising plant pathogens, antagonists, and saprophytes.Overall, this competition screen of 640 pairwise combinations revealed a broad range of outcomes, ranging from small stimulatory effects of some yeasts up to a growth inhibition of more than 80% by individual species. On average, yeasts isolated from soil suppressed filamentous fungi more strongly than phyllosphere yeasts and the antagonistic activity was a species-/isolate-specific property and not dependent on the filamentous fungus a yeast was interacting with. The isolates with the strongest antagonistic activity were Metschnikowia pulcherrima, Hanseniaspora sp., Cyberlindnera sargentensis, Aureobasidium pullulans, Candida subhashii, and Pichia kluyveri. Among these, the soil yeasts (C. sargentensis, A. pullulans, C. subhashii) assimilated and/or oxidized more di-, tri- and tetrasaccharides and organic acids than yeasts from the phyllosphere. Only the two yeasts C. subhashii and M. pulcherrima were able to grow with N-acetyl-glucosamine as carbon source.The competition assays and physiological experiments described here identified known antagonists that have been implicated in the biological control of plant pathogenic fungi in the past, but also little characterised species such as C. subhashii. Overall, soil yeasts were more antagonistic and metabolically versatile than yeasts from the phyllosphere. Noteworthy was the strong antagonistic activity of the soil yeast C. subhashii, which had so far only been described from a clinical sample and not been studied with respect to biocontrol. Based on binary competition assays and growth analyses (e.g., on different carbon sources, growth in root exudates), C. subhashii was identified as a competitive and antagonistic soil yeast with potential as a novel biocontrol agent against plant pathogenic fungi.


July 7, 2019

Identification of small RNAs in extracellular vesicles from the commensal yeast Malassezia sympodialis.

Malassezia is the dominant fungus in the human skin mycobiome and is associated with common skin disorders including atopic eczema (AE)/dermatitis. Recently, it was found that Malassezia sympodialis secretes nanosized exosome-like vesicles, designated MalaEx, that carry allergens and can induce inflammatory cytokine responses. Extracellular vesicles from different cell-types including fungi have been found to deliver functional RNAs to recipient cells. In this study we assessed the presence of small RNAs in MalaEx and addressed if the levels of these RNAs differ when M. sympodialis is cultured at normal human skin pH versus the elevated pH present on the skin of patients with AE. The total number and the protein concentration of the released MalaEx harvested after 48?h culture did not differ significantly between the two pH conditions nor did the size of the vesicles. From small RNA sequence data, we identified a set of reads with well-defined start and stop positions, in a length range of 16 to 22 nucleotides consistently present in the MalaEx. The levels of small RNAs were not significantly differentially expressed between the two different pH conditions indicating that they are not influenced by the elevated pH level observed on the AE skin.


July 7, 2019

Genome sequence of a unique Magnaporthe oryzae RMg-Dl isolate from India that causes blast disease in diverse cereal crops, obtained using PacBio single-molecule and Illumina HiSeq2500 sequencing.

The whole-genome assembly of a unique rice isolate from India, Magnaporthe oryzae RMg-Dl that causes blast disease in diverse cereal crops is presented. Analysis of the 34.82 Mb genome sequence will aid in better understanding the genetic determinants of host range, host jump, survival, pathogenicity, and virulence factors of M. oryzae. Copyright © 2017 Kumar et al.


July 7, 2019

Simultaneous emergence of multidrug-resistant Candida auris on 3 continents confirmed by whole-genome sequencing and epidemiological analyses.

Candida auris, a multidrug-resistant yeast that causes invasive infections, was first described in 2009 in Japan and has since been reported from several countries.To understand the global emergence and epidemiology of C. auris, we obtained isolates from 54 patients with C. auris infection from Pakistan, India, South Africa, and Venezuela during 2012-2015 and the type specimen from Japan. Patient information was available for 41 of the isolates. We conducted antifungal susceptibility testing and whole-genome sequencing (WGS).Available clinical information revealed that 41% of patients had diabetes mellitus, 51% had undergone recent surgery, 73% had a central venous catheter, and 41% were receiving systemic antifungal therapy when C. auris was isolated. The median time from admission to infection was 19 days (interquartile range, 9-36 days), 61% of patients had bloodstream infection, and 59% died. Using stringent break points, 93% of isolates were resistant to fluconazole, 35% to amphotericin B, and 7% to echinocandins; 41% were resistant to 2 antifungal classes and 4% were resistant to 3 classes. WGS demonstrated that isolates were grouped into unique clades by geographic region. Clades were separated by thousands of single-nucleotide polymorphisms, but within each clade isolates were clonal. Different mutations in ERG11 were associated with azole resistance in each geographic clade.C. auris is an emerging healthcare-associated pathogen associated with high mortality. Treatment options are limited, due to antifungal resistance. WGS analysis suggests nearly simultaneous, and recent, independent emergence of different clonal populations on 3 continents. Risk factors and transmission mechanisms need to be elucidated to guide control measures. Published by Oxford University Press for the Infectious Diseases Society of America 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.


July 7, 2019

Genome sequencing and analysis of Talaromyces pinophilus provide insights into biotechnological applications.

Species from the genus Talaromyces produce useful biomass-degrading enzymes and secondary metabolites. However, these enzymes and secondary metabolites are still poorly understood and have not been explored in depth because of a lack of comprehensive genetic information. Here, we report a 36.51-megabase genome assembly of Talaromyces pinophilus strain 1-95, with coverage of nine scaffolds of eight chromosomes with telomeric repeats at their ends and circular mitochondrial DNA. In total, 13,472 protein-coding genes were predicted. Of these, 803 were annotated to encode enzymes that act on carbohydrates, including 39 cellulose-degrading and 24 starch-degrading enzymes. In addition, 68 secondary metabolism gene clusters were identified, mainly including T1 polyketide synthase genes and nonribosomal peptide synthase genes. Comparative genomic analyses revealed that T. pinophilus 1-95 harbors more biomass-degrading enzymes and secondary metabolites than other related filamentous fungi. The prediction of the T. pinophilus 1-95 secretome indicated that approximately 50% of the biomass-degrading enzymes are secreted into the extracellular environment. These results expanded our genetic knowledge of the biomass-degrading enzyme system of T. pinophilus and its biosynthesis of secondary metabolites, facilitating the cultivation of T. pinophilus for high production of useful products.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.