Menu
July 7, 2019  |  

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019  |  

Draft genome sequence of Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate.

We determined the genome sequence of a thermotolerant yeast, Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate, and the sequence provides further insights into the genomic differences between this strain and other reported K. marxianus strains. The genome described here is composed of 11,165,408 bases and has 4,943 protein-coding genes. Copyright © 2014 Suzuki et al.


July 7, 2019  |  

Whole genome sequencing analysis of the cutaneous pathogenic yeast Malassezia restricta and identification of the major lipase expressed on the scalp of patients with dandruff.

Malassezia species are opportunistic pathogenic fungi that are frequently associated with seborrhoeic dermatitis, including dandruff. Most Malassezia species are lipid dependent, a property that is compensated by breaking down host sebum into fatty acids by lipases. In this study, we aimed to sequence and analyse the whole genome of Malassezia restricta KCTC 27527, a clinical isolate from a Korean patient with severe dandruff, to search for lipase orthologues and identify the lipase that is the most frequently expressed on the scalp of patients with dandruff. The genome of M. restricta KCTC 27527 was sequenced using the Illumina MiSeq and PacBio platforms. Lipase orthologues were identified by comparison with known lipase genes in the genomes of Malassezia globosa and Malassezia sympodialis. The expression of the identified lipase genes was directly evaluated in swab samples from the scalps of 56 patients with dandruff. We found that, among the identified lipase-encoding genes, the gene encoding lipase homolog MRES_03670, named LIP5 in this study, was the most frequently expressed lipase in the swab samples. Our study provides an overview of the genome of a clinical isolate of M. restricta and fundamental information for elucidating the role of lipases during fungus-host interaction.© 2016 Blackwell Verlag GmbH.


July 7, 2019  |  

Competition assays and physiological experiments of soil and phyllosphere yeasts identify Candida subhashii as a novel antagonist of filamentous fungi.

While recent advances in next generation sequencing technologies have enabled researchers to readily identify countless microbial species in soil, rhizosphere, and phyllosphere microbiomes, the biological functions of the majority of these species are unknown. Functional studies are therefore urgently needed in order to characterize the plethora of microorganisms that are being identified and to point out species that may be used for biotechnology or plant protection. Here, we used a dual culture assay and growth analyses to characterise yeasts (40 different isolates) and their antagonistic effect on 16 filamentous fungi; comprising plant pathogens, antagonists, and saprophytes.Overall, this competition screen of 640 pairwise combinations revealed a broad range of outcomes, ranging from small stimulatory effects of some yeasts up to a growth inhibition of more than 80% by individual species. On average, yeasts isolated from soil suppressed filamentous fungi more strongly than phyllosphere yeasts and the antagonistic activity was a species-/isolate-specific property and not dependent on the filamentous fungus a yeast was interacting with. The isolates with the strongest antagonistic activity were Metschnikowia pulcherrima, Hanseniaspora sp., Cyberlindnera sargentensis, Aureobasidium pullulans, Candida subhashii, and Pichia kluyveri. Among these, the soil yeasts (C. sargentensis, A. pullulans, C. subhashii) assimilated and/or oxidized more di-, tri- and tetrasaccharides and organic acids than yeasts from the phyllosphere. Only the two yeasts C. subhashii and M. pulcherrima were able to grow with N-acetyl-glucosamine as carbon source.The competition assays and physiological experiments described here identified known antagonists that have been implicated in the biological control of plant pathogenic fungi in the past, but also little characterised species such as C. subhashii. Overall, soil yeasts were more antagonistic and metabolically versatile than yeasts from the phyllosphere. Noteworthy was the strong antagonistic activity of the soil yeast C. subhashii, which had so far only been described from a clinical sample and not been studied with respect to biocontrol. Based on binary competition assays and growth analyses (e.g., on different carbon sources, growth in root exudates), C. subhashii was identified as a competitive and antagonistic soil yeast with potential as a novel biocontrol agent against plant pathogenic fungi.


July 7, 2019  |  

Identification of small RNAs in extracellular vesicles from the commensal yeast Malassezia sympodialis.

Malassezia is the dominant fungus in the human skin mycobiome and is associated with common skin disorders including atopic eczema (AE)/dermatitis. Recently, it was found that Malassezia sympodialis secretes nanosized exosome-like vesicles, designated MalaEx, that carry allergens and can induce inflammatory cytokine responses. Extracellular vesicles from different cell-types including fungi have been found to deliver functional RNAs to recipient cells. In this study we assessed the presence of small RNAs in MalaEx and addressed if the levels of these RNAs differ when M. sympodialis is cultured at normal human skin pH versus the elevated pH present on the skin of patients with AE. The total number and the protein concentration of the released MalaEx harvested after 48?h culture did not differ significantly between the two pH conditions nor did the size of the vesicles. From small RNA sequence data, we identified a set of reads with well-defined start and stop positions, in a length range of 16 to 22 nucleotides consistently present in the MalaEx. The levels of small RNAs were not significantly differentially expressed between the two different pH conditions indicating that they are not influenced by the elevated pH level observed on the AE skin.


July 7, 2019  |  

Simultaneous emergence of multidrug-resistant Candida auris on 3 continents confirmed by whole-genome sequencing and epidemiological analyses.

Candida auris, a multidrug-resistant yeast that causes invasive infections, was first described in 2009 in Japan and has since been reported from several countries.To understand the global emergence and epidemiology of C. auris, we obtained isolates from 54 patients with C. auris infection from Pakistan, India, South Africa, and Venezuela during 2012-2015 and the type specimen from Japan. Patient information was available for 41 of the isolates. We conducted antifungal susceptibility testing and whole-genome sequencing (WGS).Available clinical information revealed that 41% of patients had diabetes mellitus, 51% had undergone recent surgery, 73% had a central venous catheter, and 41% were receiving systemic antifungal therapy when C. auris was isolated. The median time from admission to infection was 19 days (interquartile range, 9-36 days), 61% of patients had bloodstream infection, and 59% died. Using stringent break points, 93% of isolates were resistant to fluconazole, 35% to amphotericin B, and 7% to echinocandins; 41% were resistant to 2 antifungal classes and 4% were resistant to 3 classes. WGS demonstrated that isolates were grouped into unique clades by geographic region. Clades were separated by thousands of single-nucleotide polymorphisms, but within each clade isolates were clonal. Different mutations in ERG11 were associated with azole resistance in each geographic clade.C. auris is an emerging healthcare-associated pathogen associated with high mortality. Treatment options are limited, due to antifungal resistance. WGS analysis suggests nearly simultaneous, and recent, independent emergence of different clonal populations on 3 continents. Risk factors and transmission mechanisms need to be elucidated to guide control measures. Published by Oxford University Press for the Infectious Diseases Society of America 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.


July 7, 2019  |  

Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii.

The probiotic yeast, Saccharomyces boulardii (Sb) is known to be effective against many gastrointestinal disorders and antibiotic-associated diarrhea. To understand molecular basis of probiotic-properties ascribed to Sb we determined the complete genomes of two strains of Sb i.e. Biocodex and unique28 and the draft genomes for three other Sb strains that are marketed as probiotics in India. We compared these genomes with 145 strains of S. cerevisiae (Sc) to understand genome-level similarities and differences between these yeasts. A distinctive feature of Sb from other Sc is absence of Ty elements Ty1, Ty3, Ty4 and associated LTR. However, we could identify complete Ty2 and Ty5 elements in Sb. The genes for hexose transporters HXT11 and HXT9, and asparagine-utilization are absent in all Sb strains. We find differences in repeat periods and copy numbers of repeats in flocculin genes that are likely related to the differential adhesion of Sb as compared to Sc. Core-proteome based taxonomy places Sb strains along with wine strains of Sc. We find the introgression of five genes from Z. bailii into the chromosome IV of Sb and wine strains of Sc. Intriguingly, genes involved in conferring known probiotic properties to Sb are conserved in most Sc strains.


July 7, 2019  |  

HINGE: long-read assembly achieves optimal repeat resolution.

Long-read sequencing technologies have the potential to produce gold-standard de novo genome assemblies, but fully exploiting error-prone reads to resolve repeats remains a challenge. Aggressive approaches to repeat resolution often produce misassemblies, and conservative approaches lead to unnecessary fragmentation. We present HINGE, an assembler that seeks to achieve optimal repeat resolution by distinguishing repeats that can be resolved given the data from those that cannot. This is accomplished by adding “hinges” to reads for constructing an overlap graph where only unresolvable repeats are merged. As a result, HINGE combines the error resilience of overlap-based assemblers with repeat-resolution capabilities of de Bruijn graph assemblers. HINGE was evaluated on the long-read bacterial data sets from the NCTC project. HINGE produces more finished assemblies than Miniasm and the manual pipeline of NCTC based on the HGAP assembler and Circlator. HINGE also allows us to identify 40 data sets where unresolvable repeats prevent the reliable construction of a unique finished assembly. In these cases, HINGE outputs a visually interpretable assembly graph that encodes all possible finished assemblies consistent with the reads, while other approaches such as the NCTC pipeline and FALCON either fragment the assembly or resolve the ambiguity arbitrarily.© 2017 Kamath et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

Genome sequences of Cyberlindnera fabianii 65, Pichia kudriavzevii 129, and Saccharomyces cerevisiae 131 isolated from fermented masau fruits in Zimbabwe.

Cyberlindnera fabianii 65, Pichia kudriavzevii 129, and Saccharomyces cerevisiae 131 have been isolated from the microbiota of fermented masau fruits. C. fabianii and P. kudriavzevii especially harbor promising features for biotechnology and food applications. Here, we present the draft annotated genome sequences of these isolates. Copyright © 2017 van Rijswijck et al.


July 7, 2019  |  

De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms.

Long-read sequencing technologies such as Pacific Biosciences and Oxford Nanopore MinION are capable of producing long sequencing reads with average fragment lengths of over 10,000 base-pairs and maximum lengths reaching 100,000 base- pairs. Compared with short reads, the assemblies obtained from long-read sequencing platforms have much higher contig continuity and genome completeness as long fragments are able to extend paths into problematic or repetitive regions. Many successful assembly applications of the Pacific Biosciences technology have been reported ranging from small bacterial genomes to large plant and animal genomes. Recently, genome assemblies using Oxford Nanopore MinION data have attracted much attention due to the portability and low cost of this novel sequencing instrument. In this paper, we re-sequenced a well characterized genome, the Saccharomyces cerevisiae S288C strain using three different platforms: MinION, PacBio and MiSeq. We present a comprehensive metric comparison of assemblies generated by various pipelines and discuss how the platform associated data characteristics affect the assembly quality. With a given read depth of 31X, the assemblies from both Pacific Biosciences and Oxford Nanopore MinION show excellent continuity and completeness for the 16 nuclear chromosomes, but not for the mitochondrial genome, whose reconstruction still represents a significant challenge.


July 7, 2019  |  

Whole genome sequence of the heterozygous clinical isolate Candida krusei 81-B-5.

Candida krusei is a diploid, heterozygous yeast that is an opportunistic fungal pathogen in immunocompromised patients. This species also is utilized for fermenting cocoa beans during chocolate production. One major concern in the clinical setting is the innate resistance of this species to the most commonly used antifungal drug fluconazole. Here we report a high-quality genome sequence and assembly for the first clinical isolate of C. krusei, strain 81-B-5, into 11 scaffolds generated with PacBio sequencing technology. Gene annotation and comparative analysis revealed a unique profile of transporters that could play a role in drug resistance or adaptation to different environments. In addition, we show that while 82% of the genome is highly heterozygous, a 2.0 Mb region of the largest scaffold has undergone loss of heterozygosity. This genome will serve as a reference for further genetic studies of this pathogen. Copyright © 2017 Author et al.


July 7, 2019  |  

Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019  |  

A large gene family in fission yeast encodes spore killers that subvert Mendel’s law.

Spore killers in fungi are selfish genetic elements that distort Mendelian segregation in their favor. It remains unclear how many species harbor them and how diverse their mechanisms are. Here, we discover two spore killers from a natural isolate of the fission yeast Schizosaccharomyces pombe. Both killers belong to the previously uncharacterized wtf gene family with 25 members in the reference genome. These two killers act in strain-background-independent and genome-location-independent manners to perturb the maturation of spores not inheriting them. Spores carrying one killer are protected from its killing effect but not that of the other killer. The killing and protecting activities can be uncoupled by mutation. The numbers and sequences of wtf genes vary considerably between S. pombe isolates, indicating rapid divergence. We propose that wtf genes contribute to the extensive intraspecific reproductive isolation in S. pombe, and represent ideal models for understanding how segregation-distorting elements act and evolve.


July 7, 2019  |  

The dynamic three-dimensional organization of the diploid yeast genome.

The budding yeast Saccharomyces cerevisiae is a long-standing model for the three-dimensional organization of eukaryotic genomes. However, even in this well-studied model, it is unclear how homolog pairing in diploids or environmental conditions influence overall genome organization. Here, we performed high-throughput chromosome conformation capture on diverged Saccharomyces hybrid diploids to obtain the first global view of chromosome conformation in diploid yeasts. After controlling for the Rabl-like orientation using a polymer model, we observe significant homolog proximity that increases in saturated culture conditions. Surprisingly, we observe a localized increase in homologous interactions between the HAS1-TDA1 alleles specifically under galactose induction and saturated growth. This pairing is accompanied by relocalization to the nuclear periphery and requires Nup2, suggesting a role for nuclear pore complexes. Together, these results reveal that the diploid yeast genome has a dynamic and complex 3D organization.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.