Menu
July 7, 2019  |  

Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data.

Long-read sequencing can overcome the weaknesses of short reads in the assembly of eukaryotic genomes, however, at present additional scaffolding is needed to achieve chromosome-level assemblies. We generated PacBio long-read data of the genomes of three relatives of the model plant Arabidopsis thaliana and assembled all three genomes into only a few hundred contigs. To improve the contiguities of these assemblies, we generated BioNano Genomics optical mapping and Dovetail Genomics chromosome conformation capture data for genome scaffolding. Despite their technical differences, optical mapping and chromosome conformation capture performed similarly and doubled N50 values. After improving both integration methods, assembly contiguity reached chromosome-arm-levels. We rigorously assessed the quality of contigs and scaffolds using Illumina mate-pair libraries and genetic map information. This showed that PacBio assemblies have high sequence accuracy but can contain several misassemblies, which join unlinked regions of the genome. Most, but not all of these mis-joints were removed during the integration of the optical mapping and chromosome conformation capture data. Even though none of the centromeres was fully assembled, the scaffolds revealed large parts of some centromeric regions, even including some of the heterochromatic regions, which are not present in gold standard reference sequences. Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

Sequencing and de novo assembly of a near complete indica rice genome.

A high-quality reference genome is critical for understanding genome structure, genetic variation and evolution of an organism. Here we report the de novo assembly of an indica rice genome Shuhui498 (R498) through the integration of single-molecule sequencing and mapping data, genetic map and fosmid sequence tags. The 390.3?Mb assembly is estimated to cover more than 99% of the R498 genome and is more continuous than the current reference genomes of japonica rice Nipponbare (MSU7) and Arabidopsis thaliana (TAIR10). We annotate high-quality protein-coding genes in R498 and identify genetic variations between R498 and Nipponbare and presence/absence variations by comparing them to 17 draft genomes in cultivated rice and its closest wild relatives. Our results demonstrate how to de novo assemble a highly contiguous and near-complete plant genome through an integrative strategy. The R498 genome will serve as a reference for the discovery of genes and structural variations in rice.


July 7, 2019  |  

Genome sequences of Cyberlindnera fabianii 65, Pichia kudriavzevii 129, and Saccharomyces cerevisiae 131 isolated from fermented masau fruits in Zimbabwe.

Cyberlindnera fabianii 65, Pichia kudriavzevii 129, and Saccharomyces cerevisiae 131 have been isolated from the microbiota of fermented masau fruits. C. fabianii and P. kudriavzevii especially harbor promising features for biotechnology and food applications. Here, we present the draft annotated genome sequences of these isolates. Copyright © 2017 van Rijswijck et al.


July 7, 2019  |  

High metabolic versatility of different toxigenic and non-toxigenic Clostridioides difficile isolates.

Clostridioides difficile (formerly Clostridium difficile) is a major nosocomial pathogen with an increasing number of community-acquired infections causing symptoms from mild diarrhea to life-threatening colitis. The pathogenicity of C. difficile is considered to be mainly associated with the production of genome-encoded toxins A and B. In addition, some strains also encode and express the binary toxin CDT. However; a large number of non-toxigenic C. difficile strains have been isolated from the human gut and the environment. In this study, we characterized the growth behavior, motility and fermentation product formation of 17 different C. difficile isolates comprising five different major genomic clades and five different toxin inventories in relation to the C. difficile model strains 630?erm and R20291. Within 33 determined fermentation products, we identified two yet undescribed products (5-methylhexanoate and 4-(methylthio)-butanoate) of C. difficile. Our data revealed major differences in the fermentation products obtained after growth in a medium containing casamino acids and glucose as carbon and energy source. While the metabolism of branched chain amino acids remained comparable in all isolates, the aromatic amino acid uptake and metabolism and the central carbon metabolism-associated fermentation pathways varied strongly between the isolates. The patterns obtained followed neither the classification of the clades nor the ribotyping patterns nor the toxin distribution. As the toxin formation is strongly connected to the metabolism, our data allow an improved differentiation of C. difficile strains. The observed metabolic flexibility provides the optimal basis for the adaption in the course of infection and to changing conditions in different environments including the human gut. Copyright © 2017 Elsevier GmbH. All rights reserved.


July 7, 2019  |  

Coping with living in the soil: the genome of the parthenogenetic springtail Folsomia candida.

Folsomia candida is a model in soil biology, belonging to the family of Isotomidae, subclass Collembola. It reproduces parthenogenetically in the presence of Wolbachia, and exhibits remarkable physiological adaptations to stress. To better understand these features and adaptations to life in the soil, we studied its genome in the context of its parthenogenetic lifestyle.We applied Pacific Bioscience sequencing and assembly to generate a reference genome for F. candida of 221.7 Mbp, comprising only 162 scaffolds. The complete genome of its endosymbiont Wolbachia, was also assembled and turned out to be the largest strain identified so far. Substantial gene family expansions and lineage-specific gene clusters were linked to stress response. A large number of genes (809) were acquired by horizontal gene transfer. A substantial fraction of these genes are involved in lignocellulose degradation. Also, the presence of genes involved in antibiotic biosynthesis was confirmed. Intra-genomic rearrangements of collinear gene clusters were observed, of which 11 were organized as palindromes. The Hox gene cluster of F. candida showed major rearrangements compared to arthropod consensus cluster, resulting in a disorganized cluster.The expansion of stress response gene families suggests that stress defense was important to facilitate colonization of soils. The large number of HGT genes related to lignocellulose degradation could be beneficial to unlock carbohydrate sources in soil, especially those contained in decaying plant and fungal organic matter. Intra- as well as inter-scaffold duplications of gene clusters may be a consequence of its parthenogenetic lifestyle. This high quality genome will be instrumental for evolutionary biologists investigating deep phylogenetic lineages among arthropods and will provide the basis for a more mechanistic understanding in soil ecology and ecotoxicology.


July 7, 2019  |  

Identification and resolution of microdiversity through metagenomic sequencing of parallel consortia.

To gain a predictive understanding of the interspecies interactions within microbial communities that govern community function, the genomic complement of every member population must be determined. Although metagenomic sequencing has enabled the de novo reconstruction of some microbial genomes from environmental communities, microdiversity confounds current genome reconstruction techniques. To overcome this issue, we performed short-read metagenomic sequencing on parallel consortia, defined as consortia cultivated under the same conditions from the same natural community with overlapping species composition. The differences in species abundance between the two consortia allowed reconstruction of near-complete (at an estimated >85% of gene complement) genome sequences for 17 of the 20 detected member species. Two Halomonas spp. indistinguishable by amplicon analysis were found to be present within the community. In addition, comparison of metagenomic reads against the consensus scaffolds revealed within-species variation for one of the Halomonas populations, one of the Rhodobacteraceae populations, and the Rhizobiales population. Genomic comparison of these representative instances of inter- and intraspecies microdiversity suggests differences in functional potential that may result in the expression of distinct roles in the community. In addition, isolation and complete genome sequence determination of six member species allowed an investigation into the sensitivity and specificity of genome reconstruction processes, demonstrating robustness across a wide range of sequence coverage (9× to 2,700×) within the metagenomic data set. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019  |  

Genome analysis of three Pneumocystis species reveals adaptation mechanisms to life exclusively in mammalian hosts.

Pneumocystis jirovecii is a major cause of life-threatening pneumonia in immunosuppressed patients including transplant recipients and those with HIV/AIDS, yet surprisingly little is known about the biology of this fungal pathogen. Here we report near complete genome assemblies for three Pneumocystis species that infect humans, rats and mice. Pneumocystis genomes are highly compact relative to other fungi, with substantial reductions of ribosomal RNA genes, transporters, transcription factors and many metabolic pathways, but contain expansions of surface proteins, especially a unique and complex surface glycoprotein superfamily, as well as proteases and RNA processing proteins. Unexpectedly, the key fungal cell wall components chitin and outer chain N-mannans are absent, based on genome content and experimental validation. Our findings suggest that Pneumocystis has developed unique mechanisms of adaptation to life exclusively in mammalian hosts, including dependence on the lungs for gas and nutrients and highly efficient strategies to escape both host innate and acquired immune defenses.


July 7, 2019  |  

Genome sequence of a virulent Pseudomonas aeruginosa strain, 12-4-4(59), isolated from the blood culture of a burn patient.

Pseudomonas aeruginosa is an opportunistic pathogen that frequently infects wounds, significantly impairs wound healing, and causes morbidity and mortality in burn patients. Here, we report the genome sequence of a virulent strain of P. aeruginosa, 12-4-4(59), isolated from the blood culture of a burn patient. Copyright © 2016 Karna et al.


July 7, 2019  |  

Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assembly approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. The Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics. Copyright © 2016 McIlwain et al.


July 7, 2019  |  

Dynamics of mutations during development of resistance by Pseudomonas aeruginosa against five antibiotics.

Pseudomonas aeruginosa is an opportunistic pathogen that causes considerable morbidity and mortality, specifically in the intensive care. Antibiotic resistant variants of this organism are more difficult to treat and cause substantial extra costs compared to susceptible strains. In the laboratory, P. aeruginosa rapidly developed resistance against five medically relevant antibiotics upon exposure to step-wise increasing concentrations. At several time points during the acquisition of resistance samples were taken for whole genome sequencing. The increase of MIC for ciprofloxacin was linked to specific mutations in gyrA, parC and gyrB, appearing sequentially. In the case of tobramycin, mutations were induced in fusA, HP02880, rplB and capD The MIC for the beta-lactam compounds meropenem, ceftazidime and the combination piperacillin/tazobactam correlated linearly with the beta-lactamase activity, but not always with individual mutations. The genes that were mutated during development of beta-lactam resistance differed for each antibiotic. A quantitative relationship between the frequency of mutations and the increase in resistance could not be established for any of the antibiotics. When the adapted strains are grown in the absence of the antibiotic, some mutations remained and others were reverted, but this reversal did not necessarily lower the MIC. The increased MIC came at the cost of moderately reduced cellular functions, or somewhat lower growth rate. In all cases except ciprofloxacin, the increase of resistance seems to be the result of a complex interaction between several cellular systems, rather than individual mutations. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019  |  

Near-Complete Genome Sequence of Clostridium paradoxum Strain JW-YL-7.

Clostridium paradoxum strain JW-YL-7 is a moderately thermophilic anaerobic alkaliphile isolated from the municipal sewage treatment plant in Athens, GA. We report the near-complete genome sequence of C. paradoxum strain JW-YL-7 obtained by using PacBio DNA sequencing and Pilon for sequence assembly refinement with Illumina data. Copyright © 2016 Lancaster et al.


July 7, 2019  |  

Direct repeat-mediated DNA deletion of the mating type MAT1-2 genes results in unidirectional mating type switching in Sclerotinia trifoliorum.

The necrotrophic fungal pathogen Sclerotinia trifoliorum exhibits ascospore dimorphism and unidirectional mating type switching – self-fertile strains derived from large ascospores produce both self-fertile (large-spores) and self-sterile (small-spores) offsprings in a 4:4 ratio. The present study, comparing DNA sequences at MAT locus of both self-fertile and self-sterile strains, found four mating type genes (MAT1-1-1, MAT1-1-5, MAT1-2-1 and MAT1-2-4) in the self-fertile strain. However, a 2891-bp region including the entire MAT1-2-1 and MAT1-2-4 genes had been completely deleted from the MAT locus in the self-sterile strain. Meanwhile, two copies of a 146-bp direct repeat motif flanking the deleted region were found in the self-fertile strain, but only one copy of this 146-bp motif (a part of the MAT1-1-1 gene) was present in the self-sterile strain. The two direct repeats were believed to be responsible for the deletion through homologous intra-molecular recombination in meiosis. Tetrad analyses showed that all small ascospore-derived strains lacked the missing DNA between the two direct repeats that was found in all large ascospore-derived strains. In addition, heterokaryons at the MAT locus were observed in field isolates as well as in laboratory derived isolates.


July 7, 2019  |  

Genomics-informed isolation and characterization of a symbiotic Nanoarchaeota system from a terrestrial geothermal environment.

Biological features can be inferred, based on genomic data, for many microbial lineages that remain uncultured. However, cultivation is important for characterizing an organism’s physiology and testing its genome-encoded potential. Here we use single-cell genomics to infer cultivation conditions for the isolation of an ectosymbiotic Nanoarchaeota (‘Nanopusillus acidilobi’) and its host (Acidilobus, a crenarchaeote) from a terrestrial geothermal environment. The cells of ‘Nanopusillus’ are among the smallest known cellular organisms (100-300?nm). They appear to have a complete genetic information processing machinery, but lack almost all primary biosynthetic functions as well as respiration and ATP synthesis. Genomic and proteomic comparison with its distant relative, the marine Nanoarchaeum equitans illustrate an ancient, common evolutionary history of adaptation of the Nanoarchaeota to ectosymbiosis, so far unique among the Archaea.


July 7, 2019  |  

Draft genome sequence of an inbred line of Chenopodium quinoa, an allotetraploid crop with great environmental adaptability and outstanding nutritional properties.

Chenopodium quinoa Willd. (quinoa) originated from the Andean region of South America, and is a pseudocereal crop of the Amaranthaceae family. Quinoa is emerging as an important crop with the potential to contribute to food security worldwide and is considered to be an optimal food source for astronauts, due to its outstanding nutritional profile and ability to tolerate stressful environments. Furthermore, plant pathologists use quinoa as a representative diagnostic host to identify virus species. However, molecular analysis of quinoa is limited by its genetic heterogeneity due to outcrossing and its genome complexity derived from allotetraploidy. To overcome these obstacles, we established the inbred and standard quinoa accession Kd that enables rigorous molecular analysis, and presented the draft genome sequence of Kd, using an optimized combination of high-throughput next generation sequencing on the Illumina Hiseq 2500 and PacBio RS II sequencers. The de novo genome assembly contained 25 k scaffolds consisting of 1 Gbp with N50 length of 86 kbp. Based on these data, we constructed the free-access Quinoa Genome DataBase (QGDB). Thus, these findings provide insights into the mechanisms underlying agronomically important traits of quinoa and the effect of allotetraploidy on genome evolution. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.