Menu
September 22, 2019

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019

The genome of Rhizophagus clarus HR1 reveals a common genetic basis for auxotrophy among arbuscular mycorrhizal fungi.

Mycorrhizal symbiosis is one of the most fundamental types of mutualistic plant-microbe interaction. Among the many classes of mycorrhizae, the arbuscular mycorrhizae have the most general symbiotic style and the longest history. However, the genomes of arbuscular mycorrhizal (AM) fungi are not well characterized due to difficulties in cultivation and genetic analysis. In this study, we sequenced the genome of the AM fungus Rhizophagus clarus HR1, compared the sequence with the genome sequence of the model species R. irregularis, and checked for missing genes that encode enzymes in metabolic pathways related to their obligate biotrophy.In the genome of R. clarus, we confirmed the absence of cytosolic fatty acid synthase (FAS), whereas all mitochondrial FAS components were present. A KEGG pathway map identified the absence of genes encoding enzymes for several other metabolic pathways in the two AM fungi, including thiamine biosynthesis and the conversion of vitamin B6 derivatives. We also found that a large proportion of the genes encoding glucose-producing polysaccharide hydrolases, that are present even in ectomycorrhizal fungi, also appear to be absent in AM fungi.In this study, we found several new genes that are absent from the genomes of AM fungi in addition to the genes previously identified as missing. Missing genes for enzymes in primary metabolic pathways imply that AM fungi may have a higher dependency on host plants than other biotrophic fungi. These missing metabolic pathways provide a genetic basis to explore the physiological characteristics and auxotrophy of AM fungi.


September 22, 2019

Genome of an allotetraploid wild peanut Arachis monticola: a de novo assembly.

Arachis monticola (2n = 4x = 40) is the only allotetraploid wild peanut within the Arachis genus and section, with an AABB-type genome of ~2.7 Gb in size. The AA-type subgenome is derived from diploid wild peanut Arachis duranensis, and the BB-type subgenome is derived from diploid wild peanut Arachis ipaensis. A. monticola is regarded either as the direct progenitor of the cultivated peanut or as an introgressive derivative between the cultivated peanut and wild species. The large polyploidy genome structure and enormous nearly identical regions of the genome make the assembly of chromosomal pseudomolecules very challenging. Here we report the first reference quality assembly of the A. monticola genome, using a series of advanced technologies. The final whole genome of A. monticola is ~2.62 Gb and has a contig N50 and scaffold N50 of 106.66 Kb and 124.92 Mb, respectively. The vast majority (91.83%) of the assembled sequence was anchored onto the 20 pseudo-chromosomes, and 96.07% of assemblies were accurately separated into AA- and BB- subgenomes. We demonstrated efficiency of the current state of the strategy for de novo assembly of the highly complex allotetraploid species, wild peanut (A. monticola), based on whole-genome shotgun sequencing, single molecule real-time sequencing, high-throughput chromosome conformation capture technology, and BioNano optical genome maps. These combined technologies produced reference-quality genome of the allotetraploid wild peanut, which is valuable for understanding the peanut domestication and evolution within the Arachis genus and among legume crops.


September 22, 2019

Draft genome sequence of Annulohypoxylon stygium, Aspergillus mulundensis, Berkeleyomyces basicola (syn. Thielaviopsis basicola), Ceratocystis smalleyi, two Cercospora beticola strains, Coleophoma cylindrospora, Fusarium fracticaudum, Phialophora cf. hyalina, and Morchella septimelata.

Draft genomes of the species Annulohypoxylon stygium, Aspergillus mulundensis, Berkeleyomyces basicola (syn. Thielaviopsis basicola), Ceratocystis smalleyi, two Cercospora beticola strains, Coleophoma cylindrospora, Fusarium fracticaudum, Phialophora cf. hyalina and Morchella septimelata are presented. Both mating types (MAT1-1 and MAT1-2) of Cercospora beticola are included. Two strains of Coleophoma cylindrospora that produce sulfated homotyrosine echinocandin variants, FR209602, FR220897 and FR220899 are presented. The sequencing of Aspergillus mulundensis, Coleophoma cylindrospora and Phialophora cf. hyalina has enabled mapping of the gene clusters encoding the chemical diversity from the echinocandin pathways, providing data that reveals the complexity of secondary metabolism in these different species. Overall these genomes provide a valuable resource for understanding the molecular processes underlying pathogenicity (in some cases), biology and toxin production of these economically important fungi.


September 22, 2019

Recurrent loss, horizontal transfer, and the obscure origins of mitochondrial introns in diatoms (Bacillariophyta).

We sequenced mitochondrial genomes from five diverse diatoms (Toxarium undulatum, Psammoneis japonica, Eunotia naegelii, Cylindrotheca closterium, and Nitzschia sp.), chosen to fill important phylogenetic gaps and help us characterize broadscale patterns of mitochondrial genome evolution in diatoms. Although gene content was strongly conserved, intron content varied widely across species. The vast majority of introns were of group II type and were located in the cox1 or rnl genes. Although recurrent intron loss appears to be the principal underlying cause of the sporadic distributions of mitochondrial introns across diatoms, phylogenetic analyses showed that intron distributions superficially consistent with a recurrent-loss model were sometimes more complicated, implicating horizontal transfer as a likely mechanism of intron acquisition as well. It was not clear, however, whether diatoms were the donors or recipients of horizontally transferred introns, highlighting a general challenge in resolving the evolutionary histories of many diatom mitochondrial introns. Although some of these histories may become clearer as more genomes are sampled, high rates of intron loss suggest that the origins of many diatom mitochondrial introns are likely to remain unclear.


September 22, 2019

High-Resolution Full-Length HLA Typing Method Using Third Generation (Pac-Bio SMRT) Sequencing Technology.

The human HLA genes are among the most polymorphic genes in the human genome. Therefore, it is very difficult to find two unrelated individuals with identical HLA molecules. As a result, HLA Class I and Class II genes are routinely sequenced or serotyped for organ transplantation, autoimmune disease-association studies, drug hypersensitivity research, and other applications. However, these methods were able to give two or four digit data, which was not sufficient enough to understand the completeness of haplotypes of HLA genes. To overcome these limitations, we here described end-to-end workflow for sequencing of HLA class I and class II genes using third generation sequencing, SMRT technology. This method produces fully-phased, unambiguous, allele-level information on the PacBio System.


September 22, 2019

Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species.We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia.Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.


September 22, 2019

Improved de novo genome assembly and analysis of the Chinese cucurbit Siraitia grosvenorii, also known as monk fruit or luo-han-guo.

Luo-han-guo (Siraitia grosvenorii), also called monk fruit, is a member of the Cucurbitaceae family. Monk fruit has become an important area for research because of the pharmacological and economic potential of its noncaloric, extremely sweet components (mogrosides). It is also commonly used in traditional Chinese medicine for the treatment of lung congestion, sore throat, and constipation. Recently, a single reference genome became available for monk fruit, assembled from 36.9x genome coverage reads via Illumina sequencing platforms. This genome assembly has a relatively short (34.2 kb) contig N50 length and lacks integrated annotations. These drawbacks make it difficult to use as a reference in assembling transcriptomes and discovering novel functional genes.Here, we offer a new high-quality draft of the S. grosvenorii genome assembled using 31 Gb (~73.8x) long single molecule real time sequencing reads and polished with ~50 Gb Illumina paired-end reads. The final genome assembly is approximately 469.5 Mb, with a contig N50 length of 432,384 bp, representing a 12.6-fold improvement. We further annotated 237.3 Mb of repetitive sequence and 30,565 consensus protein coding genes with combined evidence. Phylogenetic analysis showed that S. grosvenorii diverged from members of the Cucurbitaceae family approximately 40.9 million years ago. With comprehensive transcriptomic analysis and differential expression testing, we identified 4,606 up-regulated genes in the early fruit compared to the leaf, a number of which were linked to metabolic pathways regulating fruit development and ripening.The availability of this new monk fruit genome assembly, as well as the annotations, will facilitate the discovery of new functional genes and the genetic improvement of monk fruit.


September 22, 2019

De novo genome assembly of Oryza granulata reveals rapid genome expansion and adaptive evolution

The wild relatives of rice have adapted to different ecological environments and constitute a useful reservoir of agronomic traits for genetic improvement. Here we present the ~777?Mb de novo assembled genome sequence of Oryza granulata. Recent bursts of long-terminal repeat retrotransposons, especially RIRE2, led to a rapid twofold increase in genome size after O. granulata speciation. Universal centromeric tandem repeats are absent within its centromeres, while gypsy-type LTRs constitute the main centromere-specific repetitive elements. A total of 40,116 protein-coding genes were predicted in O. granulata, which is close to that of Oryza sativa. Both the copy number and function of genes involved in photosynthesis and energy production have undergone positive selection during the evolution of O. granulata, which might have facilitated its adaptation to the low light habitats. Together, our findings reveal the rapid genome expansion, distinctive centromere organization, and adaptive evolution of O. granulata.


September 22, 2019

Comparative genomics of Pseudomonas syringae reveals convergent gene gain and loss associated with specialization onto cherry (Prunus avium).

Genome-wide analyses of the effector- and toxin-encoding genes were used to examine the phylogenetics and evolution of pathogenicity amongst diverse strains of Pseudomonas syringae causing bacterial canker of cherry (Prunus avium), including pathovars P. syringae pv morsprunorum (Psm) races 1 and 2, P. syringae pv syringae (Pss) and P. syringae pv avii. Phylogenetic analyses revealed Psm races and P. syringae pv avii clades were distinct and were each monophyletic, whereas cherry-pathogenic strains of Pss were interspersed amongst strains from other host species. A maximum likelihood approach was used to predict effectors associated with pathogenicity on cherry. Pss possesses a smaller repertoire of type III effectors but has more toxin biosynthesis clusters than Psm and P. syringae pv avii. Evolution of cherry pathogenicity was correlated with gain of genes such as hopAR1 and hopBB1 through putative phage transfer and horizontal transfer respectively. By contrast, loss of the avrPto/hopAB redundant effector group was observed in cherry-pathogenic clades. Ectopic expression of hopAB and hopC1 triggered the hypersensitive reaction in cherry leaves, confirming computational predictions. Cherry canker provides a fascinating example of convergent evolution of pathogenicity that is explained by the mix of effector and toxin repertoires acting on a common host.© 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


September 22, 2019

Comparative genomics of Spiraeoideae-infecting Erwinia amylovora strains provides novel insight to genetic diversity and identifies the genetic basis of a low-virulence strain.

Erwinia amylovora is the causal agent of fire blight, one of the most devastating diseases of apple and pear. Erwinia amylovora is thought to have originated in North America and has now spread to at least 50 countries worldwide. An understanding of the diversity of the pathogen population and the transmission to different geographical regions is important for the future mitigation of this disease. In this research, we performed an expanded comparative genomic study of the Spiraeoideae-infecting (SI) E. amylovora population in North America and Europe. We discovered that, although still highly homogeneous, the genetic diversity of 30 E. amylovora genomes examined was about 30 times higher than previously determined. These isolates belong to four distinct clades, three of which display geographical clustering and one of which contains strains from various geographical locations (‘Widely Prevalent’ clade). Furthermore, we revealed that strains from the Widely Prevalent clade displayed a higher level of recombination with strains from a clade strictly from the eastern USA, which suggests that the Widely Prevalent clade probably originated from the eastern USA before it spread to other locations. Finally, we detected variations in virulence in the SI E. amylovora strains on immature pear, and identified the genetic basis of one of the low-virulence strains as being caused by a single nucleotide polymorphism in hfq, a gene encoding an important virulence regulator. Our results provide insights into the population structure, distribution and evolution of SI E. amylovora in North America and Europe.© 2017 BSPP AND JOHN WILEY & SONS LTD.


September 22, 2019

RAD sequencing and a hybrid Antarctic fur seal genome assembly reveal rapidly decaying linkage disequilibrium, global population structure and evidence for inbreeding.

Recent advances in high throughput sequencing have transformed the study of wild organisms by facilitating the generation of high quality genome assemblies and dense genetic marker datasets. These resources have the potential to significantly advance our understanding of diverse phenomena at the level of species, populations and individuals, ranging from patterns of synteny through rates of linkage disequilibrium (LD) decay and population structure to individual inbreeding. Consequently, we used PacBio sequencing to refine an existing Antarctic fur seal (Arctocephalus gazella) genome assembly and genotyped 83 individuals from six populations using restriction site associated DNA (RAD) sequencing. The resulting hybrid genome comprised 6,169 scaffolds with an N50 of 6.21 Mb and provided clear evidence for the conservation of large chromosomal segments between the fur seal and dog (Canis lupus familiaris). Focusing on the most extensively sampled population of South Georgia, we found that LD decayed rapidly, reaching the background level by around 400 kb, consistent with other vertebrates but at odds with the notion that fur seals experienced a strong historical bottleneck. We also found evidence for population structuring, with four main Antarctic island groups being resolved. Finally, appreciable variance in individual inbreeding could be detected, reflecting the strong polygyny and site fidelity of the species. Overall, our study contributes important resources for future genomic studies of fur seals and other pinnipeds while also providing a clear example of how high throughput sequencing can generate diverse biological insights at multiple levels of organization. Copyright © 2018 Humble et al.


September 22, 2019

Whole-genome comparison of high and low virulent Staphylococcus aureus isolates inducing implant-associated bone infections.

Staphylococcus aureus can cause wide range of infections from simple soft skin infections to severe endocarditis, bacteremia, osteomyelitis and implant associated bone infections (IABI). The focus of the present investigation was to study virulence properties of S. aureus isolates from acute and chronic IABI by means of their in vivo lethality, in vitro osteoblasts invasion, biofilm formation and subsequently whole genome comparison between high and low virulent strains. Application of insect infection model Galleria mellonella revealed high, intermediate and low virulence phenotypes of these clinical isolates, which showed good correlation with osteoblast invasion and biofilm formation assays. Comparative genomics of selected high (EDCC 5458) and low (EDCC 5464) virulent strains enabled the identification of molecular factors responsible for the development of acute and chronic IABI. Accordingly, the low virulent strain EDCC 5464 harbored point mutations resulting in frame shift mutations in agrC (histidine kinase in agr system), graS (histidine kinase in graSR, a two component system) and efeB (peroxidase in efeOBU operon, an iron acquisition system) genes. Additionally, we found a mobile element (present 11 copies in EDCC 5464) inserted at the end of ß-hemolysin (hlb) and sarU genes, which are involved in the pathogenesis and regulation of virulence gene expression in coordination with quorum sensing system. All these results are in good support with the low virulence behavior of EDCC 5464. From the previous literature, it is well known that agr defective S. aureus clinical strains are isolated from the chronic infections. Similarly, low virulent EDCC 5464 was isolated from chronic implant-associated bone infections infection whereas EDCC 5458 was obtained from acute implant-associated bone infections. Laboratory based in vitro and in vivo results and insights from comparative genomic analysis could be correlated with the clinical conclusion of IABIs and allows evidence-based treatment strategies based on the pathogenesis of the strain to cure life devastating implant-associated infections. Copyright © 2018 Elsevier GmbH. All rights reserved.


September 22, 2019

Genome biology of a novel lineage of planctomycetes widespread in anoxic aquatic environments.

Anaerobic strains affiliated with a novel order-level lineage of the Phycisphaerae class were retrieved from the suboxic zone of a hypersaline cyanobacterial mat and anoxic sediments of solar salterns. Genome sequences of five isolates were obtained and compared with metagenome-assembled genomes representing related uncultured bacteria from various anoxic aquatic environments. Gene content surveys suggest a strictly fermentative saccharolytic metabolism for members of this lineage, which could be confirmed by the phenotypic characterization of isolates. Genetic analyses indicate that the retrieved isolates do not have a canonical origin of DNA replication, but initiate chromosome replication at alternative sites possibly leading to an accelerated evolution. Further potential factors driving evolution and speciation within this clade include genome reduction by metabolic specialization and rearrangements of the genome by mobile genetic elements, which have a high prevalence in strains from hypersaline sediments and mats. Based on genetic and phenotypic data a distinct group of strictly anaerobic heterotrophic planctomycetes within the Phycisphaerae class could be assigned to a novel order that is represented by the proposed genus Sedimentisphaera gen. nov. comprising two novel species, S. salicampi gen. nov., sp. nov. and S. cyanobacteriorum gen. nov., sp. nov.© 2018 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.