Menu
July 7, 2019

Comparative and population genomic landscape of Phellinus noxius: A hypervariable fungus causing root rot in trees.

The order Hymenochaetales of white rot fungi contain some of the most aggressive wood decayers causing tree deaths around the world. Despite their ecological importance and the impact of diseases they cause, little is known about the evolution and transmission patterns of these pathogens. Here, we sequenced and undertook comparative genomic analyses of Hymenochaetales genomes using brown root rot fungus Phellinus noxius, wood-decomposing fungus Phellinus lamaensis, laminated root rot fungus Phellinus sulphurascens and trunk pathogen Porodaedalea pini. Many gene families of lignin-degrading enzymes were identified from these fungi, reflecting their ability as white rot fungi. Comparing against distant fungi highlighted the expansion of 1,3-beta-glucan synthases in P. noxius, which may account for its fast-growing attribute. We identified 13 linkage groups conserved within Agaricomycetes, suggesting the evolution of stable karyotypes. We determined that P. noxius has a bipolar heterothallic mating system, with unusual highly expanded ~60 kb A locus as a result of accumulating gene transposition. We investigated the population genomics of 60 P. noxius isolates across multiple islands of the Asia Pacific region. Whole-genome sequencing showed this multinucleate species contains abundant poly-allelic single nucleotide polymorphisms with atypical allele frequencies. Different patterns of intra-isolate polymorphism reflect mono-/heterokaryotic states which are both prevalent in nature. We have shown two genetically separated lineages with one spanning across many islands despite the geographical barriers. Both populations possess extraordinary genetic diversity and show contrasting evolutionary scenarios. These results provide a framework to further investigate the genetic basis underlying the fitness and virulence of white rot fungi.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Meeting report on experimental approaches to evolution and ecology using yeast and other model systems.

The fourth EMBO-sponsored conference on Experimental Approaches to Evolution and Ecology Using Yeast and Other Model Systems (https://www.embl.de/training/events/2016/EAE16-01/), was held at the EMBL in Heidelberg, Germany, October 19-23, 2016. The conference was organized by Judith Berman (Tel Aviv University), Maitreya Dunham (University of Washington), Jun-Yi Leu (Academia Sinica), and Lars Steinmetz (EMBL Heidelberg and Stanford University). The meeting attracted ~120 researchers from 28 countries and covered a wide range of topics in the fields of genetics, evolutionary biology, and ecology with a unifying focus on yeast as a model system. Attendees enjoyed the Keith Haring inspired yeast florescence microscopy artwork (Figure 1), a unique feature of the meeting since its inception, and the one-minute flash talks that catalyzed discussions at two vibrant poster sessions. The meeting coincided with the 20th anniversary of the publication describing the sequence of the first eukaryotic genome, Saccharomyces cerevisiae (Goffeau et al. 1996). Many of the conference talks focused on important questions about what is contained in the genome, how genomes evolve, and the architecture and behavior of communities of phenotypically and genotypically diverse microorganisms. Here, we summarize highlights of the research talks around these themes. Nearly all presentations focused on novel findings, and we refer the reader to relevant manuscripts that have subsequently been published. Copyright © 2017, G3: Genes, Genomes, Genetics.


July 7, 2019

Genomic analysis of Bacillus licheniformis CBA7126 isolated from a human fecal sample.

Bacillus licheniformis is a Gram-positive, endospore-forming, saprophytic organism that occurs in plant and soil (Veith et al., 2004). A taxonomical approach shows that it is closely related to Bacillus subtilis (Lapidus et al., 2002; Xu and Côte, 2003; Rey et al., 2004). Generally, most bacilli are predominantly aerobic; however, B. licheniformis is a facultative anaerobe compared to other bacilli in ecological niches (Alexander, 1977). The commercial utility of the extracellular products of B. licheniformis makes this microorganism an economically interesting species (Kovács et al., 2009). For example, B. licheniformis is used industrially for manufacturing biochemicals, enzymes, antibiotics, and aminopeptidase. Several proteases such as a-amylase, penicillinase, pentosanase, cycloglucosyltransferase, ß-mannanase, and certain pectinolytic enzymes are synthesized industrially using B. licheniformis (Rodríguez-Absi and Prescott, 1978; Rey et al., 2004). The proteases are used in the detergent industry and the amylases are utilized for starch hydrolysis, desizing of textiles, and sizing of paper (Erickson, 1976). In addition, certain strains are utilized to produce peptide antibiotics, specialty chemicals, and poly-?-glutamic acid (Nierman and Maglott, 1989; Rey et al., 2004).


July 7, 2019

Repetitive sequences in malaria parasite proteins.

Five species of parasite cause malaria in humans with the most severe disease caused by Plasmodium falciparum. Many of the proteins encoded in the P. falciparum genome are unusually enriched in repetitive low-complexity sequences containing a limited repertoire of amino acids. These repetitive sequences expand and contract dynamically and are among the most rapidly changing sequences in the genome. The simplest repetitive sequences consist of single amino acid repeats such as poly-asparagine tracts that are found in approximately 25% of P. falciparum proteins. More complex repeats of two or more amino acids are also common in diverse parasite protein families. There is no universal explanation for the occurrence of repetitive sequences and it is possible that many confer no function to the encoded protein and no selective advantage or disadvantage to the parasite. However, there are increasing numbers of examples where repetitive sequences are important for parasite protein function. We discuss the diverse roles of low-complexity repetitive sequences throughout the parasite life cycle, from mediating protein-protein interactions to enabling the parasite to evade the host immune system.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Isolation and complete genome sequence of Halorientalis hydrocarbonoclasticus sp. nov., a hydrocarbon-degrading haloarchaeon.

Bioremediation in hypersaline environments is particularly challenging since the microbes that tolerate such harsh environments and degrade pollutants are quite scarce. Haloarchaea, however, due to their inherent ability to grow at high salt concentrations, hold great promise for remediating the contaminated hypersaline sites. This study aimed to isolate and characterize novel haloarchaeal strains with potentials in hydrocarbon degradation. A haloarchaeal strain IM1011 was isolated from Changlu Tanggu saltern near Da Gang Oilfield in Tianjin (China) by enrichment culture in hypersaline medium containing hexadecane. It could degrade 57 ± 5.2% hexadecane (5 g/L) in the presence of 3.6 M NaCl at 37 °C within 24 days. To get further insights into the mechanisms of petroleum hydrocarbon degradation in haloarchaea, complete genome (3,778,989 bp) of IM1011 was sequenced. Phylogenetic analysis of 16S rRNA gene, RNA polymerase beta-subunit (rpoB’) gene and of the complete genome suggested IM1011 to be a new species in Halorientalis genus, and the name Halorientalis hydrocarbonoclasticus sp. nov., is proposed. Notably, with insights from the IM1011 genome sequence, the involvement of diverse alkane hydroxylase enzymes and an intact ß-oxidation pathway in hexadecane biodegradation was predicted. This is the first hexadecane-degrading strain from Halorientalis genus, of which the genome sequence information would be helpful for further dissecting the hydrocarbon degradation by haloarchaea and for their application in bioremediation of oil-polluted hypersaline environments.


July 7, 2019

Insights into land plant evolution garnered from the Marchantia polymorpha genome.

The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019

Contributions of Zea mays subspecies mexicana haplotypes to modern maize.

Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17?×?10-8 ~3.87?×?10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.


July 7, 2019

The genome sequence of Bipolaris cookei reveals mechanisms of pathogenesis underlying target leaf spot of sorghum.

Bipolaris cookei (=Bipolaris sorghicola) causes target leaf spot, one of the most prevalent foliar diseases of sorghum. Little is known about the molecular basis of pathogenesis in B. cookei, in large part due to a paucity of resources for molecular genetics, such as a reference genome. Here, a draft genome sequence of B. cookei was obtained and analyzed. A hybrid assembly strategy utilizing Illumina and Pacific Biosciences sequencing technologies produced a draft nuclear genome of 36.1?Mb, organized into 321 scaffolds with L50 of 31 and N50 of 378?kb, from which 11,189 genes were predicted. Additionally, a finished mitochondrial genome sequence of 135,790?bp was obtained, which contained 75 predicted genes. Comparative genomics revealed that B. cookei possessed substantially fewer carbohydrate-active enzymes and secreted proteins than closely related Bipolaris species. Novel genes involved in secondary metabolism, including genes implicated in ophiobolin biosynthesis, were identified. Among 37 B. cookei genes induced during sorghum infection, one encodes a putative effector with a limited taxonomic distribution among plant pathogenic fungi. The draft genome sequence of B. cookei provided novel insights into target leaf spot of sorghum and is an important resource for future investigation.


July 7, 2019

Chromosome level assembly and secondary metabolite potential of the parasitic fungus Cordyceps militaris.

Cordyceps militaris is an insect pathogenic fungus that is prized for its use in traditional medicine. This and other entomopathogenic fungi are understudied sources for the discovery of new bioactive molecules. In this study, PacBio SMRT long read sequencing technology was used to sequence the genome of C. militaris with a focus on the genetic potential for secondary metabolite production in the genome assembly of this fungus.This is first chromosome level assembly of a species in the Cordyceps genera. In this seven chromosome assembly of 33.6 Mba there were 9371 genes identified. Cordyceps militaris was determined to have the MAT 1-1-1 and MAT 1-1-2 mating type genes. Secondary metabolite analysis revealed the potential for at least 36 distinct metabolites from a variety of classes. Three of these gene clusters had homology with clusters producing desmethylbassianin, equisetin and emericellamide that had been studied in other fungi.Our assembly and analysis has revealed that C. militaris has a wealth of gene clusters for secondary metabolite production distributed among seven chromosomes. The identification of these gene clusters will facilitate the future study and identification of the secondary metabolites produced by this entomopathogenic fungus.


July 7, 2019

Single molecule sequencing-guided scaffolding and correction of draft assemblies.

Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies.We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm.Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.


July 7, 2019

The complete genome sequence of Ensifer meliloti strain CCMM B554 (FSM-MA), a highly effective nitrogen-fixing microsymbiont of Medicago truncatula Gaertn.

Strain CCMM B554, also known as FSM-MA, is a soil dwelling and nodule forming, nitrogen-fixing bacterium isolated from the nodules of the legume Medicago arborea L. in the Maamora Forest, Morocco. The strain forms effective nitrogen fixing nodules on species of the Medicago, Melilotus and Trigonella genera and is exceptional because it is a highly effective symbiotic partner of the two most widely used accessions, A17 and R108, of the model legume Medicago truncatula Gaertn. Based on 16S rRNA gene sequence, multilocus sequence and average nucleotide identity analyses, FSM-MA is identified as a new Ensifer meliloti strain. The genome is 6,70 Mbp and is comprised of the chromosome (3,64 Mbp) harboring 3574 predicted genes and two megaplasmids, pSymA (1,42 Mbp) and pSymB (1,64 Mbp) with respectively 1481 and 1595 predicted genes. The average GC content of the genome is 61.93%. The FSM-MA genome structure is highly similar and co-linear to other E. meliloti strains in the chromosome and the pSymB megaplasmid while, in contrast, it shows high variability in the pSymA plasmid. The large number of strain-specific sequences in pSymA as well as strain-specific genes on pSymB involved in the biosynthesis of the lipopolysaccharide and capsular polysaccharide surface polysaccharides may encode novel symbiotic functions explaining the high symbiotic performance of FSM-MA.


July 7, 2019

HISEA: HIerarchical SEed Aligner for PacBio data.

The next generation sequencing (NGS) techniques have been around for over a decade. Many of their fundamental applications rely on the ability to compute good genome assemblies. As the technology evolves, the assembly algorithms and tools have to continuously adjust and improve. The currently dominant technology of Illumina produces reads that are too short to bridge many repeats, setting limits on what can be successfully assembled. The emerging SMRT (Single Molecule, Real-Time) sequencing technique from Pacific Biosciences produces uniform coverage and long reads of length up to sixty thousand base pairs, enabling significantly better genome assemblies. However, SMRT reads are much more expensive and have a much higher error rate than Illumina’s – around 10-15% – mostly due to indels. New algorithms are very much needed to take advantage of the long reads while mitigating the effect of high error rate and lowering the required coverage.An essential step in assembling SMRT data is the detection of alignments, or overlaps, between reads. High error rate and very long reads make this a much more challenging problem than for Illumina data. We present a new pairwise read aligner, or overlapper, HISEA (Hierarchical SEed Aligner) for SMRT sequencing data. HISEA uses a novel two-step k-mer search, employing consistent clustering, k-mer filtering, and read alignment extension.We compare HISEA against several state-of-the-art programs – BLASR, DALIGNER, GraphMap, MHAP, and Minimap – on real datasets from five organisms. We compare their sensitivity, precision, specificity, F1-score, as well as time and memory usage. We also introduce a new, more precise, evaluation method. Finally, we compare the two leading programs, MHAP and HISEA, for their genome assembly performance in the Canu pipeline.Our algorithm has the best alignment detection sensitivity among all programs for SMRT data, significantly higher than the current best. The currently best assembler for SMRT data is the Canu program which uses the MHAP aligner in its pipeline. We have incorporated our new HISEA aligner in the Canu pipeline and benchmarked it against the best pipeline for multiple datasets at two relevant coverage levels: 30x and 50x. Our assemblies are better than those using MHAP for both coverage levels. Moreover, Canu+HISEA assemblies for 30x coverage are comparable with Canu+MHAP assemblies for 50x coverage, while being faster and cheaper.The HISEA algorithm produces alignments with highest sensitivity compared with the current state-of-the-art algorithms. Integrated in the Canu pipeline, currently the best for assembling PacBio data, it produces better assemblies than Canu+MHAP.


July 7, 2019

Chromosome evolution in the free-living flatworms: first evidence of intrachromosomal rearrangements in karyotype evolution of Macrostomum lignano (Platyhelminthes, Macrostomida).

The free-living flatworm Macrostomum lignano is a hidden tetraploid. Its genome was formed by a recent whole genome duplication followed by chromosome fusions. Its karyotype (2n = 8) consists of a pair of large chromosomes (MLI1), which contain regions of all other chromosomes, and three pairs of small metacentric chromosomes. Comparison of MLI1 with metacentrics was performed by painting with microdissected DNA probes and fluorescent in situ hybridization of unique DNA fragments. Regions of MLI1 homologous to small metacentrics appeared to be contiguous. Besides the loss of DNA repeat clusters (pericentromeric and telomeric repeats and the 5S rDNA cluster) from MLI1, the difference between small metacentrics MLI2 and MLI4 and regions homologous to them in MLI1 were revealed. Abnormal karyotypes found in the inbred DV1/10 subline were analyzed, and structurally rearranged chromosomes were described with the painting technique, suggesting the mechanism of their origin. The revealed chromosomal rearrangements generate additional diversity, opening the way toward massive loss of duplicated genes from a duplicated genome. Our findings suggest that the karyotype of M. lignano is in the early stage of genome diploidization after whole genome duplication, and further studies on M. lignano and closely related species can address many questions about karyotype evolution in animals.


July 7, 2019

Draft sequencing of the heterozygous diploid genome of Satsuma (Citrus unshiu Marc.) using a hybrid assembly approach.

Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.


July 7, 2019

Scaffolding of long read assemblies using long range contact information.

Long read technologies have revolutionized de novo genome assembly by generating contigs orders of magnitude longer than that of short read assemblies. Although assembly contiguity has increased, it usually does not reconstruct a full chromosome or an arm of the chromosome, resulting in an unfinished chromosome level assembly. To increase the contiguity of the assembly to the chromosome level, different strategies are used which exploit long range contact information between chromosomes in the genome.We develop a scalable and computationally efficient scaffolding method that can boost the assembly contiguity to a large extent using genome-wide chromatin interaction data such as Hi-C.we demonstrate an algorithm that uses Hi-C data for longer-range scaffolding of de novo long read genome assemblies. We tested our methods on the human and goat genome assemblies. We compare our scaffolds with the scaffolds generated by LACHESIS based on various metrics.Our new algorithm SALSA produces more accurate scaffolds compared to the existing state of the art method LACHESIS.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.