Menu
September 22, 2019

Evolution of host support for two ancient bacterial symbionts with differentially degraded genomes in a leafhopper host.

Plant sap-feeding insects (Hemiptera) rely on bacterial symbionts for nutrition absent in their diets. These bacteria experience extreme genome reduction and require genetic resources from their hosts, particularly for basic cellular processes other than nutrition synthesis. The host-derived mechanisms that complete these processes have remained poorly understood. It is also unclear how hosts meet the distinct needs of multiple bacterial partners with differentially degraded genomes. To address these questions, we investigated the cell-specific gene-expression patterns in the symbiotic organs of the aster leafhopper (ALF), Macrosteles quadrilineatus (Cicadellidae). ALF harbors two intracellular symbionts that have two of the smallest known bacterial genomes: Nasuia (112 kb) and Sulcia (190 kb). Symbionts are segregated into distinct host cell types (bacteriocytes) and vary widely in their basic cellular capabilities. ALF differentially expresses thousands of genes between the bacteriocyte types to meet the functional needs of each symbiont, including the provisioning of metabolites and support of cellular processes. For example, the host highly expresses genes in the bacteriocytes that likely complement gene losses in nucleic acid synthesis, DNA repair mechanisms, transcription, and translation. Such genes are required to function in the bacterial cytosol. Many host genes comprising these support mechanisms are derived from the evolution of novel functional traits via horizontally transferred genes, reassigned mitochondrial support genes, and gene duplications with bacteriocyte-specific expression. Comparison across other hemipteran lineages reveals that hosts generally support the incomplete symbiont cellular processes, but the origins of these support mechanisms are generally specific to the host-symbiont system.Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Genomic insights into multidrug-resistance, mating and virulence in Candida auris and related emerging species.

Candida auris is an emergent multidrug-resistant fungal pathogen causing increasing reports of outbreaks. While distantly related to C. albicans and C. glabrata, C. auris is closely related to rarely observed and often multidrug-resistant species from the C. haemulonii clade. Here, we analyze near complete genome assemblies for the four C. auris clades and three related species, and map intra- and inter-species rearrangements across the seven chromosomes. Using RNA-Seq-guided gene predictions, we find that most mating and meiosis genes are conserved and that clades contain either the MTLa or MTLa mating loci. Comparing the genomes of these emerging species to those of other Candida species identifies genes linked to drug resistance and virulence, including expanded families of transporters and lipases, as well as mutations and copy number variants in ERG11. Gene expression analysis identifies transporters and metabolic regulators specific to C. auris and those conserved with related species which may contribute to differences in drug response in this emerging fungal clade.


September 22, 2019

Genomic and transcriptomic comparisons of closely related malaria parasites differing in virulence and sequestration pattern.

Background: Malaria parasite species differ greatly in the harm they do to humans. While P. falciparum kills hundreds of thousands per year, P. vivax kills much less often and P. malariae is relatively benign. Strains of the rodent malaria parasite Plasmodium chabaudi show phenotypic variation in virulence during infections of laboratory mice. This make it an excellent species to study genes which may be responsible for this trait. By understanding the mechanisms which underlie differences in virulence we can learn how parasites adapt to their hosts and how we might prevent disease. Methods: Here we present a complete reference genome sequence for a more virulent P. chabaudi strain, PcCB, and perform a detailed comparison with the genome of the less virulent PcAS strain. Results: We found the greatest variation in the subtelomeric regions, in particular amongst the sequences of the pir gene family, which has been associated with virulence and establishment of chronic infection. Despite substantial variation at the sequence level, the repertoire of these genes has been largely maintained, highlighting the requirement for functional conservation as well as diversification in host-parasite interactions. However, a subset of pir genes, previously associated with increased virulence, were more highly expressed in PcCB, suggesting a role for this gene family in virulence differences between strains. We found that core genes involved in red blood cell invasion have been under positive selection and that the more virulent strain has a greater preference for reticulocytes, which has elsewhere been associated with increased virulence. Conclusions: These results provide the basis for a mechanistic understanding of the phenotypic differences between Plasmodium chabaudi strains, which might ultimately be translated into a better understanding of malaria parasites affecting humans.


September 22, 2019

N6-methyladenine DNA methylation in Japonica and Indica rice genomes and its association with gene expression, plant development, and stress responses.

N6-Methyladenine (6mA) DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes, including the dicot model Arabidopsis thaliana. However, the conservation and divergence of 6mA distribution patterns and functions in plants remain elusive. Here we report high-quality 6mA methylomes at single-nucleotide resolution in rice based on substantially improved genome sequences of two rice cultivars, Nipponbare (Nip; Japonica) and 93-11 (Indica). Analysis of 6mA genomic distribution and its association with transcription suggest that 6mA distribution and function is rather conserved between rice and Arabidopsis. We found that 6mA levels are positively correlated with the expression of key stress-related genes, which may be responsible for the difference in stress tolerance between Nip and 93-11. Moreover, we showed that mutations in DDM1 cause defects in plant growth and decreased 6mA level. Our results reveal that 6mA is a conserved DNA modification that is positively associated with gene expression and contributes to key agronomic traits in plants. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Genetics and genomics of an unusual selfish sex ratio distortion in an insect.

Diverse selfish genetic elements have evolved the ability to manipulate reproduction to increase their transmission, and this can result in highly distorted sex ratios [1]. Indeed, one of the major explanations for why sex determination systems are so dynamic is because they are shaped by ongoing coevolutionary arms races between sex-ratio-distorting elements and the rest of the genome [2]. Here, we use genetic crosses and genome analysis to describe an unusual sex ratio distortion with striking consequences on genome organization in a booklouse species, Liposcelis sp. (Insecta: Psocodea), in which two types of females coexist. Distorter females never produce sons but must mate with males (the sons of nondistorting females) to reproduce [3]. Although they are diploid and express the genes inherited from their fathers in somatic tissues, distorter females only ever transmit genes inherited from their mothers. As a result, distorter females have unusual chimeric genomes, with distorter-restricted chromosomes diverging from their nondistorting counterparts and exhibiting features of a giant non-recombining sex chromosome. The distorter-restricted genome has also acquired a gene from the bacterium Wolbachia, a well-known insect reproductive manipulator; we found that this gene has independently colonized the genomes of two other insect species with unusual reproductive systems, suggesting possible roles in sex ratio distortion in this remarkable genetic system. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Genomic and genetic insights into a cosmopolitan fungus, Paecilomyces variotii (Eurotiales).

Species in the genus Paecilomyces, a member of the fungal order Eurotiales, are ubiquitous in nature and impact a variety of human endeavors. Here, the biology of one common species, Paecilomyces variotii, was explored using genomics and functional genetics. Sequencing the genome of two isolates revealed key genome and gene features in this species. A striking feature of the genome was the two-part nature, featuring large stretches of DNA with normal GC content separated by AT-rich regions, a hallmark of many plant-pathogenic fungal genomes. These AT-rich regions appeared to have been mutated by repeat-induced point (RIP) mutations. We developed methods for genetic transformation of P. variotii, including forward and reverse genetics as well as crossing techniques. Using transformation and crossing, RIP activity was identified, demonstrating for the first time that RIP is an active process within the order Eurotiales. A consequence of RIP is likely reflected by a reduction in numbers of genes within gene families, such as in cell wall degradation, and reflected by growth limitations on P. variotii on diverse carbon sources. Furthermore, using these transformation tools we characterized a conserved protein containing a domain of unknown function (DUF1212) and discovered it is involved in pigmentation.


September 22, 2019

First draft genome for red sea bream of family Sparidae.

Reference genomes for all organisms on earth are now attainable owing to advances in genome sequencing technologies (Goodwin et al., 2016). Generally, species that contribute considerably to the economy or human welfare are sequenced and are considered more important than others. Furthermore, coastal indigenous people mainly depend on marine species for their food sources, which has resulted in the extinction of several marine species (Cisneros-Montemayor et al., 2016). Of these, an extinction risk assessment of marine fishes, mainly for sea breams (Family: Sparidae), has recently been conducted by way of a global extinction risk assessment from the dataset of the International Union for Conservation of Nature’s Red List Process, which mentions that around 25 species are threatened/near-threatened according to their body weight (Comeros-Raynal et al., 2016). Another report clearly showed the benefit of worldwide aquaculture production, which contributed to 47% of total seafood production, and also highlighted the over-fishing of sea breams (FAO, 2018). The Republic of Korea is the fourth largest seafood producer in the world, producing 3.3 million tons in 2015 and exporting seafood worth $1.6 billion in 2016; therefore, aquaculture- associated research is fundamental for Korea. In the present study, the red sea bream (Pagrus major), which belongs to the family Sparidae, which comprises 35 genera, 132 species, and 10 subspecies (de la Herran et al., 2001; NCBI, 2018), was assessed.


September 22, 2019

De novo assembly of the Pasteuria penetrans genome reveals high plasticity, host dependency, and BclA-like collagens.

Pasteuria penetrans is a gram-positive endospore forming bacterial parasite of Meloidogyne spp. the most economically damaging genus of plant parasitic nematodes globally. The obligate antagonistic nature of P. penetrans makes it an attractive candidate biological control agent. However, deployment of P. penetrans for this purpose is inhibited by a lack of understanding of its metabolism and the molecular mechanics underpinning parasitism of the host, in particular the initial attachment of the endospore to the nematode cuticle. Several attempts to assemble the genomes of species within this genus have been unsuccessful. Primarily this is due to the obligate parasitic nature of the bacterium which makes obtaining genomic DNA of sufficient quantity and quality which is free from contamination challenging. Taking advantage of recent developments in whole genome amplification, long read sequencing platforms, and assembly algorithms, we have developed a protocol to generate large quantities of high molecular weight genomic DNA from a small number of purified endospores. We demonstrate this method via genomic assembly of P. penetrans. This assembly reveals a reduced genome of 2.64Mbp estimated to represent 86% of the complete sequence; its reduced metabolism reflects widespread reliance on the host and possibly associated organisms. Additionally, apparent expansion of transposases and prediction of partial competence pathways suggest a high degree of genomic plasticity. Phylogenetic analysis places our sequence within the Bacilli, and most closely related to Thermoactinomyces species. Seventeen predicted BclA-like proteins are identified which may be involved in the determination of attachment specificity. This resource may be used to develop in vitro culture methods and to investigate the genetic and molecular basis of attachment specificity.


September 21, 2019

Recent advances in bioinformatics for fish genomics

In the past few years, we have contributed efforts to ~1/5 of the reported fish genomes. Based on our related experience, here we outline recent advances in bioinformatics for fish genomics, with an emphasis on development of software for genome assembly, genome annotation and evolutionary analysis. This review will be helpful for the new players of genome analysis on both animals and plants. In the past decade, whole genome sequences of approximately 50 fish species have been reported [1]. We have been involved in ~1/5 of these international works from 2014 to 2017, such as mudskippers (2014) [2], Chinese large yellow croaker [3], Chinese barbel fishes [4], Asian arowana [5,6], Channel catfish [7], seahorses [8], Japanese flounder [9], Chinese clearhead icefish [10] and Northern snakehead [11]. We are also in charge of the China Auqatic 10-100-1,000 Genomics Program [12], in which ~100 fish genomes are sequencing targets for the next 3~5 years. Based on our previous experience on fish genomic studies, here we outline recent advances in related bioinformatics for fish genomics to share with public readers. Since the basic informatics includes genome assembly, genome annotation and evolutionary analysis, we discuss them one by one in this order.


September 21, 2019

Whole genome sequence of the soybean aphid, Aphis glycines.

Aphids are emerging as model organisms for both basic and applied research. Of the 5,000 estimated species, only three aphids have published whole genome sequences: the pea aphid Acyrthosiphon pisum, the Russian wheat aphid, Diuraphis noxia, and the green peach aphid, Myzus persicae. We present the whole genome sequence of a fourth aphid, the soybean aphid (Aphis glycines), which is an extreme specialist and an important invasive pest of soybean (Glycine max). The availability of genomic resources is important to establish effective and sustainable pest control, as well as to expand our understanding of aphid evolution. We generated a 302.9 Mbp draft genome assembly for Ap. glycines using a hybrid sequencing approach. This assembly shows high completeness with 19,182 predicted genes, 92% of known Ap. glycines transcripts mapping to contigs, and substantial continuity with a scaffold N50 of 174,505 bp. The assembly represents 95.5% of the predicted genome size of 317.1 Mbp based on flow cytometry. Ap. glycines contains the smallest known aphid genome to date, based on updated genome sizes for 19 aphid species. The repetitive DNA content of the Ap. glycines genome assembly (81.6 Mbp or 26.94% of the 302.9 Mbp assembly) shows a reduction in the number of classified transposable elements compared to Ac. pisum, and likely contributes to the small estimated genome size. We include comparative analyses of gene families related to host-specificity (cytochrome P450’s and effectors), which may be important in Ap. glycines evolution. This Ap. glycines draft genome sequence will provide a resource for the study of aphid genome evolution, their interaction with host plants, and candidate genes for novel insect control methods. Copyright © 2017 Elsevier Ltd. All rights reserved.


September 21, 2019

Divergent selection causes whole genome differentiation without physical linkage among the targets in Spodoptera frugiperda (Noctuidae)

The process of speciation involves whole genome differentiation by overcoming gene flow between diverging populations. We have ample knowledge which evolutionary forces may cause genomic differentiation, and several speciation models have been proposed to explain the transition from genetic to genomic differentiation. However, it is still unclear what are critical conditions enabling genomic differentiation in nature. The Fall armyworm, Spodoptera frugiperda, is observed as two sympatric strains that have different host-plant ranges, suggesting the possibility of ecological divergent selection. In our previous study, we observed that these two strains show genetic differentiation across the whole genome with an unprecedentedly low extent, suggesting the possibility that whole genome sequences started to be differentiated between the strains. In this study, we analyzed whole genome sequences from these two strains from Mississippi to identify critical evolutionary factors for genomic differentiation. The genomic Fst is low (0.017) while 91.3% of 10kb windows have Fst greater than 0, suggesting genome-wide differentiation with a low extent. We identified nearly 400 outliers of genetic differentiation between strains, and found that physical linkage among these outliers is not a primary cause of genomic differentiation. Fst is not significantly correlated with gene density, a proxy for the strength of selection, suggesting that a genomic reduction in migration rate dominates the extent of local genetic differentiation. Our analyses reveal that divergent selection alone is sufficient to generate genomic differentiation, and any following diversifying factors may increase the level of genetic differentiation between diverging strains in the process of speciation.


September 21, 2019

Phased diploid genome assembly with single-molecule real-time sequencing.

While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.


September 21, 2019

Population sequencing reveals clonal diversity and ancestral inbreeding in the grapevine cultivar Chardonnay.

Chardonnay is the basis of some of the world’s most iconic wines and its success is underpinned by a historic program of clonal selection. There are numerous clones of Chardonnay available that exhibit differences in key viticultural and oenological traits that have arisen from the accumulation of somatic mutations during centuries of asexual propagation. However, the genetic variation that underlies these differences remains largely unknown. To address this knowledge gap, a high-quality, diploid-phased Chardonnay genome assembly was produced from single-molecule real time sequencing, and combined with re-sequencing data from 15 different Chardonnay clones. There were 1620 markers identified that distinguish the 15 clones. These markers were reliably used for clonal identification of independently sourced genomic material, as well as in identifying a potential genetic basis for some clonal phenotypic differences. The predicted parentage of the Chardonnay haplomes was elucidated by mapping sequence data from the predicted parents of Chardonnay (Gouais blanc and Pinot noir) against the Chardonnay reference genome. This enabled the detection of instances of heterosis, with differentially-expanded gene families being inherited from the parents of Chardonnay. Most surprisingly however, the patterns of nucleotide variation present in the Chardonnay genome indicate that Pinot noir and Gouais blanc share an extremely high degree of kinship that has resulted in the Chardonnay genome displaying characteristics that are indicative of inbreeding.


July 19, 2019

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published.A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii.Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.


July 19, 2019

The methylomes of six bacteria.

Six bacterial genomes, Geobacter metallireducens GS-15, Chromohalobacter salexigens, Vibrio breoganii 1C-10, Bacillus cereus ATCC 10987, Campylobacter jejuni subsp. jejuni 81-176 and C. jejuni NCTC 11168, all of which had previously been sequenced using other platforms were re-sequenced using single-molecule, real-time (SMRT) sequencing specifically to analyze their methylomes. In every case a number of new N(6)-methyladenine ((m6)A) and N(4)-methylcytosine ((m4)C) methylation patterns were discovered and the DNA methyltransferases (MTases) responsible for those methylation patterns were assigned. In 15 cases, it was possible to match MTase genes with MTase recognition sequences without further sub-cloning. Two Type I restriction systems required sub-cloning to differentiate their recognition sequences, while four MTase genes that were not expressed in the native organism were sub-cloned to test for viability and recognition sequences. Two of these proved active. No attempt was made to detect 5-methylcytosine ((m5)C) recognition motifs from the SMRT® sequencing data because this modification produces weaker signals using current methods. However, all predicted (m6)A and (m4)C MTases were detected unambiguously. This study shows that the addition of SMRT sequencing to traditional sequencing approaches gives a wealth of useful functional information about a genome showing not only which MTase genes are active but also revealing their recognition sequences.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.