Background Assemblies of diploid genomes are generally unphased, pseudo-haploid representations that do not correctly reconstruct the two parental haplotypes present in the individual sequenced. Instead, the assembly alternates between parental haplotypes and may contain duplications in regions where the parental haplotypes are sufficiently different. Trio binning is an approach to genome assembly that uses short reads from both parents to classify long reads from the offspring according to maternal or paternal haplotype origin, and is thus helped rather than impeded by heterozygosity. Using this approach, it is possible to derive two assemblies from an individual, accurately representing both parental contributions in their entirety with higher continuity and accuracy than is possible with other methods.Results We used trio binning to assemble reference genomes for two species from a single individual using an interspecies cross of yak (Bos grunniens) and cattle (Bos taurus). The high heterozygosity inherent to interspecies hybrids allowed us to confidently assign >99% of long reads from the F1 offspring to parental bins using unique k-mers from parental short reads. Both the maternal (yak) and paternal (cattle) assemblies contain over one third of the acrocentric chromosomes, including the two largest chromosomes, in single haplotigs.Conclusions These haplotigs are the first vertebrate chromosome arms to be assembled gap-free and fully phased, and the first time assemblies for two species have been created from a single individual. Both assemblies are the most continuous currently available for non-model vertebrates.MbmegabaseskbkilobasesMYAmillions of years agoMHCmajor histocompatibility complexSMRTsingle molecule real time
Genome sequence resources for four phytopathogenic fungi from the Colletotrichum orbiculare species complex.
Colletotrichum orbiculare species complex fungi are hemibiotrophic plant pathogens that cause anthracnose of field crops and weeds. Members of this group have genomes that are remarkably expanded relative to other Colletotrichum fungi and compartmentalized into AT-rich, gene poor and GC-rich, gene rich regions. Here we present an updated version of the Colletotrichum orbiculare genome, as well as draft genomes of three other members from the C. orbiculare species complex; the alfalfa pathogen Colletotrichum trifolii, the prickly mallow pathogen Colletotrichum sidae and the burweed pathogen Colletotrichum spinosum. The data reported here will be important for comparative genomics analyses to identify factors that play a role in the evolution and maintenance of the expanded, compartmentalized genomes of these fungi which may contribute to their pathogenicity.
Phytoplankton-virus interactions are major determinants of geochemical cycles in the oceans. Viruses are responsible for the redirection of carbon and nutrients away from larger organisms back towards microorganisms via the lysis of microalgae in a process coined the “viral shunt”. Virus-host interactions are generally expected to follow “boom and bust” dynamics, whereby a numerically dominant strain is lysed and replaced by a virus resistant strain. Here, we isolated a microalga and its infective nucleo-cytoplasmic large DNA virus (NCLDV) concomitantly from the environment in the surface NW Mediterranean Sea, Ostreococcus mediterraneus, and show continuous growth in culture of both the microalga and the virus. Evolution experiments through single cell bottlenecks demonstrate that, in the absence of the virus, susceptible cells evolve from one ancestral resistant single cell, and vice-versa; that is that resistant cells evolve from one ancestral susceptible cell. This provides evidence that the observed sustained viral production is the consequence of a minority of virus-susceptible cells. The emergence of these cells is explained by low-level phase switching between virus-resistant and virus-susceptible phenotypes, akin to a bet hedging strategy. Whole genome sequencing and analysis of the ~14 Mb microalga and the ~200 kb virus points towards ancient speciation of the microalga within the Ostreococcus species complex and frequent gene exchanges between prasinoviruses infecting Ostreococcus species. Re-sequencing of one susceptible strain demonstrated that the phase switch involved a large 60 Kb deletion of one chromosome. This chromosome is an outlier chromosome compared to the streamlined, gene dense, GC-rich standard chromosomes, as it contains many repeats and few orthologous genes. While this chromosome has been described in three different genera, its size increments have been previously associated to antiviral immunity and resistance in another species from the same genus. Mathematical modelling of this mechanism predicts microalga-virus population dynamics consistent with the observation of continuous growth of both virus and microalga. Altogether, our results suggest a previously overlooked strategy in phytoplankton-virus interactions.
Centromere-mediated chromosome break drives karyotype evolution in closely related Malassezia species
Intra-chromosomal or inter-chromosomal genomic rearrangements often lead to speciation. Loss or gain of a centromere leads to alterations in chromosome number in closely related species. Thus, centromeres can enable tracing the path of evolution from the ancestral to a derived state. The Malassezia species complex of the phylum Basiodiomycota shows remarkable diversity in chromosome number ranging between six and nine chromosomes. To understand these transitions, we experimentally identified all eight centromeres as binding sites of an evolutionarily conserved outer kinetochore protein Mis12/Mtw1 in M. sympodialis. The 3 to 5 kb centromere regions share an AT-rich, poorly transcribed core region enriched with a 12 bp consensus motif. We also mapped nine such AT-rich centromeres in M. globosa and the related species Malassezia restricta and Malassezia slooffiae. While eight predicted centromeres were found within conserved synteny blocks between these species and M. sympodialis, the remaining centromere in M. globosa (MgCEN2) or its orthologous centromere in M. slooffiae (MslCEN4) and M. restricta (MreCEN8) mapped to a synteny breakpoint compared with M. sympodialis. Taken together, we provide evidence that breakage and loss of a centromere (CEN2) in an ancestral Malassezia species possessing nine chromosomes resulted in fewer chromosomes in M. sympodialis. Strikingly, the predicted centromeres of all closely related Malassezia species map to an AT-rich core on each chromosome that also shows enrichment of the 12 bp sequence motif. We propose that centromeres are fragile AT-rich sites driving karyotype diversity through breakage and inactivation in these and other species.
Gene transfer between bacterial species is an important mechanism for adaptation. For example, sets of genes that confer the ability to form nitrogen-fixing root nodules on host plants have frequently moved between Rhizobium species. It is not clear, though, whether such transfer is exceptional, or if frequent inter-species introgression is typical. To address this, we sequenced the genomes of 196 isolates of the Rhizobium leguminosarum species complex obtained from root nodules of white clover (Trifolium repens). Core gene phylogeny placed the isolates into five distinct genospecies that show high intra-genospecies recombination rates and remarkably different demographic histories. Most gene phylogenies were largely concordant with the genospecies, indicating that recent gene transfer between genospecies was rare. In contrast, very similar symbiosis gene sequences were found in two or more genospecies, suggesting recent horizontal transfer. The replication and conjugative transfer genes of the plasmids carrying the symbiosis genes showed a similar pattern, implying that introgression occurred by conjugative plasmid transfer. The only other regions that showed strong phylogenetic discordance with the genospecies classification were two small chromosomal clusters, one neighbouring a conjugative transfer system. Phage-related sequences were observed in the genomes, but appeared to have very limited impact on introgression. Introgression among these closely-related species has been very limited, confined to the symbiosis plasmids and a few chromosomal islands. Both introgress through conjugative transfer, but have been subject to different types of selective forces.
Strengths and potential pitfalls of hay-transfer for ecological restoration revealed by RAD-seq analysis in floodplain Arabis species
Achieving high intraspecific genetic diversity is a critical goal in ecological restoration as it increases the adaptive potential and long-term resilience of populations. Thus, we investigated genetic diversity within and between pristine sites in a fossil floodplain and compared it to sites restored by hay-transfer between 1997 and 2014. RAD-seq genotyping revealed that the stenoecious flood-plain species Arabis nemorensis is co-occurring with individuals that, based on ploidy, ITS-sequencing and morphology, probably belong to the close relative Arabis sagittata, which has a documented preference for dry calcareous grasslands but has not been reported in floodplain meadows. We show that hay-transfer maintains genetic diversity for both species. Additionally, in A. sagittata, transfer from multiple genetically isolated pristine sites resulted in restored sites with increased diversity and admixed local genotypes. In A. nemorensis, transfer did not create novel admixture dynamics because genetic diversity between pristine sites was less differentiated. Thus, the effects of hay-transfer on genetic diversity also depend on the genetic makeup of the donor communities of each species, especially when local material is mixed. Our results demonstrate the efficiency of hay-transfer for habitat restoration and emphasize the importance of pre-restoration characterization of micro-geographic patterns of intraspecific diversity of the community to guarantee that restoration practices reach their goal, i.e. maximize the adaptive potential of the entire restored plant community. Overlooking these patterns may alter the balance between species in the community. Additionally, our comparison of summary statistics obtained from de novo and reference-based RAD-seq pipelines shows that the genomic impact of restoration can be reliably monitored in species lacking prior genomic knowledge.
Suppressed recombination allows divergence between homologous sex chromosomes and the functionality of their genes. Here, we reveal patterns of the earliest stages of sex-chromosome evolution in the diploid dioecious herb Mercurialis annua on the basis of cytological analysis, de novo genome assembly and annotation, genetic mapping, exome resequencing of natural populations, and transcriptome analysis. The genome assembly contained 34,105 expressed genes, of which 10,076 were assigned to linkage groups. Genetic mapping and exome resequencing of individuals across the species range both identified the largest linkage group, LG1, as the sex chromosome. Although the sex chromosomes of M. annua are karyotypically homomorphic, we estimate that about a third of the Y chromosome has ceased recombining, containing 568 transcripts and spanning 22.3 cM in the corresponding female map. Nevertheless, we found limited evidence for Y-chromosome degeneration in terms of gene loss and pseudogenization, and most X- and Y-linked genes appear to have diverged in the period subsequent to speciation between M. annua and its sister species M. huetii which shares the same sex-determining region. Taken together, our results suggest that the M. annua Y chromosome has at least two evolutionary strata: a small old stratum shared with M. huetii, and a more recent larger stratum that is probably unique to M. annua and that stopped recombining about one million years ago. Patterns of gene expression within the non-recombining region are consistent with the idea that sexually antagonistic selection may have played a role in favoring suppressed recombination.Copyright © 2019, Genetics.
Using bacteria to transform reactive corrosion products into stable compounds represents an alternative to traditional methods employed in iron conservation. Two environmental Aeromonas strains (CA23 and CU5) were used to transform ferric iron corrosion products (goethite and lepidocrocite) into stable ferrous iron-bearing minerals (vivianite and siderite). A genomic and transcriptomic approach was used to analyze the metabolic traits of these strains and to evaluate their pathogenic potential. Although genes involved in solid-phase iron reduction were identified, key genes present in other environmental iron-reducing species are missing from the genome of CU5. Several pathogenicity factors were identified in the genomes of both strains, but none of these was expressed under iron reduction conditions. Additional in vivo tests showed hemolytic and cytotoxic activities for strain CA23 but not for strain CU5. Both strains were easily inactivated using ethanol and heat. Nonetheless, given a lesser potential for a pathogenic lifestyle, CU5 is the most promising candidate for the development of a bio-based iron conservation method stabilizing iron corrosion. Based on all the results, a prototype treatment was established using archaeological items. On those, the conversion of reactive corrosion products and the formation of a homogenous layer of biogenic iron minerals were achieved. This study shows how naturally occurring microorganisms and their metabolic capabilities can be used to develop bio-inspired solutions to the problem of metal corrosion.IMPORTANCE Microbiology can greatly help in the quest for a sustainable solution to the problem of iron corrosion, which causes important economic losses in a wide range of fields, including the protection of cultural heritage and building materials. Using bacteria to transform reactive and unstable corrosion products into more-stable compounds represents a promising approach. The overall aim of this study was to develop a method for the conservation and restoration of corroded iron items, starting from the isolation of iron-reducing bacteria from natural environments. This resulted in the identification of a suitable candidate (Aeromonas sp. strain CU5) that mediates the formation of desirable minerals at the surfaces of the objects. This led to the proof of concept of an application method on real objects.Copyright © 2019 Kooli et al.
Foodborne infections caused by lung flukes of the genus Paragonimus are a significant and widespread public health problem in tropical areas. Approximately 50 Paragonimus species have been reported to infect animals and humans, but Paragonimus westermani is responsible for the bulk of human disease. Despite their medical and economic importance, no genome sequence for any Paragonimus species is available.We sequenced and assembled the genome of P. westermani, which is among the largest of the known pathogen genomes with an estimated size of 1.1 Gb. A 922.8 Mb genome assembly was generated from Illumina and Pacific Biosciences (PacBio) sequence data, covering 84% of the estimated genome size. The genome has a high proportion (45%) of repeat-derived DNA, particularly of the long interspersed element and long terminal repeat subtypes, and the expansion of these elements may explain some of the large size. We predicted 12,852 protein coding genes, showing a high level of conservation with related trematode species. The majority of proteins (80%) had homologs in the human liver fluke Opisthorchis viverrini, with an average sequence identity of 64.1%. Assembly of the P. westermani mitochondrial genome from long PacBio reads resulted in a single high-quality circularized 20.6 kb contig. The contig harbored a 6.9 kb region of non-coding repetitive DNA comprised of three distinct repeat units. Our results suggest that the region is highly polymorphic in P. westermani, possibly even within single worm isolates.The generated assembly represents the first Paragonimus genome sequence and will facilitate future molecular studies of this important, but neglected, parasite group.
Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.
A High-Quality Grapevine Downy Mildew Genome Assembly Reveals Rapidly Evolving and Lineage-Specific Putative Host Adaptation Genes.
Downy mildews are obligate biotrophic oomycete pathogens that cause devastating plant diseases on economically important crops. Plasmopara viticola is the causal agent of grapevine downy mildew, a major disease in vineyards worldwide. We sequenced the genome of Pl. viticola with PacBio long reads and obtained a new 92.94?Mb assembly with high contiguity (359 scaffolds for a N50 of 706.5?kb) due to a better resolution of repeat regions. This assembly presented a high level of gene completeness, recovering 1,592 genes encoding secreted proteins involved in plant-pathogen interactions. Plasmopara viticola had a two-speed genome architecture, with secreted protein-encoding genes preferentially located in gene-sparse, repeat-rich regions and evolving rapidly, as indicated by pairwise dN/dS values. We also used short reads to assemble the genome of Plasmopara muralis, a closely related species infecting grape ivy (Parthenocissus tricuspidata). The lineage-specific proteins identified by comparative genomics analysis included a large proportion of RxLR cytoplasmic effectors and, more generally, genes with high dN/dS values. We identified 270 candidate genes under positive selection, including several genes encoding transporters and components of the RNA machinery potentially involved in host specialization. Finally, the Pl. viticola genome assembly generated here will allow the development of robust population genomics approaches for investigating the mechanisms involved in adaptation to biotic and abiotic selective pressures in this species. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Recombination between loci underlying mate choice and ecological traits is a major evolutionary force acting against speciation with gene flow. The evolution of linkage disequilibrium between such loci is therefore a fundamental step in the origin of species. Here, we show that this process can take place in the absence of physical linkage in hamlets-a group of closely related reef fishes from the wider Caribbean that differ essentially in colour pattern and are reproductively isolated through strong visually-based assortative mating. Using full-genome analysis, we identify four narrow genomic intervals that are consistently differentiated among sympatric species in a backdrop of extremely low genomic divergence. These four intervals include genes involved in pigmentation (sox10), axial patterning (hoxc13a), photoreceptor development (casz1) and visual sensitivity (SWS and LWS opsins) that develop islands of long-distance and inter-chromosomal linkage disequilibrium as species diverge. The relatively simple genomic architecture of species differences facilitates the evolution of linkage disequilibrium in the presence of gene flow.
Genomic and Functional Characterization of the Endophytic Bacillus subtilis 7PJ-16 Strain, a Potential Biocontrol Agent of Mulberry Fruit Sclerotiniose.
Bacillus sp. 7PJ-16, an endophytic bacterium isolated from a healthy mulberry stem and previously identified as Bacillus tequilensis 7PJ-16, exhibits strong antifungal activity and has the capacity to promote plant growth. This strain was studied for its effectiveness as a biocontrol agent to reduce mulberry fruit sclerotiniose in the field and as a growth-promoting agent for mulberry in the greenhouse. In field studies, the cell suspension and supernatant of strain 7PJ-16 exhibited biocontrol efficacy and the lowest disease incidence was reduced down to only 0.80%. In greenhouse experiments, the cell suspension (1.0?×?106 and 1.0?×?105 CFU/mL) and the cell-free supernatant (100-fold and 1000-fold dilution) stimulated mulberry seed germination and promoted mulberry seedling growth. In addition, to accurately identify the 7PJ-16 strain and further explore the mechanisms of its antifungal and growth-promoting properties, the complete genome of this strain was sequenced and annotated. The 7PJ-16 genome is comprised of two circular plasmids and a 4,209,045-bp circular chromosome, containing 4492 protein-coding genes and 116 RNA genes. This strain was ultimately designed as Bacillus subtilis based on core genome sequence analyses using a phylogenomic approach. In this genome, we identified a series of gene clusters that function in the synthesis of non-ribosomal peptides (surfactin, fengycin, bacillibactin, and bacilysin) as well as the ribosome-dependent synthesis of tasA and bacteriocins (subtilin, subtilosin A), which are responsible for the biosynthesis of numerous antimicrobial metabolites. Additionally, several genes with function that promote plant growth, such as indole-3-acetic acid biosynthesis, the production of volatile substances, and siderophores synthesis, were also identified. The information described in this study has established a good foundation for understanding the beneficial interactions between endophytes and host plants, and facilitates the further application of B. subtilis 7PJ-16 as an agricultural biofertilizer and biocontrol agent.
Mitochondrial DNA and their nuclear copies in the parasitic wasp Pteromalus puparum: A comparative analysis in Chalcidoidea.
Chalcidoidea (chalcidoid wasps) are an abundant and megadiverse insect group with both ecological and economical importance. Here we report a complete mitochondrial genome in Chalcidoidea from Pteromalus puparum (Pteromalidae). Eight tandem repeats followed by 6 reversed repeats were detected in its 3308?bp control region. This long and complex control region may explain failures of amplifying and sequencing of complete mitochondrial genomes in some chalcidoids. In addition to 37 typical mitochondrial genes, an extra identical isoleucine tRNA (trnI) was detected at the opposite end of the control region. This recent mitochondrial gene duplication indicates that gene arrangements in chalcidoids are ongoing. A comparison among available chalcidoid mitochondrial genomes reveals rapid gene order rearrangements overall and high protein substitution rates in most chalcidoid taxa. In addition, we identified 24 nuclear sequences of mitochondrial origin (NUMTs) in P. puparum, summing up to 9989?bp, with 3617?bp of these NUMTs originating from mitochondrial coding regions. NUMTs abundance in P. puparum is only one-twelfth of that in its relative, Nasonia vitripennis. Based on phylogenetic analysis, we provide evidence that a faster nuclear degradation rate contributes to the reduced NUMT numbers in P. puparum. Overall, our study shows unusually high rates of mitochondrial evolution and considerable variation in NUMT accumulation in Chalcidoidea. Copyright © 2018. Published by Elsevier B.V.
Diversity of phytobeneficial traits revealed by whole-genome analysis of worldwide-isolated phenazine-producing Pseudomonas spp.
Plant-beneficial Pseudomonas spp. competitively colonize the rhizosphere and display plant-growth promotion and/or disease-suppression activities. Some strains within the P. fluorescens species complex produce phenazine derivatives, such as phenazine-1-carboxylic acid. These antimicrobial compounds are broadly inhibitory to numerous soil-dwelling plant pathogens and play a role in the ecological competence of phenazine-producing Pseudomonas spp. We assembled a collection encompassing 63 strains representative of the worldwide diversity of plant-beneficial phenazine-producing Pseudomonas spp. In this study, we report the sequencing of 58 complete genomes using PacBio RS II sequencing technology. Distributed among four subgroups within the P. fluorescens species complex, the diversity of our collection is reflected by the large pangenome which accounts for 25 413 protein-coding genes. We identified genes and clusters encoding for numerous phytobeneficial traits, including antibiotics, siderophores and cyclic lipopeptides biosynthesis, some of which were previously unknown in these microorganisms. Finally, we gained insight into the evolutionary history of the phenazine biosynthetic operon. Given its diverse genomic context, it is likely that this operon was relocated several times during Pseudomonas evolution. Our findings acknowledge the tremendous diversity of plant-beneficial phenazine-producing Pseudomonas spp., paving the way for comparative analyses to identify new genetic determinants involved in biocontrol, plant-growth promotion and rhizosphere competence. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.