Menu
July 7, 2019

Untangling heteroplasmy, structure, and evolution of an atypical mitochondrial genome by PacBio Sequencing.

The highly compact mitochondrial (mt) genome of terrestrial isopods (Oniscidae) presents two unusual features. First, several loci can individually encode two tRNAs, thanks to single nucleotide polymorphisms at anticodon sites. Within-individual variation (heteroplasmy) at these loci is thought to have been maintained for millions of years because individuals that do not carry all tRNA genes die, resulting in strong balancing selection. Second, the oniscid mtDNA genome comes in two conformations: a ~14 kb linear monomer and a ~28 kb circular dimer comprising two monomer units fused in palindrome. We hypothesized that heteroplasmy actually results from two genome units of the same dimeric molecule carrying different tRNA genes at mirrored loci. This hypothesis, however, contradicts the earlier proposition that dimeric molecules result from the replication of linear monomers-a process that should yield totally identical genome units within a dimer. To solve this contradiction, we used the SMRT (PacBio) technology to sequence mirrored tRNA loci in single dimeric molecules. We show that dimers do present different tRNA genes at mirrored loci; thus covalent linkage, rather than balancing selection, maintains vital variation at anticodons. We also leveraged unique features of the SMRT technology to detect linear monomers closed by hairpins and carrying noncomplementary bases at anticodons. These molecules contain the necessary information to encode two tRNAs at the same locus, and suggest new mechanisms of transition between linear and circular mtDNA. Overall, our analyses clarify the evolution of an atypical mt genome where dimerization counterintuitively enabled further mtDNA compaction. Copyright © 2017 by the Genetics Society of America.


July 7, 2019

Genome sequencing reveals the origin of the allotetraploid Arabidopsis suecica.

Polyploidy is an example of instantaneous speciation when it involves the formation of a new cytotype that is incompatible with the parental species. Because new polyploid individuals are likely to be rare, establishment of a new species is unlikely unless polyploids are able to reproduce through self-fertilization (selfing), or asexually. Conversely, selfing (or asexuality) makes it possible for polyploid species to originate from a single individual-a bona fide speciation event. The extent to which this happens is not known. Here, we consider the origin of Arabidopsis suecica, a selfing allopolyploid between Arabidopsis thaliana and Arabidopsis arenosa, which has hitherto been considered to be an example of a unique origin. Based on whole-genome re-sequencing of 15 natural A. suecica accessions, we identify ubiquitous shared polymorphism with the parental species, and hence conclusively reject a unique origin in favor of multiple founding individuals. We further estimate that the species originated after the last glacial maximum in Eastern Europe or central Eurasia (rather than Sweden, as the name might suggest). Finally, annotation of the self-incompatibility loci in A. suecica revealed that both loci carry non-functional alleles. The locus inherited from the selfing A. thaliana is fixed for an ancestral non-functional allele, whereas the locus inherited from the outcrossing A. arenosa is fixed for a novel loss-of-function allele. Furthermore, the allele inherited from A. thaliana is predicted to transcriptionally silence the allele inherited from A. arenosa, suggesting that loss of self-incompatibility may have been instantaneous.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements.We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements.Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ~22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements.We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats.© 2017 Botanical Society of America.


July 7, 2019

Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters.

Trichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis, and plant-fungal interactions. The genomes of QM6a and its high enzyme-producing mutants have been sequenced by second-generation-sequencing methods and are publicly available from the Joint Genome Institute. While these genome sequences have offered useful information for genomic and transcriptomic studies, their limitations and especially their short read lengths make them poorly suited for some particular biological problems, including assembly, genome-wide determination of chromosome architecture, and genetic modification or engineering.We integrated Pacific Biosciences and Illumina sequencing platforms for the highest-quality genome assembly yet achieved, revealing seven telomere-to-telomere chromosomes (34,922,528 bp; 10877 genes) with 1630 newly predicted genes and >1.5 Mb of new sequences. Most new sequences are located on AT-rich blocks, including 7 centromeres, 14 subtelomeres, and 2329 interspersed AT-rich blocks. The seven QM6a centromeres separately consist of 24 conserved repeats and 37 putative centromere-encoded genes. These findings open up a new perspective for future centromere and chromosome architecture studies. Next, we demonstrate that sexual crossing readily induced cytosine-to-thymine point mutations on both tandem and unlinked duplicated sequences. We also show by bioinformatic analysis that T. reesei has evolved a robust repeat-induced point mutation (RIP) system to accumulate AT-rich sequences, with longer AT-rich blocks having more RIP mutations. The widespread distribution of AT-rich blocks correlates genome-wide partitions with gene clusters, explaining why clustering of genes has been reported to not influence gene expression in T. reesei.Compartmentation of ancestral gene clusters by AT-rich blocks might promote flexibilities that are evolutionarily advantageous in this fungus’ soil habitats and other natural environments. Our analyses, together with the complete genome sequence, provide a better blueprint for biotechnological and industrial applications.


July 7, 2019

Novel chaperonins are prevalent in the virioplankton and demonstrate links to viral biology and ecology.

Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70?kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.


July 7, 2019

Unravelling the complete genome of Archangium gephyra DSM 2261T and evolutionary insights into myxobacterial chitinases.

Family Cystobacteraceae is a group of eubacteria within order Myxococcales and class Deltaproteobacteria that includes more than 20 species belonging to 6 genera, that is, Angiococcus, Archangium, Cystobacter, Hyalangium, Melittangium, and Stigmatella. Earlier these members have been classified based on chitin degrading efficiency such as Cystobacter fuscus and Stigmatella aurantiaca, which are efficient chitin degraders, C. violaceus a partial chitin degrader and Archangium gephyra a chitin nondegrader. Here we report the 12.5 Mbp complete genome of A. gephyra DSM 2261T and compare it with four available genomes within the family Cystobacteraceae. Phylogeny and DNA-DNA hybridization studies reveal that A. gephyra is closest to Angiococcus disciformis, C. violaceus and C. ferrugineus, which are partial chitin degraders of the family Cystobacteraceae. Homology studies reveal the conservation of approximately half of the proteins in these genomes, with about 15% unique proteins in each genome. The total carbohydrate-active enzymes (CAZome) analysis reveals the presence of one GH18 chitinase in the A. gephyra genome whereas eight copies are present in C. fuscus and S. aurantiaca. Evolutionary studies of myxobacterial GH18 chitinases reveal that most of them are likely related to Terrabacteria and Proteobacteria whereas the Archangium GH18 homolog shares maximum similarity with those of chitin nondegrading Acidobacteria.© The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Evolution of the wheat blast fungus through functional losses in a host specificity determinant.

Wheat blast first emerged in Brazil in the mid-1980s and has recently caused heavy crop losses in Asia. Here we show how this devastating pathogen evolved in Brazil. Genetic analysis of host species determinants in the blast fungus resulted in the cloning of avirulence genes PWT3 and PWT4, whose gene products elicit defense in wheat cultivars containing the corresponding resistance genes Rwt3 and Rwt4 Studies on avirulence and resistance gene distributions, together with historical data on wheat cultivation in Brazil, suggest that wheat blast emerged due to widespread deployment of rwt3 wheat (susceptible to Lolium isolates), followed by the loss of function of PWT3 This implies that the rwt3 wheat served as a springboard for the host jump to common wheat. Copyright © 2017, American Association for the Advancement of Science.


July 7, 2019

Discovery of chemoautotrophic symbiosis in the giant shipworm Kuphus polythalamia (Bivalvia: Teredinidae) extends wooden-steps theory.

The “wooden-steps” hypothesis [Distel DL, et al. (2000) Nature 403:725-726] proposed that large chemosynthetic mussels found at deep-sea hydrothermal vents descend from much smaller species associated with sunken wood and other organic deposits, and that the endosymbionts of these progenitors made use of hydrogen sulfide from biogenic sources (e.g., decaying wood) rather than from vent fluids. Here, we show that wood has served not only as a stepping stone between habitats but also as a bridge between heterotrophic and chemoautotrophic symbiosis for the giant mud-boring bivalve Kuphus polythalamia This rare and enigmatic species, which achieves the greatest length of any extant bivalve, is the only described member of the wood-boring bivalve family Teredinidae (shipworms) that burrows in marine sediments rather than wood. We show that K. polythalamia harbors sulfur-oxidizing chemoautotrophic (thioautotrophic) bacteria instead of the cellulolytic symbionts that allow other shipworm species to consume wood as food. The characteristics of its symbionts, its phylogenetic position within Teredinidae, the reduction of its digestive system by comparison with other family members, and the loss of morphological features associated with wood digestion indicate that K. polythalamia is a chemoautotrophic bivalve descended from wood-feeding (xylotrophic) ancestors. This is an example in which a chemoautotrophic endosymbiosis arose by displacement of an ancestral heterotrophic symbiosis and a report of pure culture of a thioautotrophic endosymbiont.


July 7, 2019

Repeated divergent selection on pigmentation genes in a rapid finch radiation.

Instances of recent and rapid speciation are suitable for associating phenotypes with their causal genotypes, especially if gene flow homogenizes areas of the genome that are not under divergent selection. We study a rapid radiation of nine sympatric bird species known as capuchino seedeaters, which are differentiated in sexually selected characters of male plumage and song. We sequenced the genomes of a phenotypically diverse set of species to search for differentiated genomic regions. Capuchinos show differences in a small proportion of their genomes, yet selection has acted independently on the same targets in different members of this radiation. Many divergent regions contain genes involved in the melanogenesis pathway, with the strongest signal originating from putative regulatory regions. Selection has acted on these same genomic regions in different lineages, likely shaping the evolution of cis-regulatory elements, which control how more conserved genes are expressed and thereby generate diversity in classically sexually selected traits.


July 7, 2019

A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.

The mycalesine butterfly Bicyclus anynana , the ‘Squinting bush brown’, is a model organism in the study of lepidopteran ecology, development and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species.Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology. 128 Gb raw Illumina data were filtered to 124 Gb and assembled to a final size of 475 Mb (~260X assembly coverage). Contigs were scaffolded using mate-pair, transcriptome and PacBio data into 10,800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements, and encodes a total of 22,642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes.We report a high-quality draft genome sequence for Bicyclus anynana . The genome assembly and annotated gene models are available at LepBase ( http://ensembl.lepbase.org/index.html ).


July 7, 2019

The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing

Mangroves invade some very marginal habitats for woody plants—at the interface between land and sea. Since mangroves anchor tropical coastal communities globally, their origin, diversification and adaptation are of scientific significance, particularly at a time of global climate change. In this study, a combination of single-molecule long reads and the more conventional short reads are generated from Rhizophora apiculata for the de novo assembly of its genome to a near chromosome level. The longest scaffold, N50 and N90 for the R. apiculata genome, are 13.3 Mb, 5.4 Mb and 1.0 Mb, respectively. Short reads for the genomes and transcriptomes of eight related species are also generated. We find that the ancestor of Rhizophoreae experienced a whole-genome duplication ~70 Myrs ago, which is followed rather quickly by colonization and species diversification. Mangroves exhibit pan-exome modifications of amino acid (AA) usage as well as unusual AA substitutions among closely related species. The usage and substitution of AAs, unique among plants surveyed, is correlated with the rapid evolution of proteins in mangroves. A small subset of these substitutions is associated with mangroves’ highly specialized traits (vivipary and red bark) thought to be adaptive in the intertidal habitats. Despite the many adaptive features, mangroves are among the least genetically diverse plants, likely the result of continual habitat turnovers caused by repeated rises and falls of sea level in the geologically recent past. Mangrove genomes thus inform about their past evolutionary success as well as portend a possibly difficult future.


July 7, 2019

Tandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba.

One common hypothesis to explain the impacts of tandem duplications is that whole gene duplications commonly produce additive changes in gene expression due to copy number changes. Here, we use genome wide RNA-seq data from a population sample of Drosophila yakuba to test this ‘gene dosage’ hypothesis. We observe little evidence of expression changes in response to whole transcript duplication capturing 5′ and 3′ UTRs. Among whole gene duplications, we observe evidence that dosage sharing across copies is likely to be common. The lack of expression changes after whole gene duplication suggests that the majority of genes are subject to tight regulatory control and therefore not sensitive to changes in gene copy number. Rather, we observe changes in expression level due to both shuffling of regulatory elements and the creation of chimeric structures via tandem duplication. Additionally, we observe 30 de novo gene structures arising from tandem duplications, 23 of which form with expression in the testes. Thus, the value of tandem duplications is likely to be more intricate than simple changes in gene dosage. The common regulatory effects from chimeric gene formation after tandem duplication may explain their contribution to genome evolution.


July 7, 2019

A large gene family in fission yeast encodes spore killers that subvert Mendel’s law.

Spore killers in fungi are selfish genetic elements that distort Mendelian segregation in their favor. It remains unclear how many species harbor them and how diverse their mechanisms are. Here, we discover two spore killers from a natural isolate of the fission yeast Schizosaccharomyces pombe. Both killers belong to the previously uncharacterized wtf gene family with 25 members in the reference genome. These two killers act in strain-background-independent and genome-location-independent manners to perturb the maturation of spores not inheriting them. Spores carrying one killer are protected from its killing effect but not that of the other killer. The killing and protecting activities can be uncoupled by mutation. The numbers and sequences of wtf genes vary considerably between S. pombe isolates, indicating rapid divergence. We propose that wtf genes contribute to the extensive intraspecific reproductive isolation in S. pombe, and represent ideal models for understanding how segregation-distorting elements act and evolve.


July 7, 2019

Evolutionary strata on young mating-type chromosomes despite the lack of sexual antagonism.

Sex chromosomes can display successive steps of recombination suppression known as “evolutionary strata,” which are thought to result from the successive linkage of sexually antagonistic genes to sex-determining genes. However, there is little evidence to support this explanation. Here we investigate whether evolutionary strata can evolve without sexual antagonism using fungi that display suppressed recombination extending beyond loci determining mating compatibility despite lack of male/female roles associated with their mating types. By comparing full-length chromosome assemblies from five anther-smut fungi with or without recombination suppression in their mating-type chromosomes, we inferred the ancestral gene order and derived chromosomal arrangements in this group. This approach shed light on the chromosomal fusion underlying the linkage of mating-type loci in fungi and provided evidence for multiple clearly resolved evolutionary strata over a range of ages (0.9-2.1 million years) in mating-type chromosomes. Several evolutionary strata did not include genes involved in mating-type determination. The existence of strata devoid of mating-type genes, despite the lack of sexual antagonism, calls for a unified theory of sex-related chromosome evolution, incorporating, for example, the influence of partially linked deleterious mutations and the maintenance of neutral rearrangement polymorphism due to balancing selection on sexes and mating types.


July 7, 2019

PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data.

High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.© 2017 John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.