Menu
September 22, 2019

Understanding explosive diversification through cichlid fish genomics.

Owing to their taxonomic, phenotypic, ecological and behavioural diversity and propensity for explosive diversification, the assemblages of cichlid fish in the East African Great Lakes Victoria, Malawi and Tanganyika are important role models in evolutionary biology. With the release of five reference genomes and many additional genomic resources, as well as the establishment of functional genomic tools, the cichlid system has fully entered the genomic era. The in-depth genomic exploration of the East African cichlid fauna – in combination with the examination of their ecology, morphology and behaviour – permits novel insights into the way organisms diversify.


September 22, 2019

How complete are “complete” genome assemblies?-An avian perspective.

The genomics revolution has led to the sequencing of a large variety of nonmodel organisms often referred to as “whole” or “complete” genome assemblies. But how complete are these, really? Here, we use birds as an example for nonmodel vertebrates and find that, although suitable in principle for genomic studies, the current standard of short-read assemblies misses a significant proportion of the expected genome size (7% to 42%; mean 20 ± 9%). In particular, regions with strongly deviating nucleotide composition (e.g., guanine-cytosine-[GC]-rich) and regions highly enriched in repetitive DNA (e.g., transposable elements and satellite DNA) are usually underrepresented in assemblies. However, long-read sequencing technologies successfully characterize many of these underrepresented GC-rich or repeat-rich regions in several bird genomes. For instance, only ~2% of the expected total base pairs are missing in the last chicken reference (galGal5). These assemblies still contain thousands of gaps (i.e., fragmented sequences) because some chromosomal structures (e.g., centromeres) likely contain arrays of repetitive DNA that are too long to bridge with currently available technologies. We discuss how to minimize the number of assembly gaps by combining the latest available technologies with complementary strengths. At last, we emphasize the importance of knowing the location, size and potential content of assembly gaps when making population genetic inferences about adjacent genomic regions.© 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


September 22, 2019

Genomic analysis of Picochlorum species reveals how microalgae may adapt to variable environments.

Understanding how microalgae adapt to rapidly changing environments is not only important to science but can help clarify the potential impact of climate change on the biology of primary producers. We sequenced and analyzed the nuclear genome of multiple Picochlorum isolates (Chlorophyta) to elucidate strategies of environmental adaptation. It was previously found that coordinated gene regulation is involved in adaptation to salinity stress, and here we show that gene gain and loss also play key roles in adaptation. We determined the extent of horizontal gene transfer (HGT) from prokaryotes and their role in the origin of novel functions in the Picochlorum clade. HGT is an ongoing and dynamic process in this algal clade with adaptation being driven by transfer, divergence, and loss. One HGT candidate that is differentially expressed under salinity stress is indolepyruvate decarboxylase that is involved in the production of a plant auxin that mediates bacteria-diatom symbiotic interactions. Large differences in levels of heterozygosity were found in diploid haplotypes among Picochlorum isolates. Biallelic divergence was pronounced in P. oklahomensis (salt plains environment) when compared with its closely related sister taxon Picochlorum SENEW3 (brackish water environment), suggesting a role of diverged alleles in response to environmental stress. Our results elucidate how microbial eukaryotes with limited gene inventories expand habitat range from mesophilic to halophilic through allelic diversity, and with minor but important contributions made by HGT. We also explore how the nature and quality of genome data may impact inference of nuclear ploidy.


September 22, 2019

Genomic evidence for asymmetric introgression by sexual selection in the common wall lizard.

Strongly selected characters can be transferred from one lineage to another with limited genetic exchange, resulting in asymmetric introgression and a mosaic genome in the receiving population. However, systems are rarely sufficiently well studied to link the pattern of introgression to its underlying process. Male common wall lizards in western Italy exhibit exaggeration of a suite of sexually selected characters that make them outcompete males from a distantly related lineage that lack these characters. This results in asymmetric hybridization and adaptive introgression of the suite of characters following secondary contact. We developed genomewide markers to infer the demographic history of gene flow between different genetic lineages, identify the spread of the sexually selected syndrome, and test the prediction that introgression should be asymmetric and heterogeneous across the genome. Our results show that secondary contact was accompanied by gene flow in both directions across most of the genome, but with approximately 3% of the genome showing highly asymmetric introgression in the predicted direction. Demographic simulations reveal that this asymmetric gene flow is more recent than the initial secondary contact, and the data suggest that the exaggerated male sexual characters originated within the Italian lineage and subsequently spread throughout this lineage before eventually reaching the contact zone. These results demonstrate that sexual selection can cause a suite of characters to spread throughout both closely and distantly related lineages with limited gene flow across the genome at large.© 2018 John Wiley & Sons Ltd.


September 22, 2019

Genome-scale analysis of Acetobacterium bakii reveals the cold adaptation of psychrotolerant acetogens by post-transcriptional regulation.

Acetogens synthesize acetyl-CoA via CO2 or CO fixation, producing organic compounds. Despite their ecological and industrial importance, their transcriptional and post-transcriptional regulation has not been systematically studied. With completion of the genome sequence of Acetobacterium bakii (4.28-Mb), we measured changes in the transcriptome of this psychrotolerant acetogen in response to temperature variations under autotrophic and heterotrophic growth conditions. Unexpectedly, acetogenesis genes were highly up-regulated at low temperatures under heterotrophic, as well as autotrophic, growth conditions. To mechanistically understand the transcriptional regulation of acetogenesis genes via changes in RNA secondary structures of 5′-untranslated regions (5′-UTR), the primary transcriptome was experimentally determined, and 1379 transcription start sites (TSS) and 1100 5′-UTR were found. Interestingly, acetogenesis genes contained longer 5′-UTR with lower RNA-folding free energy than other genes, revealing that the 5′-UTRs control the RNA abundance of the acetogenesis genes under low temperature conditions. Our findings suggest that post-transcriptional regulation via RNA conformational changes of 5′-UTRs is necessary for cold-adaptive acetogenesis.© 2018 Shin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.


September 22, 2019

Investigation of inter- and intraspecies variation through genome sequencing of Aspergillus section Nigri.

Aspergillus section Nigri comprises filamentous fungi relevant to biomedicine, bioenergy, health, and biotechnology. To learn more about what genetically sets these species apart, as well as about potential applications in biotechnology and biomedicine, we sequenced 23 genomes de novo, forming a full genome compendium for the section (26 species), as well as 6 Aspergillus niger isolates. This allowed us to quantify both inter- and intraspecies genomic variation. We further predicted 17,903 carbohydrate-active enzymes and 2,717 secondary metabolite gene clusters, which we condensed into 455 distinct families corresponding to compound classes, 49% of which are only found in single species. We performed metabolomics and genetic engineering to correlate genotypes to phenotypes, as demonstrated for the metabolite aurasperone, and by heterologous transfer of citrate production to Aspergillus nidulans. Experimental and computational analyses showed that both secondary metabolism and regulation are key factors that are significant in the delineation of Aspergillus species.


September 22, 2019

Genomic and genetic insights into a cosmopolitan fungus, Paecilomyces variotii (Eurotiales).

Species in the genus Paecilomyces, a member of the fungal order Eurotiales, are ubiquitous in nature and impact a variety of human endeavors. Here, the biology of one common species, Paecilomyces variotii, was explored using genomics and functional genetics. Sequencing the genome of two isolates revealed key genome and gene features in this species. A striking feature of the genome was the two-part nature, featuring large stretches of DNA with normal GC content separated by AT-rich regions, a hallmark of many plant-pathogenic fungal genomes. These AT-rich regions appeared to have been mutated by repeat-induced point (RIP) mutations. We developed methods for genetic transformation of P. variotii, including forward and reverse genetics as well as crossing techniques. Using transformation and crossing, RIP activity was identified, demonstrating for the first time that RIP is an active process within the order Eurotiales. A consequence of RIP is likely reflected by a reduction in numbers of genes within gene families, such as in cell wall degradation, and reflected by growth limitations on P. variotii on diverse carbon sources. Furthermore, using these transformation tools we characterized a conserved protein containing a domain of unknown function (DUF1212) and discovered it is involved in pigmentation.


September 21, 2019

in silico Whole Genome Sequencer & Analyzer (iWGS): a computational pipeline to guide the design and analysis of de novo genome sequencing studies.

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in non-model organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS. Copyright © 2016 Author et al.


September 21, 2019

Mistranslation drives the evolution of robustness in TEM-1 ß-lactamase.

How biological systems such as proteins achieve robustness to ubiquitous perturbations is a fundamental biological question. Such perturbations include errors that introduce phenotypic mutations into nascent proteins during the translation of mRNA. These errors are remarkably frequent. They are also costly, because they reduce protein stability and help create toxic misfolded proteins. Adaptive evolution might reduce these costs of protein mistranslation by two principal mechanisms. The first increases the accuracy of translation via synonymous “high fidelity” codons at especially sensitive sites. The second increases the robustness of proteins to phenotypic errors via amino acids that increase protein stability. To study how these mechanisms are exploited by populations evolving in the laboratory, we evolved the antibiotic resistance gene TEM-1 in Escherichia coli hosts with either normal or high rates of mistranslation. We analyzed TEM-1 populations that evolved under relaxed and stringent selection for antibiotic resistance by single molecule real-time sequencing. Under relaxed selection, mistranslating populations reduce mistranslation costs by reducing TEM-1 expression. Under stringent selection, they efficiently purge destabilizing amino acid changes. More importantly, they accumulate stabilizing amino acid changes rather than synonymous changes that increase translational accuracy. In the large populations we study, and on short evolutionary timescales, the path of least resistance in TEM-1 evolution consists of reducing the consequences of translation errors rather than the errors themselves.


September 21, 2019

The kinetoplastid-infecting Bodo saltans virus (BsV), a window into the most abundant giant viruses in the sea.

Giant viruses are ecologically important players in aquatic ecosystems that have challenged concepts of what constitutes a virus. Herein, we present the giant Bodo saltans virus (BsV), the first characterized representative of the most abundant group of giant viruses in ocean metagenomes, and the first isolate of a klosneuvirus, a subgroup of the Mimiviridae proposed from metagenomic data. BsV infects an ecologically important microzooplankton, the kinetoplastid Bodo saltans. Its 1.39 Mb genome encodes 1227 predicted ORFs, including a complex replication machinery. Yet, much of its translational apparatus has been lost, including all tRNAs. Essential genes are invaded by homing endonuclease-encoding self-splicing introns that may defend against competing viruses. Putative anti-host factors show extensive gene duplication via a genomic accordion indicating an ongoing evolutionary arms race and highlighting the rapid evolution and genomic plasticity that has led to genome gigantism and the enigma that is giant viruses.© 2018, Deeg et al.


September 21, 2019

A Sequel to Sanger: amplicon sequencing that scales.

Although high-throughput sequencers (HTS) have largely displaced their Sanger counterparts, the short read lengths and high error rates of most platforms constrain their utility for amplicon sequencing. The present study tests the capacity of single molecule, real-time (SMRT) sequencing implemented on the SEQUEL platform to overcome these limitations, employing 658 bp amplicons of the mitochondrial cytochrome c oxidase I gene as a model system.By examining templates from more than 5000 species and 20,000 specimens, the performance of SMRT sequencing was tested with amplicons showing wide variation in GC composition and varied sequence attributes. SMRT and Sanger sequences were very similar, but SMRT sequencing provided more complete coverage, especially for amplicons with homopolymer tracts. Because it can characterize amplicon pools from 10,000 DNA extracts in a single run, the SEQUEL can reduce greatly reduce sequencing costs in comparison to first (Sanger) and second generation platforms (Illumina, Ion).SMRT analysis generates high-fidelity sequences from amplicons with varying GC content and is resilient to homopolymer tracts. Analytical costs are low, substantially less than those for first or second generation sequencers. When implemented on the SEQUEL platform, SMRT analysis enables massive amplicon characterization because each instrument can recover sequences from more than 5 million DNA extracts a year.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.