Menu
July 7, 2019

Genome sequencing reveals the origin of the allotetraploid Arabidopsis suecica.

Polyploidy is an example of instantaneous speciation when it involves the formation of a new cytotype that is incompatible with the parental species. Because new polyploid individuals are likely to be rare, establishment of a new species is unlikely unless polyploids are able to reproduce through self-fertilization (selfing), or asexually. Conversely, selfing (or asexuality) makes it possible for polyploid species to originate from a single individual-a bona fide speciation event. The extent to which this happens is not known. Here, we consider the origin of Arabidopsis suecica, a selfing allopolyploid between Arabidopsis thaliana and Arabidopsis arenosa, which has hitherto been considered to be an example of a unique origin. Based on whole-genome re-sequencing of 15 natural A. suecica accessions, we identify ubiquitous shared polymorphism with the parental species, and hence conclusively reject a unique origin in favor of multiple founding individuals. We further estimate that the species originated after the last glacial maximum in Eastern Europe or central Eurasia (rather than Sweden, as the name might suggest). Finally, annotation of the self-incompatibility loci in A. suecica revealed that both loci carry non-functional alleles. The locus inherited from the selfing A. thaliana is fixed for an ancestral non-functional allele, whereas the locus inherited from the outcrossing A. arenosa is fixed for a novel loss-of-function allele. Furthermore, the allele inherited from A. thaliana is predicted to transcriptionally silence the allele inherited from A. arenosa, suggesting that loss of self-incompatibility may have been instantaneous.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

ConcatSeq: A method for increasing throughput of single molecule sequencing by concatenating short DNA fragments.

Single molecule sequencing (SMS) platforms enable base sequences to be read directly from individual strands of DNA in real-time. Though capable of long read lengths, SMS platforms currently suffer from low throughput compared to competing short-read sequencing technologies. Here, we present a novel strategy for sequencing library preparation, dubbed ConcatSeq, which increases the throughput of SMS platforms by generating long concatenated templates from pools of short DNA molecules. We demonstrate adaptation of this technique to two target enrichment workflows, commonly used for oncology applications, and feasibility using PacBio single molecule real-time (SMRT) technology. Our approach is capable of increasing the sequencing throughput of the PacBio RSII platform by more than five-fold, while maintaining the ability to correctly call allele frequencies of known single nucleotide variants. ConcatSeq provides a versatile new sample preparation tool for long-read sequencing technologies.


July 7, 2019

The MHC locus and genetic susceptibility to autoimmune and infectious diseases.

In the past 50 years, variants in the major histocompatibility complex (MHC) locus, also known as the human leukocyte antigen (HLA), have been reported as major risk factors for complex diseases. Recent advances, including large genetic screens, imputation, and analyses of non-additive and epistatic effects, have contributed to a better understanding of the shared and specific roles of MHC variants in different diseases. We review these advances and discuss the relationships between MHC variants involved in autoimmune and infectious diseases. Further work in this area will help to distinguish between alternative hypotheses for the role of pathogens in autoimmune disease development.


July 7, 2019

The genetic basis of resistance and matching-allele interactions of a host-parasite system: The Daphnia magna-Pasteuria ramosa model.

Negative frequency-dependent selection (NFDS) is an evolutionary mechanism suggested to govern host-parasite coevolution and the maintenance of genetic diversity at host resistance loci, such as the vertebrate MHC and R-genes in plants. Matching-allele interactions of hosts and parasites that prevent the emergence of host and parasite genotypes that are universally resistant and infective are a genetic mechanism predicted to underpin NFDS. The underlying genetics of matching-allele interactions are unknown even in host-parasite systems with empirical support for coevolution by NFDS, as is the case for the planktonic crustacean Daphnia magna and the bacterial pathogen Pasteuria ramosa. We fine-map one locus associated with D. magna resistance to P. ramosa and genetically characterize two haplotypes of the Pasteuria resistance (PR-) locus using de novo genome and transcriptome sequencing. Sequence comparison of PR-locus haplotypes finds dramatic structural polymorphisms between PR-locus haplotypes including a large portion of each haplotype being composed of non-homologous sequences resulting in haplotypes differing in size by 66 kb. The high divergence of PR-locus haplotypes suggest a history of multiple, diverse and repeated instances of structural mutation events and restricted recombination. Annotation of the haplotypes reveals striking differences in gene content. In particular, a group of glycosyltransferase genes that is present in the susceptible but absent in the resistant haplotype. Moreover, in natural populations, we find that the PR-locus polymorphism is associated with variation in resistance to different P. ramosa genotypes, pointing to the PR-locus polymorphism as being responsible for the matching-allele interactions that have been previously described for this system. Our results conclusively identify a genetic basis for the matching-allele interaction observed in a coevolving host-parasite system and provide a first insight into its molecular basis.


July 7, 2019

Genome-wide identification of the mutation underlying fleece variation and discriminating ancestral hairy species from modern woolly sheep.

The composition and structure of fleece variation observed in mammals is a consequence of a strong selective pressure for fiber production after domestication. In sheep, fleece variation discriminates ancestral species carrying a long and hairy fleece from modern domestic sheep (Ovis aries) owning a short and woolly fleece. Here, we report that the “woolly” allele results from the insertion of an antisense EIF2S2 retrogene (called asEIF2S2) into the 3′ UTR of the IRF2BP2 gene leading to an abnormal IRF2BP2 transcript. We provide evidence that this chimeric IRF2BP2/asEIF2S2 messenger 1) targets the genuine sense EIF2S2 RNA and 2) creates a long endogenous double-stranded RNA which alters the expression of both EIF2S2 and IRF2BP2 mRNA. This represents a unique example of a phenotype arising via a RNA-RNA hybrid, itself generated through a retroposition mechanism. Our results bring new insights on the sheep population history thanks to the identification of the molecular origin of an evolutionary phenotypic variation.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data.

High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Generation of a collection of mutant tomato lines using pooled CRISPR libraries.

The high efficiency of clustered regularly interspaced short palindromic repeats (CRISPR)-mediated mutagenesis in plants enables the development of high-throughput mutagenesis strategies. By transforming pooled CRISPR libraries into tomato (Solanum lycopersicum), collections of mutant lines were generated with minimal transformation attempts and in a relatively short period of time. Identification of the targeted gene(s) was easily determined by sequencing the incorporated guide RNA(s) in the primary transgenic events. From a single transformation with a CRISPR library targeting the immunity-associated leucine-rich repeat subfamily XII genes, heritable mutations were recovered in 15 of the 54 genes targeted. To increase throughput, a second CRISPR library was made containing three guide RNAs per construct to target 18 putative transporter genes. This resulted in stable mutations in 15 of the 18 targeted genes, with some primary transgenic plants having as many as five mutated genes. Furthermore, the redundancy in this collection of plants allowed for the association of aberrant T0 phenotypes with the underlying targeted genes. Plants with mutations in a homolog of an Arabidopsis (Arabidopsis thaliana) boron efflux transporter displayed boron deficiency phenotypes. The strategy described here provides a technically simple yet high-throughput approach for generating a collection of lines with targeted mutations and should be applicable to any plant transformation system.© 2017 American Society of Plant Biologists. All Rights Reserved.


July 7, 2019

Human to yeast pathway transplantation: cross-species dissection of the adenine de novo pathway regulatory node

Pathway transplantation from one organism to another represents a means to a more complete understanding of a biochemical or regulatory process. The purine biosynthesis pathway, a core metabolic function, was transplanted from human to yeast. We replaced the entire Saccharomyces cerevisiae adenine de novo pathway with the cognate human pathway components. A yeast strain was humanized for the full pathway by deleting all relevant yeast genes completely and then providing the human pathway in trans using a neochromosome expressing the human protein coding regions under the transcriptional control of their cognate yeast promoters and terminators. The humanized yeast strain grows in the absence of adenine, indicating complementation of the yeast pathway by the full set of human proteins. While the strain with the neochromosome is indeed prototrophic, it grows slowly in the absence of adenine. Dissection of the phenotype revealed that the human ortholog of ADE4, PPAT, shows only partial complementation. We have used several strategies to understand this phenotype, that point to PPAT/ADE4 as the central regulatory node. Pathway metabolites are responsible for regulating PPATs protein abundance through transcription and proteolysis as well as its enzymatic activity by allosteric regulation in these yeast cells. Extensive phylogenetic analysis of PPATs from diverse organisms hints at adaptations of the enzyme-level regulation to the metabolite levels in the organism. Finally, we isolated specific mutations in PPAT as well as in other genes involved in the purine metabolic network that alleviate incomplete complementation by PPAT and provide further insight into the complex regulation of this critical metabolic pathway.


July 7, 2019

Sequencing the genomic regions flanking S-linked PvGLO sequences confirms the presence of two GLO loci, one of which lies adjacent to the style-length determinant gene CYP734A50.

Primula vulgaris contains two GLOBOSA loci, one located adjacent to the style length determinant gene CYP734A50 which lies within the S -locus. Using a combination of BAC walking and PacBio sequencing, we have sequenced two substantial genomic contigs in and around the S-locus of Primula vulgaris. Using these data, we were able to demonstrate that two alleles of PvGlo (P) as well as PvGlo (T) can be present in the genome of a single plant, providing empirical evidence that these two forms of the MADS-box gene GLOBOSA are separate loci and not allelic as previously reported. We propose they should be renamed PvGLO1 and PvGLO2. BAC contigs extending from each GLOBOSA locus were identified and fully sequenced. No homologous genes were found between the contigs other than the GLOBOSA genes themselves, consistent with their identity as separate loci. Exons of the recently identified style-length determinant gene CYP734A50 were identified on one end of the contig containing PvGLO2 and these genes are adjacent in the genome, suggesting that PvGLO2 lies either within or at least very close to the S-locus. Current evidence suggests that both CYP734A50 and GLO2 are specific to the S-morph mating type and are hemizygous rather than heterozygous in the Primula genome. This finding contrasts classical models of the HSI locus, which propose that components of the S-locus are allelic, suggesting that these models may need to be reconsidered.


July 7, 2019

Long-read sequencing offers path to more accurate drug metabolism profiles

In the complex drug discovery process, one of the looming questions for any new compound is how it will be metabolised in a human bodyWhi|e there are several methods for evaluating this, one of the most common involves CYP2D6,the enzyme encoded by the cytochrome P450—2D6 gene.This enzyme is involved in metabolising a quarter of all commonly used medications, making it an important target for ADME and pharmacogenomics studies. It is known to activate some drugs and to play a role in the deactivation or excretion of others.


July 7, 2019

The evolution of the natural killer complex; a comparison between mammals using new high-quality genome assemblies and targeted annotation.

Natural killer (NK) cells are a diverse population of lymphocytes with a range of biological roles including essential immune functions. NK cell diversity is in part created by the differential expression of cell surface receptors which modulate activation and function, including multiple subfamilies of C-type lectin receptors encoded within the NK complex (NKC). Little is known about the gene content of the NKC beyond rodent and primate lineages, other than it appears to be extremely variable between mammalian groups. We compared the NKC structure between mammalian species using new high-quality draft genome assemblies for cattle and goat; re-annotated sheep, pig, and horse genome assemblies; and the published human, rat, and mouse lemur NKC. The major NKC genes are largely in the equivalent positions in all eight species, with significant independent expansions and deletions between species, allowing us to propose a model for NKC evolution during mammalian radiation. The ruminant species, cattle and goats, have independently evolved a second KLRC locus flanked by KLRA and KLRJ, and a novel KLRH-like gene has acquired an activating tail. This novel gene has duplicated several times within cattle, while other activating receptor genes have been selectively disrupted. Targeted genome enrichment in cattle identified varying levels of allelic polymorphism between the NKC genes concentrated in the predicted extracellular ligand-binding domains. This novel recombination and allelic polymorphism is consistent with NKC evolution under balancing selection, suggesting that this diversity influences individual immune responses and may impact on differential outcomes of pathogen infection and vaccination.


July 7, 2019

Regulation of hetDNA length during mitotic double-strand break repair in yeast.

Heteroduplex DNA (hetDNA) is a key molecular intermediate during the repair of mitotic double-strand breaks by homologous recombination, but its relationship to 5′ end resection and/or 3′ end extension is poorly understood. In the current study, we examined how perturbations in these processes affect the hetDNA profile associated with repair of a defined double-strand break (DSB) by the synthesis-dependent strand-annealing (SDSA) pathway. Loss of either the Exo1 or Sgs1 long-range resection pathway significantly shortened hetDNA, suggesting that these pathways normally collaborate during DSB repair. In addition, altering the processivity or proofreading activity of DNA polymerase d shortened hetDNA length or reduced break-adjacent mismatch removal, respectively, demonstrating that this is the primary polymerase that extends both 3′ ends. Data are most consistent with the extent of DNA synthesis from the invading end being the primary determinant of hetDNA length during SDSA. Copyright © 2017 Elsevier Inc. All rights reserved.


July 7, 2019

High-quality genome sequence of the radioresistant bacterium Deinococcus ficus KS 0460.

The genetic platforms of Deinococcus species remain the only systems in which massive ionizing radiation (IR)-induced genome damage can be investigated in vivo at exposures commensurate with cellular survival. We report the whole genome sequence of the extremely IR-resistant rod-shaped bacterium Deinococcus ficus KS 0460 and its phenotypic characterization. Deinococcus ficus KS 0460 has been studied since 1987, first under the name Deinobacter grandis, then Deinococcus grandis. The D. ficus KS 0460 genome consists of a 4.019 Mbp sequence (69.7% GC content and 3894 predicted genes) divided into six genome partitions, five of which are confirmed to be circular. Circularity was determined manually by mate pair linkage. Approximately 76% of the predicted proteins contained identifiable Pfam domains and 72% were assigned to COGs. Of all D. ficus KS 0460 proteins, 79% and 70% had homologues in Deinococcus radiodurans ATCC BAA-816 and Deinococcus geothermalis DSM 11300, respectively. The most striking differences between D. ficus KS 0460 and D. radiodurans BAA-816 identified by the comparison of the KEGG pathways were as follows: (i) D. ficus lacks nine enzymes of purine degradation present in D. radiodurans, and (ii) D. ficus contains eight enzymes involved in nitrogen metabolism, including nitrate and nitrite reductases, that D. radiodurans lacks. Moreover, genes previously considered to be important to IR resistance are missing in D. ficus KS 0460, namely, for the Mn-transporter nramp, and proteins DdrF, DdrJ and DdrK, all of which are also missing in Deinococcus deserti. Otherwise, D. ficus KS 0460 exemplifies the Deinococcus lineage.


July 7, 2019

Complete fusion of a transposon and herpesvirus created the Teratorn mobile element in medaka fish.

Mobile genetic elements (e.g., transposable elements and viruses) display significant diversity with various life cycles, but how novel elements emerge remains obscure. Here, we report a giant (180-kb long) transposon, Teratorn, originally identified in the genome of medaka, Oryzias latipes. Teratorn belongs to the piggyBac superfamily and retains the transposition activity. Remarkably, Teratorn is largely derived from a herpesvirus of the Alloherpesviridae family that could infect fish and amphibians. Genomic survey of Teratorn-like elements reveals that some of them exist as a fused form between piggyBac transposon and herpesvirus genome in teleosts, implying the generality of transposon-herpesvirus fusion. We propose that Teratorn was created by a unique fusion of DNA transposon and herpesvirus, leading to life cycle shift. Our study supports the idea that recombination is the key event in generation of novel mobile genetic elements. Teratorn is a large mobile genetic element originally identified in the small teleost fish medaka. Here, the authors show that Teratorn is derived from the fusion of a piggyBac superfamily DNA transposon and an alloherpesvirus and that it is widely found across teleost fish.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.