Menu
July 7, 2019

Diversity and evolution of centromere repeats in the maize genome.

Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.


July 7, 2019

Retrohoming of a mobile group II intron in human cells suggests how eukaryotes limit group II intron proliferation.

Mobile bacterial group II introns are evolutionary ancestors of spliceosomal introns and retroelements in eukaryotes. They consist of an autocatalytic intron RNA (a “ribozyme”) and an intron-encoded reverse transcriptase, which function together to promote intron integration into new DNA sites by a mechanism termed “retrohoming”. Although mobile group II introns splice and retrohome efficiently in bacteria, all examined thus far function inefficiently in eukaryotes, where their ribozyme activity is limited by low Mg2+ concentrations, and intron-containing transcripts are subject to nonsense-mediated decay (NMD) and translational repression. Here, by using RNA polymerase II to express a humanized group II intron reverse transcriptase and T7 RNA polymerase to express intron transcripts resistant to NMD, we find that simply supplementing culture medium with Mg2+ induces the Lactococcus lactis Ll.LtrB intron to retrohome into plasmid and chromosomal sites, the latter at frequencies up to ~0.1%, in viable HEK-293 cells. Surprisingly, under these conditions, the Ll.LtrB intron reverse transcriptase is required for retrohoming but not for RNA splicing as in bacteria. By using a genetic assay for in vivo selections combined with deep sequencing, we identified intron RNA mutations that enhance retrohoming in human cells, but <4-fold and not without added Mg2+. Further, the selected mutations lie outside the ribozyme catalytic core, which appears not readily modified to function efficiently at low Mg2+ concentrations. Our results reveal differences between group II intron retrohoming in human cells and bacteria and suggest constraints on critical nucleotide residues of the ribozyme core that limit how much group II intron retrohoming in eukaryotes can be enhanced. These findings have implications for group II intron use for gene targeting in eukaryotes and suggest how differences in intracellular Mg2+ concentrations between bacteria and eukarya may have impacted the evolution of introns and gene expression mechanisms.


July 7, 2019

Mutation assay using single-molecule real-time (SMRT) sequencing technology

Introduction We present here a simple, phenotype-independent mutation assay using a PacBio RSII DNA sequencer employing single-molecule real-time (SMRT) sequencing technology. Salmonella typhimurium YG7108 was treated with the alkylating agent N-ethyl-N-nitrosourea (ENU) and grown though several generations to fix the induced mutations, the DNA was extracted and the mutations were analyzed by using the SMRT DNA sequencer. Results The ENU-induced base-substitution frequency was 15.4 per Megabase pair, which is highly consistent with our previous results based on colony isolation and next-generation sequencing. The induced mutation spectrum (95% G:C???A:T, 5% A:T???G:C) is also consistent with the known ENU signature. The base-substitution frequency of the control was calculated to be less than 0.12 per Megabase pair. A current limitation of the approach is the high frequency of artifactual insertion and deletion mutations it detects. Conclusions Ultra-low frequency base-substitution mutations can be detected directly by using the SMRT DNA sequencer, and this technology provides a phenotype-independent mutation assay.


July 7, 2019

Dual functions of Macpiwi1 in transposon silencing and stem cell maintenance in the flatworm Macrostomum lignano.

PIWI proteins and piRNA pathways are essential for transposon silencing and some aspects of gene regulation during animal germline development. In contrast to most animal species, some flatworms also express PIWIs and piRNAs in somatic stem cells, where they are required for tissue renewal and regeneration. Here, we have identified and characterized piRNAs and PIWI proteins in the emerging model flatworm Macrostomum lignano. We found that M. lignano encodes at least three PIWI proteins. One of these, Macpiwi1, acts as a key component of the canonical piRNA pathway in the germline and in somatic stem cells. Knockdown of Macpiwi1 dramatically reduces piRNA levels, derepresses transposons, and severely impacts stem cell maintenance. Knockdown of the piRNA biogenesis factor Macvasa caused an even greater reduction in piRNA levels with a corresponding increase in transposons. Yet, in Macvasa knockdown animals, we detected no major impact on stem cell self-renewal. These results may suggest stem cell maintenance functions of PIWI proteins in flatworms that are distinguishable from their impact on transposons and that might function independently of what are considered canonical piRNA populations.© 2015 Zhou et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.


July 7, 2019

A transferable plasticity region in Campylobacter coli allows isolates of an otherwise non-glycolytic food-borne pathogen to catabolize glucose.

Thermophilic Campylobacter species colonize the intestine of agricultural and domestic animals commensally but cause severe gastroenteritis in humans. In contrast to other enteropathogenic bacteria, Campylobacter has been considered to be non-glycolytic, a metabolic property originally used for their taxonomic classification. Contrary to this dogma, we demonstrate that several Campylobacter coli strains are able to utilize glucose as a growth substrate. Isotopologue profiling experiments with (13) C-labeled glucose suggested that these strains catabolize glucose via the pentose phosphate and Entner-Doudoroff (ED) pathways and use glucose efficiently for de novo synthesis of amino acids and cell surface carbohydrates. Whole genome sequencing of glycolytic C.?coli isolates identified a genomic island located within a ribosomal RNA gene cluster that encodes for all ED pathway enzymes and a glucose permease. We could show in vitro that a non-glycolytic C.?coli strain could acquire glycolytic activity through natural transformation with chromosomal DNA of C.?coli and C.?jejuni subsp. doylei strains possessing the ED pathway encoding plasticity region. These results reveal for the first time the ability of a Campylobacter species to catabolize glucose and provide new insights into how genetic macrodiversity through intra- and interspecies gene transfer expand the metabolic capacity of this food-borne pathogen. © 2015 John Wiley & Sons Ltd.


July 7, 2019

Genome and transcriptome of the regeneration-competent flatworm, Macrostomum lignano.

The free-living flatworm, Macrostomum lignano has an impressive regenerative capacity. Following injury, it can regenerate almost an entirely new organism because of the presence of an abundant somatic stem cell population, the neoblasts. This set of unique properties makes many flatworms attractive organisms for studying the evolution of pathways involved in tissue self-renewal, cell-fate specification, and regeneration. The use of these organisms as models, however, is hampered by the lack of a well-assembled and annotated genome sequences, fundamental to modern genetic and molecular studies. Here we report the genomic sequence of M. lignano and an accompanying characterization of its transcriptome. The genome structure of M. lignano is remarkably complex, with ~75% of its sequence being comprised of simple repeats and transposon sequences. This has made high-quality assembly from Illumina reads alone impossible (N50 = 222 bp). We therefore generated 130× coverage by long sequencing reads from the Pacific Biosciences platform to create a substantially improved assembly with an N50 of 64 Kbp. We complemented the reference genome with an assembled and annotated transcriptome, and used both of these datasets in combination to probe gene-expression patterns during regeneration, examining pathways important to stem cell function.


July 7, 2019

Jitterbug: somatic and germline transposon insertion detection at single-nucleotide resolution.

Transposable elements are major players in genome evolution. Transposon insertion polymorphisms can translate into phenotypic differences in plants and animals and are linked to different diseases including human cancer, making their characterization highly relevant to the study of genome evolution and genetic diseases. Here we present Jitterbug, a novel tool that identifies transposable element insertion sites at single-nucleotide resolution based on the pairedend mapping and clipped-read signatures produced by NGS alignments. Jitterbug can be easily integrated into existing NGS analysis pipelines, using the standard BAM format produced by frequently applied alignment tools (e.g. bwa, bowtie2), with no need to realign reads to a set of consensus transposon sequences. Jitterbug is highly sensitive and able to recall transposon insertions with a very high specificity, as demonstrated by benchmarks in the human and Arabidopsis genomes, and validation using long PacBio reads. In addition, Jitterbug estimates the zygosity of transposon insertions with high accuracy and can also identify somatic insertions. We demonstrate that Jitterbug can identify mosaic somatic transposon movement using sequenced tumor-normal sample pairs and allows for estimating the cancer cell fraction of clones containing a somatic TE insertion. We suggest that the independent methods we use to evaluate performance are a step towards creating a gold standard dataset for benchmarking structural variant prediction tools.


July 7, 2019

Unique transposon landscapes are pervasive across Drosophila melanogaster genomes.

To understand how transposon landscapes (TLs) vary across animal genomes, we describe a new method called the Transposon Insertion and Depletion AnaLyzer (TIDAL) and a database of >300 TLs in Drosophila melanogaster (TIDAL-Fly). Our analysis reveals pervasive TL diversity across cell lines and fly strains, even for identically named sub-strains from different laboratories such as the ISO1 strain used for the reference genome sequence. On average, >500 novel insertions exist in every lab strain, inbred strains of the Drosophila Genetic Reference Panel (DGRP), and fly isolates in the Drosophila Genome Nexus (DGN). A minority (<25%) of transposon families comprise the majority (>70%) of TL diversity across fly strains. A sharp contrast between insertion and depletion patterns indicates that many transposons are unique to the ISO1 reference genome sequence. Although TL diversity from fly strains reaches asymptotic limits with increasing sequencing depth, rampant TL diversity causes unsaturated detection of TLs in pools of flies. Finally, we show novel transposon insertions negatively correlate with Piwi-interacting RNA (piRNA) levels for most transposon families, except for the highly-abundant roo retrotransposon. Our study provides a useful resource for Drosophila geneticists to understand how transposons create extensive genomic diversity in fly cell lines and strains.© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Lesions from patients with sporadic cerebral cavernous malformations harbor somatic mutations in the CCM genes: evidence for a common biochemical pathway for CCM pathogenesis.

Cerebral cavernous malformations (CCMs) are vascular lesions affecting the central nervous system. CCM occurs either sporadically or in an inherited, autosomal dominant manner. Constitutional (germline) mutations in any of three genes, KRIT1, CCM2 and PDCD10, can cause the inherited form. Analysis of CCM lesions from inherited cases revealed biallelic somatic mutations, indicating that CCM follows a Knudsonian two-hit mutation mechanism. It is still unknown, however, if the sporadic cases of CCM also follow this genetic mechanism. We extracted DNA from 11 surgically excised lesions from sporadic CCM patients, and sequenced the three CCM genes in each specimen using a next-generation sequencing approach. Four sporadic CCM lesion samples (36%) were found to contain novel somatic mutations. Three of the lesions contained a single somatic mutation, and one lesion contained two biallelic somatic mutations. Herein, we also describe evidence of somatic mosaicism in a patient presenting with over 130 CCM lesions localized to one hemisphere of the brain. Finally, in a lesion regrowth sample, we found that the regrown CCM lesion contained the same somatic mutation as the original lesion. Together, these data bolster the idea that all forms of CCM have a genetic underpinning of the two-hit mutation mechanism in the known CCM genes. Recent studies have found aberrant Rho kinase activation in inherited CCM pathogenesis, and we present evidence that this pathway is activated in sporadic CCM patients. These results suggest that all CCM patients, including those with the more common sporadic form, are potentially amenable to the same therapy. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Site-specific genetic engineering of the Anopheles gambiae Y chromosome.

Despite its function in sex determination and its role in driving genome evolution, the Y chromosome remains poorly understood in most species. Y chromosomes are gene-poor, repeat-rich and largely heterochromatic and therefore represent a difficult target for genetic engineering. The Y chromosome of the human malaria vector Anopheles gambiae appears to be involved in sex determination although very little is known about both its structure and function. Here, we characterize a transgenic strain of this mosquito species, obtained by transposon-mediated integration of a transgene construct onto the Y chromosome. Using meganuclease-induced homologous repair we introduce a site-specific recombination signal onto the Y chromosome and show that the resulting docking line can be used for secondary integration. To demonstrate its utility, we study the activity of a germ-line-specific promoter when located on the Y chromosome. We also show that Y-linked fluorescent transgenes allow automated sex separation of this important vector species, providing the means to generate large single-sex populations. Our findings will aid studies of sex chromosome function and enable the development of male-exclusive genetic traits for vector control.


July 7, 2019

A fault-tolerant method for HLA typing with PacBio data.

Human leukocyte antigen (HLA) genes are critical genes involved in important biomedical aspects, including organ transplantation, autoimmune diseases and infectious diseases. The gene family contains the most polymorphic genes in humans and the difference between two alleles is only a single base pair substitution in many cases. The next generation sequencing (NGS) technologies could be used for high throughput HLA typing but in silico methods are still needed to correctly assign the alleles of a sample. Computer scientists have developed such methods for various NGS platforms, such as Illumina, Roche 454 and Ion Torrent, based on the characteristics of the reads they generate. However, the method for PacBio reads was less addressed, probably owing to its high error rates. The PacBio system has the longest read length among available NGS platforms, and therefore is the only platform capable of having exon 2 and exon 3 of HLA genes on the same read to unequivocally solve the ambiguity problem caused by the “phasing” issue.We proposed a new method BayesTyping1 to assign HLA alleles for PacBio circular consensus sequencing reads using Bayes’ theorem. The method was applied to simulated data of the three loci HLA-A, HLA-B and HLA-DRB1. The experimental results showed its capability to tolerate the disturbance of sequencing errors and external noise reads.The BayesTyping1 method could overcome the problems of HLA typing using PacBio reads, which mostly arise from sequencing errors of PacBio reads and the divergence of HLA genes, to some extent.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.