Cas9 can induce extensive on-target damage, including large deletions, inversions, and insertions.
CRISPR/Cas9 is an attractive platform to potentially correct dominant genetic diseases by gene editing with unprecedented precision. In the current proof-of-principle study, we explored the use of CRISPR/Cas9 for gene-editing in myotonic dystrophy type-1 (DM1), an autosomal-dominant muscle disorder, by excising the CTG-repeat expansion in the 3′-untranslated-region (UTR) of the human myotonic dystrophy protein kinase (DMPK) gene in DM1 patient-specific induced pluripotent stem cells (DM1-iPSC), DM1-iPSC-derived myogenic cells and DM1 patient-specific myoblasts. To eliminate the pathogenic gain-of-function mutant DMPK transcript, we designed a dual guide RNA based strategy that excises the CTG-repeat expansion with high efficiency, as confirmed by Southern blot…
The citrus fruit fly Bactrocera (Tetradacus) minax is a major and devastating agricultural pest in Asian subtropical countries. Previous studies have shown that B. minax interacts with hosts via an efficient chemosensory system. However, knowledge regarding the molecular components of the B. minax chemosensory system has not yet been well established. Herein, based on our newly generated whole-genome dataset for B. minax and by comparison with the characterized genomes of 6 other fruit fly species, we identified, for the first time, a total of 25 putative odorant-binding receptors (OBPs), 4 single-copy chemosensory proteins (CSPs) and 53 candidate odorant receptors (ORs).…
Although engineered nucleases can efficiently cleave intracellular DNA at desired target sites, major concerns remain on potential ‘off-target’ cleavage that may occur throughout the genome. We developed an online tool: predicted report of genome-wide nuclease off-target sites (PROGNOS) that effectively identifies off-target sites. The initial bioinformatics algorithms in PROGNOS were validated by predicting 44 of 65 previously confirmed off-target sites, and by uncovering a new off-target site for the extensively studied zinc finger nucleases (ZFNs) targeting C-C chemokine receptor type 5. Using PROGNOS, we rapidly interrogated 128 potential off-target sites for newly designed transcription activator-like effector nucleases containing either Asn-Asn…
The accurate and comprehensive identification of functional regulatory sequences in mammalian genomes remains a major challenge. Here we describe site-specific integration fluorescence-activated cell sorting followed by sequencing (SIF-seq), an unbiased, medium-throughput functional assay for the discovery of distant-acting enhancers. Targeted single-copy genomic integration into pluripotent cells, reporter assays and flow cytometry are coupled with high-throughput DNA sequencing to enable parallel screening of large numbers of DNA sequences. By functionally interrogating >500 kilobases (kb) of mouse and human sequence in mouse embryonic stem cells for enhancer activity we identified enhancers at pluripotency loci including NANOG. In in vitro-differentiated cardiomyocytes and neural…
Genome-wide analysis of adeno-associated virus (AAV) type 2 integration in HeLa cells has shown that wild-type AAV integrates at numerous genomic sites, including AAVS1 on chromosome 19q13.42. Multiple GAGY/C repeats, resembling consensus AAV Rep-binding sites are preferred, whereas rep-deficient AAV vectors (rAAV) regularly show a random integration profile. This study is the first study to analyze wild-type AAV integration in diploid human fibroblasts. Applying high-throughput third-generation PacBio-based DNA sequencing, integration profiles of wild-type AAV and rAAV are compared side by side. Bioinformatic analysis reveals that both wild-type AAV and rAAV prefer open chromatin regions. Although genomic features of AAV integration…
Gene therapy with genetically modified human CD34(+) hematopoietic stem and progenitor cells (HSPCs) may be safer using targeted integration (TI) of transgenes into a genomic ‘safe harbor’ site rather than random viral integration. We demonstrate that temporally optimized delivery of zinc finger nuclease mRNA via electroporation and adeno-associated virus (AAV) 6 delivery of donor constructs in human HSPCs approaches clinically relevant levels of TI into the AAVS1 safe harbor locus. Up to 58% Venus(+) HSPCs with 6-16% human cell marking were observed following engraftment into mice. In HSPCs from patients with X-linked chronic granulomatous disease (X-CGD), caused by mutations in…
The possibility to predict the outcome of targeted DNA double-stranded break (DSB) repair would be desirable for genome editing. Furthermore the consequences of mis-repair of potentially cell-lethal DSBs and the underlying pathways are not yet fully understood. Here we study the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-induced mutation spectra at three selected endogenous loci in Arabidopsis thaliana by deep sequencing of long amplicon libraries. Notably, we found sequence-dependent genomic features that affected the DNA repair outcome. Deletions of 1-bp to 1 kbp (all due to NHEJ) and deletions combined with insertions between 5-bp to >100 bp [caused by a synthesis-dependent strand…
Recombinant adeno-associated virus (rAAV)-based gene therapy has entered a phase of clinical translation and commercialization. Despite this progress, vector integrity following production is often overlooked. Compromised vectors may negatively impact therapeutic efficacy and safety. Using single molecule, real-time (SMRT) sequencing, we can comprehensively profile packaged genomes as a single intact molecule and directly assess vector integrity without extensive preparation. We have exploited this methodology to profile all heterogeneic populations of self-complementary AAV genomes via bioinformatics pipelines and have coined this approach AAV-genome population sequencing (AAV-GPseq). The approach can reveal the relative distribution of truncated genomes versus full-length genomes in vector…
The yellow catfish, Pelteobagrus fulvidraco, belonging to the Siluriformes order, is an economically important freshwater aquaculture fish species in Asia, especially in Southern China. The aquaculture industry has recently been facing tremendous challenges in germplasm degeneration and poor disease resistance. As the yellow catfish exhibits notable sex dimorphism in growth, with adult males about two- to three-fold bigger than females, the way in which the aquaculture industry takes advantage of such sex dimorphism is another challenge. To address these issues, a high-quality reference genome of the yellow catfish would be a very useful resource.To construct a high-quality reference genome for…
A Gram-stain-positive, catalase-positive and pleomorphic rod organism was isolated from malted barley in Finland, classified initially by partial 16S rRNA gene sequencing and originally deposited in the VTT Culture Collection as a strain of Propionibacterium acidipropionici (currently Acidipropionibacterium acidipropionici). The subsequent comparison of the whole 16S rRNA gene with other representatives of the genus Acidipropionibacterium revealed that the strain belongs to a novel species, most closely related to Acidipropionibacterium microaerophilum and Acidipropionibacterium acidipropionici, with similarity values of 98.46 and 98.31?%, respectively. The whole genome sequencing using PacBio RS II platform allowed further comparison of the genome with all of the…
Cereal grasses of the Triticeae tribe have been the major food source in temperate regions since the dawn of agriculture. Their large genomes are characterized by a high content of repetitive elements and large pericentromeric regions that are virtually devoid of meiotic recombination. Here we present a high-quality reference genome assembly for barley (Hordeum vulgare L.). We use chromosome conformation capture mapping to derive the linear order of sequences across the pericentromeric space and to investigate the spatial organization of chromatin in the nucleus at megabase resolution. The composition of genes and repetitive elements differs between distal and proximal regions.…
The regulome—the part of the genome that regulates function—includes noncoding RNAs with varied functions yet to be deciphered.
Autism spectrum disorder (ASD) is one of the most heritable neuropsychiatric conditions. The complex genetic landscape of the disorder includes both common and rare variants at hundreds of genetic loci. This marked heterogeneity has thus far hampered efforts to develop genetic diagnostic panels and targeted pharmacological therapies. Here, we give an overview of the current literature on the genetic basis of ASD, and review recent human brain transcriptome studies and their role in identifying convergent pathways downstream of the heterogeneous genetic variants. We also discuss emerging evidence on the involvement of non-coding genomic regions and non-coding RNAs in ASD.
Accurate transcript structure and abundance inference from RNA sequencing (RNA-seq) data is foundational for molecular discovery. Here we present TACO, a computational method to reconstruct a consensus transcriptome from multiple RNA-seq data sets. TACO employs novel change-point detection to demarcate transcript start and end sites, leading to improved reconstruction accuracy compared with other tools in its class. The tool is available at http://tacorna.github.io and can be readily incorporated into RNA-seq analysis workflows.