Menu
April 21, 2020

Construction of a Genomic Bacterial Artificial Chromosome (BAC) Library for the Prawn Macrobrachium rosenbergii and Initial Analysis of ZW Chromosome-Derived BAC Inserts.

Knowledge on sex determination has proven valuable for commercial production of the prawn Macrobrachium rosenbergii due to sex dimorphism of the male and female individuals. Previous studies indicated that prawn sex is determined by a ZW-ZZ chromosomal system, but no genomic information is available for the sex chromosome. Herein, we constructed a genomic bacterial artificial chromosome (BAC) library and identified the ZW-derived BAC clones for initial analysis of the sex chromosomal DNA sequence. The arrayed BAC library contains 200,448 clones with average insert size of 115.4 kb, corresponding to ~?4× coverage of the estimated 5.38 Gb genome. Based on a short female-specific marker, a Z- and a W-fragment were retrieved with the genomic walking method. Screening the BAC library using a ZW-specific marker as probe resulted in 12 positive clones. From these, a Z-derived (P331M17) and a W-derived (P122G2) BAC clones were randomly selected and sequenced by PacBio method. We report the construction of a large insert, deep-coverage, and high-quality BAC library for M. rosenbergii that provides a useful resource for positional cloning of target genes, genomic organization, and comparative genomics analysis. Our study not only confirmed the ZW/ZZ system but also discovered sex-linked genes on ZW chromosomes for the first time, contributing to a comprehensive understanding of the genomic structure of sex chromosomes in M. rosenbergii.


April 21, 2020

Using Cre-recombinase-driven Polylox barcoding for in vivo fate mapping in mice.

Fate mapping is a powerful genetic tool for linking stem or progenitor cells with their progeny, and hence for defining cell lineages in vivo. The resolution of fate mapping depends on the numbers of distinct markers that are introduced in the beginning into stem or progenitor cells; ideally, numbers should be sufficiently large to allow the tracing of output from individual cells. Highly diverse genetic barcodes can serve this purpose. We recently developed an endogenous genetic barcoding system, termed Polylox. In Polylox, random DNA recombination can be induced by transient activity of Cre recombinase in a 2.1-kb-long artificial recombination substrate that has been introduced into a defined locus in mice (Rosa26Polylox reporter mice). Here, we provide a step-by-step protocol for the use of Polylox, including barcode induction and estimation of induction efficiency, barcode retrieval with single-molecule real-time (SMRT) DNA sequencing followed by computational barcode identification, and the calculation of barcode-generation probabilities, which is key for estimations of single-cell labeling for a given number of stem cells. Thus, Polylox barcoding enables high-resolution fate mapping in essentially all tissues in mice for which inducible Cre driver lines are available. Alternative methods include ex vivo cell barcoding, inducible transposon insertion and CRISPR-Cas9-based barcoding; Polylox currently allows combining non-invasive and cell-type-specific labeling with high label diversity. The execution time of this protocol is ~2-3 weeks for experimental data generation and typically <2 d for computational Polylox decoding and downstream analysis.


April 21, 2020

The complete mitochondrial genome of the tree frog, Polypedates braueri (Anura, Rhacophoridae)

We determined the complete mitochondrial genome of the tree frog, Polypedates braueri using next generation sequencing (NGS) and Sanger sequencing. The mitogenome of P. braueri was 19,904?bp in length, which contained 12 protein-coding genes, 22 tRNAs, two rRNAs, and two control regions (D-Loop). A noncoding sequence (NC) was discovered between tRNALys and ATP6 gene, as well as replaced the original position of ATP8 gene. The ND5 gene was found between the two control regions. More mitochondrial genomic information will contribute to revealing the phylogenetic relationships among species of the genus Polypedates.


April 21, 2020

Low-copy nuclear sequence data confirm complex patterns of farina evolution in notholaenid ferns (Pteridaceae).

Notholaenids are an unusual group of ferns that have adapted to, and diversified within, the deserts of Mexico and the southwestern United States. With approximately 40 species, this group is noted for being desiccation-tolerant and having “farina”-powdery exudates of lipophilic flavonoid aglycones-that occur on both the gametophytic and sporophytic phases of their life cycle. The most recent circumscription of notholaenids based on plastid markers surprisingly suggests that several morphological characters, including the expression of farina, are homoplasious. In a striking case of convergence, Notholaena standleyi appears to be distantly related to core Notholaena, with several taxa not before associated with Notholaena nested between them. Such conflicts can be due to morphological homoplasy resulting from adaptive convergence or, alternatively, the plastid phylogeny itself might be misleading, diverging from the true species tree due to incomplete lineage sorting, hybridization, or other factors. In this study, we present a species phylogeny for notholaenid ferns, using four low-copy nuclear loci and concatenated data from three plastid loci. A total of 61 individuals (49 notholaenids and 12 outgroup taxa) were sampled, including 31 out of 37 recognized notholaenid species. The homeologous/allelic nuclear sequences were retrieved using PacBio sequencing and the PURC bioinformatics pipeline. Each dataset was first analyzed individually using maximum likelihood and Bayesian inference, and the species phylogeny was inferred using *BEAST. Although we observed several incongruences between the nuclear and plastid phylogenies, our principal results are broadly congruent with previous inferences based on plastid data. By mapping the presence of farina and their biochemical constitutions on our consensus phylogenetic tree, we confirmed that the characters are indeed homoplastic and have complex evolutionary histories. Hybridization among recognized species of the notholaenid clade appears to be relatively rare compared to that observed in other well-studied fern genera.Copyright © 2019 Elsevier Inc. All rights reserved.


April 21, 2020

TSD: A Computational Tool To Study the Complex Structural Variants Using PacBio Targeted Sequencing Data.

PacBio sequencing is a powerful approach to study DNA or RNA sequences in a longer scope. It is especially useful in exploring the complex structural variants generated by random integration or multiple rearrangement of endogenous or exogenous sequences. Here, we present a tool, TSD, for complex structural variant discovery using PacBio targeted sequencing data. It allows researchers to identify and visualize the genomic structures of targeted sequences by unlimited splitting, alignment and assembly of long PacBio reads. Application to the sequencing data derived from an HBV integrated human cell line(PLC/PRF/5) indicated that TSD could recover the full profile of HBV integration events, especially for the regions with the complex human-HBV genome integrations and multiple HBV rearrangements. Compared to other long read analysis tools, TSD showed a better performance for detecting complex genomic structural variants. TSD is publicly available at: https://github.com/menggf/tsd. Copyright © 2019 Meng et al.


April 21, 2020

NCF1 (p47phox)-deficient chronic granulomatous disease: comprehensive genetic and flow cytometric analysis.

Mutations in NCF1 (p47phox) cause autosomal recessive chronic granulomatous disease (CGD) with abnormal dihydrorhodamine (DHR) assay and absent p47phox protein. Genetic identification of NCF1 mutations is complicated by adjacent highly conserved (>98%) pseudogenes (NCF1B and NCF1C). NCF1 has GTGT at the start of exon 2, whereas the pseudogenes each delete 1 GT (?GT). In p47phox CGD, the most common mutation is ?GT in NCF1 (c.75_76delGT; p.Tyr26fsX26). Sequence homology between NCF1 and its pseudogenes precludes reliable use of standard Sanger sequencing for NCF1 mutations and for confirming carrier status. We first established by flow cytometry that neutrophils from p47phox CGD patients had negligible p47phox expression, whereas those from p47phox CGD carriers had ~60% of normal p47phox expression, independent of the specific mutation in NCF1 We developed a droplet digital polymerase chain reaction (ddPCR) with 2 distinct probes, recognizing either the wild-type GTGT sequence or the ?GT sequence. A second ddPCR established copy number by comparison with the single-copy telomerase reverse transcriptase gene, TERT We showed that 84% of p47phox CGD patients were homozygous for ?GT NCF1 The ddPCR assay also enabled determination of carrier status of relatives. Furthermore, only 79.2% of normal volunteers had 2 copies of GTGT per 6 total (NCF1/NCF1B/NCF1C) copies, designated 2/6; 14.7% had 3/6, and 1.6% had 4/6 GTGT copies. In summary, flow cytometry for p47phox expression quickly identifies patients and carriers of p47phox CGD, and genomic ddPCR identifies patients and carriers of ?GT NCF1, the most common mutation in p47phox CGD.


April 21, 2020

Construction of full-length Japanese reference panel of class I HLA genes with single-molecule, real-time sequencing.

Human leukocyte antigen (HLA) is a gene complex known for its exceptional diversity across populations, importance in organ and blood stem cell transplantation, and associations of specific alleles with various diseases. We constructed a Japanese reference panel of class I HLA genes (ToMMo HLA panel), comprising a distinct set of HLA-A, HLA-B, HLA-C, and HLA-H alleles, by single-molecule, real-time (SMRT) sequencing of 208 individuals included in the 1070 whole-genome Japanese reference panel (1KJPN). For high-quality allele reconstruction, we developed a novel pipeline, Primer-Separation Assembly and Refinement Pipeline (PSARP), in which the SMRT sequencing and additional short-read data were used. The panel consisted of 139 alleles, which were all extended from known IPD-IMGT/HLA sequences, contained 40 with novel variants, and captured more than 96.5% of allelic diversity in 1KJPN. These newly available sequences would be important resources for research and clinical applications including high-resolution HLA typing, genetic association studies, and analyzes of cis-regulatory elements.


April 21, 2020

Megabase Length Hypermutation Accompanies Human Structural Variation at 17p11.2.

DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to ~1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes. Copyright © 2019 Elsevier Inc. All rights reserved.


April 21, 2020

A Species-Wide Inventory of NLR Genes and Alleles in Arabidopsis thaliana.

Infectious disease is both a major force of selection in nature and a prime cause of yield loss in agriculture. In plants, disease resistance is often conferred by nucleotide-binding leucine-rich repeat (NLR) proteins, intracellular immune receptors that recognize pathogen proteins and their effects on the host. Consistent with extensive balancing and positive selection, NLRs are encoded by one of the most variable gene families in plants, but the true extent of intraspecific NLR diversity has been unclear. Here, we define a nearly complete species-wide pan-NLRome in Arabidopsis thaliana based on sequence enrichment and long-read sequencing. The pan-NLRome largely saturates with approximately 40 well-chosen wild strains, with half of the pan-NLRome being present in most accessions. We chart NLR architectural diversity, identify new architectures, and quantify selective forces that act on specific NLRs and NLR domains. Our study provides a blueprint for defining pan-NLRomes.Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.


April 21, 2020

Noncoding CGG repeat expansions in neuronal intranuclear inclusion disease, oculopharyngodistal myopathy and an overlapping disease.

Noncoding repeat expansions cause various neuromuscular diseases, including myotonic dystrophies, fragile X tremor/ataxia syndrome, some spinocerebellar ataxias, amyotrophic lateral sclerosis and benign adult familial myoclonic epilepsies. Inspired by the striking similarities in the clinical and neuroimaging findings between neuronal intranuclear inclusion disease (NIID) and fragile X tremor/ataxia syndrome caused by noncoding CGG repeat expansions in FMR1, we directly searched for repeat expansion mutations and identified noncoding CGG repeat expansions in NBPF19 (NOTCH2NLC) as the causative mutations for NIID. Further prompted by the similarities in the clinical and neuroimaging findings with NIID, we identified similar noncoding CGG repeat expansions in two other diseases: oculopharyngeal myopathy with leukoencephalopathy and oculopharyngodistal myopathy, in LOC642361/NUTM2B-AS1 and LRP12, respectively. These findings expand our knowledge of the clinical spectra of diseases caused by expansions of the same repeat motif, and further highlight how directly searching for expanded repeats can help identify mutations underlying diseases.


April 21, 2020

Single-molecule real-time sequencing reveals diverse allelic variations in carotenoid biosynthetic genes in pepper (Capsicum spp.).

The diverse colours of mature pepper (Capsicum spp.) fruit result from the accumulation of different carotenoids. The carotenoid biosynthetic pathway has been well elucidated in Solanaceous plants, and analysis of candidate genes involved in this process has revealed variations in carotenoid biosynthetic genes in Capsicum spp. However, the allelic variations revealed by previous studies could not fully explain the variation in fruit colour in Capsicum spp. due to technical difficulties in detecting allelic variation in multiple candidate genes in numerous samples. In this study, we uncovered allelic variations in six carotenoid biosynthetic genes, including phytoene synthase (PSY1, PSY2), lycopene ß-cyclase, ß-carotene hydroxylase, zeaxanthin epoxidase and capsanthin-capsorubin synthase (CCS) genes, in 94 pepper accessions by single-molecule real-time (SMRT) sequencing. To investigate the relationship between allelic variations in the candidate genes and differences in fruit colour, we performed ultra-performance liquid chromatography analysis using 43 accessions representing each allelic variation. Different combinations of dysfunctional mutations in PSY1 and CCS could explain variation in the compositions and levels of carotenoids in the accessions examined in this study. Our results demonstrate that SMRT sequencing technology can be used to rapidly identify allelic variation in target genes in various germplasms. The newly identified allelic variants will be useful for pepper breeding and for further analysis of carotenoid biosynthesis pathways. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Report from the Eleventh Killer Immunoglobulin-like Receptor (KIR) Workshop: Novel insights on KIR polymorphism, ligand recognition, expression and function.

The Eleventh Killer Immunoglobulin-like Receptor (KIR) Workshop was held in Camogli (Genoa, Italy) in October 2018. This congress brought together 113 participants working on KIR field. Fifty-eight studies have been presented, the majority of which included unpublished data. Thus, KIR workshop, allowing the meeting of people sharing their knowledge and experience in a friendly atmosphere, still represents a special event of fruitful discussion and exchange of novel breakthrough, results, and ideas. In this report, we summarize all the scientific contributions highlighting the most recent advances in KIR field. Forty abstracts presented at the KIR Workshop are published in this issue. © 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


April 21, 2020

Development of CRISPR-Cas systems for genome editing and beyond

The development of clustered regularly interspaced short-palindromic repeat (CRISPR)-Cas systems for genome editing has transformed the way life science research is conducted and holds enormous potential for the treatment of disease as well as for many aspects of biotech- nology. Here, I provide a personal perspective on the development of CRISPR-Cas9 for genome editing within the broader context of the field and discuss our work to discover novel Cas effectors and develop them into additional molecular tools. The initial demonstra- tion of Cas9-mediated genome editing launched the development of many other technologies, enabled new lines of biological inquiry, and motivated a deeper examination of natural CRISPR-Cas systems, including the discovery of new types of CRISPR-Cas systems. These new discoveries in turn spurred further technological developments. I review these exciting discoveries and technologies as well as provide an overview of the broad array of applications of these technologies in basic research and in the improvement of human health. It is clear that we are only just beginning to unravel the potential within microbial diversity, and it is quite likely that we will continue to discover other exciting phenomena, some of which it may be possible to repurpose as molecular technologies. The transformation of mysterious natural phenomena to powerful tools, however, takes a collective effort to discover, characterize, and engineer them, and it has been a privilege to join the numerous researchers who have contributed to this transformation of CRISPR-Cas systems.


April 21, 2020

Large Enriched Fragment Targeted Sequencing (LEFT-SEQ) Applied to Capture of Wolbachia Genomes.

Symbiosis is a major force of evolutionary change, influencing virtually all aspects of biology, from population ecology and evolution to genomics and molecular/biochemical mechanisms of development and reproduction. A remarkable example is Wolbachia endobacteria, present in some parasitic nematodes and many arthropod species. Acquisition of genomic data from diverse Wolbachia clades will aid in the elucidation of the different symbiotic mechanisms(s). However, challenges of de novo assembly of Wolbachia genomes include the presence in the sample of host DNA: nematode/vertebrate or insect. We designed biotinylated probes to capture large fragments of Wolbachia DNA for sequencing using PacBio technology (LEFT-SEQ: Large Enriched Fragment Targeted Sequencing). LEFT-SEQ was used to capture and sequence four Wolbachia genomes: the filarial nematode Brugia malayi, wBm, (21-fold enrichment), Drosophila mauritiana flies (2 isolates), wMau (11-fold enrichment), and Aedes albopictus mosquitoes, wAlbB (200-fold enrichment). LEFT-SEQ resulted in complete genomes for wBm and for wMau. For wBm, 18 single-nucleotide polymorphisms (SNPs), relative to the wBm reference, were identified and confirmed by PCR. A limit of LEFT-SEQ is illustrated by the wAlbB genome, characterized by a very high level of insertion sequences elements (ISs) and DNA repeats, for which only a 20-contig draft assembly was achieved.


April 21, 2020

CRISPR/CAS9 targeted CAPTURE of mammalian genomic regions for characterization by NGS.

The robust detection of structural variants in mammalian genomes remains a challenge. It is particularly difficult in the case of genetically unstable Chinese hamster ovary (CHO) cell lines with only draft genome assemblies available. We explore the potential of the CRISPR/Cas9 system for the targeted capture of genomic loci containing integrated vectors in CHO-K1-based cell lines followed by next generation sequencing (NGS), and compare it to popular target-enrichment sequencing methods and to whole genome sequencing (WGS). Three different CRISPR/Cas9-based techniques were evaluated; all of them allow for amplification-free enrichment of target genomic regions in the range from 5 to 60 fold, and for recovery of ~15 kb-long sequences with no sequencing artifacts introduced. The utility of these protocols has been proven by the identification of transgene integration sites and flanking sequences in three CHO cell lines. The long enriched fragments helped to identify Escherichia coli genome sequences co-integrated with vectors, and were further characterized by Whole Genome Sequencing (WGS). Other advantages of CRISPR/Cas9-based methods are the ease of bioinformatics analysis, potential for multiplexing, and the production of long target templates for real-time sequencing.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.