Menu
April 21, 2020

A physical and genetic map of Cannabis sativa identifies extensive rearrangements at the THC/CBD acid synthase loci.

Cannabis sativa is widely cultivated for medicinal, food, industrial, and recreational use, but much remains unknown regarding its genetics, including the molecular determinants of cannabinoid content. Here, we describe a combined physical and genetic map derived from a cross between the drug-type strain Purple Kush and the hemp variety “Finola.” The map reveals that cannabinoid biosynthesis genes are generally unlinked but that aromatic prenyltransferase (AP), which produces the substrate for THCA and CBDA synthases (THCAS and CBDAS), is tightly linked to a known marker for total cannabinoid content. We further identify the gene encoding CBCA synthase (CBCAS) and characterize its catalytic activity, providing insight into how cannabinoid diversity arises in cannabis. THCAS and CBDAS (which determine the drug vs. hemp chemotype) are contained within large (>250 kb) retrotransposon-rich regions that are highly nonhomologous between drug- and hemp-type alleles and are furthermore embedded within ~40 Mb of minimally recombining repetitive DNA. The chromosome structures are similar to those in grains such as wheat, with recombination focused in gene-rich, repeat-depleted regions near chromosome ends. The physical and genetic map should facilitate further dissection of genetic and molecular mechanisms in this commercially and medically important plant. © 2019 Laverty et al.; Published by Cold Spring Harbor Laboratory Press.


April 21, 2020

Characterization of vanM carrying clinical Enterococcus isolates and diversity of the suppressed vanM gene cluster.

Here we report the prevalence of the suppressed vanM gene cluster as a reservoir of vancomycin resistance genes. Among 1284 clinical isolates of enterococci from four hospitals in Hangzhou, China, 55 isolates of Enterococcus faecium and one isolate of Enterococcus faecalis were screened positive for the vanM genotype. Antimicrobial susceptibility testing showed that 55 of the 56 vanM-positive isolates were susceptible to vancomycin and teicoplanin. Most of them (54/56) belonged to the main epidemic lineage CC17, mostly the ST78 type. The vanM gene clusters in the 55 vancomycin-susceptible isolates showed sequence diversity owing to different insertion locations of IS1216E. The vanM transposons could be classified into five types and they all carried two or more IS1216E elements, leading to complete or partial deletions of vanR, vanS, or vanX. Quantitative reverse transcription polymerase chain reaction showed that the expression level of vanM was significantly lower in the vancomycin-susceptible isolates than in the vancomycin-resistant isolate. Considering the prevalence of the vanM genotype and the potential for conversion to a resistant phenotype, vanM might act as an important determinant of glycopeptide resistance in the future. It is essential to strengthen the surveillance of vanM-containing enterococci to control the dissemination of vancomycin resistance. Copyright © 2018. Published by Elsevier B.V.


April 21, 2020

Persistence of Moraxella catarrhalis in Chronic Obstructive Pulmonary Disease and Regulation of the Hag/MID Adhesin.

Persistence of bacterial pathogens in the airways has profound consequences on the course and pathogenesis of chronic obstructive pulmonary disease (COPD). Patients with COPD continuously acquire and clear strains of Moraxella catarrhalis, a major pathogen in COPD. Some strains are cleared quickly and some persist for months to years. The mechanism of the variability in duration of persistence is unknown. Guided by genome sequences of selected strains, we studied the expression of Hag/MID, hag/mid gene sequences, adherence to human cells, and autoaggregation in longitudinally collected strains of M. catarrhalis from adults with COPD. Twenty-eight of 30 cleared strains of M. catarrhalis expressed Hag/MID whereas 17 of 30 persistent strains expressed Hag/MID upon acquisition by patients. All persistent strains ceased expression of Hag/MID during persistence. Expression of Hag/MID in human airways was regulated by slipped-strand mispairing. Virulence-associated phenotypes (adherence to human respiratory epithelial cells and autoaggregation) paralleled Hag/MID expression in airway isolates.Most strains of M. catarrhalis express Hag/MID upon acquisition by adults with COPD and all persistent strains shut off expression during persistence. These observations suggest that Hag/MID is important for initial colonization by M. catarrhalis and that cessation of expression facilitates persistence in COPD airways. © The Author(s) 2018. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.


April 21, 2020

A reference genome for pea provides insight into legume genome evolution.

We report the first annotated chromosome-level reference genome assembly for pea, Gregor Mendel’s original genetic model. Phylogenetics and paleogenomics show genomic rearrangements across legumes and suggest a major role for repetitive elements in pea genome evolution. Compared to other sequenced Leguminosae genomes, the pea genome shows intense gene dynamics, most likely associated with genome size expansion when the Fabeae diverged from its sister tribes. During Pisum evolution, translocation and transposition differentially occurred across lineages. This reference sequence will accelerate our understanding of the molecular basis of agronomically important traits and support crop improvement.


April 21, 2020

Genome analyses for the Tohoku Medical Megabank Project towards establishment of personalized healthcare.

Personalized healthcare (PHC) based on an individual’s genetic make-up is one of the most advanced, yet feasible, forms of medical care. The Tohoku Medical Megabank (TMM) Project aims to combine population genomics, medical genetics and prospective cohort studies to develop a critical infrastructure for the establishment of PHC. To date, a TMM CommCohort (adult general population) and a TMM BirThree Cohort (birth+three-generation families) have conducted recruitments and baseline surveys. Genome analyses as part of the TMM Project will aid in the development of a high-fidelity whole-genome Japanese reference panel, in designing custom single-nucleotide polymorphism (SNP) arrays specific to Japanese, and in estimation of the biological significance of genetic variations through linked investigations of the cohorts. Whole-genome sequencing from >3,500 unrelated Japanese and establishment of a Japanese reference genome sequence from long-read data have been done. We next aim to obtain genotype data for all TMM cohort participants (>150,000) using our custom SNP arrays. These data will help identify disease-associated genomic signatures in the Japanese population, while genomic data from TMM BirThree Cohort participants will be used to improve the reference genome panel. Follow-up of the cohort participants will allow us to test the genetic markers and, consequently, contribute to the realization of PHC.


April 21, 2020

TSD: A Computational Tool To Study the Complex Structural Variants Using PacBio Targeted Sequencing Data.

PacBio sequencing is a powerful approach to study DNA or RNA sequences in a longer scope. It is especially useful in exploring the complex structural variants generated by random integration or multiple rearrangement of endogenous or exogenous sequences. Here, we present a tool, TSD, for complex structural variant discovery using PacBio targeted sequencing data. It allows researchers to identify and visualize the genomic structures of targeted sequences by unlimited splitting, alignment and assembly of long PacBio reads. Application to the sequencing data derived from an HBV integrated human cell line(PLC/PRF/5) indicated that TSD could recover the full profile of HBV integration events, especially for the regions with the complex human-HBV genome integrations and multiple HBV rearrangements. Compared to other long read analysis tools, TSD showed a better performance for detecting complex genomic structural variants. TSD is publicly available at: https://github.com/menggf/tsd. Copyright © 2019 Meng et al.


April 21, 2020

NCF1 (p47phox)-deficient chronic granulomatous disease: comprehensive genetic and flow cytometric analysis.

Mutations in NCF1 (p47phox) cause autosomal recessive chronic granulomatous disease (CGD) with abnormal dihydrorhodamine (DHR) assay and absent p47phox protein. Genetic identification of NCF1 mutations is complicated by adjacent highly conserved (>98%) pseudogenes (NCF1B and NCF1C). NCF1 has GTGT at the start of exon 2, whereas the pseudogenes each delete 1 GT (?GT). In p47phox CGD, the most common mutation is ?GT in NCF1 (c.75_76delGT; p.Tyr26fsX26). Sequence homology between NCF1 and its pseudogenes precludes reliable use of standard Sanger sequencing for NCF1 mutations and for confirming carrier status. We first established by flow cytometry that neutrophils from p47phox CGD patients had negligible p47phox expression, whereas those from p47phox CGD carriers had ~60% of normal p47phox expression, independent of the specific mutation in NCF1 We developed a droplet digital polymerase chain reaction (ddPCR) with 2 distinct probes, recognizing either the wild-type GTGT sequence or the ?GT sequence. A second ddPCR established copy number by comparison with the single-copy telomerase reverse transcriptase gene, TERT We showed that 84% of p47phox CGD patients were homozygous for ?GT NCF1 The ddPCR assay also enabled determination of carrier status of relatives. Furthermore, only 79.2% of normal volunteers had 2 copies of GTGT per 6 total (NCF1/NCF1B/NCF1C) copies, designated 2/6; 14.7% had 3/6, and 1.6% had 4/6 GTGT copies. In summary, flow cytometry for p47phox expression quickly identifies patients and carriers of p47phox CGD, and genomic ddPCR identifies patients and carriers of ?GT NCF1, the most common mutation in p47phox CGD.


April 21, 2020

Construction of full-length Japanese reference panel of class I HLA genes with single-molecule, real-time sequencing.

Human leukocyte antigen (HLA) is a gene complex known for its exceptional diversity across populations, importance in organ and blood stem cell transplantation, and associations of specific alleles with various diseases. We constructed a Japanese reference panel of class I HLA genes (ToMMo HLA panel), comprising a distinct set of HLA-A, HLA-B, HLA-C, and HLA-H alleles, by single-molecule, real-time (SMRT) sequencing of 208 individuals included in the 1070 whole-genome Japanese reference panel (1KJPN). For high-quality allele reconstruction, we developed a novel pipeline, Primer-Separation Assembly and Refinement Pipeline (PSARP), in which the SMRT sequencing and additional short-read data were used. The panel consisted of 139 alleles, which were all extended from known IPD-IMGT/HLA sequences, contained 40 with novel variants, and captured more than 96.5% of allelic diversity in 1KJPN. These newly available sequences would be important resources for research and clinical applications including high-resolution HLA typing, genetic association studies, and analyzes of cis-regulatory elements.


April 21, 2020

Genetic basis for the establishment of endosymbiosis in Paramecium.

The single-celled ciliate Paramecium bursaria is an indispensable model for investigating endosymbiosis between protists and green-algal symbionts. To elucidate the mechanism of this type of endosymbiosis, we combined PacBio and Illumina sequencing to assemble a high-quality and near-complete macronuclear genome of P. bursaria. The genomic characteristics and phylogenetic analyses indicate that P. bursaria is the basal clade of the Paramecium genus. Through comparative genomic analyses with its close relatives, we found that P. bursaria encodes more genes related to nitrogen metabolism and mineral absorption, but encodes fewer genes involved in oxygen binding and N-glycan biosynthesis. A comparison of the transcriptomic profiles between P. bursaria with and without endosymbiotic Chlorella showed differential expression of a wide range of metabolic genes. We selected 32 most differentially expressed genes to perform RNA interference experiment in P. bursaria, and found that P. bursaria can regulate the abundance of their symbionts through glutamine supply. This study provides novel insights into Paramecium evolution and will extend our knowledge of the molecular mechanism for the induction of endosymbiosis between P. bursaria and green algae.


April 21, 2020

Epidemiologic and genomic insights on mcr-1-harbouring Salmonella from diarrhoeal outpatients in Shanghai, China, 2006-2016.

Colistin resistance mediated by mcr-1-harbouring plasmids is an emerging threat in Enterobacteriaceae, like Salmonella. Based on its major contribution to the diarrhoea burden, the epidemic state and threat of mcr-1-harbouring Salmonella in community-acquired infections should be estimated.This retrospective study analysed the mcr-1 gene incidence in Salmonella strains collected from a surveillance on diarrhoeal outpatients in Shanghai Municipality, China, 2006-2016. Molecular characteristics of the mcr-1-positive strains and their plasmids were determined by genome sequencing. The transfer abilities of these plasmids were measured with various conjugation strains, species, and serotypes.Among the 12,053 Salmonella isolates, 37 mcr-1-harbouring strains, in which 35 were serovar Typhimurium, were detected first in 2012 and with increasing frequency after 2015. Most patients infected with mcr-1-harbouring strains were aged <5?years. All strains, including fluoroquinolone-resistant and/or extended-spectrum ß-lactamase-producing strains, were multi-drug resistant. S. Typhimurium had higher mcr-1 plasmid acquisition ability compared with other common serovars. Phylogeny based on the genomes combined with complete plasmid sequences revealed some clusters, suggesting the presence of mcr-1-harbouring Salmonella outbreaks in the community. Most mcr-1-positive strains were clustered together with the pork strains, strongly suggesting pork consumption as a main infection source.The mcr-1-harbouring Salmonella prevalence in community-acquired diarrhoea displays a rapid increase trend, and the ESBL-mcr-1-harbouring Salmonella poses a threat for children. These findings highlight the necessary and significance of prohibiting colistin use in animals and continuous monitoring of mcr-1-harbouring Salmonella.Copyright © 2019. Published by Elsevier B.V.


April 21, 2020

Diploid Genome Assembly of the Wine Grape Carménère.

In this genome report, we describe the sequencing and annotation of the genome of the wine grape Carménère (clone 02, VCR-702). Long considered extinct, this old French wine grape variety is now cultivated mostly in Chile where it was imported in the 1850s just before the European phylloxera epidemic. Genomic DNA was sequenced using Single Molecule Real Time technology and assembled with FALCON-Unzip, a diploid-aware assembly pipeline. To optimize the contiguity and completeness of the assembly, we tested about a thousand combinations of assembly parameters, sequencing coverage, error correction and repeat masking methods. The final scaffolds provide a complete and phased representation of the diploid genome of this wine grape. Comparison of the two haplotypes revealed numerous heterozygous variants, including loss-of-function ones, some of which in genes associated with polyphenol biosynthesis. Comparisons with other publicly available grape genomes and transcriptomes showed the impact of structural variation on gene content differences between Carménère and other wine grape cultivars. Among the putative cultivar-specific genes, we identified genes potentially involved in aroma production and stress responses. The genome assembly of Carménère expands the representation of the genomic variability in grapes and will enable studies that aim to understand its distinctive organoleptic and agronomical features and assess its still elusive extant genetic variability. A genome browser for Carménère, its annotation, and an associated blast tool are available at http://cantulab.github.io/data.Copyright © 2019 Minio et al.


April 21, 2020

The role of genomic structural variation in the genetic improvement of polyploid crops

Many of our major crop species are polyploids, containing more than one genome or set of chromosomes. Polyploid crops present unique challenges, including difficulties in genome assembly, in discriminating between multiple gene and sequence copies, and in genetic mapping, hindering use of genomic data for genetics and breeding. Polyploid genomes may also be more prone to containing structural variation, such as loss of gene copies or sequences (presence–absence variation) and the presence of genes or sequences in multiple copies (copy-number variation). Although the two main types of genomic structural variation commonly identified are presence–absence variation and copy-number variation, we propose that homeologous exchanges constitute a third major form of genomic structural variation in polyploids. Homeologous exchanges involve the replacement of one genomic segment by a similar copy from another genome or ancestrally duplicated region, and are known to be extremely common in polyploids. Detecting all kinds of genomic structural variation is challenging, but recent advances such as optical mapping and long-read sequencing offer potential strategies to help identify structural variants even in complex polyploid genomes. All three major types of genomic structural variation (presence–absence, copy-number, and homeologous exchange) are now known to influence phenotypes in crop plants, with examples of flowering time, frost tolerance, and adaptive and agronomic traits. In this review, we summarize the challenges of genome analysis in polyploid crops, describe the various types of genomic structural variation and the genomics technologies and data that can be used to detect them, and collate information produced to date related to the impact of genomic structural variation on crop phenotypes. We highlight the importance of genomic structural variation for the future genetic improvement of polyploid crops.


April 21, 2020

A Species-Wide Inventory of NLR Genes and Alleles in Arabidopsis thaliana.

Infectious disease is both a major force of selection in nature and a prime cause of yield loss in agriculture. In plants, disease resistance is often conferred by nucleotide-binding leucine-rich repeat (NLR) proteins, intracellular immune receptors that recognize pathogen proteins and their effects on the host. Consistent with extensive balancing and positive selection, NLRs are encoded by one of the most variable gene families in plants, but the true extent of intraspecific NLR diversity has been unclear. Here, we define a nearly complete species-wide pan-NLRome in Arabidopsis thaliana based on sequence enrichment and long-read sequencing. The pan-NLRome largely saturates with approximately 40 well-chosen wild strains, with half of the pan-NLRome being present in most accessions. We chart NLR architectural diversity, identify new architectures, and quantify selective forces that act on specific NLRs and NLR domains. Our study provides a blueprint for defining pan-NLRomes.Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.


April 21, 2020

Genome of the Komodo dragon reveals adaptations in the cardiovascular and chemosensory systems of monitor lizards.

Monitor lizards are unique among ectothermic reptiles in that they have high aerobic capacity and distinctive cardiovascular physiology resembling that of endothermic mammals. Here, we sequence the genome of the Komodo dragon Varanus komodoensis, the largest extant monitor lizard, and generate a high-resolution de novo chromosome-assigned genome assembly for V. komodoensis using a hybrid approach of long-range sequencing and single-molecule optical mapping. Comparing the genome of V. komodoensis with those of related species, we find evidence of positive selection in pathways related to energy metabolism, cardiovascular homoeostasis, and haemostasis. We also show species-specific expansions of a chemoreceptor gene family related to pheromone and kairomone sensing in V. komodoensis and other lizard lineages. Together, these evolutionary signatures of adaptation reveal the genetic underpinnings of the unique Komodo dragon sensory and cardiovascular systems, and suggest that selective pressure altered haemostasis genes to help Komodo dragons evade the anticoagulant effects of their own saliva. The Komodo dragon genome is an important resource for understanding the biology of monitor lizards and reptiles worldwide.


April 21, 2020

Noncoding CGG repeat expansions in neuronal intranuclear inclusion disease, oculopharyngodistal myopathy and an overlapping disease.

Noncoding repeat expansions cause various neuromuscular diseases, including myotonic dystrophies, fragile X tremor/ataxia syndrome, some spinocerebellar ataxias, amyotrophic lateral sclerosis and benign adult familial myoclonic epilepsies. Inspired by the striking similarities in the clinical and neuroimaging findings between neuronal intranuclear inclusion disease (NIID) and fragile X tremor/ataxia syndrome caused by noncoding CGG repeat expansions in FMR1, we directly searched for repeat expansion mutations and identified noncoding CGG repeat expansions in NBPF19 (NOTCH2NLC) as the causative mutations for NIID. Further prompted by the similarities in the clinical and neuroimaging findings with NIID, we identified similar noncoding CGG repeat expansions in two other diseases: oculopharyngeal myopathy with leukoencephalopathy and oculopharyngodistal myopathy, in LOC642361/NUTM2B-AS1 and LRP12, respectively. These findings expand our knowledge of the clinical spectra of diseases caused by expansions of the same repeat motif, and further highlight how directly searching for expanded repeats can help identify mutations underlying diseases.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.