Menu
April 21, 2020  |  

A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing.

We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.


April 21, 2020  |  

Fast and accurate genomic analyses using genome graphs.

The human reference genome serves as the foundation for genomics by providing a scaffold for alignment of sequencing reads, but currently only reflects a single consensus haplotype, thus impairing analysis accuracy. Here we present a graph reference genome implementation that enables read alignment across 2,800 diploid genomes encompassing 12.6 million SNPs and 4.0 million insertions and deletions (indels). The pipeline processes one whole-genome sequencing sample in 6.5?h using a system with 36?CPU cores. We show that using a graph genome reference improves read mapping sensitivity and produces a 0.5% increase in variant calling recall, with unaffected specificity. Structural variations incorporated into a graph genome can be genotyped accurately under a unified framework. Finally, we show that iterative augmentation of graph genomes yields incremental gains in variant calling accuracy. Our implementation is an important advance toward fulfilling the promise of graph genomes to radically enhance the scalability and accuracy of genomic analyses.


April 21, 2020  |  

Current advances in HIV vaccine preclinical studies using Macaque models.

The macaque simian or simian/human immunodeficiency virus (SIV/SHIV) challenge model has been widely used to inform and guide human vaccine trials. Substantial advances have been made recently in the application of repeated-low-dose challenge (RLD) approach to assess SIV/SHIV vaccine efficacies (VE). Some candidate HIV vaccines have shown protective effects in preclinical studies using the macaque SIV/SHIV model but the model’s true predictive value for screening potential HIV vaccine candidates needs to be evaluated further. Here, we review key parameters used in the RLD approach and discuss their relevance for evaluating VE to improve preclinical studies of candidate HIV vaccines.Crown Copyright © 2019. Published by Elsevier Ltd. All rights reserved.


April 21, 2020  |  

Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae.

Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ~80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ~1.45?Gb, spanning ~96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (~80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1-2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.


April 21, 2020  |  

One Aeromonas salmonicida subsp. salmonicida isolate with a pAsa5 variant bearing antibiotic resistance and a pRAS3 variant making a link with a swine pathogen.

The Gram-negative bacterium Aeromonas salmonicida subsp. salmonicida is an aquatic pathogen which causes furunculosis to salmonids, especially in fish farms. The emergence of strains of this bacterium exhibiting antibiotic resistance is increasing, limiting the effectiveness of antibiotherapy as a treatment against this worldwide disease. In the present study, we discovered an isolate of A. salmonicida subsp. salmonicida that harbors two novel plasmids variants carrying antibiotic resistance genes. The use of long-read sequencing (PacBio) allowed us to fully characterize those variants, named pAsa5-3432 and pRAS3-3432, which both differ from their classic counterpart through their content in mobile genetic elements. The plasmid pAsa5-3432 carries a new multidrug region composed of multiple mobile genetic elements, including a Class 1 integron similar to an integrated element of Salmonella enterica. With this new region, probably acquired through plasmid recombination, pAsa5-3432 is the first reported plasmid of this bacterium that bears both an essential virulence factor (the type three secretion system) and multiple antibiotic resistance genes. As for pRAS3-3432, compared to the classic pRAS3, it carries a new mobile element that has only been identified in Chlamydia suis. Hence, with the identification of those two novel plasmids harboring mobile genetic elements that are normally encountered in other bacterial species, the present study puts emphasis on the important impact of mobile genetic elements in the genomic plasticity of A. salmonicida subsp. salmonicida and suggests that this aquatic bacterium could be an important reservoir of antibiotic resistance genes that can be exchanged with other bacteria, including human and animal pathogens. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020  |  

Function and Distribution of a Lantipeptide in Strawberry Fusarium Wilt Disease-Suppressive Soils.

Streptomyces griseus S4-7 is representative of strains responsible for the specific soil suppressiveness of Fusarium wilt of strawberry caused by Fusarium oxysporum f. sp. fragariae. Members of the genus Streptomyces secrete diverse secondary metabolites including lantipeptides, heat-stable lanthionine-containing compounds that can exhibit antibiotic activity. In this study, a class II lantipeptide provisionally named grisin, of previously unknown biological function, was shown to inhibit F. oxysporum. The inhibitory activity of grisin distinguishes it from other class II lantipeptides from Streptomyces spp. Results of quantitative reverse transcription-polymerase chain reaction with lanM-specific primers showed that the density of grisin-producing Streptomyces spp. in the rhizosphere of strawberry was positively correlated with the number of years of monoculture and a minimum of seven years was required for development of specific soil suppressiveness to Fusarium wilt disease. We suggest that lanM can be used as a diagnostic marker of whether a soil is conducive or suppressive to the disease.


April 21, 2020  |  

Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Comprehensive molecular characterization of myriad somatic alterations and aberrant gene expressions at personal level is key to precision cancer therapy, yet limited by current short-read sequencing technology, individualized catalog of complete genomic and transcriptomic features is thus far elusive. Here, we integrated second- and third-generation sequencing platforms to generate a multidimensional dataset on a patient affected by metastatic epithelial ovarian cancer. Whole-genome and hybrid transcriptome dissection captured global genetic and transcriptional variants at previously unparalleled resolution. Particularly, single-molecule mRNA sequencing identified a vast array of unannotated transcripts, novel long noncoding RNAs and gene chimeras, permitting accurate determination of transcription start, splice, polyadenylation and fusion sites. Phylogenetic and enrichment inference of isoform-level measurements implicated early functional divergence and cytosolic proteostatic stress in shaping ovarian tumorigenesis. A complementary imaging-based high-throughput drug screen was performed and subsequently validated, which consistently pinpointed proteasome inhibitors as an effective therapeutic regime by inducing protein aggregates in ovarian cancer cells. Therefore, our study suggests that clinical application of the emerging long-read full-length analysis for improving molecular diagnostics is feasible and informative. An in-depth understanding of the tumor transcriptome complexity allowed by leveraging the hybrid sequencing approach lays the basis to reveal novel and valid therapeutic vulnerabilities in advanced ovarian malignancies.


April 21, 2020  |  

Comparative analysis of KPC-2-encoding chimera plasmids with multi-replicon IncR:IncpA1763-KPC:IncN1 or IncFIIpHN7A8:IncpA1763-KPC:IncN1.

IncR, IncFII, IncpA1763-KPC, and IncN1 plasmids have been increasingly found among Enterobacteriaceae species, but plasmids with hybrid structures derived from the above-mentioned incompatibility groups have not yet been described.Plasmids p721005-KPC, p504051-KPC, and pA3295-KPC were fully sequenced and compared with previously sequenced related plasmids pHN84KPC (IncR), pKPHS2 (IncFIIK), pKOX_NDM1 (IncFIIY), pHN7A8 (IncFIIpHN7A8), and R46 (IncN1).The backbone of p721005-KPC/p504051-KPC was a hybrid of the entire 10-kb IncR-type backbone from pHN84KPC, the entire 64.3-kb IncFIIK-type maintenance, and conjugal transfer regions from pKPHS2, a 15.5-kb IncFIIY-type maintenance region from pKOX_NDM1 and a 5.6-kb IncpA1763-KPC-type backbone region from pA1763-KPC, and it contained a primary IncR replicon and two auxiliary IncpA1763-KPC and IncN1 replicons. The backbone of pA3295-KPC was a hybrid of a 7.2-kb IncFIIpHN7A8-type backbone region from pHN7A8, the almost entire 33.3-kb IncN1-type maintenance and conjugal transfer regions highly similar to R46, a 26.2-kb IncFIIK-type maintenance regions from pKPHS2, the above 15.5-kb IncFIIY-type maintenance region, and the above 5.6-kb IncpA1763-KPC-type backbone region, and it contained a primary Inc-FIIpHN7A8 replicon and two auxiliary IncpA1763-KPC and IncN1 replicons. Each of p721005-KPC, p504051-KPC, and pA3295-KPC acquired a wealth of accessory modules, carrying a range of intact and residue mobile elements (such as insertion sequences, unit transposons, and integrons) and resistance markers (such as blaKPC, tetA, dfrA, and qnr).In each of p721005-KPC, p504051-KPC, and pA3295-KPC, multiple replicons in coordination with maintenance and conjugation regions of various origins would maintain a broad host range and a stable replication at a steady-state plasmid copy number.


April 21, 2020  |  

Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.

The locus for familial cortical myoclonic tremor with epilepsy (FCMTE) has long been mapped to 8q24 in linkage studies, but the causative mutations remain unclear. Recently, expansions of intronic TTTCA and TTTTA repeat motifs within SAMD12 were found to be involved in the pathogenesis of FCMTE in Japanese pedigrees. We aim to identify the causative mutations of FCMTE in Chinese pedigrees.We performed genetic linkage analysis by microsatellite markers in a five-generation Chinese pedigree with 55 members. We also used array-comparative genomic hybridisation (CGH) and next-generation sequencing (NGS) technologies (whole-exome sequencing, capture region deep sequencing and whole-genome sequencing) to identify the causative mutations in the disease locus. Recently, we used low-coverage (~10×) long-read genome sequencing (LRS) on the PacBio Sequel and Oxford Nanopore platforms to identify the causative mutations, and used repeat-primed PCR for validation of the repeat expansions.Linkage analysis mapped the disease locus to 8q23.3-24.23. Array-CGH and NGS failed to identify causative mutations in this locus. LRS identified the intronic TTTCA and TTTTA repeat expansions in SAMD12 as the causative mutations, thus corroborating the recently published results in Japanese pedigrees.We identified the pentanucleotide repeat expansion in SAMD12 as the causative mutation in Chinese FCMTE pedigrees. Our study also suggested that LRS is an effective tool for molecular diagnosis of genetic disorders, especially for neurological diseases that cannot be positively diagnosed by conventional clinical microarray and NGS technologies. © Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.


April 21, 2020  |  

Toxin and genome evolution in a Drosophila defensive symbiosis.

Defenses conferred by microbial symbionts play a vital role in the health and fitness of their animal hosts. An important outstanding question in the study of defensive symbiosis is what determines long term stability and effectiveness against diverse natural enemies. In this study, we combine genome and transcriptome sequencing, symbiont transfection and parasite protection experiments, and toxin activity assays to examine the evolution of the defensive symbiosis between Drosophila flies and their vertically transmitted Spiroplasma bacterial symbionts, focusing in particular on ribosome-inactivating proteins (RIPs), symbiont-encoded toxins that have been implicated in protection against both parasitic wasps and nematodes. Although many strains of Spiroplasma, including the male-killing symbiont (sMel) of Drosophila melanogaster, protect against parasitic wasps, only the strain (sNeo) that infects the mycophagous fly Drosophila neotestacea appears to protect against parasitic nematodes. We find that RIP repertoire is a major differentiating factor between strains that do and do not offer nematode protection, and that sMel RIPs do not show activity against nematode ribosomes in vivo. We also discovered a strain of Spiroplasma infecting a mycophagous phorid fly, Megaselia nigra. Although both the host and its Spiroplasma are distantly related to D. neotestacea and its symbiont, genome sequencing revealed that the M. nigra symbiont encodes abundant and diverse RIPs, including plasmid-encoded toxins that are closely related to the RIPs in sNeo. Our results suggest that distantly related Spiroplasma RIP toxins may perform specialized functions with regard to parasite specificity and suggest an important role for horizontal gene transfer in the emergence of novel defensive phenotypes.


April 21, 2020  |  

Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense.

Allotetraploid cotton species (Gossypium hirsutum and Gossypium barbadense) have long been cultivated worldwide for natural renewable textile fibers. The draft genome sequences of both species are available but they are highly fragmented and incomplete1-4. Here we report reference-grade genome assemblies and annotations for G. hirsutum accession Texas Marker-1 (TM-1) and G. barbadense accession 3-79 by integrating single-molecule real-time sequencing, BioNano optical mapping and high-throughput chromosome conformation capture techniques. Compared with previous assembled draft genomes1,3, these genome sequences show considerable improvements in contiguity and completeness for regions with high content of repeats such as centromeres. Comparative genomics analyses identify extensive structural variations that probably occurred after polyploidization, highlighted by large paracentric/pericentric inversions in 14 chromosomes. We constructed an introgression line population to introduce favorable chromosome segments from G. barbadense to G. hirsutum, allowing us to identify 13 quantitative trait loci associated with superior fiber quality. These resources will accelerate evolutionary and functional genomic studies in cotton and inform future breeding programs for fiber improvement.


April 21, 2020  |  

A chromosome-scale genome assembly reveals a highly dynamic effector repertoire of wheat powdery mildew.

Blumeria graminis f. sp. tritici (B.g. tritici) is the causal agent of the wheat powdery mildew disease. The highly fragmented B.g. tritici genome available so far has prevented a systematic analysis of effector genes that are known to be involved in host adaptation. To study the diversity and evolution of effector genes we produced a chromosome-scale assembly of the B.g. tritici genome. The genome assembly and annotation was achieved by combining long-read sequencing with high-density genetic mapping, bacterial artificial chromosome fingerprinting and transcriptomics. We found that the 166.6 Mb B.g. tritici genome encodes 844 candidate effector genes, over 40% more than previously reported. Candidate effector genes have characteristic local genomic organization such as gene clustering and enrichment for recombination-active regions and certain transposable element families. A large group of 412 candidate effector genes shows high plasticity in terms of copy number variation in a global set of 36 isolates and of transcription levels. Our data suggest that copy number variation and transcriptional flexibility are the main drivers for adaptation in B.g. tritici. The high repeat content may play a role in providing a genomic environment that allows rapid evolution of effector genes with selection as the driving force. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


April 21, 2020  |  

Multiple modes of convergent adaptation in the spread of glyphosate-resistant Amaranthus tuberculatus.

The selection pressure exerted by herbicides has led to the repeated evolution of herbicide resistance in weeds. The evolution of herbicide resistance on contemporary timescales in turn provides an outstanding opportunity to investigate key questions about the genetics of adaptation, in particular the relative importance of adaptation from new mutations, standing genetic variation, or geographic spread of adaptive alleles through gene flow. Glyphosate-resistant Amaranthus tuberculatus poses one of the most significant threats to crop yields in the Midwestern United States, with both agricultural populations and herbicide resistance only recently emerging in Canada. To understand the evolutionary mechanisms driving the spread of resistance, we sequenced and assembled the A. tuberculatus genome and investigated the origins and population genomics of 163 resequenced glyphosate-resistant and susceptible individuals from Canada and the United States. In Canada, we discovered multiple modes of convergent evolution: in one locality, resistance appears to have evolved through introductions of preadapted US genotypes, while in another, there is evidence for the independent evolution of resistance on genomic backgrounds that are historically nonagricultural. Moreover, resistance on these local, nonagricultural backgrounds appears to have occurred predominantly through the partial sweep of a single haplotype. In contrast, resistant haplotypes arising from the Midwestern United States show multiple amplification haplotypes segregating both between and within populations. Therefore, while the remarkable species-wide diversity of A. tuberculatus has facilitated geographic parallel adaptation of glyphosate resistance, more recently established agricultural populations are limited to adaptation in a more mutation-limited framework.Copyright © 2019 the Author(s). Published by PNAS.


April 21, 2020  |  

Detecting a long insertion variant in SAMD12 by SMRT sequencing: implications of long-read whole-genome sequencing for repeat expansion diseases.

Long-read sequencing technology is now capable of reading single-molecule DNA with an average read length of more than 10?kb, fully enabling the coverage of large structural variations (SVs). This advantage may pave the way for the detection of unprecedented SVs as well as repeat expansions. Pathogenic SVs of only known genes used to be selectively analyzed based on prior knowledge of target DNA sequence. The unbiased application of long-read whole-genome sequencing (WGS) for the detection of pathogenic SVs has just begun. Here, we apply PacBio SMRT sequencing in a Japanese family with benign adult familial myoclonus epilepsy (BAFME). Our SV selection of low-coverage WGS data (7×) narrowed down the candidates to only six SVs in a 7.16-Mb region of the BAFME1 locus and correctly determined an approximately 4.6-kb SAMD12 intronic repeat insertion, which is causal of BAFME1. These results indicate that long-read WGS is potentially useful for evaluating all of the known SVs in a genome and identifying new disease-causing SVs in combination with other genetic methods to resolve the genetic causes of currently unexplained diseases.


April 21, 2020  |  

Computational aspects underlying genome to phenome analysis in plants.

Recent advances in genomics technologies have greatly accelerated the progress in both fundamental plant science and applied breeding research. Concurrently, high-throughput plant phenotyping is becoming widely adopted in the plant community, promising to alleviate the phenotypic bottleneck. While these technological breakthroughs are significantly accelerating quantitative trait locus (QTL) and causal gene identification, challenges to enable even more sophisticated analyses remain. In particular, care needs to be taken to standardize, describe and conduct experiments robustly while relying on plant physiology expertise. In this article, we review the state of the art regarding genome assembly and the future potential of pangenomics in plant research. We also describe the necessity of standardizing and describing phenotypic studies using the Minimum Information About a Plant Phenotyping Experiment (MIAPPE) standard to enable the reuse and integration of phenotypic data. In addition, we show how deep phenotypic data might yield novel trait-trait correlations and review how to link phenotypic data to genomic data. Finally, we provide perspectives on the golden future of machine learning and their potential in linking phenotypes to genomic features. © 2018 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.