Menu
April 21, 2020

BjuWRR1, a CC-NB-LRR gene identified in Brassica juncea, confers resistance to white rust caused by Albugo candida.

BjuWRR1, a CNL-type R gene, was identified from an east European gene pool line of Brassica juncea and validated for conferring resistance to white rust by genetic transformation. White rust caused by the oomycete pathogen Albugo candida is a significant disease of crucifer crops including Brassica juncea (mustard), a major oilseed crop of the Indian subcontinent. Earlier, a resistance-conferring locus named AcB1-A5.1 was mapped in an east European gene pool line of B. juncea-Donskaja-IV. This line was tested along with some other lines of B. juncea (AABB), B. rapa (AA) and B. nigra (BB) for resistance to six isolates of A. candida collected from different mustard growing regions of India. Donskaja-IV was found to be completely resistant to all the tested isolates. Sequencing of a BAC spanning the locus AcB1-A5.1 showed the presence of a single CC-NB-LRR protein encoding R gene. The genomic sequence of the putative R gene with its native promoter and terminator was used for the genetic transformation of a susceptible Indian gene pool line Varuna and was found to confer complete resistance to all the isolates. This is the first white rust resistance-conferring gene described from Brassica species and has been named BjuWRR1. Allelic variants of the gene in B. juncea germplasm and orthologues in the Brassicaceae genomes were studied to understand the evolutionary dynamics of the BjuWRR1 gene.


April 21, 2020

CRISPR/Cas9-targeted enrichment and long-read sequencing of the Fuchs endothelial corneal dystrophy-associated TCF4 triplet repeat.

To demonstrate the utility of an amplification-free long-read sequencing method to characterize the Fuchs endothelial corneal dystrophy (FECD)-associated intronic TCF4 triplet repeat (CTG18.1).We applied an amplification-free method, utilizing the CRISPR/Cas9 system, in combination with PacBio single-molecule real-time (SMRT) long-read sequencing, to study CTG18.1. FECD patient samples displaying a diverse range of CTG18.1 allele lengths and zygosity status (n?=?11) were analyzed. A robust data analysis pipeline was developed to effectively filter, align, and interrogate CTG18.1-specific reads. All results were compared with conventional polymerase chain reaction (PCR)-based fragment analysis.CRISPR-guided SMRT sequencing of CTG18.1 provided accurate genotyping information for all samples and phasing was possible for 18/22 alleles sequenced. Repeat length instability was observed for all expanded (=50 repeats) phased CTG18.1 alleles analyzed. Furthermore, higher levels of repeat instability were associated with increased CTG18.1 allele length (mode length =91 repeats) indicating that expanded alleles behave dynamically.CRISPR-guided SMRT sequencing of CTG18.1 has revealed novel insights into CTG18.1 length instability. Furthermore, this study provides a framework to improve the molecular diagnostic accuracy for CTG18.1-mediated FECD, which we anticipate will become increasingly important as gene-directed therapies are developed for this common age-related and sight threatening disease.


April 21, 2020

Fam83F induces p53 stabilisation and promotes its activity.

p53 is one of the most important tumour suppressor proteins currently known. It is activated in response to DNA damage and this activation leads to proliferation arrest and cell death. The abundance and activity of p53 are tightly controlled and reductions in p53’s activity can contribute to the development of cancer. Here, we show that Fam83F increases p53 protein levels by protein stabilisation. Fam83F interacts with p53 and decreases its ubiquitination and degradation. Fam83F is induced in response to DNA damage and its overexpression also increases p53 activity in cell culture experiments and in zebrafish embryos. Downregulation of Fam83F decreases transcription of p53 target genes in response to DNA damage and increases cell proliferation, identifying Fam83F as an important regulator of the DNA damage response. Overexpression of Fam83F also enhances migration of cells harbouring mutant p53 demonstrating that it can also activate mutant forms of p53.


April 21, 2020

Schizophrenia risk variants influence multiple classes of transcripts of sorting nexin 19 (SNX19).

Genome-wide association studies (GWAS) have identified many genomic loci associated with risk for schizophrenia, but unambiguous identification of the relationship between disease-associated variants and specific genes, and in particular their effect on risk conferring transcripts, has proven difficult. To better understand the specific molecular mechanism(s) at the schizophrenia locus in 11q25, we undertook cis expression quantitative trait loci (cis-eQTL) mapping for this 2 megabase genomic region using postmortem human brain samples. To comprehensively assess the effects of genetic risk upon local expression, we evaluated multiple transcript features: genes, exons, and exon-exon junctions in multiple brain regions-dorsolateral prefrontal cortex (DLPFC), hippocampus, and caudate. Genetic risk variants strongly associated with expression of SNX19 transcript features that tag multiple rare classes of SNX19 transcripts, whereas they only weakly affected expression of an exon-exon junction that tags the majority of abundant transcripts. The most prominent class of SNX19 risk-associated transcripts is predicted to be overexpressed, defined by an exon-exon splice junction between exons 8 and 10 (junc8.10) and that is predicted to encode proteins that lack the characteristic nexin C terminal domain. Risk alleles were also associated with either increased or decreased expression of multiple additional classes of transcripts. With RACE, molecular cloning, and long read sequencing, we found a number of novel SNX19 transcripts that further define the set of potential etiological transcripts. We explored epigenetic regulation of SNX19 expression and found that DNA methylation at CpG sites near the primary transcription start site and within exon 2 partially mediate the effects of risk variants on risk-associated expression. ATAC sequencing revealed that some of the most strongly risk-associated SNPs are located within a region of open chromatin, suggesting a nearby regulatory element is involved. These findings indicate a potentially complex molecular etiology, in which risk alleles for schizophrenia generate epigenetic alterations and dysregulation of multiple classes of SNX19 transcripts.


April 21, 2020

A Highly Unusual V1 Region of Env in an Elite Controller of HIV Infection.

HIV elite controllers represent a remarkable minority of patients who maintain normal CD4+ T-cell counts and low or undetectable viral loads for decades in the absence of antiretroviral therapy. To examine the possible contribution of virus attenuation to elite control, we obtained a primary HIV-1 isolate from an elite controller who had been infected for 19?years, the last 10 of which were in the absence of antiretroviral therapy. Full-length sequencing of this isolate revealed a highly unusual V1 domain in Envelope (Env). The V1 domain in this HIV-1 strain was 49 amino acids, placing it in the top 1% of lengths among the 6,112 Env sequences in the Los Alamos National Laboratory online database. Furthermore, it included two additional N-glycosylation sites and a pair of cysteines suggestive of an extra disulfide loop. Virus with this Env retained good infectivity and replicative capacity; however, analysis of recombinant viruses suggested that other sequences in Env were adapted to accommodate the unusual V1 domain. While the long V1 domain did not confer resistance to neutralization by monoclonal antibodies of the V1/V2-glycan-dependent class, it did confer resistance to neutralization by monoclonal antibodies of the V3-glycan-dependent class. Our findings support results in the literature that suggest a role for long V1 regions in shielding HIV-1 from recognition by V3-directed broadly neutralizing antibodies. In the case of the elite controller described here, it seems likely that selective pressures from the humoral immune system were responsible for driving the highly unusual polymorphisms present in this HIV-1 Envelope.IMPORTANCE Elite controllers have long provided an avenue for researchers to reveal mechanisms underlying control of HIV-1. While the role of host genetic factors in facilitating elite control is well known, the possibility of infection by attenuated strains of HIV-1 has been much less studied. Here we describe an unusual viral feature found in an elite controller of HIV-1 infection and demonstrate its role in conferring escape from monoclonal antibodies of the V3-glycan class. Our results suggest that extreme variation may be needed by HIV-1 to escape neutralization by some antibody specificities. Copyright © 2019 Silver et al.


April 21, 2020

Mitochondrial DNA Variants in Patients with Liver Injury Due to Anti-Tuberculosis Drugs.

Hepatotoxicity is the most severe adverse effect of anti-tuberculosis therapy. Isoniazid’s metabolite hydrazine is a mitochondrial complex II inhibitor. We hypothesized that mitochondrial DNA variants are risk factors for drug-induced liver injury (DILI) due to isoniazid, rifampicin or pyrazinamide.We obtained peripheral blood from tuberculosis (TB) patients before anti-TB therapy. A total of 38 patients developed DILI due to anti-TB drugs. We selected 38 patients with TB but without DILI as controls. Next-generation sequencing detected point mutations in the mitochondrial DNA genome. DILI was defined as ALT =5 times the upper limit of normal (ULN), or ALT =3 times the ULN with total bilirubin =2 times the ULN.In 38 patients with DILI, the causative drug was isoniazid in eight, rifampicin in 14 and pyrazinamide in 16. Patients with isoniazid-induced liver injury had more variants in complex I’s NADH subunit 5 and 1 genes, more nonsynonymous mutations in NADH subunit 5, and a higher ratio of nonsynonymous to total substitutions. Patients with rifampicin- or pyrazinamide-induced liver injury had no association with mitochondrial DNA variants.Variants in complex I’s subunit 1 and 5 genes might affect respiratory chain function and predispose isoniazid-induced liver injury when exposed to hydrazine, a metabolite of isoniazid and a complex II inhibitor.


April 21, 2020

Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population.

The central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are enriched in long arrays of tandem repeats, or satellite DNAs, that are known to vary extensively in copy number and repeat structure in the human population. Satellite sequence variation in the human genome is often so large that it is detected cytogenetically, yet due to the lack of a reference assembly and informatics tools to measure this variability, contemporary high-resolution disease association studies are unable to detect causal variants in these regions. Nevertheless, recently uncovered associations between satellite DNA variation and human disease support that these regions present a substantial and biologically important fraction of human sequence variation. Therefore, there is a pressing and unmet need to detect and incorporate this uncharacterized sequence variation into broad studies of human evolution and medical genomics. Here I discuss the current knowledge of satellite DNA variation in the human genome, focusing on centromeric satellites and their potential implications for disease.


April 21, 2020

Full-Length Multi-Barcoding: DNA Barcoding from Single Ingredient to Complex Mixtures.

DNA barcoding has been used for decades, although it has mostly been applied to somesingle-species. Traditional Chinese medicine (TCM), which is mainly used in the form ofcombination-one type of the multi-species, identification is crucial for clinical usage.Next-generation Sequencing (NGS) has been used to address this authentication issue for the pastfew years, but conventional NGS technology is hampered in application due to its short sequencingreads and systematic errors. Here, a novel method, Full-length multi-barcoding (FLMB) vialong-read sequencing, is employed for the identification of biological compositions in herbalcompound formulas in adequate and well controlled studies. By directly sequencing the full-lengthamplicons of ITS2 and psbA-trnH through single-molecule real-time (SMRT) technology, thebiological composition of a classical prescription Sheng-Mai-San (SMS) was analyzed. At the sametime, clone-dependent Sanger sequencing was carried out as a parallel control. Further, anotherformula-Sanwei-Jili-San (SJS)-was analyzed with genes of ITS2 and CO1. All the ingredients inthe samples of SMS and SJS were successfully authenticated at the species level, and 11 exogenousspecies were also checked, some of which were considered as common contaminations in theseproducts. Methodology analysis demonstrated that this method was sensitive, accurate andreliable. FLMB, a superior but feasible approach for the identification of biological complexmixture, was established and elucidated, which shows perfect interpretation for DNA barcodingthat could lead its application in multi-species mixtures.


April 21, 2020

Uncovering Missing Heritability in Rare Diseases.

The problem of ‘missing heritability’ affects both common and rare diseases hindering: discovery, diagnosis, and patient care. The ‘missing heritability’ concept has been mainly associated with common and complex diseases where promising modern technological advances, like genome-wide association studies (GWAS), were unable to uncover the complete genetic mechanism of the disease/trait. Although rare diseases (RDs) have low prevalence individually, collectively they are common. Furthermore, multi-level genetic and phenotypic complexity when combined with the individual rarity of these conditions poses an important challenge in the quest to identify causative genetic changes in RD patients. In recent years, high throughput sequencing has accelerated discovery and diagnosis in RDs. However, despite the several-fold increase (from ~10% using traditional to ~40% using genome-wide genetic testing) in finding genetic causes of these diseases in RD patients, as is the case in common diseases-the majority of RDs are also facing the ‘missing heritability’ problem. This review outlines the key role of high throughput sequencing in uncovering genetics behind RDs, with a particular focus on genome sequencing. We review current advances and challenges of sequencing technologies, bioinformatics approaches, and resources.


April 21, 2020

Impact of Chromosomal Rearrangements on the Interpretation of Lupin Karyotype Evolution.

Plant genome evolution can be very complex and challenging to describe, even within a genus. Mechanisms that underlie genome variation are complex and can include whole-genome duplications, gene duplication and/or loss, and, importantly, multiple chromosomal rearrangements. Lupins (Lupinus) diverged from other legumes approximately 60 mya. In contrast to New World lupins, Old World lupins show high variability not only for chromosome numbers (2n = 32?52), but also for the basic chromosome number (x = 5?9, 13) and genome size. The evolutionary basis that underlies the karyotype evolution in lupins remains unknown, as it has so far been impossible to identify individual chromosomes. To shed light on chromosome changes and evolution, we used comparative chromosome mapping among 11 Old World lupins, with Lupinusangustifolius as the reference species. We applied set of L.angustifolius-derived bacterial artificial chromosome clones for fluorescence in situ hybridization. We demonstrate that chromosome variations in the species analyzed might have arisen from multiple changes in chromosome structure and number. We hypothesize about lupin karyotype evolution through polyploidy and subsequent aneuploidy. Additionally, we have established a cytogenomic map of L.angustifolius along with chromosome markers that can be used for related species to further improve comparative studies of crops and wild lupins.


April 21, 2020

Harnessing long-read amplicon sequencing to uncover NRPS and Type I PKS gene sequence diversity in polar desert soils.

The severity of environmental conditions at Earth’s frigid zones present attractive opportunities for microbial biomining due to their heightened potential as reservoirs for novel secondary metabolites. Arid soil microbiomes within the Antarctic and Arctic circles are remarkably rich in Actinobacteria and Proteobacteria, bacterial phyla known to be prolific producers of natural products. Yet the diversity of secondary metabolite genes within these cold, extreme environments remain largely unknown. Here, we employed amplicon sequencing using PacBio RS II, a third generation long-read platform, to survey over 200 soils spanning twelve east Antarctic and high Arctic sites for natural product-encoding genes, specifically targeting non-ribosomal peptides (NRPS) and Type I polyketides (PKS). NRPS-encoding genes were more widespread across the Antarctic, whereas PKS genes were only recoverable from a handful of sites. Many recovered sequences were deemed novel due to their low amino acid sequence similarity to known protein sequences, particularly throughout the east Antarctic sites. Phylogenetic analysis revealed that a high proportion were most similar to antifungal and biosurfactant-type clusters. Multivariate analysis showed that soil fertility factors of carbon, nitrogen and moisture displayed significant negative relationships with natural product gene richness. Our combined results suggest that secondary metabolite production is likely to play an important physiological component of survival for microorganisms inhabiting arid, nutrient-starved soils. © FEMS 2019.


April 21, 2020

Crustacean Genome Exploration Reveals the Evolutionary Origin of White Spot Syndrome Virus.

White spot syndrome virus (WSSV) is a crustacean-infecting, double-stranded DNA virus and is the most serious viral pathogen in the global shrimp industry. WSSV is the sole recognized member of the family Nimaviridae, and the lack of genomic data on other nimaviruses has obscured the evolutionary history of WSSV. Here, we investigated the evolutionary history of WSSV by characterizing WSSV relatives hidden in host genomic data. We surveyed 14 host crustacean genomes and identified five novel nimaviral genomes. Comparative genomic analysis of Nimaviridae identified 28 “core genes” that are ubiquitously conserved in Nimaviridae; unexpected conservation of 13 uncharacterized proteins highlighted yet-unknown essential functions underlying the nimavirus replication cycle. The ancestral Nimaviridae gene set contained five baculoviral per os infectivity factor homologs and a sulfhydryl oxidase homolog, suggesting a shared phylogenetic origin of Nimaviridae and insect-associated double-stranded DNA viruses. Moreover, we show that novel gene acquisition and subsequent amplification reinforced the unique accessory gene repertoire of WSSV. Expansion of unique envelope protein and nonstructural virulence-associated genes may have been the key genomic event that made WSSV such a deadly pathogen.IMPORTANCE WSSV is the deadliest viral pathogen threatening global shrimp aquaculture. The evolutionary history of WSSV has remained a mystery, because few WSSV relatives, or nimaviruses, had been reported. Our aim was to trace the history of WSSV using the genomes of novel nimaviruses hidden in host genome data. We demonstrate that WSSV emerged from a diverse family of crustacean-infecting large DNA viruses. By comparing the genomes of WSSV and its relatives, we show that WSSV possesses an expanded set of unique host-virus interaction-related genes. This extensive gene gain may have been the key genomic event that made WSSV such a deadly pathogen. Moreover, conservation of insect-infecting virus protein homologs suggests a common phylogenetic origin of crustacean-infecting Nimaviridae and other insect-infecting DNA viruses. Our work redefines the previously poorly characterized crustacean virus family and reveals the ancient genomic events that preordained the emergence of a devastating shrimp pathogen.Copyright © 2019 American Society for Microbiology.


April 21, 2020

Adaptive archaic introgression of copy number variants and the discovery of previously unknown human genes

As they migrated out of Africa and into Europe and Asia, anatomically modern humans interbred with archaic hominins, such as Neanderthals and Denisovans. The result of this genetic introgression on the recipient populations has been of considerable interest, especially in cases of selection for specific archaic genetic variants. Hsieh et al. characterized adaptive structural variants and copy number variants that are likely targets of positive selection in Melanesians. Focusing on population-specific regions of the genome that carry duplicated genes and show an excess of amino acid replacements provides evidence for one of the mechanisms by which genetic novelty can arise and result in differentiation between human genomes.Science, this issue p. eaax2083INTRODUCTIONCharacterizing genetic variants underlying local adaptations in human populations is one of the central goals of evolutionary research. Most studies have focused on adaptive single-nucleotide variants that either arose as new beneficial mutations or were introduced after interbreeding with our now-extinct relatives, including Neanderthals and Denisovans. The adaptive role of copy number variants (CNVs), another well-known form of genomic variation generated through deletions or duplications that affect more base pairs in the genome, is less well understood, despite evidence that such mutations are subject to stronger selective pressures.RATIONALEThis study focuses on the discovery of introgressed and adaptive CNVs that have become enriched in specific human populations. We combine whole-genome CNV calling and population genetic inference methods to discover CNVs and then assess signals of selection after controlling for demographic history. We examine 266 publicly available modern human genomes from the Simons Genome Diversity Project and genomes of three ancient homininstextemdasha Denisovan, a Neanderthal from the Altai Mountains in Siberia, and a Neanderthal from Croatia. We apply long-read sequencing methods to sequence-resolve complex CNVs of interest specifically in the Melanesianstextemdashan Oceanian population distributed from Papua New Guinea to as far east as the islands of Fiji and known to harbor some of the greatest amounts of Neanderthal and Denisovan ancestry.RESULTSConsistent with the hypothesis of archaic introgression outside Africa, we find a significant excess of CNV sharing between modern non-African populations and archaic hominins (P = 0.039). Among Melanesians, we observe an enrichment of CNVs with potential signals of positive selection (n = 37 CNVs), of which 19 CNVs likely introgressed from archaic hominins. We show that Melanesian-stratified CNVs are significantly associated with signals of positive selection (P = 0.0323). Many map near or within genes associated with metabolism (e.g., ACOT1 and ACOT2), development and cell cycle or signaling (e.g., TNFRSF10D and CDK11A and CDK11B), or immune response (e.g., IFNLR1). We characterize two of the largest and most complex CNVs on chromosomes 16p11.2 and 8p21.3 that introgressed from Denisovans and Neanderthals, respectively, and are absent from most other human populations. At chromosome 16p11.2, we sequence-resolve a large duplication of >383 thousand base pairs (kbp) that originated from Denisovans and introgressed into the ancestral Melanesian population 60,000 to 170,000 years ago. This large duplication occurs at high frequency (>79%) in diverse Melanesian groups, shows signatures of positive selection, and maps adjacent to Homo sapienstextendashspecific duplications that predispose to rearrangements associated with autism. On chromosome 8p21.3, we identify a Melanesian haplotype that carries two CNVs, a ~6-kbp deletion, and a ~38-kbp duplication, with a Neanderthal origin and that introgressed into non-Africans 40,000 to 120,000 years ago. This CNV haplotype occurs at high frequency (44%) and shows signals consistent with a partial selective sweep in Melanesians. Using long-read sequencing genomic and transcriptomic data, we reconstruct the structure and complex evolutionary history for these two CNVs and discover previously undescribed duplicated genes (TNFRSF10D1, TNFRSF10D2, and NPIPB16) that show an excess of amino acid replacements consistent with the action of positive selection.CONCLUSIONOur results suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation that is absent from current reference genomes.Large adaptive-introgressed CNVs at chromosomes 8p21.3 and 16p11.2 in Melanesians.The magnifying glasses highlight structural differences between the archaic (top) and reference (bottom) genomes. Neanderthal (red) and Denisovan (blue) haplotypes encompassing large CNVs occur at high frequencies in Melanesians (44 and 79%, respectively) but are absent (black) in all non-Melanesians. These CNVs create positively selected genes (TNFRSF10D1, TNFRSF10D2, and NPIPB16) that are absent from the reference genome.Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.


April 21, 2020

A personalized platform identifies trametinib plus zoledronate for a patient with KRAS-mutant metastatic colorectal cancer.

Colorectal cancer remains a leading source of cancer mortality worldwide. Initial response is often followed by emergent resistance that is poorly responsive to targeted therapies, reflecting currently undruggable cancer drivers such as KRAS and overall genomic complexity. Here, we report a novel approach to developing a personalized therapy for a patient with treatment-resistant metastatic KRAS-mutant colorectal cancer. An extensive genomic analysis of the tumor’s genomic landscape identified nine key drivers. A transgenic model that altered orthologs of these nine genes in the Drosophila hindgut was developed; a robotics-based screen using this platform identified trametinib plus zoledronate as a candidate treatment combination. Treating the patient led to a significant response: Target and nontarget lesions displayed a strong partial response and remained stable for 11 months. By addressing a disease’s genomic complexity, this personalized approach may provide an alternative treatment option for recalcitrant disease such as KRAS-mutant colorectal cancer.


April 21, 2020

Long-read amplicon denoising.

Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called ‘amplicon denoising’, this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.