Menu
April 21, 2020

Pentatricopeptide repeat poly(A) binding protein KPAF4 stabilizes mitochondrial mRNAs in Trypanosoma brucei.

In Trypanosoma brucei, most mitochondrial mRNAs undergo editing, and 3′ adenylation and uridylation. The internal sequence changes and terminal extensions are coordinated: pre-editing addition of the short (A) tail protects the edited transcript against 3′-5′ degradation, while post-editing A/U-tailing renders mRNA competent for translation. Participation of a poly(A) binding protein (PABP) in coupling of editing and 3′ modification processes has been inferred, but its identity and mechanism of action remained elusive. We report identification of KPAF4, a pentatricopeptide repeat-containing PABP which sequesters the A-tail and impedes mRNA degradation. Conversely, KPAF4 inhibits uridylation of A-tailed transcripts and, therefore, premature A/U-tailing of partially-edited mRNAs. This quality check point likely prevents translation of incompletely edited mRNAs. We also find that RNA editing substrate binding complex (RESC) mediates the interaction between the 5′ end-bound pyrophosphohydrolase MERS1 and 3′ end-associated KPAF4 to enable mRNA circularization. This event appears to be critical for edited mRNA stability.


April 21, 2020

Cichorium intybus L.?×?Cicerbita alpina Walbr.: doubled haploid chicory induction and CENH3 characterization

Intergeneric hybridization between industrial chicory (Cichorium intybus L.) and Cicerbita alpina Walbr. induces interspecific hybrids and haploid chicory plants after in vitro embryo rescue. The protocol yielded haploids in 5 out of 12 cultivars pollinated; altogether 18 haploids were regenerated from 2836 embryos, with a maximum efficiency of 1.96% haploids per cross. Obtained haploids were chromosome doubled with mitosis inhibitors trifluralin and oryzalin; exposure to 0.05 g L-1 oryzalin during one week was the most efficient treatment to regenerate doubled haploids. Inbreeding effects in vitro were limited, but the ploidy level affects morphology. Transcriptome sequencing revealed two unique copies of CENH3 in Cicerbita alpina Walbr. Comparison of CENH3.1 protein sequences of Cicerbita and Cichorium obtained through transcriptome and whole shotgun genome sequencing revealed two amino-acid substitutions at critical residues of the histone fold domain. These particular changes cause chromosome elimination and reduced centromere loading in several other species and might indicate a CENH3-dependent mechanism causing chromosome elimination of parental chromosomes during Cichorium?×?Cicerbita intergeneric hybridization. Our results provide insights in chromosome elimination and might increase the efficiency of haploid induction in Cichorium.


April 21, 2020

Characterization of a male specific region containing a candidate sex determining gene in Atlantic cod.

The genetic mechanisms determining sex in teleost fishes are highly variable and the master sex determining gene has only been identified in few species. Here we characterize a male-specific region of 9?kb on linkage group 11 in Atlantic cod (Gadus morhua) harboring a single gene named zkY for zinc knuckle on the Y chromosome. Diagnostic PCR test of phenotypically sexed males and females confirm the sex-specific nature of the Y-sequence. We identified twelve highly similar autosomal gene copies of zkY, of which eight code for proteins containing the zinc knuckle motif. 3D modeling suggests that the amino acid changes observed in six copies might influence the putative RNA-binding specificity. Cod zkY and the autosomal proteins zk1 and zk2 possess an identical zinc knuckle structure, but only the Y-specific gene zkY was expressed at high levels in the developing larvae before the onset of sex differentiation. Collectively these data suggest zkY as a candidate master masculinization gene in Atlantic cod. PCR amplification of Y-sequences in Arctic cod (Arctogadus glacialis) and Greenland cod (Gadus macrocephalus ogac) suggests that the male-specific region emerged in codfishes more than 7.5 million years ago.


April 21, 2020

A draft genome for Spatholobus suberectus.

Spatholobus suberectus Dunn (S. suberectus), which belongs to the Leguminosae, is an important medicinal plant in China. Owing to its long growth cycle and increased use in human medicine, wild resources of S. suberectus have decreased rapidly and may be on the verge of extinction. De novo assembly of the whole S. suberectus genome provides us a critical potential resource towards biosynthesis of the main bioactive components and seed development regulation mechanism of this plant. Utilizing several sequencing technologies such as Illumina HiSeq X Ten, single-molecule real-time sequencing, 10x Genomics, as well as new assembly techniques such as FALCON and chromatin interaction mapping (Hi-C), we assembled a chromosome-scale genome about 798?Mb in size. In total, 748?Mb (93.73%) of the contig sequences were anchored onto nine chromosomes with the longest scaffold being 103.57?Mb. Further annotation analyses predicted 31,634 protein-coding genes, of which 93.9% have been functionally annotated. All data generated in this study is available in public databases.


April 21, 2020

Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight.

The human genome contains “dark” gene regions that cannot be adequately assembled or aligned using standard short-read sequencing technologies, preventing researchers from identifying mutations within these gene regions that may be relevant to human disease. Here, we identify regions with few mappable reads that we call dark by depth, and others that have ambiguous alignment, called camouflaged. We assess how well long-read or linked-read technologies resolve these regions.Based on standard whole-genome Illumina sequencing data, we identify 36,794 dark regions in 6054 gene bodies from pathways important to human health, development, and reproduction. Of these gene bodies, 8.7% are completely dark and 35.2% are =?5% dark. We identify dark regions that are present in protein-coding exons across 748 genes. Linked-read or long-read sequencing technologies from 10x Genomics, PacBio, and Oxford Nanopore Technologies reduce dark protein-coding regions to approximately 50.5%, 35.6%, and 9.6%, respectively. We present an algorithm to resolve most camouflaged regions and apply it to the Alzheimer’s Disease Sequencing Project. We rescue a rare ten-nucleotide frameshift deletion in CR1, a top Alzheimer’s disease gene, found in disease cases but not in controls.While we could not formally assess the association of the CR1 frameshift mutation with Alzheimer’s disease due to insufficient sample-size, we believe it merits investigating in a larger cohort. There remain thousands of potentially important genomic regions overlooked by short-read sequencing that are largely resolved by long-read technologies.


April 21, 2020

Transcriptomic profiles of 33 opium poppy samples in different tissues, growth phases, and cultivars.

Opium poppy is one of the most important medicinal plants and remains the only commercial resource of morphinan-based painkillers. However, little is known about the regulatory mechanisms involved in benzylisoquinoline alkaloids (BIAs) biosynthesis in opium poppy. Herein, the full-length transcriptome dataset of opium poppy was constructed for the first time in accompanied with the 33 samples of Illumina transcriptome data from different tissues, growth phases and cultivars. The long-read sequencing produced 902,140 raw reads with 55,114 high-quality transcripts, and short-read sequencing produced 1,923,679,864 clean reads with an average Q30 rate of 93%. The high-quality transcripts were subsequently quantified using the short reads, and the expression of each unigene among different samples was calculated as reads per kilobase per million mapped reads (RPKM). These data provide a foundation for opium poppy transcriptomic analysis, which may aid in capturing splice variants and some non-coding RNAs involved in the regulation of BIAs biosynthesis. It can also be used for genome assembly and annotation which will favor in new transcript identification.


April 21, 2020

Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.

Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome. Our method is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we prioritize pathogenic expansions within the top 10 out of 700,000 tandem repeats in whole genome sequencing data. This may help to elucidate the many genetic diseases whose causes remain unknown.


April 21, 2020

The complexity of the Fragaria x ananassa (octoploid) transcriptome by single-molecule long-read sequencing.

Strawberry (Fragaria x ananassa) is an allopolyploid species with diverse and complex transcripts. The regulatory mechanisms of fruit development and maturation have been extensively studied; however, little is known about the signaling mechanisms that direct this process in octoploid strawberry (Fragaria x ananassa). Here, we used long-read sequencing (LRS) technology and RNA-seq analysis to investigate the diversity and complexity of the polyploid transcriptome and differentially expressed transcripts along four successive fruit developmental stages of cultivated strawberry. We obtained a reference transcriptome with 119,897 unique full-length isoforms, including 2017 new isoforms and 2510 long noncoding RNAs. Based on the genome of the plausible progenitor (Fragaria vesca), 20,229 alternative splicing (AS) events were identified. Using this transcriptome, we found 17,485 differentially expressed transcripts during strawberry fruit development, including 527 transcription factors (TFs) belonging to 41 families. The expression profiles of all members of the auxin, ABA pathway, and anthocyanin biosynthesis gene families were also examined, and many of them were highly expressed at the ripe fruit stage, strongly indicating that the role of those genes is in the regulation of fruit ripening. We produce a high-quality reference transcriptome for octoploid strawberry, including much of the full-length transcript diversity, to help understand the regulatory mechanisms of fruit development and maturation of polyploid species, particularly via elucidation of the biochemical pathways involved in auxin, ABA, and anthocyanin biosynthesis.


April 21, 2020

Full-length transcript sequencing and comparative transcriptomic analysis to evaluate the contribution of osmotic and ionic stress components towards salinity tolerance in the roots of cultivated alfalfa (Medicago sativa L.).

Alfalfa is the most extensively cultivated forage legume. Salinity is a major environmental factor that impacts on alfalfa’s productivity. However, little is known about the molecular mechanisms underlying alfalfa responses to salinity, especially the relative contribution of the two important components of osmotic and ionic stress.In this study, we constructed the first full-length transcriptome database for alfalfa root tips under continuous NaCl and mannitol treatments for 1, 3, 6, 12, and 24?h (three biological replicates for each time points, including the control group) via PacBio Iso-Seq. This resulted in the identification of 52,787 full-length transcripts, with an average length of 2551?bp. Global transcriptional changes in the same 33 stressed samples were then analyzed via BGISEQ-500 RNA-Seq. Totals of 8861 NaCl-regulated and 8016 mannitol-regulated differentially expressed genes (DEGs) were identified. Metabolic analyses revealed that these DEGs overlapped or diverged in the cascades of molecular networks involved in signal perception, signal transduction, transcriptional regulation, and antioxidative defense. Notably, several well characterized signalling pathways, such as CDPK, MAPK, CIPK, and PYL-PP2C-SnRK2, were shown to be involved in osmotic stress, while the SOS core pathway was activated by ionic stress. Moreover, the physiological shifts of catalase and peroxidase activity, glutathione and proline content were in accordance with dynamic transcript profiles of the relevant genes, indicating that antioxidative defense system plays critical roles in response to salinity stress.Overall, our study provides evidence that the response to salinity stress in alfalfa includes both osmotic and ionic components. The key osmotic and ionic stress-related genes are candidates for future studies as potential targets to improve resistance to salinity stress via genetic engineering.


April 21, 2020

Differential retention of transposable element-derived sequences in outcrossing Arabidopsis genomes.

Transposable elements (TEs) are genomic parasites with major impacts on host genome architecture and host adaptation. A proper evaluation of their evolutionary significance has been hampered by the paucity of short scale phylogenetic comparisons between closely related species. Here, we characterized the dynamics of TE accumulation at the micro-evolutionary scale by comparing two closely related plant species, Arabidopsis lyrata and A. halleri.Joint genome annotation in these two outcrossing species confirmed that both contain two distinct populations of TEs with either ‘recent’ or ‘old’ insertion histories. Identification of rare segregating insertions suggests that diverse TE families contribute to the ongoing dynamics of TE accumulation in the two species. Orthologous TE fragments (i.e. those that have been maintained in both species), tend to be located closer to genes than those that are retained in one species only. Compared to non-orthologous TE insertions, those that are orthologous tend to produce fewer short interfering RNAs, are less heavily methylated when found within or adjacent to genes and these tend to have lower expression levels. These findings suggest that long-term retention of TE insertions reflects their frequent acquisition of adaptive roles and/or the deleterious effects of removing nearly neutral TE insertions when they are close to genes.Our results indicate a rapid evolutionary dynamics of the TE landscape in these two outcrossing species, with an important input of a diverse set of new insertions with variable propensity to resist deletion.


April 21, 2020

Construction of JRG (Japanese reference genome) with single-molecule real-time sequencing

In recent genome analyses, population-specific reference panels have indicated important. However, reference panels based on short-read sequencing data do not sufficiently cover long insertions. Therefore, the nature of long insertions has not been well documented. Here, we assembled a Japanese genome using single-molecule real-time sequencing data and characterized insertions found in the assembled genome. We identified 3691 insertions ranging from 100?bps to ~10,000?bps in the assembled genome relative to the international reference sequence (GRCh38). To validate and characterize these insertions, we mapped short-reads from 1070 Japanese individuals and 728 individuals from eight other populations to insertions integrated into GRCh38. With this result, we constructed JRGv1 (Japanese Reference Genome version 1) by integrating the 903 verified insertions, totaling 1,086,173 bases, shared by at least two Japanese individuals into GRCh38. We also constructed decoyJRGv1 by concatenating 3559 verified insertions, totaling 2,536,870 bases, shared by at least two Japanese individuals or by six other assemblies. This assembly improved the alignment ratio by 0.4% on average. These results demonstrate the importance of refining the reference assembly and creating a population-specific reference genome. JRGv1 and decoyJRGv1 are available at the JRG website.


April 21, 2020

Resource Concentration Modulates the Fate of Dissimilated Nitrogen in a Dual-Pathway Actinobacterium.

Respiratory ammonification and denitrification are two evolutionarily unrelated dissimilatory nitrogen (N) processes central to the global N cycle, the activity of which is thought to be controlled by carbon (C) to nitrate (NO3-) ratio. Here we find that Intrasporangium calvum C5, a novel dual-pathway denitrifier/respiratory ammonifier, disproportionately utilizes ammonification rather than denitrification when grown under low C concentrations, even at low C:NO3- ratios. This finding is in conflict with the paradigm that high C:NO3- ratios promote ammonification and low C:NO3- ratios promote denitrification. We find that the protein atomic composition for denitrification modules (NirK) are significantly cost minimized for C and N compared to ammonification modules (NrfA), indicating that limitation for C and N is a major evolutionary selective pressure imprinted in the architecture of these proteins. The evolutionary precedent for these findings suggests ecological importance for microbial activity as evidenced by higher growth rates when I. calvum grows predominantly using its ammonification pathway and by assimilating its end-product (ammonium) for growth under ammonium-free conditions. Genomic analysis of I. calvum further reveals a versatile ecophysiology to cope with nutrient stress and redox conditions. Metabolite and transcriptional profiles during growth indicate that enzyme modules, NrfAH and NirK, are not constitutively expressed but rather induced by nitrite production via NarG. Mechanistically, our results suggest that pathway selection is driven by intracellular redox potential (redox poise), which may be lowered when resource concentrations are low, thereby decreasing catalytic activity of upstream electron transport steps (i.e., the bc1 complex) needed for denitrification enzymes. Our work advances our understanding of the biogeochemical flexibility of N-cycling organisms, pathway evolution, and ecological food-webs.


April 21, 2020

Comprehensive analysis of full genome sequence and Bd-milRNA/target mRNAs to discover the mechanism of hypovirulence in Botryosphaeria dothidea strains on pear infection with BdCV1 and BdPV1

Pear ring rot disease, mainly caused by Botryosphaeria dothidea, is widespread in most pear and apple-growing regions. Mycoviruses are used for biocontrol, especially in fruit tree disease. BdCV1 (Botryosphaeria dothidea chrysovirus 1) and BdPV1 (Botryosphaeria dothidea partitivirus 1) influence the biological characteristics of B. dothidea strains. BdCV1 is a potential candidate for the control of fungal disease. Therefore, it is vital to explore interactions between B. dothidea and mycovirus to clarify the pathogenic mechanisms of B. dothidea and hypovirulence of B. dothidea in pear. A high-quality full-length genome sequence of the B. dothidea LW-Hubei isolate was obtained using Single Molecule Real-Time sequencing. It has high repeat sequence with 9.3% and DNA methylation existence in the genome. The 46.34?Mb genomes contained 14,091 predicted genes, which of 13,135 were annotated. B. dothidea was predicted to express 3833 secreted proteins. In bioinformatics analysis, 351 CAZy members, 552 transporters, 128 kinases, and 1096 proteins associated with plant-host interaction (PHI) were identified. RNA-silencing components including two endoribonuclease Dicer, four argonaute (Ago) and three RNA-dependent RNA polymerase (RdRp) molecules were identified and expressed in response to mycovirus infection. Horizontal transfer of the LW-C and LW-P strains indicated that BdCV1 induced host gene silencing in LW-C to suppress BdPV1 transmission. To investigate the role of RNA-silencing in B. dothidea defense, we constructed four small RNA libraries and sequenced B. dothidea micro-like RNAs (Bd-milRNAs) produced in response to BdCV1 and BdPV1 infection. Among these, 167 conserved and 68 candidate novel Bd-milRNAs were identified, of which 161 conserved and 20 novel Bd-milRNA were differentially expressed. WEGO analysis revealed involvement of the differentially expressed Bd-milRNA-targeted genes in metabolic process, catalytic activity, cell process and response to stress or stimulus. BdCV1 had a greater effect on the phenotype, virulence, conidiomata, vertical and horizontal transmission ability, and mycelia cellular structure biological characteristics of B. dothidea strains than BdPV1 and virus-free strains. The results obtained in this study indicate that mycovirus regulates biological processes in B. dothidea through the combined interaction of antiviral defense mediated by RNA-silencing and milRNA-mediated regulation of target gene mRNA expression.


October 23, 2019

Codon swapping of zinc finger nucleases confers expression in primary cells and in vivo from a single lentiviral vector.

Zinc finger nucleases (ZFNs) are promising tools for genome editing for biotechnological as well as therapeutic purposes. Delivery remains a major issue impeding targeted genome modification. Lentiviral vectors are highly efficient for delivering transgenes into cell lines, primary cells and into organs, such as the liver. However, the reverse transcription of lentiviral vectors leads to recombination of homologous sequences, as found between and within ZFN monomers.We used a codon swapping strategy to both drastically disrupt sequence identity between ZFN monomers and to reduce sequence repeats within a monomer sequence. We constructed lentiviral vectors encoding codon-swapped ZFNs or unmodified ZFNs from a single mRNA transcript. Cell lines, primary hepatocytes and newborn rats were used to evaluate the efficacy of integrative-competent (ICLV) and integrative-deficient (IDLV) lentiviral vectors to deliver ZFNs into target cells.We reduced total identity between ZFN monomers from 90.9% to 61.4% and showed that a single ICLV allowed efficient expression of functional ZFNs targeting the rat UGT1A1 gene after codon-swapping, leading to much higher ZFN activity in cell lines (up to 7-fold increase compared to unmodified ZFNs and 60% activity in C6 cells), as compared to plasmid transfection or a single ICLV encoding unmodified ZFN monomers. Off-target analysis located several active sites for the 5-finger UGT1A1-ZFNs. Furthermore, we reported for the first time successful ZFN-induced targeted DNA double-strand breaks in primary cells (hepatocytes) and in vivo (liver) after delivery of a single IDLV encoding two ZFNs.These results demonstrate that a codon-swapping approach allowed a single lentiviral vector to efficiently express ZFNs and should stimulate the use of this viral platform for ZFN-mediated genome editing of primary cells, for both ex vivo or in vivo applications.


October 23, 2019

Sites of retroviral DNA integration: From basic research to clinical applications.

One of the most crucial steps in the life cycle of a retrovirus is the integration of the viral DNA (vDNA) copy of the RNA genome into the genome of an infected host cell. Integration provides for efficient viral gene expression as well as for the segregation of viral genomes to daughter cells upon cell division. Some integrated viruses are not well expressed, and cells latently infected with human immunodeficiency virus type 1 (HIV-1) can resist the action of potent antiretroviral drugs and remain dormant for decades. Intensive research has been dedicated to understanding the catalytic mechanism of integration, as well as the viral and cellular determinants that influence integration site distribution throughout the host genome. In this review, we summarize the evolution of techniques that have been used to recover and map retroviral integration sites, from the early days that first indicated that integration could occur in multiple cellular DNA locations, to current technologies that map upwards of millions of unique integration sites from single in vitro integration reactions or cell culture infections. We further review important insights gained from the use of such mapping techniques, including the monitoring of cell clonal expansion in patients treated with retrovirus-based gene therapy vectors, or patients with acquired immune deficiency syndrome (AIDS) on suppressive antiretroviral therapy (ART). These insights span from integrase (IN) enzyme sequence preferences within target DNA (tDNA) at the sites of integration, to the roles of host cellular proteins in mediating global integration distribution, to the potential relationship between genomic location of vDNA integration site and retroviral latency.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.