Menu
April 21, 2020

SMRT sequencing of the full-length transcriptome of the Sunda pangolin (Manis javanica).

It is widely known that transcriptional diversity contributes greatly to biological regulation in eukaryotes. With the development of next-generation sequencing (NGS) technologies, several studies on RNA sequencing have considerably improved our understanding of transcriptome complexity. However, obtaining full-length (FL) transcripts remains a considerable challenge because of difficulties in short read-based assembly. In the present study, single-molecule real-time (SMRT) sequencing and NGS were combined to generate the complete and FL transcriptome of Manis javanica. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of the M. javanica genome. We obtained 45,530 high-confidence transcripts from 19,109 genic loci, of which 8014 genes have not yet been annotated within the M. javanica genome. Furthermore, we revealed 8824 long-chain noncoding RNAs (lncRNAs). A total of 30,199 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events were identified in the sequencing data. The structure and expression level of 59 digestive enzyme genes, including 13 carbohydrase genes, 28 lipase genes and 18 protease genes, were analyzed, which might provide original data for further research on M. javanica. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020

The developmental dynamics of the Populus stem transcriptome.

The Populus shoot undergoes primary growth (longitudinal growth) followed by secondary growth (radial growth), which produces biomass that is an important source of energy worldwide. We adopted joint PacBio Iso-Seq and RNA-seq analysis to identify differentially expressed transcripts along a developmental gradient from the shoot apex to the fifth internode of Populus Nanlin895. We obtained 87 150 full-length transcripts, including 2081 new isoforms and 62 058 new alternatively spliced isoforms, most of which were produced by intron retention, that were used to update the Populus annotation. Among these novel isoforms, there are 1187 long non-coding RNAs and 356 fusion genes. Using this annotation, we found 15 838 differentially expressed transcripts along the shoot developmental gradient, of which 1216 were transcription factors (TFs). Only a few of these genes were reported previously. The differential expression of these TFs suggests that they may play important roles in primary and secondary growth. AP2, ARF, YABBY and GRF TFs are highly expressed in the apex, whereas NAC, bZIP, PLATZ and HSF TFs are likely to be important for secondary growth. Overall, our findings provide evidence that long-read sequencing can complement short-read sequencing for cataloguing and quantifying eukaryotic transcripts and increase our understanding of the vital and dynamic process of shoot development. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Construction of a Genomic Bacterial Artificial Chromosome (BAC) Library for the Prawn Macrobrachium rosenbergii and Initial Analysis of ZW Chromosome-Derived BAC Inserts.

Knowledge on sex determination has proven valuable for commercial production of the prawn Macrobrachium rosenbergii due to sex dimorphism of the male and female individuals. Previous studies indicated that prawn sex is determined by a ZW-ZZ chromosomal system, but no genomic information is available for the sex chromosome. Herein, we constructed a genomic bacterial artificial chromosome (BAC) library and identified the ZW-derived BAC clones for initial analysis of the sex chromosomal DNA sequence. The arrayed BAC library contains 200,448 clones with average insert size of 115.4 kb, corresponding to ~?4× coverage of the estimated 5.38 Gb genome. Based on a short female-specific marker, a Z- and a W-fragment were retrieved with the genomic walking method. Screening the BAC library using a ZW-specific marker as probe resulted in 12 positive clones. From these, a Z-derived (P331M17) and a W-derived (P122G2) BAC clones were randomly selected and sequenced by PacBio method. We report the construction of a large insert, deep-coverage, and high-quality BAC library for M. rosenbergii that provides a useful resource for positional cloning of target genes, genomic organization, and comparative genomics analysis. Our study not only confirmed the ZW/ZZ system but also discovered sex-linked genes on ZW chromosomes for the first time, contributing to a comprehensive understanding of the genomic structure of sex chromosomes in M. rosenbergii.


April 21, 2020

The major histocompatibility complex of Old World camelids: Class I and class I-related genes.

The genomic structure of the Major Histocompatibility Complex (MHC) region and variation in selected MHC class I related genes in Old World camels, Camelus bactrianus and Camelus dromedaries were studied. The overall genomic organization of the camel MHC region follows a general pattern observed in other mammalian species and individual MHC loci appear to be well conserved. Selected MHC class I genes B-67 and BL3-7 exhibited unexpectedly low variability, even when compared to other camel MHC class I related genes MR1 and MICA. Interspecific SNP and allele sharing are relatively common, and frequencies of heterozygotes are usually low. Such a low variation in a genomic region generally considered as one of the most polymorphic in vertebrate genomes is unusual. Evolutionary relationships between MHC class I related genes and their counterparts from other species seem to be rather complex. Often, they do not follow the general evolutionary history of the species concerned. Close evolutionary relationships of individual MHC class I loci between camels, humans and dogs were observed. Based on the results of this study and on our data on MHC class II genes, the extent and the pattern of polymorphism of the MHC region of Old World camelids differed from most mammalian groups studied so far. Camels thus seem to be an important model for our understanding of the role of genetic diversity in immune functions, especially in the context of unique features of their immunoglobulin and T-cell receptor genes. © 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


April 21, 2020

Mate Selection in Self-Compatible Wild Tobacco Results from Coordinated Variation in Homologous Self-Incompatibility Genes.

In flowering plants, intraspecific mate preference is frequently related to mating systems: the rejection of self pollen in self-incompatible (SI) plants that prevents inbreeding is one of the best described examples. However, in other mating systems, more nuanced patterns of pollen rejection occur. In the self-compatible (SC) Nicotiana attenuata, in which SI is not found and all crosses are compatible, certain pollen genotypes are consistently selected in mixed pollinations. However, the molecular mechanisms of this polyandrous mate selection remain unknown. Style-expressed NaS-like-RNases and pollen-expressed NaSLF-like genes, homologous to SI factors in Solanaceae, were identified and examined for a role in N. attenuata’s mate selection. A comparison of two NaS-like-RNases and six NaSLF-like genes among 26 natural accessions revealed specific combinations of co-expression and direct protein-protein interactions. To evaluate their role in mate selection, we silenced the expression of specific NaS-like-RNases and NaSLF-like proteins and conducted diagnostic binary mixed pollinations and mixed pollinations with 14 different non-self pollen donors. Styles expressing particular combinations of NaS-like-RNases selected mates from plants with corresponding NaS-like-RNase expression patterns, while styles lacking NaS-like-RNase expression were non-selective in their fertilizations, which reflected the genotype ratios of pollen mixtures deposited on the stigmas. DNA methylation could account for some of the observed variation in stylar NaS-like-RNase patterns. We conclude that the S-RNase-SLF recognition mechanism plays a central role in polyandrous mate selection in this self-compatible species. These results suggest that after the SI-SC transition, natural variation of SI homologous genes was repurposed to mediate intraspecific mate selection. Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Development of a Molecular Marker Linked to the A4 Locus and the Structure of HD Genes in Pleurotus eryngii

Allelic differences in A and B mating-type loci are a prerequisite for the progression of mating in the genus Pleurotus eryngii; thus, the crossing is hampered by this biological barrier in inbreeding. Molecular markers linked to mating types of P. eryngii KNR2312 were investigated with randomly amplified polymorphic DNA to enhance crossing efficiency. An A4-linked sequence was identified and used to find the adjacent genomic region with the entire motif of the A locus from a contig sequenced by PacBio. The sequence-characterized amplified region marker 7-2299 distinguished A4 mating-type monokaryons from KNR2312 and other strains. A BLAST search of flanked sequences revealed that the A4 locus had a general feature consisting of the putative HD1 and HD2 genes. Both putative HD transcription factors contain a homeodomain sequence and a nuclear localization sequence; however, valid dimerization motifs were found only in the HD1 protein. The ACAAT motif, which was reported to have relevance to sex determination, was found in the intergenic region. The SCAR marker could be applicable in the classification of mating types in the P. eryngii breeding program, and the A4 locus could be the basis for a multi-allele detection marker.


April 21, 2020

The complete mitochondrial genome of the tartar Sand Boa Eryx tataricus

Eryx is a genus of snakes belonging to the family Boidae. In this study, the mitochondrial genome sequence of Eryx tataricus was generated using a PacBio RSII DNA sequencer employing the single mol- ecule, real-time sequencing technology. A maximum-likelihood (ML) phylogenetic tree of 26 snakes was re-constructed based on the 13 protein-coding genes for convincing the mitochondrial DNA sequences.


April 21, 2020

The Genome Sequence of the Anthelmintic-Susceptible New Zealand Haemonchus contortus.

Internal parasitic nematodes are a global animal health issue causing drastic losses in livestock. Here, we report a H. contortus representative draft genome to serve as a genetic resource to the scientific community and support future experimental research of molecular mechanisms in related parasites. A de novo hybrid assembly was generated from PCR-free whole genome sequence data, resulting in a chromosome-level assembly that is 465 Mb in size encoding 22,341 genes. The genome sequence presented here is consistent with the genome architecture of the existing Haemonchus species and is a valuable resource for future studies regarding population genetic structures of parasitic nematodes. Additionally, comparative pan-genomics with other species of economically important parasitic nematodes have revealed highly open genomes and strong collinearities within the phylum Nematoda. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Structural and functional characterization of an intradiol ring-cleavage dioxygenase from the polyphagous spider mite herbivore Tetranychus urticae Koch.

Genome analyses of the polyphagous spider mite herbivore Tetranychus urticae (two-spotted spider mite) revealed the presence of a set of 17 genes that code for secreted proteins belonging to the “intradiol dioxygenase-like” subgroup. Phylogenetic analyses indicate that this novel enzyme family has been acquired by horizontal gene transfer. In order to better understand the role of these proteins in T. urticae, we have structurally and functionally characterized one paralog (tetur07g02040). It was demonstrated that this protein is indeed an intradiol ring-cleavage dioxygenase, as the enzyme is able to cleave catechol between two hydroxyl-groups using atmospheric dioxygen. The enzyme was characterized functionally and structurally. The active site of the T. urticae enzyme contains an Fe3+ cofactor that is coordinated by two histidine and two tyrosine residues, an arrangement that is similar to those observed in bacterial homologs. However, the active site is significantly more solvent exposed than in bacterial proteins. Moreover, the mite enzyme is monomeric, while almost all structurally characterized bacterial homologs form oligomeric assemblies. Tetur07g02040 is not only the first spider mite dioxygenase that has been characterized at the molecular level, but is also the first structurally characterized intradiol ring-cleavage dioxygenase originating from a eukaryote.Copyright © 2018 Elsevier Ltd. All rights reserved.


April 21, 2020

Full-Length Transcriptome Analysis of the Genes Involved in Tocopherol Biosynthesis in Torreya grandis.

The seeds of Torreya grandis (Cephalotaxaceae) are rich in tocopherols, which are essential components of the human diet as a result of their function in scavenging reactive oxygen and free radicals. Different T. grandis cultivars (10 cultivars selected in this study were researched, and their information is shown in Table S1 of the Supporting Information) vary enormously in their tocopherol contents (0.28-11.98 mg/100 g). However, little is known about the molecular basis and regulatory mechanisms of tocopherol biosynthesis in T. grandis kernels. Here, we applied single-molecule real-time (SMRT) sequencing to T. grandis (X08 cultivar) for the first time and obtained a total of 97?211 full-length transcripts. We proposed the biosynthetic pathway of tocopherol and identified eight full-length transcripts encoding enzymes potentially involved in tocopherol biosynthesis in T. grandis. The results of the correlation analysis between the tocopherol content and gene expression level in the 10 selected cultivars and different kernel developmental stages of the X08 cultivar suggested that homogentisate phytyltransferase coding gene ( TgVTE2b) and ?-tocopherol methyltransferase coding gene ( TgVTE4) may be key players in tocopherol accumulation in the kernels of T. grandis. Subcellular localization assays showed that both TgVTE2b and TgVTE4 were localized to the chloroplast. We also identified candidate regulatory genes similar to WRI1 and DGAT1 in Arabidopsis that may be involved in the regulation of tocopherol biosynthesis. Our findings provide valuable genetic information for T. grandis using full-length transcriptomic analysis, elucidating the candidate genes and key regulatory genes involved in tocopherol biosynthesis. This information will be critical for further molecular-assisted screening and breeding of T. grandis genotypes with high tocopherol contents.


April 21, 2020

Midrib Sucrose Accumulation and Sugar Transporter Gene Expression in YCS-Affected Sugarcane Leaves

Sucrose accumulation and decreased photosynthesis are early symptoms of yellow canopy syndrome (YCS) in sugarcane (Saccharum spp.), and precede the visual yellowing of the leaves. To investigate broad-scale gene expression changes during YCS-onset, transcriptome analyses coupled to metabolome analyses were performed. Across leaf tissues, the greatest number of differentially expressed genes related to the chloroplast, and the metabolic processes relating to nitrogen and carbohydrates. Five genes represented 90% of the TPM (Transcripts Per Million) associated with the downregulation of transcription during YCS-onset, which included PSII D1 (PsbA). This differential expression was consistent with a feedback regulatory effect upon photosynthesis. Broad-scale gene expression analyses did not reveal a cause for leaf sugar accumulation during YCS-onset. Interestingly, the midrib showed the greatest accumulation of sugars, followed by symptomatic lamina. To investigate if phloem loading/reloading may be compromised on a gene expression level – to lead to leaf sucrose accumulation – sucrose transport-related proteins of SWEETs, Sucrose Transporters (SUTs), H+-ATPases and H+-pyrophosphatases (H+-PPases) were characterised from a sugarcane transcriptome and expression analysed. Two clusters of Type I H+-PPases, with one upregulated and the other downregulated, were evident. Although less pronounced, a similar pattern of change was observed for the H+-ATPases. The disaccharide transporting SWEETs were downregulated after visual symptoms were present, and a monosaccharide transporting SWEET upregulated preceding, as well as after, symptom development. SUT gene expression was the least responsive to YCS development. The results are consistent with a reduction of photoassimilate movement through the phloem leading to sucrose build-up in the leaf.


April 21, 2020

A siphonous macroalgal genome suggests convergent functions of homeobox genes in algae and land plants.

Genome evolution and development of unicellular, multinucleate macroalgae (siphonous algae) are poorly known, although various multicellular organisms have been studied extensively. To understand macroalgal developmental evolution, we assembled the ~26?Mb genome of a siphonous green alga, Caulerpa lentillifera, with high contiguity, containing 9,311 protein-coding genes. Molecular phylogeny using 107 nuclear genes indicates that the diversification of the class Ulvophyceae, including C. lentillifera, occurred before the split of the Chlorophyceae and Trebouxiophyceae. Compared with other green algae, the TALE superclass of homeobox genes, which expanded in land plants, shows a series of lineage-specific duplications in this siphonous macroalga. Plant hormone signalling components were also expanded in a lineage-specific manner. Expanded transport regulators, which show spatially different expression, suggest that the structural patterning strategy of a multinucleate cell depends on diversification of nuclear pore proteins. These results not only imply functional convergence of duplicated genes among green plants, but also provide insight into evolutionary roots of green plants. Based on the present results, we propose cellular and molecular mechanisms involved in the structural differentiation in the siphonous alga. © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


April 21, 2020

Full-Length Transcriptome Sequencing and the Discovery of New Transcripts in the Unfertilized Eggs of Zebrafish (Danio rerio).

Understanding early gene expression in zebrafish embryos is a prerequisite for developmental biology research. In this study, 1,629,447 polymerase reads were obtained from the unfertilized eggs of zebrafish via full-length transcriptome sequencing using the PacBio RS II platform first. Then, 102,920 unique isoforms were obtained by correction, clustering and comparison with the zebrafish genome. 12,782 genes in the genome were captured, accounting for 39.71% of the all annotated genes. Approximately 62.27% of the 12,782 genes have been alternatively spliced. GO and KEGG annotations revealed that the unfertilized eggs primarily stored genes that participate in RNA processing and nuclear protein complex composition. According to this PacBio data that aligned with the genome, 3,970 fusion genes, 819 ncRNAs, and 84 new transcripts were predicted. Illumina RNA-seq and RT-qPCR detection found that the expression of two new transcripts, PB.5289.1 and PB.10209.1, were significantly up-regulated at the 2-cell stage and down-regulated rapidly thereafter, suggesting their involvement in minor ZGA during early embryonic development. This study indicated that the unfertilized eggs of zebrafish may have retained genes directly related to cell division and development to initiate the subsequent development in a limited space and time. On the other hand, NTRs or new transcriptome regions in the genome were discovered, which provided new clues regarding ZGA of MZT during early embryonic development in fish.Copyright © 2019 Mehjabin et al.


April 21, 2020

Liriodendron genome sheds light on angiosperm phylogeny and species-pair differentiation.

The genus Liriodendron belongs to the family Magnoliaceae, which resides within the magnoliids, an early diverging lineage of the Mesangiospermae. However, the phylogenetic relationship of magnoliids with eudicots and monocots has not been conclusively resolved and thus remains to be determined1-6. Liriodendron is a relict lineage from the Tertiary with two distinct species-one East Asian (L. chinense (Hemsley) Sargent) and one eastern North American (L. tulipifera Linn)-identified as a vicariad species pair. However, the genetic divergence and evolutionary trajectories of these species remain to be elucidated at the whole-genome level7. Here, we report the first de novo genome assembly of a plant in the Magnoliaceae, L. chinense. Phylogenetic analyses suggest that magnoliids are sister to the clade consisting of eudicots and monocots, with rapid diversification occurring in the common ancestor of these three lineages. Analyses of population genetic structure indicate that L. chinense has diverged into two lineages-the eastern and western groups-in China. While L. tulipifera in North America is genetically positioned between the two L. chinense groups, it is closer to the eastern group. This result is consistent with phenotypic observations that suggest that the eastern and western groups of China may have diverged long ago, possibly before the intercontinental differentiation between L. chinense and L. tulipifera. Genetic diversity analyses show that L. chinense has tenfold higher genetic diversity than L. tulipifera, suggesting that the complicated regions comprising east-west-orientated mountains and the Yangtze river basin (especially near 30°?N latitude) in East Asia offered more successful refugia than the south-north-orientated mountain valleys in eastern North America during the Quaternary glacial period.


April 21, 2020

Computational aspects underlying genome to phenome analysis in plants.

Recent advances in genomics technologies have greatly accelerated the progress in both fundamental plant science and applied breeding research. Concurrently, high-throughput plant phenotyping is becoming widely adopted in the plant community, promising to alleviate the phenotypic bottleneck. While these technological breakthroughs are significantly accelerating quantitative trait locus (QTL) and causal gene identification, challenges to enable even more sophisticated analyses remain. In particular, care needs to be taken to standardize, describe and conduct experiments robustly while relying on plant physiology expertise. In this article, we review the state of the art regarding genome assembly and the future potential of pangenomics in plant research. We also describe the necessity of standardizing and describing phenotypic studies using the Minimum Information About a Plant Phenotyping Experiment (MIAPPE) standard to enable the reuse and integration of phenotypic data. In addition, we show how deep phenotypic data might yield novel trait-trait correlations and review how to link phenotypic data to genomic data. Finally, we provide perspectives on the golden future of machine learning and their potential in linking phenotypes to genomic features. © 2018 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.