Menu
April 21, 2020

Full-length mRNA sequencing in Saccharina japonica and identification of carbonic anhydrase genes

The carbonic anhydrases (CAs) are a group of enzymes that play an important role in the absorption and transportation of CO2 in Saccharina japonica. They are encoded by a superfamily of genes with seven subtypes that are unrelated in sequence but share conserved function in catalyzing the reversible conversion of CO2 and HCO3-. Here we have characterized the CA members in the transcriptome of S. japonica using Single-molecule real-time (SMRT) sequencing technology. Approximately 9830.4 megabases from 5,028,003 quality subreads were generated, and they were assembled into 326,512 full-length non-chimeric (FLNC) reads, with an average flnc read length of 2181 bp. After removing redundant sequences, 79,010 unique transcripts were obtained of which 38,039 transcripts were successfully annotated. From the full-length transcriptome, we have identified 7 full-length cDNA sequences for CA genes (4 a-CAs, 1 ß-CAs and 2 ?-CAs) and assessed for their potential functions based on phylogenetic analysis. Characterizations of CAs will provide the ground for future studies to determine the involvement of CAs in inorganic carbon absorption and transportation in S. japonica.


April 21, 2020

Genome sequence of Jatropha curcas L., a non-edible biodiesel plant, provides a resource to improve seed-related traits.

Jatropha curcas (physic nut), a non-edible oilseed crop, represents one of the most promising alternative energy sources due to its high seed oil content, rapid growth and adaptability to various environments. We report ~339 Mbp draft whole genome sequence of J. curcas var. Chai Nat using both the PacBio and Illumina sequencing platforms. We identified and categorized differentially expressed genes related to biosynthesis of lipid and toxic compound among four stages of seed development. Triacylglycerol (TAG), the major component of seed storage oil, is mainly synthesized by phospholipid:diacylglycerol acyltransferase in Jatropha, and continuous high expression of homologs of oleosin over seed development contributes to accumulation of high level of oil in kernels by preventing the breakdown of TAG. A physical cluster of genes for diterpenoid biosynthetic enzymes, including casbene synthases highly responsible for a toxic compound, phorbol ester, in seed cake, was syntenically highly conserved between Jatropha and castor bean. Transcriptomic analysis of female and male flowers revealed the up-regulation of a dozen family of TFs in female flower. Additionally, we constructed a robust species tree enabling estimation of divergence times among nine Jatropha species and five commercial crops in Malpighiales order. Our results will help researchers and breeders increase energy efficiency of this important oil seed crop by improving yield and oil content, and eliminating toxic compound in seed cake for animal feed. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Analysis of Chromosomal Numbers, Mitochondrial Genome, and Full-Length Transcriptome of Onychostoma brevibarba.

Onychostoma brevibarba is a new discovered species which is distributed in Xiang Jiang River of the middle Chang Jiang basin in Hunan Province, South China. In this study, the ploidy levels of O. brevibarba were confirmed by counting chromosomal numbers and analyzing karyotype. The complete mitochondrial genome of O. brevibarba was determined and analyzed. Besides, we firstly performed the full-length transcriptome of O. brevibarba derived from 5 different tissues using the PacBio SMRT sequencing. The result shows that O. brevibarba was a diploid with 48 chromosomes. The complete mitogenome of O. brevibarba was 16,602 bp in size and very similar (89.1-91.3%) to that of other Onychostoma species but was distinct from all congeners. The full-length transcriptome dataset of O. brevibarba comprised 120,239 unigenes. Among the unigenes, 91,542 were functionally annotated, whereas 26,794 were found to have two or more isoforms. This study could provide many new insights into cytology and molecular characteristics of O. brevibarba; it laid the foundation for further exploration of the genomic signatures of species of Onychostoma.


April 21, 2020

The Genome of Cucurbita argyrosperma (Silver-Seed Gourd) Reveals Faster Rates of Protein-Coding Gene and Long Noncoding RNA Turnover and Neofunctionalization within Cucurbita.

Whole-genome duplications are an important source of evolutionary novelties that change the mode and tempo at which genetic elements evolve within a genome. The Cucurbita genus experienced a whole-genome duplication around 30 million years ago, although the evolutionary dynamics of the coding and noncoding genes in this genus have not yet been scrutinized. Here, we analyzed the genomes of four Cucurbita species, including a newly assembled genome of Cucurbita argyrosperma, and compared the gene contents of these species with those of five other members of the Cucurbitaceae family to assess the evolutionary dynamics of protein-coding and long intergenic noncoding RNA (lincRNA) genes after the genome duplication. We report that Cucurbita genomes have a higher protein-coding gene birth-death rate compared with the genomes of the other members of the Cucurbitaceae family. C. argyrosperma gene families associated with pollination and transmembrane transport had significantly faster evolutionary rates. lincRNA families showed high levels of gene turnover throughout the phylogeny, and 67.7% of the lincRNA families in Cucurbita showed evidence of birth from the neofunctionalization of previously existing protein-coding genes. Collectively, our results suggest that the whole-genome duplication in Cucurbita resulted in faster rates of gene family evolution through the neofunctionalization of duplicated genes. Copyright © 2019 The Author. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Newly designed 16S rRNA metabarcoding primers amplify diverse and novel archaeal taxa from the environment.

High-throughput studies of microbial communities suggest that Archaea are a widespread component of microbial diversity in various ecosystems. However, proper quantification of archaeal diversity and community ecology remains limited, as sequence coverage of Archaea is usually low owing to the inability of available prokaryotic primers to efficiently amplify archaeal compared to bacterial rRNA genes. To improve identification and quantification of Archaea, we designed and validated the utility of several primer pairs to efficiently amplify archaeal 16S rRNA genes based on up-to-date reference genes. We demonstrate that several of these primer pairs amplify phylogenetically diverse Archaea with high sequencing coverage, outperforming commonly used primers. Based on comparing the resulting long 16S rRNA gene fragments with public databases from all habitats, we found several novel family- to phylum-level archaeal taxa from topsoil and surface water. Our results suggest that archaeal diversity has been largely overlooked due to the limitations of available primers, and that improved primer pairs enable to estimate archaeal diversity more accurately. © 2018 The Authors. Environmental Microbiology Reports published by Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Next generation sequencing characterizes HLA diversity in a registry population from the Netherlands.

Next generation DNA sequencing is used to determine the HLA-A, -B, -C, -DRB1, -DRB3/4/5, and -DQB1 assignments of 1009 unrelated volunteers for the unrelated donor registry in The Netherlands. The analysis characterizes all HLA exons and introns for class I alleles; at least exons 2 to 3 for HLA-DRB1; and exons 2 to 6 for HLA-DQB1. Of the distinct alleles present, there are 229 class I and 71 class II; 36 of these alleles are novel. The majority (approximately 98%) of the cumulative allele frequency at each locus is contributed by alleles that appear three or more times. Alleles encoding protein variation outside of the antigen recognition domains are 0.6% of the class I assignments and 5.3% of the class II assignments. © 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


April 21, 2020

The role of long-term mineral and organic fertilisation treatment in changing pathogen and symbiont community composition in soil

Application of organic fertilisers to soil prevents erosion, improves fertility and may suppress certain soil-borne plant pathogens, but it is still unclear how different trophic groups of fungi and oomycetes respond to long-term fertilisation treatment. The objective of the study was to examine the effect of different fertilisation regimes on fungal and oomycete pathogen- and mycorrhizal symbiont diversity and community structure in both soil and roots, using PacBio SMRT sequencing. The field experiment included three fertilisation treatments that have been applied since 1989: nitrogen fertilisation (WOM), nitrogen fertilisation with manure amendment (FYM) and alternative organic fertilisation (AOF), each applied at five different rates. Soil samples were collected three times during the growing season, while root samples were collected during the flowering stage. There was no influence of the studied variables on soil and root pathogen richness. Contrary to our hypothesis, pathogen relative abundance in both soil and roots was significantly higher in plots with the AOF treatment. Furthermore, richness and relative abundance of arbuscular mycorrhizal (AM) fungi decreased significantly in the AOF treatment. Permutational analysis of variance (PERMANOVA) demonstrated the effect of fertilisation treatment on pathogen community composition in both soil and roots. Our findings indicate that organic fertilisers may not always benefit soil microbial community composition. Therefore, further studies are needed to understand how fertilisation affects mycorrhizal mutualists and pathogens.


April 21, 2020

Genome sequence and genetic transformation of a widely distributed and cultivated poplar.

Populus alba is widely distributed and cultivated in Europe and Asia. This species has been used for diverse studies. In this study, we assembled a de novo genome sequence of P. alba var. pyramidalis (= P. bolleana) and confirmed its high transformation efficiency and short transformation time by experiments. Through a process of hybrid genome assembly, a total of 464 M of the genome was assembled. Annotation analyses predicted 37 901 protein-coding genes. This genome is highly collinear to that of P. trichocarpa, with most genes having orthologs in the two species. We found a marked expansion of gene families related to histone and the hormone auxin but loss of disease resistance genes in P. alba if compared with the closely related P. trichocarpa. The genome sequence presented here represents a valuable resource for further molecular functional analyses of this species as a new tree model, poplar breeding practices and comparative genomic analyses across different poplars. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Patterns of non-ARD variation in more than 300 full-length HLA-DPB1 alleles.

Our understanding of sequence variation in the HLA-DPB1 gene is largely restricted to the hypervariable antigen recognition domain (ARD) encoded by exon 2. Here, we employed a redundant sequencing strategy combining long-read and short-read data to accurately phase and characterise in full length the majority of common and well-documented (CWD) DPB1 alleles as well as alleles with an observed frequency of at least 0.0006% in our predominantly European sample set. We generated 664 DPB1 sequences, comprising 279 distinct allelic variants. This allows us to present the, to date, most comprehensive analysis of the nature and extent of DPB1 sequence variation. The full-length sequence analysis revealed the existence of two highly diverged allele clades. These clades correlate with the rs9277534 A???G variant, a known expression marker located in the 3′-UTR. The two clades are fully differentiated by 174 fixed polymorphisms throughout a 3.6?kb stretch at the 3′-end of DPB1. The region upstream of this differentiation zone is characterised by increasingly shared variation between the clades. The low-expression A clade comprises 59% of the distinct allelic sequences including the three by far most frequent DPB1 alleles, DPB1*04:01, DPB1*02:01 and DPB1*04:02. Alleles in the A clade show reduced nucleotide diversity with an excess of rare variants when compared to the high-expression G clade. This pattern is consistent with a scenario of recent proliferation of A-clade alleles. The full-length characterisation of all but the most rare DPB1 alleles will benefit the application of NGS for DPB1 genotyping and provides a helpful framework for a deeper understanding of high- and low-expression alleles and their implications in the context of unrelated haematopoietic stem-cell transplantation.Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Recipients receiving better HLA-matched hematopoietic cell transplantation grafts, uncovered by a novel HLA typing method, have superior survival: A retrospective study

HLA matching at an allelic-level resolution for volunteer unrelated donor (VUD) hematopoietic cell transplanta- tion (HCT) results in improved survival and fewer post-transplant complications. Limitations in typing technolo- gies used for the hyperpolymorphic HLA genes have meant that variations outside of the antigen recognition domain (ARD) have not been previously characterized in HCT. Our aim was to explore the extent of diversity out- side of the ARD and determine the impact of this diversity on transplant outcome. Eight hundred ninety-one VUD-HCT donors and their recipients transplanted for a hematologic malignancy in the United Kingdom were ret- rospectively HLA typed at an ultra-high resolution (UHR) for HLA-A, -B, -C, -DRB1, -DQB1, and -DPB1 using next- generation sequencing technology. Matching was determined at full gene level for HLA class I and at a coding DNA sequence level for HLA class II genes. The HLA matching status changed in 29.1% of pairs after UHR HLA typ- ing. The 12/12 UHR HLA matched patients had significantly improved 5-year overall survival when compared with those believed to be 12/12 HLA matches based on their original HLA typing but were found to be mismatched after UHR HLA typing (54.8% versus 30.1%, P= .022). Survival was also significantly better in 12/12 UHR HLA- matched patients when compared with those with any degree of mismatch at this level of resolution (55.1% ver- sus 40.1%, P= .005). This study shows that better HLA matching, found when typing is done at UHR that includes exons outside of the ARD, introns, and untranslated regions, can significantly improve outcomes for recipients of a VUD-HCT for a hematologic malignancy and should be prospectively performed at donor selection.


April 21, 2020

Efficiency of PacBio long read correction by 2nd generation Illumina sequencing.

Long sequencing reads offer unprecedented opportunities in analysis and reconstruction of complex genomic regions. However, the gain in sequence length is often traded for quality. Therefore, recently several approaches have been proposed (e.g. higher sequencing coverage, hybrid assembly or sequence correction) to enhance the quality of long sequencing reads. A simple and cost-effective approach includes use of the high quality 2nd generation sequencing data to improve the quality of long reads. We designed a dedicated testing procedure and selected universal programs for long read correction, which provide as the output sequences that can be used in further genomic and transcriptomic studies. Our results show that HALC is the best choice for correction of long PacBio reads, when both, read size and quality, are the main focus of the analysis. However, the tested tools show some unexpected behaviors, including read trimming and fragmentation.Copyright © 2017 Elsevier Inc. All rights reserved.


April 21, 2020

Genetic map-guided genome assembly reveals a virulence-governing minichromosome in the lentil anthracnose pathogen Colletotrichum lentis.

Colletotrichum lentis causes anthracnose, which is a serious disease on lentil and can account for up to 70% crop loss. Two pathogenic races, 0 and 1, have been described in the C. lentis population from lentil. To unravel the genetic control of virulence, an isolate of the virulent race 0 was sequenced at 1481-fold genomic coverage. The 56.10-Mb genome assembly consists of 50 scaffolds with N50 scaffold length of 4.89 Mb. A total of 11 436 protein-coding gene models was predicted in the genome with 237 coding candidate effectors, 43 secondary metabolite biosynthetic enzymes and 229 carbohydrate-active enzymes (CAZymes), suggesting a contraction of the virulence gene repertoire in C. lentis. Scaffolds were assigned to 10 core and two minichromosomes using a population (race 0 × race 1, n = 94 progeny isolates) sequencing-based, high-density (14 312 single nucleotide polymorphisms) genetic map. Composite interval mapping revealed a single quantitative trait locus (QTL), qClVIR-11, located on minichromosome 11, explaining 85% of the variability in virulence of the C. lentis population. The QTL covers a physical distance of 0.84 Mb with 98 genes, including seven candidate effector and two secondary metabolite genes. Taken together, the study provides genetic and physical evidence for the existence of a minichromosome controlling the C. lentis virulence on lentil. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


April 21, 2020

Full-length transcriptome and microRNA sequencing reveal the specific gene-regulation network of velvet antler in sika deer with extremely different velvet antler weight.

Velvet antler displays the fastest and most robust tissue proliferation in the animal world, it is a model for a complete organ development/regeneration, and alternative medicine, tonic made from velvet antler, was beneficial for human. The weight of velvet antler had high biomedical and economic value, but the related regulation mechanisms controlling velvet antler weight remain unclear. In this study, extremely heavy and light velvet antler groups were selected from a sika deer population of 100 individuals with extreme velvet antler weight. A combination of full-length transcriptome sequencing and microRNA sequencing to the proliferation zone in the tip of velvet antler was applied. A total of 55306 transcripts and 1082 microRNAs were identified. Some highly expressed genes (COL1A1, COL1A2, COL3A1, FN1, and ATP6) and microRNAs (miR-21, let-7i, and miR-27b) were highly correlated with the physiological and growth characteristics of velvet antlers. Among the 334 differentially expressed genes, we found that most of the genes were located in the developmental process, especially animal organ development process. It is exciting to see that more blood vessels were found in the growing tip of heavy velvet antler through histological observation, and GO term of blood vessel development was also significant different between two groups. The combination analysis with mRNA and microRNA data in velvet antler showed a specific regulation network involved in the development of bone, mesenchyme, cartilage, and blood vessel, and helped us clearly find out the candidate 14 genes and 6 microRNAs, which could be used for selecting significant DNA markers of velvet antler weight.


April 21, 2020

An Annotated Genome for Haliotis rufescens (Red Abalone) and Resequenced Green, Pink, Pinto, Black, and White Abalone Species.

Abalone are one of the few marine taxa where aquaculture production dominates the global market as a result of increasing demand and declining natural stocks from overexploitation and disease. To better understand abalone biology, aid in conservation efforts for endangered abalone species, and gain insight into sustainable aquaculture, we created a draft genome of the red abalone (Haliotis rufescens). The approach to this genome draft included initial assembly using raw Illumina and PacBio sequencing data with MaSuRCA, before scaffolding using sequencing data generated from Chicago library preparations with HiRise2. This assembly approach resulted in 8,371 scaffolds and total length of 1.498?Gb; the N50 was 1.895?Mb, and the longest scaffold was 13.2?Mb. Gene models were predicted, using MAKER2, from RNA-Seq data and all related expressed sequence tags and proteins from NCBI; this resulted in 57,785 genes with an average length of 8,255?bp. In addition, single nucleotide polymorphisms were called on Illumina short-sequencing reads from five other eastern Pacific abalone species: the green (H. fulgens), pink (H. corrugata), pinto (H. kamtschatkana), black (H. cracherodii), and white (H. sorenseni) abalone. Phylogenetic relationships largely follow patterns detected by previous studies based on 1,784,991 high-quality single nucleotide polymorphisms. Among the six abalone species examined, the endangered white abalone appears to harbor the lowest levels of heterozygosity. This draft genome assembly and the sequencing data provide a foundation for genome-enabled aquaculture improvement for red abalone, and for genome-guided conservation efforts for the other five species and, in particular, for the endangered white and black abalone.


April 21, 2020

In-depth analysis of the genome of Trypanosoma evansi, an etiologic agent of surra.

Trypanosoma evansi is the causative agent of the animal trypanosomiasis surra, a disease with serious economic burden worldwide. The availability of the genome of its closely related parasite Trypanosoma brucei allows us to compare their genetic and evolutionarily shared and distinct biological features. The complete genomic sequence of the T. evansi YNB strain was obtained using a combination of genomic and transcriptomic sequencing, de novo assembly, and bioinformatic analysis. The genome size of the T. evansi YNB strain was 35.2 Mb, showing 96.59% similarity in sequence and 88.97% in scaffold alignment with T. brucei. A total of 8,617 protein-coding genes, accounting for 31% of the genome, were predicted. Approximately 1,641 alternative splicing events of 820 genes were identified, with a majority mediated by intron retention, which represented a major difference in post-transcriptional regulation between T. evansi and T. brucei. Disparities in gene copy number of the variant surface glycoprotein, expression site-associated genes, microRNAs, and RNA-binding protein were clearly observed between the two parasites. The results revealed the genomic determinants of T. evansi, which encoded specific biological characteristics that distinguished them from other related trypanosome species.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.