Menu
April 21, 2020

A full-length transcriptome of Sepia esculenta using a combination of single-molecule long-read (SMRT) and Illumina sequencing

As an economically important cephalopods species, wild-caught Sepia esculenta fishery has suffered a server decline due to over-fishing and ocean environmental damage. To restore this seriously declining fishery resource, we should understand the genetic foundation and molecular mechanism of spawning, reproduction and mortal of golden cuttlefish. In this study, we generated the full-length transcriptome of S. esculenta based on the total RNA of tissue samples (brain, optic gland, nidamental gland, ovary and muscle at different developmental stages) using a combination of single-molecule real-time (SMRT) and Illumina RNA-seq technology. A total of 14.16 Gb SMRT sequencing data were assembled into 94,635 transcripts. Meanwhile, 35.15 Gb Illumina HiSeq data were assembled into 177,226 non-redundant transcripts. Then, we merged SMRT and Illumina assembled data to generate a more complete/full-length S. esculenta transcriptome with 177,951 high-quality transcripts. Based on the obtained transcriptome data, total 81,459 transcripts were annotated in at least one of seven functional databases and 49,189 nucleotide sequences of coding regions were identified. Additionally, 161,327 SSRs distributed in 64,933 transcripts were identified based on SSR analysis. This full-length and high-quality transcriptome of S. esculenta can provide an important foundation for future genomic research on growth and development, reproduction and mortal of cephalopod and further recovery of this recessionary fisheries resources.


April 21, 2020

A high-quality draft genome assembly of Sinella curviseta: A soil model organism (Collembola).

Sinella curviseta, among the most widespread springtails (Collembola) in Northern Hemisphere, has often been treated as a model organism in soil ecology and environmental toxicology. However, little information on its genetic knowledge severely hinders our understanding of its adaptations to the soil habitat. We present the largest genome assembly within Collembola using ~44.86?Gb (118X) of single-molecule real-time Pacific Bioscience Sequel sequencing. The final assembly of 599 scaffolds was ~381.46?Mb with a N50 length of 3.28?Mb, which captured 95.3% complete and 1.5% partial arthropod Benchmarking Universal Single-Copy Orthologs (n?=?1066). Transcripts and circularized mitochondrial genome were also assembled. We predicted 23,943 protein-coding genes, of which 83.88% were supported by transcriptome-based evidence and 82.49% matched protein records in UniProt. In addition, we also identified 222,501 repeats and 881 noncoding RNAs. Phylogenetic reconstructions for Collembola support Tomoceridae sistered to the remaining Entomobryomorpha with the position of Symphypleona not fully resolved. Gene family evolution analyses identified 9,898 gene families, of which 156 experienced significant expansions or contractions. Our high-quality reference genome of S. curviseta provides the genetic basis for future investigations in evolutionary biology, soil ecology, and ecotoxicology. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Analysis of Chromosomal Numbers, Mitochondrial Genome, and Full-Length Transcriptome of Onychostoma brevibarba.

Onychostoma brevibarba is a new discovered species which is distributed in Xiang Jiang River of the middle Chang Jiang basin in Hunan Province, South China. In this study, the ploidy levels of O. brevibarba were confirmed by counting chromosomal numbers and analyzing karyotype. The complete mitochondrial genome of O. brevibarba was determined and analyzed. Besides, we firstly performed the full-length transcriptome of O. brevibarba derived from 5 different tissues using the PacBio SMRT sequencing. The result shows that O. brevibarba was a diploid with 48 chromosomes. The complete mitogenome of O. brevibarba was 16,602 bp in size and very similar (89.1-91.3%) to that of other Onychostoma species but was distinct from all congeners. The full-length transcriptome dataset of O. brevibarba comprised 120,239 unigenes. Among the unigenes, 91,542 were functionally annotated, whereas 26,794 were found to have two or more isoforms. This study could provide many new insights into cytology and molecular characteristics of O. brevibarba; it laid the foundation for further exploration of the genomic signatures of species of Onychostoma.


April 21, 2020

a-Difluoromethylornithine reduces gastric carcinogenesis by causing mutations in Helicobacter pylori cagY.

Infection by Helicobacter pylori is the primary cause of gastric adenocarcinoma. The most potent H. pylori virulence factor is cytotoxin-associated gene A (CagA), which is translocated by a type 4 secretion system (T4SS) into gastric epithelial cells and activates oncogenic signaling pathways. The gene cagY encodes for a key component of the T4SS and can undergo gene rearrangements. We have shown that the cancer chemopreventive agent a-difluoromethylornithine (DFMO), known to inhibit the enzyme ornithine decarboxylase, reduces H. pylori-mediated gastric cancer incidence in Mongolian gerbils. In the present study, we questioned whether DFMO might directly affect H. pylori pathogenicity. We show that H. pylori output strains isolated from gerbils treated with DFMO exhibit reduced ability to translocate CagA in gastric epithelial cells. Further, we frequently detected genomic modifications in the middle repeat region of the cagY gene of output strains from DFMO-treated animals, which were associated with alterations in the CagY protein. Gerbils did not develop carcinoma when infected with a DFMO output strain containing rearranged cagY or the parental strain in which the wild-type cagY was replaced by cagY with DFMO-induced rearrangements. Lastly, we demonstrate that in vitro treatment of H. pylori by DFMO induces oxidative DNA damage, expression of the DNA repair enzyme MutS2, and mutations in cagY, demonstrating that DFMO directly affects genomic stability. Deletion of mutS2 abrogated the ability of DFMO to induce cagY rearrangements directly. In conclusion, DFMO-induced oxidative stress in H. pylori leads to genomic alterations and attenuates virulence.


April 21, 2020

Next generation sequencing characterizes HLA diversity in a registry population from the Netherlands.

Next generation DNA sequencing is used to determine the HLA-A, -B, -C, -DRB1, -DRB3/4/5, and -DQB1 assignments of 1009 unrelated volunteers for the unrelated donor registry in The Netherlands. The analysis characterizes all HLA exons and introns for class I alleles; at least exons 2 to 3 for HLA-DRB1; and exons 2 to 6 for HLA-DQB1. Of the distinct alleles present, there are 229 class I and 71 class II; 36 of these alleles are novel. The majority (approximately 98%) of the cumulative allele frequency at each locus is contributed by alleles that appear three or more times. Alleles encoding protein variation outside of the antigen recognition domains are 0.6% of the class I assignments and 5.3% of the class II assignments. © 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


April 21, 2020

Genomic variation and strain-specific functional adaptation in the human gut microbiome during early life.

The human gut microbiome matures towards the adult composition during the first years of life and is implicated in early immune development. Here, we investigate the effects of microbial genomic diversity on gut microbiome development using integrated early childhood data sets collected in the DIABIMMUNE study in Finland, Estonia and Russian Karelia. We show that gut microbial diversity is associated with household location and linear growth of children. Single nucleotide polymorphism- and metagenomic assembly-based strain tracking revealed large and highly dynamic microbial pangenomes, especially in the genus Bacteroides, in which we identified evidence of variability deriving from Bacteroides-targeting bacteriophages. Our analyses revealed functional consequences of strain diversity; only 10% of Finnish infants harboured Bifidobacterium longum subsp. infantis, a subspecies specialized in human milk metabolism, whereas Russian infants commonly maintained a probiotic Bifidobacterium bifidum strain in infancy. Groups of bacteria contributing to diverse, characterized metabolic pathways converged to highly subject-specific configurations over the first two years of life. This longitudinal study extends the current view of early gut microbial community assembly based on strain-level genomic variation.


April 21, 2020

Antarctic blackfin icefish genome reveals adaptations to extreme environments.

Icefishes (suborder Notothenioidei; family Channichthyidae) are the only vertebrates that lack functional haemoglobin genes and red blood cells. Here, we report a high-quality genome assembly and linkage map for the Antarctic blackfin icefish Chaenocephalus aceratus, highlighting evolved genomic features for its unique physiology. Phylogenomic analysis revealed that Antarctic fish of the teleost suborder Notothenioidei, including icefishes, diverged from the stickleback lineage about 77 million years ago and subsequently evolved cold-adapted phenotypes as the Southern Ocean cooled to sub-zero temperatures. Our results show that genes involved in protection from ice damage, including genes encoding antifreeze glycoprotein and zona pellucida proteins, are highly expanded in the icefish genome. Furthermore, genes that encode enzymes that help to control cellular redox state, including members of the sod3 and nqo1 gene families, are expanded, probably as evolutionary adaptations to the relatively high concentration of oxygen dissolved in cold Antarctic waters. In contrast, some crucial regulators of circadian homeostasis (cry and per genes) are absent from the icefish genome, suggesting compromised control of biological rhythms in the polar light environment. The availability of the icefish genome sequence will accelerate our understanding of adaptation to extreme Antarctic environments.


April 21, 2020

Different knockout genotypes of OsIAA23 in rice using CRISPR/Cas9 generating different phenotypes.

We have isolated several Osiaa23 rice mutants with different knockout genotypes, resulting in different phenotypes, which suggested that different genetic backgrounds or mutation types influence gene function. The Auxin/Indole-3-Acetic Acid (Aux/IAA) gene family performs critical roles in auxin signal transduction in plants. In rice, the gene OsIAA23 (Os06t0597000) is known to affect development of roots and shoots, but previous knockouts in OsIAA23 have been sterile and difficult for research continuously. Here, we isolate new Osiaa23 mutants using the CRISPR/Cas9 system in japonica (Wuyunjing24) and indica (Kasalath) rice, with extensive genome re-sequencing to confirm the absence of off-target effects. In Kasalath, mutants with a 13-amino acid deletion showed profoundly greater dwarfing, lateral root developmental disorder, and fertility deficiency, relative to mutants with a single amino acid deletion, demonstrating that those 13 amino acids in Kasalath are essential to gene function. In Wuyunjing24, we predicted that mutants with a single base-pair frameshift insertion would experience premature termination and strong phenotypic defects, but instead these lines exhibited negligible phenotypic difference and normal fertility. Through RNA-seq, we show here that new mosaic transcripts of OsIAA23 were produced de novo, which circumvented the premature termination and thereby preserved the wild-type phenotype. This finding is a notable demonstration in plants that mutants can mask loss of function CRISPR/Cas9 editing of the target gene through de novo changes in alternative splicing.


April 21, 2020

Varieties of immunity activities and gut contents in tilapia with seasonal changes.

We performed 16S rDNA sequencing of tilapia fecal samples to analyze changes in tilapia gut contents after cultivation of the fish in the presence of sandwich-like floating beds of Chinese medicinal herbs (5 and 10% planting-areas; 5% Polygonum cuspidatum). The interactive effects between water quality and blood and hepatic pro- and anti-inflammatory concentrations were also assessed. Our results showed that the water quality (i.e., NO3-N, NO2-N, TP removal rates) improved, and the abundance of Chloroflexi and Cyanobacteria increased. The abundance of Bacteroidetes, Verrucomicrobia, Saccharibacteria, and Actinobacteria showed both significant seasonal decreases and increases in the presence of P. cuspidatum (increases in August and decreases in July). Fish blood and hepatic IL-10 and IFN-? levels (together with fish sampled in September) significantly increased in the P. cuspidatum group sampled in August, while those of TNF-a (10% sandwich-like, P. cuspidatum), IL-1ß (P. cuspidatum), IL-8 (5% sandwich-like in September, S905S) significantly decreased. Heat shock proteins 60 and 70 levels significantly increased in the P. cuspidatum group, and complement C3 and C4 concentrations significantly increased in S905S. This study demonstrated that enhanced immunity through the regulation of pro- and anti-inflammatory proteins was sustained throughout development until harvest, particularly in fish grown with P. cuspidatum.Copyright © 2019. Published by Elsevier Ltd.


April 21, 2020

In-depth analysis of the genome of Trypanosoma evansi, an etiologic agent of surra.

Trypanosoma evansi is the causative agent of the animal trypanosomiasis surra, a disease with serious economic burden worldwide. The availability of the genome of its closely related parasite Trypanosoma brucei allows us to compare their genetic and evolutionarily shared and distinct biological features. The complete genomic sequence of the T. evansi YNB strain was obtained using a combination of genomic and transcriptomic sequencing, de novo assembly, and bioinformatic analysis. The genome size of the T. evansi YNB strain was 35.2 Mb, showing 96.59% similarity in sequence and 88.97% in scaffold alignment with T. brucei. A total of 8,617 protein-coding genes, accounting for 31% of the genome, were predicted. Approximately 1,641 alternative splicing events of 820 genes were identified, with a majority mediated by intron retention, which represented a major difference in post-transcriptional regulation between T. evansi and T. brucei. Disparities in gene copy number of the variant surface glycoprotein, expression site-associated genes, microRNAs, and RNA-binding protein were clearly observed between the two parasites. The results revealed the genomic determinants of T. evansi, which encoded specific biological characteristics that distinguished them from other related trypanosome species.


April 21, 2020

Mediterraneibacter butyricigenes sp. nov., a butyrate-producing bacterium isolated from human faeces.

A Gram-stain-positive, obligately anaerobic, non-motile, nonspore-forming, and rod-shaped bacterial strain, designated KGMB01110T, was isolated from a faecal sample of a healthy male in South Korea. Phylogenetic analysis based on 16S rRNA gene showed that strain KGMB01110T belonged to Clostridium cluster XIVa and was most closely related to Mediterraneibacter glycyrrhizinilyticus KCTC 5760T (95.9% 16S rRNA gene sequence similarity). The DNA G + C content of strain KGMB01110T based on its whole genome sequence was 44.1 mol%. The major cellular fatty acids (> 10%) of the isolate were C14:0 and C16:0. The strain KGMB01110T was positive for arginine dihydrolase, ß-galactosidase-6-phosphatase, and alkaline phosphatase. The strain KGMB01110T also produced acid from D-glucose and D-rhamnose, and hydrolyzed gelatin and aesculin. Furthermore, HPLC analysis and UV-tests of culture supernatant revealed that the strain KGMB01110T produced butyrate as the major end product of glucose fermentation. Based on the phylogenetic and phenotypic characteristics, strain KGMB01110T represent a novel species of the genus Mediterraneibacter in the family Lachnospiraceae. The type strain is KGMB01110T (= KCTC 15684T = CCUG 72830T).


April 21, 2020

RADAR-seq: A RAre DAmage and Repair sequencing method for detecting DNA damage on a genome-wide scale.

RAre DAmage and Repair sequencing (RADAR-seq) is a highly adaptable sequencing method that enables the identification and detection of rare DNA damage events for a wide variety of DNA lesions at single-molecule resolution on a genome-wide scale. In RADAR-seq, DNA lesions are replaced with a patch of modified bases that can be directly detected by Pacific Biosciences Single Molecule Real-Time (SMRT) sequencing. RADAR-seq enables dynamic detection over a wide range of DNA damage frequencies, including low physiological levels. Furthermore, without the need for DNA amplification and enrichment steps, RADAR-seq provides sequencing coverage of damaged and undamaged DNA across an entire genome. Here, we use RADAR-seq to measure the frequency and map the location of ribonucleotides in wild-type and RNaseH2-deficient E. coli and Thermococcus kodakarensis strains. Additionally, by tracking ribonucleotides incorporated during in vivo lagging strand DNA synthesis, we determined the replication initiation point in E. coli, and its relation to the origin of replication (oriC). RADAR-seq was also used to map cyclobutane pyrimidine dimers (CPDs) in Escherichia coli (E. coli) genomic DNA exposed to UV-radiation. On a broader scale, RADAR-seq can be applied to understand formation and repair of DNA damage, the correlation between DNA damage and disease initiation and progression, and complex biological pathways, including DNA replication.Copyright © 2019 The Authors. Published by Elsevier B.V. All rights reserved.


April 21, 2020

A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing.

We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.


April 21, 2020

The CF Canada-Sick Kids Program in individual CF therapy: A resource for the advancement of personalized medicine in CF.

Therapies targeting certain CFTR mutants have been approved, yet variations in clinical response highlight the need for in-vitro and genetic tools that predict patient-specific clinical outcomes. Toward this goal, the CF Canada-Sick Kids Program in Individual CF Therapy (CFIT) is generating a “first of its kind”, comprehensive resource containing patient-specific cell cultures and data from 100 CF individuals that will enable modeling of therapeutic responses.The CFIT program is generating: 1) nasal cells from drug naïve patients suitable for culture and the study of drug responses in vitro, 2) matched gene expression data obtained by sequencing the RNA from the primary nasal tissue, 3) whole genome sequencing of blood derived DNA from each of the 100 participants, 4) induced pluripotent stem cells (iPSCs) generated from each participant’s blood sample, 5) CRISPR-edited isogenic control iPSC lines and 6) prospective clinical data from patients treated with CF modulators.To date, we have recruited 57 of 100 individuals to CFIT, most of whom are homozygous for F508del (to assess in-vitro: in-vivo correlations with respect to ORKAMBI response) or heterozygous for F508del and a minimal function mutation. In addition, several donors are homozygous for rare nonsense and missense mutations. Nasal epithelial cell cultures and matched iPSC lines are available for many of these donors.This accessible resource will enable development of tools that predict individual outcomes to current and emerging modulators targeting F508del-CFTR and facilitate therapy discovery for rare CF causing mutations.Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.


April 21, 2020

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

To better understand the immune system of shrimp, this study combined PacBio isoform sequencing (Iso-Seq) and Illumina paired-end short reads sequencing methods to discover full-length immune-related molecules of the Pacific white shrimp, Litopenaeus vannamei. A total of 72,648 nonredundant full-length transcripts (unigenes) were generated with an average length of 2545 bp from five main tissues, including the hepatopancreas, cardiac stomach, heart, muscle, and pyloric stomach. These unigenes exhibited a high annotation rate (62,164, 85.57%) when compared against NR, NT, Swiss-Prot, Pfam, GO, KEGG and COG databases. A total of 7544 putative long noncoding RNAs (lncRNAs) were detected and 1164 nonredundant full-length transcripts (449 UniTransModels) participated in the alternative splicing (AS) events. Importantly, a total of 5279 nonredundant full-length unigenes were successfully identified, which were involved in the innate immune system, including 9 immune-related processes, 19 immune-related pathways and 10 other immune-related systems. We also found wide transcript variants, which increased the number and function complexity of immune molecules; for example, toll-like receptors (TLRs) and interferon regulatory factors (IRFs). The 480 differentially expressed genes (DEGs) were significantly higher or tissue-specific expression patterns in the hepatopancreas compared with that in other four tested tissues (FDR <0.05). Furthermore, the expression levels of six selected immune-related DEGs and putative IRFs were validated using real-time PCR technology, substantiating the reliability of the PacBio Iso-seq results. In conclusion, our results provide new genetic resources of long-read full-length transcripts data and information for identifying immune-related genes, which are an invaluable transcriptomic resource as genomic reference, especially for further exploration of the innate immune and defense mechanisms of shrimp. Copyright © 2019 Elsevier Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.