Menu
April 21, 2020

Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Comprehensive molecular characterization of myriad somatic alterations and aberrant gene expressions at personal level is key to precision cancer therapy, yet limited by current short-read sequencing technology, individualized catalog of complete genomic and transcriptomic features is thus far elusive. Here, we integrated second- and third-generation sequencing platforms to generate a multidimensional dataset on a patient affected by metastatic epithelial ovarian cancer. Whole-genome and hybrid transcriptome dissection captured global genetic and transcriptional variants at previously unparalleled resolution. Particularly, single-molecule mRNA sequencing identified a vast array of unannotated transcripts, novel long noncoding RNAs and gene chimeras, permitting accurate determination of transcription start, splice, polyadenylation and fusion sites. Phylogenetic and enrichment inference of isoform-level measurements implicated early functional divergence and cytosolic proteostatic stress in shaping ovarian tumorigenesis. A complementary imaging-based high-throughput drug screen was performed and subsequently validated, which consistently pinpointed proteasome inhibitors as an effective therapeutic regime by inducing protein aggregates in ovarian cancer cells. Therefore, our study suggests that clinical application of the emerging long-read full-length analysis for improving molecular diagnostics is feasible and informative. An in-depth understanding of the tumor transcriptome complexity allowed by leveraging the hybrid sequencing approach lays the basis to reveal novel and valid therapeutic vulnerabilities in advanced ovarian malignancies.


April 21, 2020

Oenococcus sicerae sp. nov., isolated from French cider.

Two Gram-stain-positive, small ellipsoidal cocci, non-motile, oxidase- and catalase-negative, and facultative anaerobic strains (UCMA15228T and UCMA17102) were isolated in France, from fermented apple juices (ciders). The 16S rRNA gene sequence was identical between the two isolates and showed 97 % similarity with respect to the closest related species Oenococcus oeni and O. kitaharae. Therefore, the two isolates were classified within the genus Oenococcus. The phylogeny based on the pheS gene sequences also confirmed the position of the new taxon. DNA-DNA hybridizations based on in silico genome-to-genome comparisons (GGDC) and Average Nucleotide Identity (ANI) values, as well as species-specific PCR, validated the novelty of the taxon. Various phenotypic characteristics such as the optimum temperature and pH for growth, the ability to metabolise sugars, the aptitude to perform the malolactic fermentation, and the resistance to ethanol and NaCl, revealed that the two strains are distinguishable from the other members of the Oenococcus genus. The combined genotypic and phenotypic data support the classification of strains UCMA15228T and UCMA17102 into a novel species of Oenococcus, for which the name O. sicerae sp. nov. is proposed. The type strain is UCMA15228T (=DSM107163T=CIRM-BIA2288T).Copyright © 2018 Elsevier GmbH. All rights reserved.


April 21, 2020

Penicillium purpurogenum Produces a Set of Endoxylanases: Identification, Heterologous Expression, and Characterization of a Fourth Xylanase, XynD, a Novel Enzyme Belonging to Glycoside Hydrolase Family 10.

The fungus Penicillium purpurogenum grows on a variety of natural carbon sources and secretes a large number of enzymes which degrade the polysaccharides present in lignocellulose. In this work, the gene coding for a novel endoxylanase has been identified in the genome of the fungus. This gene (xynd) possesses four introns. The cDNA has been expressed in Pichia pastoris and characterized. The enzyme, XynD, belongs to family 10 of the glycoside hydrolases. Mature XynD has a calculated molecular weight of 40,997. It consists of 387 amino acid residues with an N-terminal catalytic module, a linker rich in ser and thr residues, and a C-terminal family 1 carbohydrate-binding module. XynD shows the highest identity (97%) to a putative endoxylanase from Penicillium subrubescens but its highest identity to a biochemically characterized xylanase (XYND from Penicillium funiculosum) is only 68%. The enzyme has a temperature optimum of 60 °C, and it is highly stable in its pH optimum range of 6.5-8.5. XynD is the fourth biochemically characterized endoxylanase from P. purpurogenum, confirming the rich potential of this fungus for lignocellulose biodegradation. XynD, due to its wide pH optimum and stability, may be a useful enzyme in biotechnological procedures related to this biodegradation process.


April 21, 2020

PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.

In eukaryotes, alternative splicing (AS) greatly expands the diversity of transcripts. However, it is challenging to accurately determine full-length splicing isoforms. Recently, more studies have taken advantage of Pacific Bioscience (PacBio) long-read sequencing to identify full-length transcripts. Nevertheless, the high error rate of PacBio reads seriously offsets the advantages of long reads, especially for accurately identifying splicing junctions. To best capitalize on the features of long reads, we used Illumina RNA-seq reads to improve PacBio circular consensus sequence (CCS) quality and to validate splicing patterns in the rice transcriptome. We evaluated the impact of CCS accuracy on the number and the validation rate of splicing isoforms, and integrated a comprehensive pipeline of splicing transcripts analysis by Iso-Seq and RNA-seq (STAIR) to identify the full-length multi-exon isoforms in rice seedling transcriptome (Oryza sativa L. ssp. japonica). STAIR discovered 11 733 full-length multi-exon isoforms, 6599 more than the SMRT Portal RS_IsoSeq pipeline did. Of these splicing isoforms identified, 4453 (37.9%) were missed in assembled transcripts from RNA-seq reads, and 5204 (44.4%), including 268 multi-exon long non-coding RNAs (lncRNAs), were not reported in the MSU_osa1r7 annotation. Some randomly selected unreported splicing junctions were verified by polymerase chain reaction (PCR) amplification. In addition, we investigated alternative polyadenylation (APA) events in transcripts and identified 829 major polyadenylation [poly(A)] site clusters (PACs). The analysis of splicing isoforms and APA events will facilitate the annotation of the rice genome and studies on the expression and polyadenylation of AS genes in different developmental stages or growth conditions of rice. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


April 21, 2020

Fudania jinshanensis gen. nov., sp. nov., isolated from faeces of the Tibetan antelope (Pantholops hodgsonii) in China.

Two hitherto unknown bacteria (strains 313T and 352) were recovered from the faeces of Tibetan antelopes on the Tibet-Qinghai Plateau, PR China. Cells were rod-shaped and Gram-stain-positive. The optimal growth conditions were at 37?°C and pH 7. The isolates were closely related to Actinotignum sanguinis (92.6?% 16S rRNA gene sequence similarity), Arcanobacterium haemolyticum (92.5?%), Actinotignum schaalii (92.4?%), Actinobaculum massiliense (92.2?%) and Flaviflexus huanghaiensis (91.6?%). Phylogenetic analyses showed that strains 313T and 352 clustered independently in the vicinity of the genera Actinotignum, Actinobaculum and Flaviflexus, but could not be classified clearly as a member of any of these genera. Phylogenomic analysis also indicated that strains 313T and 352 formed an independent branch in the family Actinomycetaceae. The major cellular fatty acids of the strains were C16?:?0 and C18?:?1?9c. The polar lipids comprised diphosphatidylglycerol, phosphatidylinositol mannoside, phosphatidylglycerol, phosphatidylinositol and five unidentified components. The peptidoglycan contained lysine, alanine and glutamic acid. The respiratory quinone was absent. The whole-cell sugars included glucose and rhamnose. The DNA G+C?content of strain 313T was 60.6?mol%. Based on the low 16S rRNA gene sequence similarities, its taxonomic position in the phylogenetic and phylogenomic trees and its unique lipid pattern, we propose that strains 313T and 352 represent members of a novel species in a new genus, for which the name Fudania jinshanensis gen. nov., sp. nov. is proposed. The type strain is 313T (=CGMCC 4.7453T=DSM 106216T).


April 21, 2020

Alternative Splicing of the Delta-Opioid Receptor Gene Suggests Existence of New Functional Isoforms.

The delta-opioid receptor (DOPr) participates in mediating the effects of opioid analgesics. However, no selective agonists have entered clinical care despite potential to ameliorate many neurological and psychiatric disorders. In an effort to address the drug development challenges, the functional contribution of receptor isoforms created by alternative splicing of the three-exonic coding gene, OPRD1, has been overlooked. We report that the gene is transcriptionally more diverse than previously demonstrated, producing novel protein isoforms in humans and mice. We provide support for the functional relevance of splice variants through context-dependent expression profiling (tissues, disease model) and conservation of the transcriptional landscape in closely related vertebrates. The conserved alternative transcriptional events have two distinct patterns. First, cassette exon inclusions between exons 1 and 2 interrupt the reading frame, producing truncated receptor fragments comprising only the first transmembrane (TM) domain, despite the lack of exact exon orthologues between distant species. Second, a novel promoter and transcriptional start site upstream of exon 2 produces a transcript of an N-terminally truncated 6TM isoform. However, a fundamental difference in the exonic landscaping as well as translation and translation products poses limits for modelling the human DOPr receptor system in mice.


April 21, 2020

Retrospective whole-genome sequencing analysis distinguished PFGE and drug-resistance-matched retail meat and clinical Salmonella isolates.

Non-typhoidal Salmonella is a leading cause of outbreak and sporadic-associated foodborne illnesses in the United States. These infections have been associated with a range of foods, including retail meats. Traditionally, pulsed-field gel electrophoresis (PFGE) and antibiotic susceptibility testing (AST) have been used to facilitate public health investigations of Salmonella infections. However, whole-genome sequencing (WGS) has emerged as an alternative tool that can be routinely implemented. To assess its potential in enhancing integrated surveillance in Pennsylvania, USA, WGS was used to directly compare the genetic characteristics of 7 retail meat and 43 clinical historic Salmonella isolates, subdivided into 3 subsets based on PFGE and AST results, to retrospectively resolve their genetic relatedness and identify antimicrobial resistance (AMR) determinants. Single nucleotide polymorphism (SNP) analyses revealed that the retail meat isolates within S. Heidelberg, S. Typhimurium var. O5- subset 1 and S. Typhimurium var. O5- subset 2 were separated from each primary PFGE pattern-matched clinical isolate by 6-12, 41-96 and 21-81 SNPs, respectively. Fifteen resistance genes were identified across all isolates, including fosA7, a gene only recently found in a limited number of Salmonella and a =95?%?phenotype to genotype correlation was observed for all tested antimicrobials. Moreover, AMR was primarily plasmid-mediated in S. Heidelberg and S. Typhimurium var. O5- subset 2, whereas AMR was chromosomally carried in S. Typhimurium var. O5- subset 1. Similar plasmids were identified in both the retail meat and clinical isolates. Collectively, these data highlight the utility of WGS in retrospective analyses and enhancing integrated surveillance for Salmonella from multiple sources.


April 21, 2020

Genome Sequence of Bacillus Velezensis W1, A Strain with Strong Acaricidal Activity against Two-Spotted Spider Mite (Tetranychus Urticae)

Bacillus velezensis W1, isolated from two-spotted spider mites that had died naturally, is a patented strain with strong capability to cause mortality of the phytophagous mite Tetranychus urticae. The whole genome of W1 was completely sequenced with a combination of an Illumina Miseq platform (400-bp paired-end) with 2 × 250 bases and a Pacific Biosciences (PaBio) RS II Single Molecule Real Time (SMRT) sequencing platform using a 20 kb SMRTbellTM template library. Here, we report the complete genome sequence of B. velezensis W1, including one circular chromosome of 4,237,431 bp encoding 4,352 genes with GC content of 45.84%, providing insights into the genomic basis of its acaricidal activity and facilitating its application in red spider mite biocontrol.


April 21, 2020

Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.

The locus for familial cortical myoclonic tremor with epilepsy (FCMTE) has long been mapped to 8q24 in linkage studies, but the causative mutations remain unclear. Recently, expansions of intronic TTTCA and TTTTA repeat motifs within SAMD12 were found to be involved in the pathogenesis of FCMTE in Japanese pedigrees. We aim to identify the causative mutations of FCMTE in Chinese pedigrees.We performed genetic linkage analysis by microsatellite markers in a five-generation Chinese pedigree with 55 members. We also used array-comparative genomic hybridisation (CGH) and next-generation sequencing (NGS) technologies (whole-exome sequencing, capture region deep sequencing and whole-genome sequencing) to identify the causative mutations in the disease locus. Recently, we used low-coverage (~10×) long-read genome sequencing (LRS) on the PacBio Sequel and Oxford Nanopore platforms to identify the causative mutations, and used repeat-primed PCR for validation of the repeat expansions.Linkage analysis mapped the disease locus to 8q23.3-24.23. Array-CGH and NGS failed to identify causative mutations in this locus. LRS identified the intronic TTTCA and TTTTA repeat expansions in SAMD12 as the causative mutations, thus corroborating the recently published results in Japanese pedigrees.We identified the pentanucleotide repeat expansion in SAMD12 as the causative mutation in Chinese FCMTE pedigrees. Our study also suggested that LRS is an effective tool for molecular diagnosis of genetic disorders, especially for neurological diseases that cannot be positively diagnosed by conventional clinical microarray and NGS technologies. © Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.


April 21, 2020

A new variant of mcr-1 identified from an extended-spectrum ß lactamase-producing Escherichia coli.

Plasmid-mediated colistin resistance gene, mcr-1, has been widely reported almost all over the world. The product of the gene, MCR-1, is one of the members of the phosphoethanolamine transferase enzyme family, which can add phosphoethanolamine to lipid A, thus reducing affinity to polymyxins. Isolates carrying mcr-1 gene are often multidrug resistant (MDR), including co-production of MCR-1 and extended spectrum B lactamases (ESBLs) or carbapenemases, resulting in great clinical concerns.


April 21, 2020

Genetic characterization and potential molecular dissemination mechanism of tet(31) gene in Aeromonas caviae from an oxytetracycline wastewater treatment system.

Recently, the rarely reported tet(31) tetracycline resistance determinant was commonly found in Aeromonas salmonicida, Gallibacterium anatis, and Oblitimonas alkaliphila isolated from farming animals and related environment. However, its distribution in other bacteria and potential molecular dissemination mechanism in environment are still unknown. The purpose of this study was to investigate the potential mechanism underlying dissemination of tet(31) by analysing the tet(31)-carrying fragments in A. caviae strains isolated from an aerobic biofilm reactor treating oxytetracycline bearing wastewater. Twenty-three A. caviae strains were screened for the tet(31) gene by polymerase chain reaction (PCR). Three strains (two harbouring tet(31), one not) were subjected to whole genome sequencing using the PacBio RSII platform. Seventeen A. caviae strains carried the tet(31) gene and exhibited high resistance levels to oxytetracycline with minimum inhibitory concentrations (MICs) ranging from 256 to 512?mg/L. tet(31) was comprised of the transposon Tn6432 on the chromosome of A. caviae, and Tn6432 was also found in 15 additional tet(31)-positive A. caviae isolates by PCR. More important, Tn6432 was located on an integrative conjugative element (ICE)-like element, which could mediate the dissemination of the tet(31)-carrying transposon Tn6432 between bacteria. Comparative analysis demonstrated that Tn6432 homologs with the structure ISCR2-?phzF-tetR(31)-tet(31)-?glmM-sul2 were also carried by A. salmonicida, G. anatis, and O. alkaliphila, suggesting that this transposon can be transferred between species and even genera. This work provides the first report on the identification of the tet(31) gene in A. caviae, and will be helpful in exploring the dissemination mechanisms of tet(31) in water environment.Copyright © 2018. Published by Elsevier B.V.


April 21, 2020

Genome Organization and Adaptive Potential of Archetypal Organophosphate Degrading Sphingobium fuliginis ATCC 27551.

Sphingobium fuliginis ATCC 27551, previously classified as Flavobacterium sp. ATCC 27551, degrades neurotoxic organophosphate insecticides and nerve agents through the activity of a membrane-associated organophosphate hydrolase. This study was designed to determine the complete genome sequence of S. fuliginis ATCC 27551 to unravel its degradative potential and adaptability to harsh environments. The 5,414,624?bp genome with a GC content of 64.4% is distributed between two chromosomes and four plasmids and encodes 5,557 proteins. Of the four plasmids, designated as pSF1, pSF2, pSF3, and pSF4, only two (pSF1 and pSF2) are self-transmissible and contained the complete genetic repertoire for a T4SS. The other two plasmids (pSF3 and pSF4) are mobilizable and both showed the presence of an oriT and relaxase-encoding sequences. The sequence of plasmid pSF3 coincided with the previously determined sequence of pPDL2 and included an opd gene encoding organophosphate hydrolase as a part of the mobile element. About 15,455 orthologous clusters were identified from among the cumulatively annotated genes of 49 Sphingobium species. Phylogenetic analysis done using the core genome consisting of 802 orthologous clusters revealed a close relationship between S. fuliginis ATCC 27551 and bacteria capable of degradation of polyaromatic hydrocarbon compounds. Genes coding for transposases, efflux pumps conferring resistance to heavy metals, and TonR-type outer membrane receptors are selectively enriched in the genome of S. fuliginis ATCC 27551 and appear to contribute to the adaptive potential of the organism to challenging and harsh environments. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.

Amplicon sequencing of the 16S rRNA gene is the predominant method to quantify microbial compositions and to discover novel lineages. However, traditional short amplicons often do not contain enough information to confidently resolve their phylogeny. Here we present a cost-effective protocol that amplifies a large part of the rRNA operon and sequences the amplicons with PacBio technology. We tested our method on a mock community and developed a read-curation pipeline that reduces the overall read error rate to 0.18%. Applying our method on four environmental samples, we captured near full-length rRNA operon amplicons from a large diversity of prokaryotes. The method operated at moderately high-throughput (22286-37,850 raw ccs reads) and generated a large amount of putative novel archaeal 23S rRNA gene sequences compared to the archaeal SILVA database. These long amplicons allowed for higher resolution during taxonomic classification by means of long (~1000 bp) 16S rRNA gene fragments and for substantially more confident phylogenies by means of combined near full-length 16S and 23S rRNA gene sequences, compared to shorter traditional amplicons (250 bp of the 16S rRNA gene). We recommend our method to those who wish to cost-effectively and confidently estimate the phylogenetic diversity of prokaryotes in environmental samples at high throughput. © 2019 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.


April 21, 2020

Long-read sequence capture of the haemoglobin gene clusters across codfish species.

Combining high-throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short-read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long-read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom-made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage-specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long-read capture as a versatile approach for comparative genomic studies by generation of a cross-species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


April 21, 2020

Comparative genomic analysis of Lactobacillus mucosae LM1 identifies potential niche-specific genes and pathways for gastrointestinal adaptation.

Lactobacillus mucosae is currently of interest as putative probiotics due to their metabolic capabilities and ability to colonize host mucosal niches. L. mucosae LM1 has been studied in its functions in cell adhesion and pathogen inhibition, etc. It demonstrated unique abilities to use energy from carbohydrate and non-carbohydrate sources. Due to these functions, we report the first complete genome sequence of an L. mucosae strain, L. mucosae LM1. Analysis of the pan-genome in comparison with closely-related Lactobacillus species identified a complete glycogen metabolism pathway, as well as folate biosynthesis, complementing previous proteomic data on the LM1 strain. It also revealed common and unique niche-adaptation genes among the various L. mucosae strains. The aim of this study was to derive genomic information that would reveal the probable mechanisms underlying the probiotic effect of L. mucosae LM1, and provide a better understanding of the nature of L. mucosae sp. Copyright © 2017 Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.