Menu
April 21, 2020

Secretion of an Argonaute protein by a parasitic nematode and the evolution of its siRNA guides.

Extracellular RNA has been proposed to mediate communication between cells and organisms however relatively little is understood regarding how specific sequences are selected for export. Here, we describe a specific Argonaute protein (exWAGO) that is secreted in extracellular vesicles (EVs) released by the gastrointestinal nematode Heligmosomoides bakeri, at multiple copies per EV. Phylogenetic and gene expression analyses demonstrate exWAGO orthologues are highly conserved and abundantly expressed in related parasites but highly diverged in free-living genus Caenorhabditis. We show that the most abundant small RNAs released from the nematode parasite are not microRNAs as previously thought, but rather secondary small interfering RNAs (siRNAs) that are produced by RNA-dependent RNA Polymerases. The siRNAs that are released in EVs have distinct evolutionary properties compared to those resident in free-living or parasitic nematodes. Immunoprecipitation of exWAGO demonstrates that it specifically associates with siRNAs from transposons and newly evolved repetitive elements that are packaged in EVs and released into the host environment. Together this work demonstrates molecular and evolutionary selectivity in the small RNA sequences that are released in EVs into the host environment and identifies a novel Argonaute protein as the mediator of this. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing.

We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.


April 21, 2020

Characterization of the genome of a Nocardia strain isolated from soils in the Qinghai-Tibetan Plateau that specifically degrades crude oil and of this biodegradation.

A strain of Nocardia isolated from crude oil-contaminated soils in the Qinghai-Tibetan Plateau degrades nearly all components of crude oil. This strain was identified as Nocardia soli Y48, and its growth conditions were determined. Complete genome sequencing showed that N. soli Y48 has a 7.3?Mb genome and many genes responsible for hydrocarbon degradation, biosurfactant synthesis, emulsification and other hydrocarbon degradation-related metabolisms. Analysis of the clusters of orthologous groups (COGs) and genomic islands (GIs) revealed that Y48 has undergone significant gene transfer events to adapt to changing environmental conditions (crude oil contamination). The structural features of the genome might provide a competitive edge for the survival of N. soli Y48 in oil-polluted environments and reflect the adaptation of coexisting bacteria to distinct nutritional niches.Copyright © 2018. Published by Elsevier Inc.


April 21, 2020

Characterization of an NDM-19-producing Klebsiella pneumoniae strain harboring 2 resistance plasmids from China.

Carbapenem-resistant Klebsiella pneumoniae (CRKP) has become a major cause of nosocomial infections and posed challenges on clinical treatments. The main objective of this study was to determinate the genetic characteristics of the NDM-19-producing CRKP strain SCM96. From 2015 to 2017, 18 CRKP strains were recovered from sputum samples of patients in respiratory medicine in 6 hospitals from 5 provinces and cities in China. Polymerase chain reaction results for carbapenem resistance genes detection showed strain SCM96 carried blaNDM-19. Three types of transconjugants harboring different plasmids were selected by conjugation experiment. The Whole Genome Sequencing (WGS) was performed using the PacBio RS platform. The genome size of SCM96 was 5,579,775?bp and composed of chromosomal DNA (5,398,745?bp) and 2 plasmids, IncFII type plasmid pSCM96-1 (134,869?bp) and IncX3 type plasmid pSCM96-2 (46,161?bp). SCM96 belonged to ST15 and K28. In addition to the 4 antibiotic resistance genes located in the chromosome, pSCM96-1 carried a complex resistance region containing 17 resistance genes and several mobile genetic elements (MGEs) like ?Tn6029, In4-like integron, and Tn3, and pSCM96-2 had only 1 blaNDM-19 gene. As far as we know, this was the first description of blaNDM-19 in K. pneumoniae. Up to 22 antibiotic resistance genes, several important MGEs, and transferable plasmids might increase the possibility of co-spreading of blaNDM-19 with other resistance genes.Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Fast and accurate genomic analyses using genome graphs.

The human reference genome serves as the foundation for genomics by providing a scaffold for alignment of sequencing reads, but currently only reflects a single consensus haplotype, thus impairing analysis accuracy. Here we present a graph reference genome implementation that enables read alignment across 2,800 diploid genomes encompassing 12.6 million SNPs and 4.0 million insertions and deletions (indels). The pipeline processes one whole-genome sequencing sample in 6.5?h using a system with 36?CPU cores. We show that using a graph genome reference improves read mapping sensitivity and produces a 0.5% increase in variant calling recall, with unaffected specificity. Structural variations incorporated into a graph genome can be genotyped accurately under a unified framework. Finally, we show that iterative augmentation of graph genomes yields incremental gains in variant calling accuracy. Our implementation is an important advance toward fulfilling the promise of graph genomes to radically enhance the scalability and accuracy of genomic analyses.


April 21, 2020

The CF Canada-Sick Kids Program in individual CF therapy: A resource for the advancement of personalized medicine in CF.

Therapies targeting certain CFTR mutants have been approved, yet variations in clinical response highlight the need for in-vitro and genetic tools that predict patient-specific clinical outcomes. Toward this goal, the CF Canada-Sick Kids Program in Individual CF Therapy (CFIT) is generating a “first of its kind”, comprehensive resource containing patient-specific cell cultures and data from 100 CF individuals that will enable modeling of therapeutic responses.The CFIT program is generating: 1) nasal cells from drug naïve patients suitable for culture and the study of drug responses in vitro, 2) matched gene expression data obtained by sequencing the RNA from the primary nasal tissue, 3) whole genome sequencing of blood derived DNA from each of the 100 participants, 4) induced pluripotent stem cells (iPSCs) generated from each participant’s blood sample, 5) CRISPR-edited isogenic control iPSC lines and 6) prospective clinical data from patients treated with CF modulators.To date, we have recruited 57 of 100 individuals to CFIT, most of whom are homozygous for F508del (to assess in-vitro: in-vivo correlations with respect to ORKAMBI response) or heterozygous for F508del and a minimal function mutation. In addition, several donors are homozygous for rare nonsense and missense mutations. Nasal epithelial cell cultures and matched iPSC lines are available for many of these donors.This accessible resource will enable development of tools that predict individual outcomes to current and emerging modulators targeting F508del-CFTR and facilitate therapy discovery for rare CF causing mutations.Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.


April 21, 2020

Characterization and analysis of the transcriptome in Gymnocypris selincuoensis on the Qinghai-Tibetan Plateau using single-molecule long-read sequencing and RNA-seq.

The lakes on the Qinghai-Tibet Plateau (QTP) are the largest and highest lake group in the world. Gymnocypris selincuoensis is the only cyprinid fish living in lake Selincuo, the largest lake on QTP. However, its genetic resource is still blank, limiting studies on molecular and genetic analysis. In this study, the transcriptome of G. selincuoensis was first generated by using PacBio Iso-Seq and Illumina RNA-seq. A full-length (FL) transcriptome with 75,435 transcripts was obtained by Iso-Seq with N50 length of 3,870 bp. Among all transcripts, 75,016 were annotated to public databases, 64,710 contain complete open reading frames and 2,811 were long non-coding RNAs. Based on all- vs.-all BLAST, 2,069 alternative splicing events were detected, and 80% of them were validated by reverse transcription polymerase chain reaction (RT-PCR). Tissue gene expression atlas showed that the number of detected expressed transcripts ranged from 37,397 in brain to 19,914 in muscle, with 10,488 transcripts detected in all seven tissues. Comparative genomic analysis with other cyprinid fishes identified 77 orthologous genes with potential positive selection (Ka/Ks > 0.3). A total of 56,696 perfect simple sequence repeats were identified from FL transcripts. Our results provide valuable genetic resources for further studies on adaptive evolution, gene expression and population genetics in G. selincuoensis and other congeneric fishes. © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


April 21, 2020

Genomic analysis of the aggressive tree pathogen Ceratocystis albifundus.

The overall goal of this study was to determine whether the genome of an important plant pathogen in Africa, Ceratocystis albifundus, is structured into subgenomic compartments, and if so, to establish how these compartments are distributed across the genome. For this purpose, the publicly available genome of C. albifundus was complemented with the genome sequences for four additional isolates using the Illumina HiSeq platform. In addition, a reference genome for one of the individuals was assembled using both PacBio and Illumina HiSeq technologies. Our results showed a high degree of synteny between the five genomes, although several regions lacked detectable long-range synteny. These regions were associated with the presence of accessory genes, lower genetic similarity, variation in read-map depth, as well as transposable elements and genes associated with host-pathogen interactions (e.g. effectors and CAZymes). Such patterns are regarded as hallmarks of accelerated evolution, particularly of accessory subgenomic compartments in fungal pathogens. Our findings thus showed that the genome of C. albifundus is made-up of core and accessory subgenomic compartments, which is an important step towards characterizing its pangenome. This study also highlights the value of comparative genomics for understanding mechanisms that may underly and influence the biology and evolution of pathogens.Copyright © 2019 British Mycological Society. Published by Elsevier Ltd. All rights reserved.


April 21, 2020

rMETL: sensitive mobile element insertion detection with long read realignment.

Mobile element insertion (MEI) is a major category of structure variations (SVs). The rapid development of long read sequencing technologies provides the opportunity to detect MEIs sensitively. However, the signals of MEI implied by noisy long reads are highly complex due to the repetitiveness of mobile elements as well as the high sequencing error rates. Herein, we propose the Realignment-based Mobile Element insertion detection Tool for Long read (rMETL). Benchmarking results of simulated and real datasets demonstrate that rMETL enables to handle the complex signals to discover MEIs sensitively. It is suited to produce high-quality MEI callsets in many genomics studies.rMETL is available from https://github.com/hitbc/rMETL.Supplementary data are available at Bioinformatics online. © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


April 21, 2020

Dynamic virulence-related regions of the plant pathogenic fungus Verticillium dahliae display enhanced sequence conservation.

Plant pathogens continuously evolve to evade host immune responses. During host colonization, many fungal pathogens secrete effectors to perturb such responses, but these in turn may become recognized by host immune receptors. To facilitate the evolution of effector repertoires, such as the elimination of recognized effectors, effector genes often reside in genomic regions that display increased plasticity, a phenomenon that is captured in the two-speed genome hypothesis. The genome of the vascular wilt fungus Verticillium dahliae displays regions with extensive presence/absence polymorphisms, so-called lineage-specific regions, that are enriched in in planta-induced putative effector genes. As expected, comparative genomics reveals differential degrees of sequence divergence between lineage-specific regions and the core genome. Unanticipated, lineage-specific regions display markedly higher sequence conservation in coding as well as noncoding regions than the core genome. We provide evidence that disqualifies horizontal transfer to explain the observed sequence conservation and conclude that sequence divergence occurs at a slower pace in lineage-specific regions of the V. dahliae genome. We hypothesize that differences in chromatin organisation may explain lower nucleotide substitution rates in the plastic, lineage-specific regions of V. dahliae. © 2019 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.


April 21, 2020

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

To better understand the immune system of shrimp, this study combined PacBio isoform sequencing (Iso-Seq) and Illumina paired-end short reads sequencing methods to discover full-length immune-related molecules of the Pacific white shrimp, Litopenaeus vannamei. A total of 72,648 nonredundant full-length transcripts (unigenes) were generated with an average length of 2545 bp from five main tissues, including the hepatopancreas, cardiac stomach, heart, muscle, and pyloric stomach. These unigenes exhibited a high annotation rate (62,164, 85.57%) when compared against NR, NT, Swiss-Prot, Pfam, GO, KEGG and COG databases. A total of 7544 putative long noncoding RNAs (lncRNAs) were detected and 1164 nonredundant full-length transcripts (449 UniTransModels) participated in the alternative splicing (AS) events. Importantly, a total of 5279 nonredundant full-length unigenes were successfully identified, which were involved in the innate immune system, including 9 immune-related processes, 19 immune-related pathways and 10 other immune-related systems. We also found wide transcript variants, which increased the number and function complexity of immune molecules; for example, toll-like receptors (TLRs) and interferon regulatory factors (IRFs). The 480 differentially expressed genes (DEGs) were significantly higher or tissue-specific expression patterns in the hepatopancreas compared with that in other four tested tissues (FDR <0.05). Furthermore, the expression levels of six selected immune-related DEGs and putative IRFs were validated using real-time PCR technology, substantiating the reliability of the PacBio Iso-seq results. In conclusion, our results provide new genetic resources of long-read full-length transcripts data and information for identifying immune-related genes, which are an invaluable transcriptomic resource as genomic reference, especially for further exploration of the innate immune and defense mechanisms of shrimp. Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Determining stoichiometry and kinetics of two thermophilic nitrifying communities as a crucial step in the development of thermophilic nitrogen removal.

Nitrification and denitrification, the key biological processes for thermophilic nitrogen removal, have separately been established in bioreactors at 50?°C. A well-characterized set of kinetic parameters is essential to integrate these processes while safeguarding the autotrophs performing nitrification. Knowledge on thermophilic nitrifying kinetics is restricted to isolated or highly enriched batch cultures, which do not represent bioreactor conditions. This study characterized the stoichiometry and kinetics of two thermophilic (50?°C) nitrifying communities. The most abundant ammonia oxidizing archaea (AOA) were related to the Nitrososphaera genus, clustering relatively far from known species Nitrososphaera gargensis (95.5% 16S rRNA gene sequence identity). The most abundant nitrite oxidizing bacteria (NOB) were related to Nitrospira calida (97% 16S rRNA gene sequence identity). The nitrification biomass yield was 0.20-0.24?g VSS g-1 N, resulting mainly from a high AOA yield (0.16-0.20?g VSS g-1 N), which was reflected in a high AOA abundance in the community (57-76%) compared to NOB (5-11%). Batch-wise determination of decay rates (AOA: 0.23-0.29 d-1; NOB: 0.32-0.43 d-1) rendered an overestimation compared to in situ estimations of overall decay rate (0.026-0.078 d-1). Possibly, the inactivation rate rather than the actual decay rate was determined in batch experiments. Maximum growth rates of AOA and NOB were 0.12-0.15 d-1 and 0.13-0.33 d-1 respectively. NOB were susceptible to nitrite, opening up opportunities for shortcut nitrogen removal. However, NOB had a similar growth rate and oxygen affinity (0.15-0.55?mg O2 L-1) as AOA and were resilient towards free ammonia (IC50?>?16?mg NH3-N L-1). This might complicate NOB outselection using common practices to establish shortcut nitrogen removal (SRT control; aeration control; free ammonia shocks). Overall, the obtained insights can assist in integrating thermophilic conversions and facilitate single-sludge nitrification/denitrification. Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Marinitoga lauensis sp. nov., a novel deep-sea hydrothermal vent thermophilic anaerobic heterotroph with a prophage.

A novel moderately thermophilic, heterotrophic anaerobe, designated strain LG1T, was isolated from the Mariner deep-sea hydrothermal vent field along the Eastern Lau Spreading Center and Valu Fa Ridge. Cells of strain LG1T were motile rods, occurring singly or in pairs, 0.6µm in width and 1.2µm in length. The strain LG1T grew between 40 and 70°C (optimum 50-55°C), at a pH between 5 and 8 (optimum pH 6.5) and with 7.5-50gL-1 NaCl (optimum 30gL-1). Sulfur, cystine and thiosulfate were reduced to sulfide, and cell yield was improved in the presence of cystine. Strain LG1T was an organotroph able to use a variety of organic compounds. Phylogenetic analysis based on 16S rRNA gene sequence comparisons indicated that strain LG1T was affiliated to the genus Marinitoga within the order Petrotogales. It shared 95.34-96.31% 16S rRNA gene sequence similarity with strains of other Marinitoga species, and is most closely related to Marinitoga okinawensis. Genome analysis revealed the presence of a prophage sharing high sequence homology with the viruses MPV1, MCV1 and MCV2 hosted by Marinitoga strains. Based on the data from the phylogenetic analyses and the physiological properties of the novel isolate, we propose that strain LG1T is a representative of a novel species, for which the name Marinitoga lauensis sp. nov. is proposed; the type strain is LG1T (=DSM 106824=JCM 32613).Copyright © 2019 Elsevier GmbH. All rights reserved.


April 21, 2020

TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts.

Long-read, single-molecule sequencing platforms hold great potential for isoform discovery and characterization of multi-exon transcripts. However, their high error rates are an obstacle to distinguishing novel transcript isoforms from sequencing artifacts. Therefore, we developed the package TranscriptClean to correct mismatches, microindels and noncanonical splice junctions in mapped transcripts using the reference genome while preserving known variants.Our method corrects nearly all mismatches and indels present in a publically available human PacBio Iso-seq dataset, and rescues 39% of noncanonical splice junctions.All Python and R scripts used in this paper are available at https://github.com/dewyman/TranscriptClean.


April 21, 2020

Current advances in HIV vaccine preclinical studies using Macaque models.

The macaque simian or simian/human immunodeficiency virus (SIV/SHIV) challenge model has been widely used to inform and guide human vaccine trials. Substantial advances have been made recently in the application of repeated-low-dose challenge (RLD) approach to assess SIV/SHIV vaccine efficacies (VE). Some candidate HIV vaccines have shown protective effects in preclinical studies using the macaque SIV/SHIV model but the model’s true predictive value for screening potential HIV vaccine candidates needs to be evaluated further. Here, we review key parameters used in the RLD approach and discuss their relevance for evaluating VE to improve preclinical studies of candidate HIV vaccines.Crown Copyright © 2019. Published by Elsevier Ltd. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.