Validation Archives - Page 9 of 29

September 22, 2019

Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.

Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.© 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

September 22, 2019

Multi-platform sequencing approach reveals a novel transcriptome profile in pseudorabies virus.

Third-generation sequencing is an emerging technology that is capable of solving several problems that earlier approaches were not able to, including the identification of transcripts isoforms and overlapping transcripts. In this study, we used long-read sequencing for the analysis of pseudorabies virus (PRV) transcriptome, including Oxford Nanopore Technologies MinION, PacBio RS-II, and Illumina HiScanSQ platforms. We also used data from our previous short-read and long-read sequencing studies for the comparison of the results and in order to confirm the obtained data. Our investigations identified 19 formerly unknown putative protein-coding genes, all of which are 5′ truncated forms of earlier annotated longer PRV genes. Additionally, we detected 19 non-coding RNAs, including 5′ and 3′ truncated transcripts without in-frame ORFs, antisense RNAs, as well as RNA molecules encoded by those parts of the viral genome where no transcription had been detected before. This study has also led to the identification of three complex transcripts and 50 distinct length isoforms, including transcription start and end variants. We also detected 121 novel transcript overlaps, and two transcripts that overlap the replication origins of PRV. Furthermore,in silicoanalysis revealed 145 upstream ORFs, many of which are located on the longer 5′ isoforms of the transcripts.

September 22, 2019

Full-length transcriptome sequences of ephemeral plant Arabidopsis pumila provides insight into gene expression dynamics during continuous salt stress.

Arabidopsis pumila is native to the desert region of northwest China and it is extraordinarily well adapted to the local semi-desert saline soil, thus providing a candidate plant system for environmental adaptation and salt-tolerance gene mining. However, understanding of the salt-adaptation mechanism of this species is limited because of genomic sequences scarcity. In the present study, the transcriptome profiles of A. pumila leaf tissues treated with 250 mM NaCl for 0, 0.5, 3, 6, 12, 24 and 48 h were analyzed using a combination of second-generation sequencing (SGS) and third-generation single-molecule real-time (SMRT) sequencing.Correction of SMRT long reads by SGS short reads resulted in 59,328 transcripts. We found 8075 differentially expressed genes (DEGs) between salt-stressed tissues and controls, of which 483 were transcription factors and 1157 were transport proteins. Most DEGs were activated within 6 h of salt stress and their expression stabilized after 48 h; the number of DEGs was greatest within 12 h of salt stress. Gene annotation and functional analyses revealed that expression of genes associated with the osmotic and ionic phases rapidly and coordinately changed during the continuous salt stress in this species, and salt stress-related categories were highly enriched among these DEGs, including oxidation-reduction, transmembrane transport, transcription factor activity and ion channel activity. Orphan, MYB, HB, bHLH, C3H, PHD, bZIP, ARF and NAC TFs were most enriched in DEGs; ABCB1, CLC-A, CPK30, KEA2, KUP9, NHX1, SOS1, VHA-A and VP1 TPs were extensively up-regulated in salt-stressed samples, suggesting that they play important roles in slat tolerance. Importantly, further experimental studies identified a mitogen-activated protein kinase (MAPK) gene MAPKKK18 as continuously up-regulated throughout salt stress, suggesting its crucial role in salt tolerance. The expression patterns of the salt-responsive 24 genes resulted from quantitative real-time PCR were basically consistent with their transcript abundance changes identified by RNA-Seq.The full-length transcripts generated in this study provide a more accurate depiction of gene transcription of A. pumila. We identified potential genes involved in salt tolerance of A. pumila. These data present a genetic resource and facilitate better understanding of salt-adaptation mechanism for ephemeral plants.

September 22, 2019

Genomic insights into the acid adaptation of novel methanotrophs enriched from acidic forest soils.

Soil acidification is accelerated by anthropogenic and agricultural activities, which could significantly affect global methane cycles. However, detailed knowledge of the genomic properties of methanotrophs adapted to acidic soils remains scarce. Using metagenomic approaches, we analyzed methane-utilizing communities enriched from acidic forest soils with pH 3 and 4, and recovered near-complete genomes of proteobacterial methanotrophs. Novel methanotroph genomes designated KS32 and KS41, belonging to two representative clades of methanotrophs (Methylocystis of Alphaproteobacteria and Methylobacter of Gammaproteobacteria), were dominant. Comparative genomic analysis revealed diverse systems of membrane transporters for ensuring pH homeostasis and defense against toxic chemicals. Various potassium transporter systems, sodium/proton antiporters, and two copies of proton-translocating F1F0-type ATP synthase genes were identified, which might participate in the key pH homeostasis mechanisms in KS32. In addition, the V-type ATP synthase and urea assimilation genes might be used for pH homeostasis in KS41. Genes involved in the modification of membranes by incorporation of cyclopropane fatty acids and hopanoid lipids might be used for reducing proton influx into cells. The two methanotroph genomes possess genes for elaborate heavy metal efflux pumping systems, possibly owing to increased heavy metal toxicity in acidic conditions. Phylogenies of key genes involved in acid adaptation, methane oxidation, and antiviral defense in KS41 were incongruent with that of 16S rRNA. Thus, the detailed analysis of the genome sequences provides new insights into the ecology of methanotrophs responding to soil acidification.

September 22, 2019

Effects of low crude oil chronic exposure on the northern krill (Meganyctiphanes norvegica)

Chronic oil pollution related to gas and oil drilling activities is increasing in the sea due to the rising offshore petroleum industry activity. Among marine organisms, zooplankton play a crucial role in the marine ecosystem and therefore understanding the effects of crude oil chronic exposure on zooplankton is needed to determine the impact of oil in marine environments. The present study reports on the effect of crude oil on adult northern krill, Meganyctiphanes norvegica, collected during three seasons. Their sensitivity to oil was examined with oil concentration of 0.01 versus 0.1 mg oil L- 1 and photo-modified oil in flowing seawater maintained in the dark for 2 weeks at in situ temperature. Oil (polycyclic aromatic hydrocarbons, PAHs) entered the krill (on average, 350 and 4400 µg·kg- 1 wet weight in low and medium oil treatments respectively) and a larger fraction of the krill exhibited digestive gland pathologies (enhanced apoptosis and pathology of digestive tubules) in oil treatments (27–80%) compared to a significantly lower fraction (7–13%) in treatments that received no oil. However, 2-week oil exposure at these concentrations did not significantly decrease survivorship or impair basic functioning such as feeding and respiration rates. Similarly, there were only limited changes in the transcription of 7 selected genes from head tissue. Additionally, although there was significant seasonal variation in krill total lipid content and fatty acid composition, there was no treatment effect on both these parameters, which suggests limited oxidative stress under experimental conditions. Furthermore, there was no significant treatment effect on two direct measures of oxidative stress (MDA: malondialdehyde and AOPP: advanced oxidation protein products) in any of the seasons. Nevertheless, histology clearly revealed enhanced digestive gland pathologies in krill even at low concentrations. Although krill with such pathologies continue to survive, their accumulation of PAHs may be transferred up the food chain, impacting their predators and the wider ecosystem.

September 22, 2019

Accurate determination of bacterial abundances in human metagenomes using full-length 16S sequencing reads

DNA sequencing of PCR-amplified marker genes, especially but not limited to the 16S rRNA gene, is perhaps the most common approach for profiling microbial communities. Due to technological constraints of commonly available DNA sequencing, these approaches usually take the form of short reads sequenced from a narrow, targeted variable region, with a corresponding loss of taxonomic resolution relative to the full length marker gene. We use Pacific Biosciences single-molecule, real-time circular consensus sequencing to sequence amplicons spanning the entire length of the 16S rRNA gene. However, this sequencing technology suffers from high sequencing error rate that needs to be addressed in order to take full advantage of the longer sequence. Here, we present a method to model the sequencing error process using a generalized pair hidden Markov chain model and estimate bacterial abundances in microbial samples. We demonstrate, with simulated and real data, that our model and its associated estimation procedure are able to give accurate estimates at the species (or subspecies) level, and is more flexible than existing methods like SImple Non-Bayesian TAXonomy (SINTAX).

September 22, 2019

Full-length isoform sequencing yields a more comprehensive view of gene activity

RNA-seq has revolutionised how scientists can interrogate gene expression. But after years of performing RNA-seq studies with short-read sequencers, many have realised that there is more to be discovered.

September 22, 2019

Comparative transcriptome analysis of genes involved in Na+ transport in the leaves of halophyte Halogeton glomeratus.

Compartmentalization of Na+ into vacuoles is considered to be the most critical aspect of salt tolerance in H. glomeratus, an annual, succulent halophyte. Previous analysis of transcriptome involved in the H. glomeratus salt stress response relied on next-generation sequencing technologies that limit the capture of accurately spliced, full-length isoforms. To gain deeper insights into its salt stress response, we used the H. glomeratus Iso-Seq transcriptome database as a reference, and subsequent next-generation sequencing was subjected to various NaCl concentrations of leaves from plants revealed 115 upregulated and 87 downregulated differentially expressed isoforms (core DEIs). The majority of the core DEIs were involved in carbohydrate metabolism and energy production and conversion. In contrast, levels of known isoforms encoding Na+ transporters did not change significantly under salt stress. However, 16 core DEIs of unknown function were predicted to possess transmembrane domains, suggesting that these candidate isoforms could be involved in Na+ transport in H. glomeratus. These results suggest a potential means for identification of novel Na+ transporters, in addition to providing a foundation for further investigation of Na+ transport networks in halophytes. Copyright © 2018. Published by Elsevier B.V.

September 22, 2019

The state of play in higher eukaryote gene annotation.

A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe – or ‘annotate’ – genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists – from clinicians to evolutionary biologists – need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets. Their creation, however, remains challenging.

September 22, 2019

Targeted combinatorial alternative splicing generates brain region-specific repertoires of neurexins.

Molecular diversity of surface receptors has been hypothesized to provide a mechanism for selective synaptic connectivity. Neurexins are highly diversified receptors that drive the morphological and functional differentiation of synapses. Using a single cDNA sequencing approach, we detected 1,364 unique neurexin-a and 37 neurexin-ß mRNAs produced by alternative splicing of neurexin pre-mRNAs. This molecular diversity results from near-exhaustive combinatorial use of alternative splice insertions in Nrxn1a and Nrxn2a. By contrast, Nrxn3a exhibits several highly stereotyped exon selections that incorporate novel elements for posttranscriptional regulation of a subset of transcripts. Complexity of Nrxn1a repertoires correlates with the cellular complexity of neuronal tissues, and a specific subset of isoforms is enriched in a purified cell type. Our analysis defines the molecular diversity of a critical synaptic receptor and provides evidence that neurexin diversity is linked to cellular diversity in the nervous system. Copyright © 2014 Elsevier Inc. All rights reserved.

September 22, 2019

Bacteroides dorei dominates gut microbiome prior to autoimmunity in Finnish children at high risk for type 1 diabetes.

The incidence of the autoimmune disease, type 1 diabetes (T1D), has increased dramatically over the last half century in many developed countries and is particularly high in Finland and other Nordic countries. Along with genetic predisposition, environmental factors are thought to play a critical role in this increase. As with other autoimmune diseases, the gut microbiome is thought to play a potential role in controlling progression to T1D in children with high genetic risk, but we know little about how the gut microbiome develops in children with high genetic risk for T1D. In this study, the early development of the gut microbiomes of 76 children at high genetic risk for T1D was determined using high-throughput 16S rRNA gene sequencing. Stool samples from children born in the same hospital in Turku, Finland were collected at monthly intervals beginning at 4-6 months after birth until 2.2 years of age. Of those 76 children, 29 seroconverted to T1D-related autoimmunity (cases) including 22 who later developed T1D, the remaining 47 subjects remained healthy (controls). While several significant compositional differences in low abundant species prior to seroconversion were found, one highly abundant group composed of two closely related species, Bacteroides dorei and Bacteroides vulgatus, was significantly higher in cases compared to controls prior to seroconversion. Metagenomic sequencing of samples high in the abundance of the B. dorei/vulgatus group before seroconversion, as well as longer 16S rRNA sequencing identified this group as Bacteroides dorei. The abundance of B. dorei peaked at 7.6 months in cases, over 8 months prior to the appearance of the first islet autoantibody, suggesting that early changes in the microbiome may be useful for predicting T1D autoimmunity in genetically susceptible infants. The cause of increased B. dorei abundance in cases is not known but its timing appears to coincide with the introduction of solid food.

September 22, 2019

Molecular mechanisms of acclimatization to phosphorus starvation and recovery underlying full-length transcriptome profiling in barley (Hordeum vulgare L.).

A lack of phosphorus (P) in plants can severely constrain growth and development. Barley, one of the earliest domesticated crops, is extensively planted in poor soil around the world. To date, the molecular mechanisms of enduring low phosphorus, at the transcriptional level, in barley are still unclear. In the present study, two different barley genotypes (GN121 and GN42)-with contrasting phosphorus efficiency-were used to reveal adaptations to low phosphorus stress, at three time points, at the morphological, physiological, biochemical, and transcriptome level. GN121 growth was less affected by phosphorus starvation and recovery than that of GN42. The biomass and inorganic phosphorus concentration of GN121 and GN42 declined under the low phosphorus-induced stress and increased after recovery with normal phosphorus. However, the range of these parameters was higher in GN42 than in GN121. Subsequently, a more complete genome annotation was obtained by correcting with the data sequenced on Illumina HiSeq X 10 and PacBio RSII SMRT platform. A total of 6,182 and 5,270 differentially expressed genes (DEGs) were identified in GN121 and GN42, respectively. The majority of these DEGs were involved in phosphorus metabolism such as phospholipid degradation, hydrolysis of phosphoric enzymes, sucrose synthesis, phosphorylation/dephosphorylation and post-transcriptional regulation; expression of these genes was significantly different between GN121 and GN42. Specifically, six and seven DEGs were annotated as phosphorus transporters in roots and leaves, respectively. Furthermore, a putative model was constructed relying on key metabolic pathways related to phosphorus to illustrate the higher phosphorus efficiency of GN121 compared to GN42 under low phosphorus conditions. Results from this study provide a multi-transcriptome database and candidate genes for further study on phosphorus use efficiency (PUE).

September 22, 2019

Somatic APP gene recombination in Alzheimer’s disease and normal neurons.

The diversity and complexity of the human brain are widely assumed to be encoded within a constant genome. Somatic gene recombination, which changes germline DNA sequences to increase molecular diversity, could theoretically alter this code but has not been documented in the brain, to our knowledge. Here we describe recombination of the Alzheimer’s disease-related gene APP, which encodes amyloid precursor protein, in human neurons, occurring mosaically as thousands of variant ‘genomic cDNAs’ (gencDNAs). gencDNAs lacked introns and ranged from full-length cDNA copies of expressed, brain-specific RNA splice variants to myriad smaller forms that contained intra-exonic junctions, insertions, deletions, and/or single nucleotide variations. DNA in situ hybridization identified gencDNAs within single neurons that were distinct from wild-type loci and absent from non-neuronal cells. Mechanistic studies supported neuronal ‘retro-insertion’ of RNA to produce gencDNAs; this process involved transcription, DNA breaks, reverse transcriptase activity, and age. Neurons from individuals with sporadic Alzheimer’s disease showed increased gencDNA diversity, including eleven mutations known to be associated with familial Alzheimer’s disease that were absent from healthy neurons. Neuronal gene recombination may allow ‘recording’ of neural activity for selective ‘playback’ of preferred gene variants whose expression bypasses splicing; this has implications for cellular diversity, learning and memory, plasticity, and diseases of the human brain.

September 22, 2019

Resistance to ceftazidime-avibactam in Klebsiella pneumoniae due to porin mutations and the increased expression of KPC-3.

We reported the first clinical case of a ceftazidime-avibactam resistant KPC-3-producing Klebsiella pneumoniae (1), from a patient with no history of ceftazidime-avibactam therapy. We now present data documenting mechanisms of ceftazidime-avibactam resistance in this isolate. Whole-genome sequencing (WGS) was performed on two isolates: KP1245 (ceftazidime-avibactam MIC, 4 µg/ml; from blood on hospital day 1; referred to as isolate 1 in our previous report [1]) and KP1244 (ceftazidime-avibactam MIC, 32 µg/ml; from blood on hospital day 2; referred to as isolate 2 in our previous report [2]), using MiSeq (Illumina, San Diego, CA) and PacBio RSII (Menlo Park, CA) systems (2). The in silico multilocus sequence type (ST) was ST258. Single nucleotide polymorphism (SNP) analysis revealed 17 SNPs between KP1245 and KP1244, indicating that the isolates were related but that significant diversity existed in this patient (2). Nonsynonymous mutations are shown in Table 1; the most striking of these is in the OmpK36 porin gene. KP1244 contained a missense mutation predicted to encode a T333N mutation. Both isolates also harbored a mutation predicted to encode R191L in OmpK36 and had a nonfunctional OmpK35, due to a frameshift mutation that truncated the protein at amino acid 42, common to K. pneumoniae ST258 (3). Association between mutations in ompK36 and elevated ceftazidime-avibactam MICs has been shown previously (4). However, T333N, found in one of the ß-sheet domains of the OmpK36 subunit, has not been described in K. pneumoniae; as such, further validation is required to confirm the role of the OmpK36 mutation in this isolate’s ceftazidime-avibactam resistance phenotype.

September 22, 2019

Active microorganisms in forest soils differ from the total community yet are shaped by the same environmental factors: the influence of pH and soil moisture.

Predicting the impact of environmental change on soil microbial functions requires an understanding of how environmental factors shape microbial composition. Here, we investigated the influence of environmental factors on bacterial and fungal communities across an expanse of northern hardwood forest in Michigan, USA, which spans a 500-km regional climate gradient. We quantified soil microbial community composition using high-throughput DNA sequencing on coextracted rDNA (i.e. total community) and rRNA (i.e. active community). Within both bacteria and fungi, total and active communities were compositionally distinct from one another across the regional gradient (bacteria P = 0.01; fungi P < 0.01). Taxonomically, the active community was a subset of the total community. Compositional differences between total and active communities reflected changes in the relative abundance of dominant taxa. The composition of both the total and active microbial communities varied by site across the gradient (P < 0.01) and was shaped by differences in soil moisture, pH, SOM carboxyl content, as well as C and N concentration. Our study highlights the importance of distinguishing between metabolically active microorganisms and the total community, and emphasizes that the same environmental factors shape the total and active communities of bacteria and fungi in this ecosystem.© FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auto Tag: Validation

Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.

Multi-platform sequencing approach reveals a novel transcriptome profile in pseudorabies virus.

Full-length transcriptome sequences of ephemeral plant Arabidopsis pumila provides insight into gene expression dynamics during continuous salt stress.

Genomic insights into the acid adaptation of novel methanotrophs enriched from acidic forest soils.

Effects of low crude oil chronic exposure on the northern krill (Meganyctiphanes norvegica)

Accurate determination of bacterial abundances in human metagenomes using full-length 16S sequencing reads

Full-length isoform sequencing yields a more comprehensive view of gene activity

Comparative transcriptome analysis of genes involved in Na+ transport in the leaves of halophyte Halogeton glomeratus.

The state of play in higher eukaryote gene annotation.

Targeted combinatorial alternative splicing generates brain region-specific repertoires of neurexins.

Bacteroides dorei dominates gut microbiome prior to autoimmunity in Finnish children at high risk for type 1 diabetes.

Molecular mechanisms of acclimatization to phosphorus starvation and recovery underlying full-length transcriptome profiling in barley (Hordeum vulgare L.).

Somatic APP gene recombination in Alzheimer’s disease and normal neurons.

Resistance to ceftazidime-avibactam in Klebsiella pneumoniae due to porin mutations and the increased expression of KPC-3.

Active microorganisms in forest soils differ from the total community yet are shaped by the same environmental factors: the influence of pH and soil moisture.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert