Menu
April 21, 2020

The Impact of cDNA Normalization on Long-Read Sequencing of a Complex Transcriptome

Normalization of cDNA is widely used to improve the coverage of rare transcripts in analysis of transcriptomes employing next-generation sequencing. Recently, long-read technology has been emerging as a powerful tool for sequencing and construction of transcriptomes, especially for complex genomes containing highly similar transcripts and transcript-spliced isoforms. Here, we analyzed the transcriptome of sugarcane, with a highly polyploidy plant genome, by PacBio isoform sequencing (Iso-Seq) of two different cDNA library preparations, with and without a normalization step. The results demonstrated that, while the two libraries included many of the same transcripts, many longer transcripts were removed and many new generally shorter transcripts were detected by normalization. For the same input cDNA and the same data yield, the normalized library recovered more total transcript isoforms, number of predicted gene families and orthologous groups, resulting in a higher representation for the sugarcane transcriptome, compared to the non-normalized library. The non-normalized library, on the other hand, included a wider transcript length range with more longer transcripts above ~1.25 kb, more transcript isoforms per gene family and gene ontology terms per transcript. A large proportion of the unique transcripts comprising ~52% of the normalized library were expressed at a lower level than the unique transcripts from the non-normalized library, across three tissue types tested including leaf, stalk and root. About 83% of the total 5,348 predicted long noncoding transcripts was derived from the normalized library, of which ~80% was derived from the lowly expressed fraction. Functional annotation of the unique transcripts suggested that each library enriched different functional transcript fractions. This demonstrated the complementation of the two approaches in obtaining a complete transcriptome of a complex genome at the sequencing depth used in this study.


April 21, 2020

Complete genome sequence analysis of the thermoacidophilic verrucomicrobial methanotroph “Candidatus Methylacidiphilum kamchatkense” strain Kam1 and comparison with its closest relatives.

The candidate genus “Methylacidiphilum” comprises thermoacidophilic aerobic methane oxidizers belonging to the Verrucomicrobia phylum. These are the first described non-proteobacterial aerobic methane oxidizers. The genes pmoCAB, encoding the particulate methane monooxygenase do not originate from horizontal gene transfer from proteobacteria. Instead, the “Ca. Methylacidiphilum” and the sister genus “Ca. Methylacidimicrobium” represent a novel and hitherto understudied evolutionary lineage of aerobic methane oxidizers. Obtaining and comparing the full genome sequences is an important step towards understanding the evolution and physiology of this novel group of organisms.Here we present the closed genome of “Ca. Methylacidiphilum kamchatkense” strain Kam1 and a comparison with the genomes of its two closest relatives “Ca. Methylacidiphilum fumariolicum” strain SolV and “Ca. Methylacidiphilum infernorum” strain V4. The genome consists of a single 2,2 Mbp chromosome with 2119 predicted protein coding sequences. Genome analysis showed that the majority of the genes connected with metabolic traits described for one member of “Ca. Methylacidiphilum” is conserved between all three genomes. All three strains encode class I CRISPR-cas systems. The average nucleotide identity between “Ca. M. kamchatkense” strain Kam1 and strains SolV and V4 is =95% showing that they should be regarded as separate species. Whole genome comparison revealed a high degree of synteny between the genomes of strains Kam1 and SolV. In contrast, comparison of the genomes of strains Kam1 and V4 revealed a number of rearrangements. There are large differences in the numbers of transposable elements found in the genomes of the three strains with 12, 37 and 80 transposable elements in the genomes of strains Kam1, V4 and SolV respectively. Genomic rearrangements and the activity of transposable elements explain much of the genomic differences between strains. For example, a type 1h uptake hydrogenase is conserved between strains Kam1 and SolV but seems to have been lost from strain V4 due to genomic rearrangements.Comparing three closed genomes of “Ca. Methylacidiphilum” spp. has given new insights into the evolution of these organisms and revealed large differences in numbers of transposable elements between strains, the activity of these explains much of the genomic differences between strains.


April 21, 2020

Denitrifying Bacteria Active in Woodchip Bioreactors at Low-Temperature Conditions.

Woodchip bioreactor technology removes nitrate from agricultural subsurface drainage by using denitrifying microorganisms. Although woodchip bioreactors have demonstrated success in many field locations, low water temperature can significantly limit bioreactor efficiency and performance. To improve bioreactor performance, it is important to identify the microbes responsible for nitrate removal at low temperature conditions. Therefore, in this study, we identified and characterized denitrifiers active at low-temperature conditions by using culture-independent and -dependent approaches. By comparative 16S rRNA (gene) analysis and culture isolation technique, Pseudomonas spp., Polaromonas spp., and Cellulomonas spp. were identified as being important bacteria responsible for denitrification in woodchip bioreactor microcosms at relatively low temperature conditions (15°C). Genome analysis of Cellulomonas sp. strain WB94 confirmed the presence of nitrite reductase gene nirK. Transcription levels of this nirK were significantly higher in the denitrifying microcosms than in the non-denitrifying microcosms. Strain WB94 was also capable of degrading cellulose and other complex polysaccharides. Taken together, our results suggest that Cellulomonas sp. denitrifiers could degrade woodchips to provide carbon source and electron donors to themselves and other denitrifiers in woodchip bioreactors at low-temperature conditions. By inoculating these denitrifiers (i.e., bioaugmentation), it might be possible to increase the nitrate removal rate of woodchip bioreactors at low-temperature conditions.


April 21, 2020

Single-molecule sequencing detection of N6-methyladenine in microbial reference materials.

The DNA base modification N6-methyladenine (m6A) is involved in many pathways related to the survival of bacteria and their interactions with hosts. Nanopore sequencing offers a new, portable method to detect base modifications. Here, we show that a neural network can improve m6A detection at trained sequence contexts compared to previously published methods using deviations between measured and expected current values as each adenine travels through a pore. The model, implemented as the mCaller software package, can be extended to detect known or confirm suspected methyltransferase target motifs based on predictions of methylation at untrained contexts. We use PacBio, Oxford Nanopore, methylated DNA immunoprecipitation sequencing (MeDIP-seq), and whole-genome bisulfite sequencing data to generate and orthogonally validate methylomes for eight microbial reference species. These well-characterized microbial references can serve as controls in the development and evaluation of future methods for the identification of base modifications from single-molecule sequencing data.


April 21, 2020

A Phage-Like Plasmid Carrying blaKPC-2 Gene in Carbapenem-Resistant Pseudomonas aeruginosa.

Background: Lateral gene transfer plays a central role in the dissemination of carbapenem resistance in bacterial pathogens associated with nosocomial infections, mainly Enterobacteriaceae and Pseudomonas aeruginosa. Despite their clinical significance, there is little information regarding the mobile genetic elements and mechanism of acquisition and propagation of lateral genes in P. aeruginosa, and they remain largely unknown. Objectives: The present study characterized the genetic context of blaKPC-2 in carbapenem-resistant P. aeruginosa strain BH9. Methods:Pseudomonas aeruginosa BH9 sequencing was performed using the long-read PacBio SMRT platform and the Ion Proton System. De novo assembly was carried out using the SMRT pipeline and Canu, and gene prediction and annotation were performed using Prokka and RAST. Results:Pseudomonas aeruginosa BH9 exhibited a 7.1 Mb circular chromosome. However, the blaKPC-2 gene is located in an additional contig composed by a small plasmid pBH6 from P. aeruginosa strain BH6 and several phage-related genes. Further analysis revealed that the beginning and end of the contig contain identical sequences, supporting a circular plasmid structure. This structure spans 41,087 bp, exhibiting all the Mu-like phage landmarks. In addition, 5-bp direct repeats (GGATG) flanking the pBH6 ends were found, strongly indicating integration of the Mu-like phage into the pBH6 plasmid. Mu phages are commonly found in P. aeruginosa. However, for the first time showing a potential impact in shaping the vehicles of the dissemination of antimicrobial (e.g., plasmid pBH6) resistance genes in the Pseudomonas genus. Conclusion: pBH6 captured the Mu-like Phage BH9, creating a co-integrate pBH6::Phage BH9, and this phage-plasmid complex may represent novel case of a phage-like plasmid.


April 21, 2020

Systematic identification of intergenic long-noncoding RNAs in mouse retinas using full-length isoform sequencing.

A great mass of long noncoding RNAs (lncRNAs) have been identified in mouse genome and increasing evidences in the last decades have revealed their crucial roles in diverse biological processes. Nevertheless, the biological roles of lncRNAs in the mouse retina remains largely unknown due to the lack of a comprehensive annotation of lncRNAs expressed in the retina.In this study, we applied the long-reads sequencing strategy to unravel the transcriptomes of developing mouse retinas and identified a total of 940 intergenic lncRNAs (lincRNAs) in embryonic and neonatal retinas, including about 13% of them were transcribed from unannotated gene loci. Subsequent analysis revealed that function of lincRNAs expressed in mouse retinas were closely related to the physiological roles of this tissue, including 90 lincRNAs that were differentially expressed after the functional loss of key regulators of retinal ganglion cell (RGC) differentiation. In situ hybridization results demonstrated the enrichment of three class IV POU-homeobox genes adjacent lincRNAs (linc-3a, linc-3b and linc-3c) in ganglion cell layer and indicated they were potentially RGC-specific.In summary, this study systematically annotated the lincRNAs expressed in embryonic and neonatal mouse retinas and implied their crucial regulatory roles in retinal development such as RGC differentiation.


April 21, 2020

Closing the Yield Gap for Cannabis: A Meta-Analysis of Factors Determining Cannabis Yield.

Until recently, the commercial production of Cannabis sativa was restricted to varieties that yielded high-quality fiber while producing low levels of the psychoactive cannabinoid tetrahydrocannabinol (THC). In the last few years, a number of jurisdictions have legalized the production of medical and/or recreational cannabis with higher levels of THC, and other jurisdictions seem poised to follow suit. Consequently, demand for industrial-scale production of high yield cannabis with consistent cannabinoid profiles is expected to increase. In this paper we highlight that currently, projected annual production of cannabis is based largely on facility size, not yield per square meter. This meta-analysis of cannabis yields reported in scientific literature aimed to identify the main factors contributing to cannabis yield per plant, per square meter, and per W of lighting electricity. In line with previous research we found that variety, plant density, light intensity and fertilization influence cannabis yield and cannabinoid content; we also identified pot size, light type and duration of the flowering period as predictors of yield and THC accumulation. We provide insight into the critical role of light intensity, quality, and photoperiod in determining cannabis yields, with particular focus on the potential for light-emitting diodes (LEDs) to improve growth and reduce energy requirements. We propose that the vast amount of genomics data currently available for cannabis can be used to better understand the effect of genotype on yield. Finally, we describe diversification that is likely to emerge in cannabis growing systems and examine the potential role of plant-growth promoting rhizobacteria (PGPR) for growth promotion, regulation of cannabinoid biosynthesis, and biocontrol.


April 21, 2020

Directed Repeats Co-occur with Few Short-Dispersed Repeats in Plastid Genome of a Spikemoss, Selaginella vardei (Selaginellaceae, Lycopodiopsida).

It is hypothesized that the highly conserved inverted repeats (IR) structure of land plant plastid genomes (plastomes) is beneficial for stabilizing plastome organization, whereas the mechanism of the occurrence and stability maintenance of the recently reported direct repeats (DR) structure is yet awaiting further exploration. Here we describe the DR structure of the Selaginella vardei (Selaginellaceae) plastome, to elucidate the mechanism of DR occurrence and stability maintenance.The plastome of S. vardei is 121,254 bp in length and encodes 76 genes, of which 62 encode proteins, 10 encode tRNAs, and four encode rRNAs. Unexpectedly, the two identical rRNA gene regions (13,893 bp) are arranged in a direct orientation (DR), rather than inverted. Comparing to the IR organization in Isoetes flaccida (Isoetaceae, Lycopodiopsida) plastome, a ca. 50-kb trnN-trnF inversion that spans one DR copy was found in the plastome of S. vardei, which might cause the orientation change. In addition, we find extremely rare short dispersed repeats (SDRs) in the plastomes of S. vardei and its closely related species S. indica.We suggest that the ca. 50-kb inversion resulted in the DR structure, and the reduction in SDRs plays a key role in maintaining the stability of plastomes with DR structure by avoiding potential secondary recombination. We further confirmed the presence of homologous recombination between DR regions, which are able to generate subgenomes and form diverse multimers. Our study deepens the understanding of Selaginella plastomes and provides new insights into the diverse plastome structures in land plants.


April 21, 2020

Identification of Diverse Integron and Plasmid Structures Carrying a Novel Carbapenemase Among Pseudomonas Species.

A novel carbapenem-hydrolyzing beta-lactamase, called IMP-63, was identified in three clonally distinct strains of Pseudomonas aeruginosa and two strains of Pseudomonas putida isolated within a 4 year timeframe in three French hospitals. The blaIMP-63 gene that encodes this carbapenemase turned out to be located in the variable region of four integrons (In1297, In1574, In1573, and In1572) and to coexist with novel or rare gene cassettes (fosM, gcu170, gcuF1) and insertion elements (ISPsp7v, ISPa16v). All these integrons except one (In1574) were flanked by a copy of insertion sequence ISPa17 next to the orf6 putative gene, and were carried by non-conjugative plasmids (pNECK1, pROUSS1, pROUSS2, pROUE1). These plasmids exhibit unique modular structures and partial sequence homologies with plasmids previously identified in various non-fermenting environmental Gram-negative species. Lines of evidence suggest that ISPa17 promoted en bloc the transposition of IMP-63-encoding integrons on these different plasmids. As demonstrated by genotyping experiments, isolates of P. aeruginosa harboring the 28.9-kb plasmid pNECK1 and belonging to international “high-risk” clone ST308 were responsible for an outbreak in one hospital. Collectively, these data provide an insight into the complex and unpredictable routes of diffusion of some resistance determinants, here blaIMP-63, among Pseudomonas species.


April 21, 2020

A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds.

The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera. We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by scaffolding using a BioNano optical genome map and a Hi-C chromatin interaction map, complemented by a genetic linkage map.Each of the assembly steps reduced the number of gaps and incorporated a substantial amount of additional sequence into scaffolds. The new assembly (Amel_HAv3) is significantly more contiguous and complete than the previous one (Amel_4.5), based mainly on Sanger sequencing reads. N50 of contigs is 120-fold higher (5.381 Mbp compared to 0.053 Mbp) and we anchor >?98% of the sequence to chromosomes. All of the 16 chromosomes are represented as single scaffolds with an average of three sequence gaps per chromosome. The improvements are largely due to the inclusion of repetitive sequence that was unplaced in previous assemblies. In particular, our assembly is highly contiguous across centromeres and telomeres and includes hundreds of AvaI and AluI repeats associated with these features.The improved assembly will be of utility for refining gene models, studying genome function, mapping functional genetic variation, identification of structural variants, and comparative genomics.


April 21, 2020

Genome of lethal Lepiota venenata and insights into the evolution of toxin-biosynthetic genes.

Genomes of lethal Amanita and Galerina mushrooms have gradually become available in the past ten years; in contrast the other known amanitin-producing genus, Lepiota, is still vacant in this aspect. A fatal mushroom poisoning case in China has led to acquisition of fresh L. venenata fruiting bodies, based on which a draft genome was obtained through PacBio and Illumina sequencing platforms. Toxin-biosynthetic MSDIN family and Porlyl oligopeptidase B (POPB) genes were mined from the genome and used for phylogenetic and statistical studies to gain insights into the evolution of the biosynthetic pathway.The analysis of the genome data illustrated that only one MSDIN, named LvAMA1, exits in the genome, along with a POPB gene. No POPA homolog was identified by direct homology searching, however, one additional POP gene, named LvPOPC, was cloned and the gene structure determined. Similar to ApAMA1 in A. phalloides and GmAMA1 in G. marginata, LvAMA1 directly encodes a-amanitin. The two toxin genes were mapped to the draft genome, and the structures analyzed. Furthermore, phylogenetic and statistical analyses were conducted to study the evolution history of the POPB genes. Compared to our previous report, the phylogenetic trees unambiguously showed that a monophyletic POPB lineage clearly conflicted with the species phylogeny. In contrast, phylogeny of POPA genes resembled the species phylogeny. Topology and divergence tests showed that the POPB lineage was robust and these genes exhibited significantly shorter genetic distances than those of the house-keeping rbp2, a characteristic feature of genes with horizontal gene transfer (HGT) background. Consistently, same scenario applied to the only MSDIN, LvAMA1, in the genome.To the best of our knowledge, this is the first reported genome of Lepiota. The analyses of the toxin genes indicate that the cyclic peptides are synthesized through a ribosomal mechanism. The toxin genes, LvAMA1 and LvPOPB, are not in the vicinity of each other. Phylogenetic and evolutionary studies suggest that HGT is the underlining cause for the occurrence of POPB and MSDIN in Amanita, Galerina and Lepiota, which are allocated in three distantly-related families.


April 21, 2020

Comparative genomics reveals structural and functional features specific to the genome of a foodborne Escherichia coli O157:H7.

Escherichia coli O157:H7 (O157) has been linked to numerous foodborne disease outbreaks. The ability to rapidly sequence and analyze genomes is important for understanding epidemiology, virulence, survival, and evolution of outbreak strains. In the current study, we performed comparative genomics to determine structural and functional features of the genome of a foodborne O157 isolate NADC 6564 and infer its evolutionary relationship to other O157 strains.The chromosome of NADC 6564 contained 5466?kb compared to reference strains Sakai (5498?kb) and EDL933 (5547?kb) and shared 41 of its 43 Linear Conserved Blocks (LCB) with the reference strains. However, 18 of 41 LCB had inverse orientation in NADC 6564 compared to the reference strains. NADC 6564 shared 18 of 19 bacteriophages with reference strains except that the chromosomal positioning of some of the phages differed among these strains. The additional phage (P19) of NADC 6564 was located on a 39-kb insertion element (IE) encoding several hypothetical proteins, an integrase, transposases, transcriptional regulators, an adhesin, and a phosphoethanolamine transferase (PEA). The complete homologs of the 39-kb?IE were found in E. coli PCN061 of porcine origin. The IE-encoded PEA showed low homology (32-33%) to four other PEA in NADC 6564 and PEA linked to mobilizable colistin resistance in E. coli but was highly homologous (95%) to a PEA of uropathogenic, avian pathogenic, and enteroaggregative E. coli. NADC 6564 showed slightly higher minimum inhibitory concentration of colistin compared to the reference strains. The 39-kb?IE also contained dndBCDE and dptFGH operons encoding DNA S-modification and a restriction pathway, linked to oxidative stress tolerance and self-defense against foreign DNA, respectively. Evolutionary tree analysis grouped NADC 6564 with lineage I O157 strains.These results indicated that differential phage counts and different chromosomal positioning of many bacteriophages and genomic islands might have resulted in recombination events causing altered chromosomal organization in NADC 6564. Evolutionary analysis grouped NADC 6564 with lineage I strains and suggested its earlier divergence from these strains. The ability to perform S-DNA modification might affect tolerance of NADC 6564 to various stressors.


April 21, 2020

A First Study of the Virulence Potential of a Bacillus subtilis Isolate From Deep-Sea Hydrothermal Vent.

Bacillus subtilis is the best studied Gram-positive bacterium, primarily as a model of cell differentiation and industrial exploitation. To date, little is known about the virulence of B. subtilis. In this study, we examined the virulence potential of a B. subtilis strain (G7) isolated from the Iheya North hydrothermal field of Okinawa Trough. G7 is aerobic, motile, endospore-forming, and requires NaCl for growth. The genome of G7 is composed of one circular chromosome of 4,216,133 base pairs with an average GC content of 43.72%. G7 contains 4,416 coding genes, 27.5% of which could not be annotated, and the remaining 72.5% were annotated with known or predicted functions in 25 different COG categories. Ten sets of 23S, 5S, and 16S ribosomal RNA operons, 86 tRNA and 14 sRNA genes, 50 tandem repeats, 41 mini-satellites, one microsatellite, and 42 transposons were identified in G7. Comparing to the genome of the B. subtilis wild type strain NCIB 3610T, G7 genome contains many genomic translocations, inversions, and insertions, and twice the amount of genomic Islands (GIs), with 42.5% of GI genes encoding hypothetical proteins. G7 possesses abundant putative virulence genes associated with adhesion, invasion, dissemination, anti-phagocytosis, and intracellular survival. Experimental studies showed that G7 was able to cause mortality in fish and mice following intramuscular/intraperitoneal injection, resist the killing effect of serum complement, and replicate in mouse macrophages and fish peripheral blood leukocytes. Taken together, our study indicates that G7 is a B. subtilis isolate with unique genetic features and can be lethal to vertebrate animals once being introduced into the animals by artificial means. These results provide the first insight into the potential harmfulness of deep-sea B. subtilis.


April 21, 2020

Comparative genomics and pathogenicity potential of members of the Pseudomonas syringae species complex on Prunus spp.

Diseases on Prunus spp. have been associated with a large number of phylogenetically different pathovars and species within the P. syringae species complex. Despite their economic significance, there is a severe lack of genomic information of these pathogens. The high phylogenetic diversity observed within strains causing disease on Prunus spp. in nature, raised the question whether other strains or species within the P. syringae species complex were potentially pathogenic on Prunus spp.To gain insight into the genomic potential of adaptation and virulence in Prunus spp., a total of twelve de novo whole genome sequences of P. syringae pathovars and species found in association with diseases on cherry (sweet, sour and ornamental-cherry) and peach were sequenced. Strains sequenced in this study covered three phylogroups and four clades. These strains were screened in vitro for pathogenicity on Prunus spp. together with additional genome sequenced strains thus covering nine out of thirteen of the currently defined P. syringae phylogroups. Pathogenicity tests revealed that most of the strains caused symptoms in vitro and no obvious link was found between presence of known virulence factors and the observed pathogenicity pattern based on comparative genomics. Non-pathogenic strains were displaying a two to three times higher generation time when grown in rich medium.In this study, the first set of complete genomes of cherry associated P. syringae strains as well as the draft genome of the quarantine peach pathogen P. syringae pv. persicae were generated. The obtained genomic data were matched with phenotypic data in order to determine factors related to pathogenicity to Prunus spp. Results of this study suggest that the inability to cause disease on Prunus spp. in vitro is not the result of host specialization but rather linked to metabolic impairments of individual strains.


April 21, 2020

Complete genome sequence of a marine-sediment-derived bacterial strain Bacillus velezensis SH-B74, a cyclic lipopeptides producer and a biopesticide.

A marine-sediment sample-derived strain Bacillus velezensis SH-B74 has the capacity to produce cyclic lipopeptides (CLPs), and these CLPs secreted by the strain show biological activities against various pests under both in vitro and in planta conditions, such evidence has supported that the strain SH-B74 is a biopesticide. To get a better insight into the mechanisms on the control of the pesticides by the strain, a genome sequencing project has been applied to the genomic DNA of the strain SH-B74. The results show that the strain SH-B74 has a chromosome size of 4,042,190 bp, with a GC content of 46.5%, in addition, the strain contains a 61,634 bp plasmid pSH-B74, with a GC content of 40.8%. Data from bioinformatic analysis reveal that the strain SH-B74 has genes with the capacity to increase environmental adaptation, promote the rhizosphere fitnesses and secrete a spectrum of antibiotics, including nonribosomal peptide synthetases (NRPSs)-derived CLPs bacillopeptin, plipastatin, and surfactin. The presence of CLPs in the bacterial cultures of the strain SH-B74 was confirmed further by LC-MS analysis. Thus, genome sequencing and analyses together with chemical analysis reveal the promising perspectives of the strain SH-B74 that are of spectacular importance to its trait as a plant beneficial microbe to be used in agriculture practices.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.