NGS Archives - Page 78 of 425

April 21, 2020

Comparative Genomics of Thiohalobacter thiocyanaticus HRh1T and Guyparkeria sp. SCN-R1, Halophilic Chemolithoautotrophic Sulfur-Oxidizing Gammaproteobacteria Capable of Using Thiocyanate as Energy Source.

The genomes of Thiohalobacter thiocyanaticus and Guyparkeria (formerly known as Halothiobacillus) sp. SCN-R1, two gammaproteobacterial halophilic sulfur-oxidizing bacteria (SOB) capable of thiocyanate oxidation via the “cyanate pathway”, have been analyzed with a particular focus on their thiocyanate-oxidizing potential and sulfur oxidation pathways. Both genomes encode homologs of the enzyme thiocyanate dehydrogenase (TcDH) that oxidizes thiocyanate via the “cyanate pathway” in members of the haloalkaliphilic SOB of the genus Thioalkalivibrio. However, despite the presence of conservative motives indicative of TcDH, the putative TcDH of the halophilic SOB have a low overall amino acid similarity to the Thioalkalivibrio enzyme, and also the surrounding genes in the TcDH locus were different. In particular, an alternative copper transport system Cus is present instead of Cop and a putative zero-valent sulfur acceptor protein gene appears just before TcDH. Moreover, in contrast to the thiocyanate-oxidizing Thioalkalivibrio species, both genomes of the halophilic SOB contained a gene encoding the enzyme cyanate hydratase. The sulfur-oxidizing pathway in the genome of Thiohalobacter includes a Fcc type of sulfide dehydrogenase, a rDsr complex/AprAB/Sat for oxidation of zero-valent sulfur to sulfate, and an incomplete Sox pathway, lacking SoxCD. The sulfur oxidation pathway reconstructed from the genome of Guyparkeria sp. SCN-R1 was more similar to that of members of the Thiomicrospira-Hydrogenovibrio group, including a Fcc type of sulfide dehydrogenase and a complete Sox complex. One of the outstanding properties of Thiohalobacter is the presence of a Na+-dependent ATP synthase, which is rarely found in aerobic Prokaryotes.Overall, the results showed that, despite an obvious difference in the general sulfur-oxidation pathways, halophilic and haloalkaliphilic SOB belonging to different genera within the Gammaproteobacteria developed a similar unique thiocyanate-degrading mechanism based on the direct oxidative attack on the sulfane atom of thiocyanate.

April 21, 2020

Study of the whole genome, methylome and transcriptome of Cordyceps militaris.

The complete genome of Cordyceps militaris was sequenced using single-molecule real-time (SMRT) sequencing technology at a coverage over 300×. The genome size was 32.57?Mb, and 14 contigs ranging from 0.35 to 4.58?Mb with an N50 of 2.86?Mb were assembled, including 4 contigs with telomeric sequences on both ends and an additional 8 contigs with telomeric sequences on either the 5′ or 3′ end. A methylome database of the genome was constructed using SMRT and m4C and m6A methylated nucleotides, and many unknown modification types were identified. The major m6A methylation motif is GA and GGAG, and the major m4C methylation motif is GC or CG/GC. In the C. militaris genome DNA, there were four types of methylated nucleotides that we confirmed using high-resolution LCMS-IT-TOF. Using PacBio Iso-Seq, a total of 31,133 complete cDNA sequences were obtained in the fruiting body. The conserved domains of the nontranscribed regions of the genome include TATA boxes, which are the initial regions of genome replication. There were 406 structural variants between the HN and CM01 strains, and there were 1,114 structural variants between the HN and ATCC strains.

April 21, 2020

Genome Features and Secondary Metabolites Biosynthetic Potential of the Class Ktedonobacteria.

The prevalence of antibiotic resistance and the decrease in novel antibiotic discovery in recent years necessitates the identification of potentially novel microbial resources to produce natural products. Ktedonobacteria, a class of deeply branched bacterial lineage in the ancient phylum Chloroflexi, are ubiquitous in terrestrial environments and characterized by their large genome size and complex life cycle. These characteristics indicate Ktedonobacteria as a potential active producer of bioactive compounds. In this study, we observed the existence of a putative “megaplasmid,” multiple copies of ribosomal RNA operons, and high ratio of hypothetical proteins with unknown functions in the class Ktedonobacteria. Furthermore, a total of 104 antiSMASH-predicted putative biosynthetic gene clusters (BGCs) for secondary metabolites with high novelty and diversity were identified in nine Ktedonobacteria genomes. Our investigation of domain composition and organization of the non-ribosomal peptide synthetase and polyketide synthase BGCs further supports the concept that class Ktedonobacteria may produce compounds structurally different from known natural products. Furthermore, screening of bioactive compounds from representative Ktedonobacteria strains resulted in the identification of broad antimicrobial activities against both Gram-positive and Gram-negative tested bacterial strains. Based on these findings, we propose the ancient, ubiquitous, and spore-forming Ktedonobacteria as a versatile and promising microbial resource for natural product discovery.

April 21, 2020

Biodegradation of naphthalene, BTEX, and aliphatic hydrocarbons by Paraburkholderia aromaticivorans BN5 isolated from petroleum-contaminated soil.

To isolate bacteria responsible for the biodegradation of naphthalene, BTEX (benzene, toluene, ethylbenzene, and o-, m-, and p-xylene), and aliphatic hydrocarbons in petroleum-contaminated soil, three enrichment cultures were established using soil extract as the medium supplemented with naphthalene, BTEX, or n-hexadecane. Community analyses showed that Paraburkholderia species were predominant in naphthalene and BTEX, but relatively minor in n-hexadecane. Paraburkholderia aromaticivorans BN5 was able to degrade naphthalene and all BTEX compounds, but not n-hexadecane. The genome of strain BN5 harbors genes encoding 29 monooxygenases including two alkane 1-monooxygenases and 54 dioxygenases, indicating that strain BN5 has versatile metabolic capabilities, for diverse organic compounds: the ability of strain BN5 to degrade short chain aliphatic hydrocarbons was verified experimentally. The biodegradation pathways of naphthalene and BTEX compounds were bioinformatically predicted and verified experimentally through the analysis of their metabolic intermediates. Some genomic features including the encoding of the biodegradation genes on a plasmid and the low sequence homologies of biodegradation-related genes suggest that biodegradation potentials of strain BN5 may have been acquired via horizontal gene transfers and/or gene duplication, resulting in enhanced ecological fitness by enabling strain BN5 to degrade all compounds including naphthalene, BTEX, and short aliphatic hydrocarbons in contaminated soil.

April 21, 2020

Urotensin-related gene transcripts mark developmental emergence of the male forebrain vocal control system in songbirds.

Songbirds communicate through learned vocalizations, using a forebrain circuit with convergent similarity to vocal-control circuitry in humans. This circuit is incomplete in female zebra finches, hence only males sing. We show that the UTS2B gene, encoding Urotensin-Related Peptide (URP), is uniquely expressed in a key pre-motor vocal nucleus (HVC), and specifically marks the neurons that form a male-specific projection that encodes timing features of learned song. UTS2B-expressing cells appear early in males, prior to projection formation, but are not observed in the female nucleus. We find no expression evidence for canonical receptors within the vocal circuit, suggesting either signalling to other brain regions via diffusion or transduction through other receptor systems. Urotensins have not previously been implicated in vocal control, but we find an annotation in Allen Human Brain Atlas of increased UTS2B expression within portions of human inferior frontal cortex implicated in human speech and singing. Thus UTS2B (URP) is a novel neural marker that may have conserved functions for vocal communication.

April 21, 2020

Sequence properties of certain GC rich avian genes, their origins and absence from genome assemblies: case studies.

More and more eukaryotic genomes are sequenced and assembled, most of them presented as a complete model in which missing chromosomal regions are filled by Ns and where a few chromosomes may be lacking. Avian genomes often contain sequences with high GC content, which has been hypothesized to be at the origin of many missing sequences in these genomes. We investigated features of these missing sequences to discover why some may not have been integrated into genomic libraries and/or sequenced.The sequences of five red jungle fowl cDNA models with high GC content were used as queries to search publicly available datasets of Illumina and Pacbio sequencing reads. These were used to reconstruct the leptin, TNFa, MRPL52, PCP2 and PET100 genes, all of which are absent from the red jungle fowl genome model. These gene sequences displayed elevated GC contents, had intron sizes that were sometimes larger than non-avian orthologues, and had non-coding regions that contained numerous tandem and inverted repeat sequences with motifs able to assemble into stable G-quadruplexes and intrastrand dyadic structures. Our results suggest that Illumina technology was unable to sequence the non-coding regions of these genes. On the other hand, PacBio technology was able to sequence these regions, but with dramatically lower efficiency than would typically be expected.High GC content was not the principal reason why numerous GC-rich regions of avian genomes are missing from genome assembly models. Instead, it is the presence of tandem repeats containing motifs capable of assembling into very stable secondary structures that is likely responsible.

April 21, 2020

Longitudinal HIV sequencing reveals reservoir expression leading to decay which is obscured by clonal expansion.

After initiating antiretroviral therapy (ART), a rapid decline in HIV viral load is followed by a long period of undetectable viremia. Viral outgrowth assay suggests the reservoir continues to decline slowly. Here, we use full-length sequencing to longitudinally study the proviral landscape of four subjects on ART to investigate the selective pressures influencing the dynamics of the treatment-resistant HIV reservoir. We find intact and defective proviruses that contain genetic elements favoring efficient protein expression decrease over time. Moreover, proviruses that lack these genetic elements, yet contain strong donor splice sequences, increase relatively to other defective proviruses, especially among clones. Our work suggests that HIV expression occurs to a significant extent during ART and results in HIV clearance, but this is obscured by the expansion of proviral clones. Paradoxically, clonal expansion may also be enhanced by HIV expression that leads to splicing between HIV donor splice sites and downstream human exons.

April 21, 2020

Getting the Entire Message: Progress in Isoform Sequencing

The advent of second-generation sequencing and its application to RNA sequencing has revolutionized the field of genomics by allowing the quantification of expression of entire genes as well as single TSS, exons and splice sites, RNA-editing sites as well as polyA-sites. However, due to the sequencing of fragments of cDNAs these methods have not given a reliable picture of complete RNA isoforms. Third-generation sequencing has filled this gap and allows end-to-end sequencing of entire RNA/cDNA molecules. This approach to transcriptomics has been a ‘niche’ technology for a couple of years but now is becoming mainstream with many different applications. Here, we review the background and progress made to date in this rapidly growing field. We start by reviewing the progressive realization that alternative splicing is omnipresent. We then focus on long-non-coding RNA isoforms and the distinct combination patterns of exons in non-coding and coding genes. We consider the implications of the recent technologies of direct RNA sequencing and single-cell isoform RNA sequencing. Finally, we discuss the parameters that define the success of long-read RNA sequencing experiments and strategies commonly used to make the most of such data.

April 21, 2020

The First Highly Contiguous Genome Assembly of Pikeperch (Sander lucioperca), an Emerging Aquaculture Species in Europe

The pikeperch (Sander lucioperca) is a fresh and brackish water Percid fish natively inhabiting the northern hemisphere. This species is emerging as a promising candidate for intensive aquaculture production in Europe. Specific traits like cannibalism, growth rate and meat quality require genomics based understanding, for an optimal husbandry and domestication process. Still, the aquaculture community is lacking an annotated genome sequence to facilitate genome-wide studies on pikeperch. Here, we report the first highly contiguous draft genome assembly of Sander lucioperca. In total, 413 and 66 giga base pairs of DNA sequencing raw data were generated with the Illumina platform and PacBio Sequel System, respectively. The PacBio data were assembled into a final assembly size of ~900 Mb covering 89% of the 1,014 Mb estimated genome size. The draft genome consisted of 1966 contigs ordered into 1,313 scaffolds. The contig and scaffold N50 lengths are 3.0 Mb and 4.9 Mb, respectively. The identified repetitive structures accounted for 39% of the genome. We utilized homologies to other ray-finned fishes, and ab initio gene prediction methods to predict 21,249 protein-coding genes in the Sander lucioperca genome, of which 88% were functionally annotated by either sequence homology or protein domains and signatures search. The assembled genome spans 97.6% and 96.3% of Vertebrate and Actinopterygii single-copy orthologs, respectively. The outstanding mapping rate (99.9%) of genomic PE-reads on the assembly suggests an accurate and nearly complete genome reconstruction. This draft genome sequence is the first genomic resource for this promising aquaculture species. It will provide an impetus for genomic-based breeding studies targeting phenotypic and performance traits of captive pikeperch.

April 21, 2020

Combining next-generation sequencing and single-molecule sequencing to explore brown plant hopper responses to contrasting genotypes of japonica rice.

The brown plant hopper (BPH), Nilaparvata lugens, is one of the major pest of rice (Oryza sativa). Plant defenses against insect herbivores have been extensively studied, but our understanding of insect responses to host plants’ resistance mechanisms is still limited. The purpose of this study is to characterize transcripts of BPH and reveal the responses of BPH insects to resistant rice at transcription level by using the advanced molecular techniques, the next-generation sequencing (NGS) and the single-molecule, real-time (SMRT) sequencing.The current study obtained 24,891 collapsed isoforms of full-length transcripts, and 20,662 were mapped to known annotated genes, including 17,175 novel transcripts. The current study also identified 915 fusion genes, 1794 novel genes, 2435 long non-coding RNAs (lncRNAs), and 20,356 alternative splicing events. Moreover, analysis of differentially expressed genes (DEGs) revealed that genes involved in metabolic and cell proliferation processes were significantly enriched in up-regulated and down-regulated sets, respectively, in BPH fed on resistant rice relative to BPH fed on susceptible wild type rice. Furthermore, the FoxO signaling pathway was involved and genes related to BPH starvation response (Nlbmm), apoptosis and autophagy (caspase 8, ATG13, BNIP3 and IAP), active oxygen elimination (catalase, MSR, ferritin) and detoxification (GST, CarE) were up-regulated in BPH responses to resistant rice.The current study provides the first demonstrations of the full diversity and complexity of the BPH transcriptome, and indicates that BPH responses to rice resistance, might be related to starvation stress responses, nutrient transformation, oxidative decomposition, and detoxification. The current result findings will facilitate further exploration of molecular mechanisms of interaction between BPH insects and host rice.

April 21, 2020

The importance of genome sequence quality to microbial comparative genomics.

The quality of microbial genome sequences has been a concern ever since the emergence of genome sequencing. The quality of the genome assemblies is dependent on the sequencing technology used and the aims for which the sequence was generated. Novel sequencing and bioinformatics technologies are not intrinsically better than the older technologies, although they are generally more efficient. In this correspondence, the importance for comparative genomics of additional manual assembly efforts over autoassembly and careful annotation is emphasized.

April 21, 2020

The Impact of cDNA Normalization on Long-Read Sequencing of a Complex Transcriptome

Normalization of cDNA is widely used to improve the coverage of rare transcripts in analysis of transcriptomes employing next-generation sequencing. Recently, long-read technology has been emerging as a powerful tool for sequencing and construction of transcriptomes, especially for complex genomes containing highly similar transcripts and transcript-spliced isoforms. Here, we analyzed the transcriptome of sugarcane, with a highly polyploidy plant genome, by PacBio isoform sequencing (Iso-Seq) of two different cDNA library preparations, with and without a normalization step. The results demonstrated that, while the two libraries included many of the same transcripts, many longer transcripts were removed and many new generally shorter transcripts were detected by normalization. For the same input cDNA and the same data yield, the normalized library recovered more total transcript isoforms, number of predicted gene families and orthologous groups, resulting in a higher representation for the sugarcane transcriptome, compared to the non-normalized library. The non-normalized library, on the other hand, included a wider transcript length range with more longer transcripts above ~1.25 kb, more transcript isoforms per gene family and gene ontology terms per transcript. A large proportion of the unique transcripts comprising ~52% of the normalized library were expressed at a lower level than the unique transcripts from the non-normalized library, across three tissue types tested including leaf, stalk and root. About 83% of the total 5,348 predicted long noncoding transcripts was derived from the normalized library, of which ~80% was derived from the lowly expressed fraction. Functional annotation of the unique transcripts suggested that each library enriched different functional transcript fractions. This demonstrated the complementation of the two approaches in obtaining a complete transcriptome of a complex genome at the sequencing depth used in this study.

April 21, 2020

Complete genome sequence analysis of the thermoacidophilic verrucomicrobial methanotroph “Candidatus Methylacidiphilum kamchatkense” strain Kam1 and comparison with its closest relatives.

The candidate genus “Methylacidiphilum” comprises thermoacidophilic aerobic methane oxidizers belonging to the Verrucomicrobia phylum. These are the first described non-proteobacterial aerobic methane oxidizers. The genes pmoCAB, encoding the particulate methane monooxygenase do not originate from horizontal gene transfer from proteobacteria. Instead, the “Ca. Methylacidiphilum” and the sister genus “Ca. Methylacidimicrobium” represent a novel and hitherto understudied evolutionary lineage of aerobic methane oxidizers. Obtaining and comparing the full genome sequences is an important step towards understanding the evolution and physiology of this novel group of organisms.Here we present the closed genome of “Ca. Methylacidiphilum kamchatkense” strain Kam1 and a comparison with the genomes of its two closest relatives “Ca. Methylacidiphilum fumariolicum” strain SolV and “Ca. Methylacidiphilum infernorum” strain V4. The genome consists of a single 2,2 Mbp chromosome with 2119 predicted protein coding sequences. Genome analysis showed that the majority of the genes connected with metabolic traits described for one member of “Ca. Methylacidiphilum” is conserved between all three genomes. All three strains encode class I CRISPR-cas systems. The average nucleotide identity between “Ca. M. kamchatkense” strain Kam1 and strains SolV and V4 is =95% showing that they should be regarded as separate species. Whole genome comparison revealed a high degree of synteny between the genomes of strains Kam1 and SolV. In contrast, comparison of the genomes of strains Kam1 and V4 revealed a number of rearrangements. There are large differences in the numbers of transposable elements found in the genomes of the three strains with 12, 37 and 80 transposable elements in the genomes of strains Kam1, V4 and SolV respectively. Genomic rearrangements and the activity of transposable elements explain much of the genomic differences between strains. For example, a type 1h uptake hydrogenase is conserved between strains Kam1 and SolV but seems to have been lost from strain V4 due to genomic rearrangements.Comparing three closed genomes of “Ca. Methylacidiphilum” spp. has given new insights into the evolution of these organisms and revealed large differences in numbers of transposable elements between strains, the activity of these explains much of the genomic differences between strains.

April 21, 2020

DiscoverY: a classifier for identifying Y chromosome sequences in male assemblies.

Although the Y chromosome plays an important role in male sex determination and fertility, it is currently understudied due to its haploid and repetitive nature. Methods to isolate Y-specific contigs from a whole-genome assembly broadly fall into two categories. The first involves retrieving Y-contigs using proportion sharing with a female, but such a strategy is prone to false positives in the absence of a high-quality, complete female reference. A second strategy uses the ratio of depth of coverage from male and female reads to select Y-contigs, but such a method requires high-depth sequencing of a female and cannot utilize existing female references.We develop a k-mer based method called DiscoverY, which combines proportion sharing with female with depth of coverage from male reads to classify contigs as Y-chromosomal. We evaluate the performance of DiscoverY on human and gorilla genomes, across different sequencing platforms including Illumina, 10X, and PacBio. In the cases where the male and female data are of high quality, DiscoverY has a high precision and recall and outperforms existing methods. For cases when a high quality female reference is not available, we quantify the effect of using draft reference or even just raw sequencing reads from a female.DiscoverY is an effective method to isolate Y-specific contigs from a whole-genome assembly. However, regions homologous to the X chromosome remain difficult to detect.

April 21, 2020

Denitrifying Bacteria Active in Woodchip Bioreactors at Low-Temperature Conditions.

Woodchip bioreactor technology removes nitrate from agricultural subsurface drainage by using denitrifying microorganisms. Although woodchip bioreactors have demonstrated success in many field locations, low water temperature can significantly limit bioreactor efficiency and performance. To improve bioreactor performance, it is important to identify the microbes responsible for nitrate removal at low temperature conditions. Therefore, in this study, we identified and characterized denitrifiers active at low-temperature conditions by using culture-independent and -dependent approaches. By comparative 16S rRNA (gene) analysis and culture isolation technique, Pseudomonas spp., Polaromonas spp., and Cellulomonas spp. were identified as being important bacteria responsible for denitrification in woodchip bioreactor microcosms at relatively low temperature conditions (15°C). Genome analysis of Cellulomonas sp. strain WB94 confirmed the presence of nitrite reductase gene nirK. Transcription levels of this nirK were significantly higher in the denitrifying microcosms than in the non-denitrifying microcosms. Strain WB94 was also capable of degrading cellulose and other complex polysaccharides. Taken together, our results suggest that Cellulomonas sp. denitrifiers could degrade woodchips to provide carbon source and electron donors to themselves and other denitrifiers in woodchip bioreactors at low-temperature conditions. By inoculating these denitrifiers (i.e., bioaugmentation), it might be possible to increase the nitrate removal rate of woodchip bioreactors at low-temperature conditions.

Asset Tag: NGS

Comparative Genomics of Thiohalobacter thiocyanaticus HRh1T and Guyparkeria sp. SCN-R1, Halophilic Chemolithoautotrophic Sulfur-Oxidizing Gammaproteobacteria Capable of Using Thiocyanate as Energy Source.

Study of the whole genome, methylome and transcriptome of Cordyceps militaris.

Genome Features and Secondary Metabolites Biosynthetic Potential of the Class Ktedonobacteria.

Biodegradation of naphthalene, BTEX, and aliphatic hydrocarbons by Paraburkholderia aromaticivorans BN5 isolated from petroleum-contaminated soil.

Urotensin-related gene transcripts mark developmental emergence of the male forebrain vocal control system in songbirds.

Sequence properties of certain GC rich avian genes, their origins and absence from genome assemblies: case studies.

Longitudinal HIV sequencing reveals reservoir expression leading to decay which is obscured by clonal expansion.

Getting the Entire Message: Progress in Isoform Sequencing

The First Highly Contiguous Genome Assembly of Pikeperch (Sander lucioperca), an Emerging Aquaculture Species in Europe

Combining next-generation sequencing and single-molecule sequencing to explore brown plant hopper responses to contrasting genotypes of japonica rice.

The importance of genome sequence quality to microbial comparative genomics.

Complete genome sequence analysis of the thermoacidophilic verrucomicrobial methanotroph “Candidatus Methylacidiphilum kamchatkense” strain Kam1 and comparison with its closest relatives.

DiscoverY: a classifier for identifying Y chromosome sequences in male assemblies.

Denitrifying Bacteria Active in Woodchip Bioreactors at Low-Temperature Conditions.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert