Menu
April 21, 2020

Long-read sequencing unveils IGH-DUX4 translocation into the silenced IGH allele in B-cell acute lymphoblastic leukemia.

IGH@ proto-oncogene translocation is a common oncogenic event in lymphoid lineage cancers such as B-ALL, lymphoma and multiple myeloma. Here, to investigate the interplay between IGH@ proto-oncogene translocation and IGH allelic exclusion, we perform long-read whole-genome and transcriptome sequencing along with epigenetic and 3D genome profiling of Nalm6, an IGH-DUX4 positive B-ALL cell line. We detect significant allelic imbalance on the wild-type over the IGH-DUX4 haplotype in expression and epigenetic data, showing IGH-DUX4 translocation occurs on the silenced IGH allele. In vitro, this reduces the oncogenic stress of DUX4 high-level expression. Moreover, patient samples of IGH-DUX4 B-ALL have similar expression profile and IGH breakpoints as Nalm6, suggesting a common mechanism to allow optimal dosage of non-toxic DUX4 expression.


April 21, 2020

Comprehensive identification of the full-length transcripts and alternative splicing related to the secondary metabolism pathways in the tea plant (Camellia sinensis).

Flavonoids, theanine and caffeine are the main secondary metabolites of the tea plant (Camellia sinensis), which account for the tea’s unique flavor quality and health benefits. The biosynthesis pathways of these metabolites have been extensively studied at the transcriptional level, but the regulatory mechanisms are still unclear. In this study, to explore the transcriptome diversity and complexity of tea plant, PacBio Iso-Seq and RNA-seq analysis were combined to obtain full-length transcripts and to profile the changes in gene expression during the leaf development. A total of 1,388,066 reads of insert (ROI) were generated with an average length of 1,762?bp, and more than 54% (755,716) of the ROIs were full-length non-chimeric (FLNC) reads. The Benchmarking Universal Single-Copy Orthologue (BUSCO) completeness was 92.7%. A total of 93,883 non-redundant transcripts were obtained, and 87,395 (93.1%) were new alternatively spliced isoforms. Meanwhile, 7,650 differential expression transcripts (DETs) were identified. A total of 28,980 alternative splicing (AS) events were predicted, including 1,297 differential AS (DAS) events. The transcript isoforms of the key genes involved in the flavonoid, theanine and caffeine biosynthesis pathways were characterized. Additionally, 5,777 fusion transcripts and 9,052 long non-coding RNAs (lncRNAs) were also predicted. Our results revealed that AS potentially plays a crucial role in the regulation of the secondary metabolism of the tea plant. These findings enhanced our understanding of the complexity of the secondary metabolic regulation of tea plants and provided a basis for the subsequent exploration of the regulatory mechanisms of flavonoid, theanine and caffeine biosynthesis in tea plants.


April 21, 2020

Identification of a Xist silencing domain by Tiling CRISPR.

Despite essential roles played by long noncoding RNAs (lncRNAs) in development and disease, methods to determine lncRNA cis-elements are lacking. Here, we developed a screening method named “Tiling CRISPR” to identify lncRNA functional domains. Using this approach, we identified Xist A-Repeats as the silencing domain, an observation in agreement with published work, suggesting Tiling CRISPR feasibility. Mechanistic analysis suggested a novel function for Xist A-repeats in promoting Xist transcription. Overall, our method allows mapping of lncRNA functional domains in an unbiased and potentially high-throughput manner to facilitate the understanding of lncRNA functions.


April 21, 2020

Within-host evolution of Helicobacter pylori shaped by niche-specific adaptation, intragastric migrations and selective sweeps.

The human pathogen Helicobacter pylori displays extensive genetic diversity. While H. pylori is known to evolve during infection, population dynamics inside the gastric environment have not been extensively investigated. Here we obtained gastric biopsies from multiple stomach regions of 16 H. pylori-infected adults, and analyze the genomes of 10 H. pylori isolates from each biopsy. Phylogenetic analyses suggest location-specific evolution and bacterial migration between gastric regions. Migration is significantly more frequent between the corpus and the fundus than with the antrum, suggesting that physiological differences between antral and oxyntic mucosa contribute to spatial partitioning of H. pylori populations. Associations between H. pylori gene polymorphisms and stomach niches suggest that chemotaxis, regulatory functions and outer membrane proteins contribute to specific adaptation to the antral and oxyntic mucosa. Moreover, we show that antibiotics can induce severe population bottlenecks and likely play a role in shaping the population structure of H. pylori.


April 21, 2020

Genomic Analyses Reveal Evidence of Independent Evolution, Demographic History, and Extreme Environment Adaptation of Tibetan Plateau Agaricus bisporus.

Agaricus bisporus distributed in the Tibetan Plateau of China has high-stress resistance that is valuable for breeding improvements. However, its evolutionary history, specialization, and adaptation to the extreme Tibetan Plateau environment are largely unknown. Here, we performed de novo genome sequencing of a representative Tibetan Plateau wild strain ABM and comparative genomic analysis with the reported European strain H97 and H39. The assembled ABM genome was 30.4 Mb in size, and comprised 8,562 protein-coding genes. The ABM genome shared highly conserved syntenic blocks and a few inversions with H97 and H39. The phylogenetic tree constructed by 1,276 single-copy orthologous genes in nine fungal species showed that the Tibetan Plateau and European A. bisporus diverged ~5.5 million years ago. Population genomic analysis using genome resequencing of 29 strains revealed that the Tibetan Plateau population underwent significant differentiation from the European and American populations and evolved independently, and the global climate changes critically shaped the demographic history of the Tibetan Plateau population. Moreover, we identified key genes that are related to the cell wall and membrane system, and the development and defense systems regulated A. bisporus adapting to the harsh Tibetan Plateau environment. These findings highlight the value of genomic data in assessing the evolution and adaptation of mushrooms and will enhance future genetic improvements of A. bisporus.


April 21, 2020

Efomycins K and L From a Termite-Associated Streptomyces sp. M56 and Their Putative Biosynthetic Origin.

Two new elaiophylin derivatives, efomycins K (1) and L (2), and five known elaiophylin derivatives (3-7) were isolated from the termite-associated Streptomyces sp. M56. The structures were determined by 1D and 2D NMR and HR-ESIMS analyses and comparative CD spectroscopy. The putative gene cluster responsible for the production of the elaiophylin and efomycin derivatives was identified based on significant homology to related clusters. Phylogenetic analysis of gene cluster domains was used to provide a biosynthetic rational for these new derivatives and to demonstrate how a single biosynthetic pathway can produce diverse structures.


April 21, 2020

The Not-so-Sterile Womb: Evidence That the Human Fetus Is Exposed to Bacteria Prior to Birth.

The human microbiome includes trillions of bacteria, many of which play a vital role in host physiology. Numerous studies have now detected bacterial DNA in first-pass meconium and amniotic fluid samples, suggesting that the human microbiome may commence in utero. However, these data have remained contentious due to underlying contamination issues. Here, we have used a previously described method for reducing contamination in microbiome workflows to determine if there is a fetal bacterial microbiome beyond the level of background contamination. We recruited 50 women undergoing non-emergency cesarean section deliveries with no evidence of intra-uterine infection and collected first-pass meconium and amniotic fluid samples. Full-length 16S rRNA gene sequencing was performed using PacBio SMRT cell technology, to allow high resolution profiling of the fetal gut and amniotic fluid bacterial microbiomes. Levels of inflammatory cytokines were measured in amniotic fluid, and levels of immunomodulatory short chain fatty acids (SCFAs) were quantified in meconium. All meconium samples and most amniotic fluid samples (36/43) contained bacterial DNA. The meconium microbiome was dominated by reads that mapped to Pelomonas puraquae. Aside from this species, the meconium microbiome was remarkably heterogeneous between patients. The amniotic fluid microbiome was more diverse and contained mainly reads that mapped to typical skin commensals, including Propionibacterium acnes and Staphylococcus spp. All meconium samples contained acetate and propionate, at ratios similar to those previously reported in infants. P. puraquae reads were inversely correlated with meconium propionate levels. Amniotic fluid cytokine levels were associated with the amniotic fluid microbiome. Our results demonstrate that bacterial DNA and SCFAs are present in utero, and have the potential to influence the developing fetal immune system.


April 21, 2020

Comparative Genomics of Thiohalobacter thiocyanaticus HRh1T and Guyparkeria sp. SCN-R1, Halophilic Chemolithoautotrophic Sulfur-Oxidizing Gammaproteobacteria Capable of Using Thiocyanate as Energy Source.

The genomes of Thiohalobacter thiocyanaticus and Guyparkeria (formerly known as Halothiobacillus) sp. SCN-R1, two gammaproteobacterial halophilic sulfur-oxidizing bacteria (SOB) capable of thiocyanate oxidation via the “cyanate pathway”, have been analyzed with a particular focus on their thiocyanate-oxidizing potential and sulfur oxidation pathways. Both genomes encode homologs of the enzyme thiocyanate dehydrogenase (TcDH) that oxidizes thiocyanate via the “cyanate pathway” in members of the haloalkaliphilic SOB of the genus Thioalkalivibrio. However, despite the presence of conservative motives indicative of TcDH, the putative TcDH of the halophilic SOB have a low overall amino acid similarity to the Thioalkalivibrio enzyme, and also the surrounding genes in the TcDH locus were different. In particular, an alternative copper transport system Cus is present instead of Cop and a putative zero-valent sulfur acceptor protein gene appears just before TcDH. Moreover, in contrast to the thiocyanate-oxidizing Thioalkalivibrio species, both genomes of the halophilic SOB contained a gene encoding the enzyme cyanate hydratase. The sulfur-oxidizing pathway in the genome of Thiohalobacter includes a Fcc type of sulfide dehydrogenase, a rDsr complex/AprAB/Sat for oxidation of zero-valent sulfur to sulfate, and an incomplete Sox pathway, lacking SoxCD. The sulfur oxidation pathway reconstructed from the genome of Guyparkeria sp. SCN-R1 was more similar to that of members of the Thiomicrospira-Hydrogenovibrio group, including a Fcc type of sulfide dehydrogenase and a complete Sox complex. One of the outstanding properties of Thiohalobacter is the presence of a Na+-dependent ATP synthase, which is rarely found in aerobic Prokaryotes.Overall, the results showed that, despite an obvious difference in the general sulfur-oxidation pathways, halophilic and haloalkaliphilic SOB belonging to different genera within the Gammaproteobacteria developed a similar unique thiocyanate-degrading mechanism based on the direct oxidative attack on the sulfane atom of thiocyanate.


April 21, 2020

Genome Features and Secondary Metabolites Biosynthetic Potential of the Class Ktedonobacteria.

The prevalence of antibiotic resistance and the decrease in novel antibiotic discovery in recent years necessitates the identification of potentially novel microbial resources to produce natural products. Ktedonobacteria, a class of deeply branched bacterial lineage in the ancient phylum Chloroflexi, are ubiquitous in terrestrial environments and characterized by their large genome size and complex life cycle. These characteristics indicate Ktedonobacteria as a potential active producer of bioactive compounds. In this study, we observed the existence of a putative “megaplasmid,” multiple copies of ribosomal RNA operons, and high ratio of hypothetical proteins with unknown functions in the class Ktedonobacteria. Furthermore, a total of 104 antiSMASH-predicted putative biosynthetic gene clusters (BGCs) for secondary metabolites with high novelty and diversity were identified in nine Ktedonobacteria genomes. Our investigation of domain composition and organization of the non-ribosomal peptide synthetase and polyketide synthase BGCs further supports the concept that class Ktedonobacteria may produce compounds structurally different from known natural products. Furthermore, screening of bioactive compounds from representative Ktedonobacteria strains resulted in the identification of broad antimicrobial activities against both Gram-positive and Gram-negative tested bacterial strains. Based on these findings, we propose the ancient, ubiquitous, and spore-forming Ktedonobacteria as a versatile and promising microbial resource for natural product discovery.


April 21, 2020

Getting the Entire Message: Progress in Isoform Sequencing

The advent of second-generation sequencing and its application to RNA sequencing has revolutionized the field of genomics by allowing the quantification of expression of entire genes as well as single TSS, exons and splice sites, RNA-editing sites as well as polyA-sites. However, due to the sequencing of fragments of cDNAs these methods have not given a reliable picture of complete RNA isoforms. Third-generation sequencing has filled this gap and allows end-to-end sequencing of entire RNA/cDNA molecules. This approach to transcriptomics has been a ‘niche’ technology for a couple of years but now is becoming mainstream with many different applications. Here, we review the background and progress made to date in this rapidly growing field. We start by reviewing the progressive realization that alternative splicing is omnipresent. We then focus on long-non-coding RNA isoforms and the distinct combination patterns of exons in non-coding and coding genes. We consider the implications of the recent technologies of direct RNA sequencing and single-cell isoform RNA sequencing. Finally, we discuss the parameters that define the success of long-read RNA sequencing experiments and strategies commonly used to make the most of such data.


April 21, 2020

The First Highly Contiguous Genome Assembly of Pikeperch (Sander lucioperca), an Emerging Aquaculture Species in Europe

The pikeperch (Sander lucioperca) is a fresh and brackish water Percid fish natively inhabiting the northern hemisphere. This species is emerging as a promising candidate for intensive aquaculture production in Europe. Specific traits like cannibalism, growth rate and meat quality require genomics based understanding, for an optimal husbandry and domestication process. Still, the aquaculture community is lacking an annotated genome sequence to facilitate genome-wide studies on pikeperch. Here, we report the first highly contiguous draft genome assembly of Sander lucioperca. In total, 413 and 66 giga base pairs of DNA sequencing raw data were generated with the Illumina platform and PacBio Sequel System, respectively. The PacBio data were assembled into a final assembly size of ~900 Mb covering 89% of the 1,014 Mb estimated genome size. The draft genome consisted of 1966 contigs ordered into 1,313 scaffolds. The contig and scaffold N50 lengths are 3.0 Mb and 4.9 Mb, respectively. The identified repetitive structures accounted for 39% of the genome. We utilized homologies to other ray-finned fishes, and ab initio gene prediction methods to predict 21,249 protein-coding genes in the Sander lucioperca genome, of which 88% were functionally annotated by either sequence homology or protein domains and signatures search. The assembled genome spans 97.6% and 96.3% of Vertebrate and Actinopterygii single-copy orthologs, respectively. The outstanding mapping rate (99.9%) of genomic PE-reads on the assembly suggests an accurate and nearly complete genome reconstruction. This draft genome sequence is the first genomic resource for this promising aquaculture species. It will provide an impetus for genomic-based breeding studies targeting phenotypic and performance traits of captive pikeperch.


April 21, 2020

Combining next-generation sequencing and single-molecule sequencing to explore brown plant hopper responses to contrasting genotypes of japonica rice.

The brown plant hopper (BPH), Nilaparvata lugens, is one of the major pest of rice (Oryza sativa). Plant defenses against insect herbivores have been extensively studied, but our understanding of insect responses to host plants’ resistance mechanisms is still limited. The purpose of this study is to characterize transcripts of BPH and reveal the responses of BPH insects to resistant rice at transcription level by using the advanced molecular techniques, the next-generation sequencing (NGS) and the single-molecule, real-time (SMRT) sequencing.The current study obtained 24,891 collapsed isoforms of full-length transcripts, and 20,662 were mapped to known annotated genes, including 17,175 novel transcripts. The current study also identified 915 fusion genes, 1794 novel genes, 2435 long non-coding RNAs (lncRNAs), and 20,356 alternative splicing events. Moreover, analysis of differentially expressed genes (DEGs) revealed that genes involved in metabolic and cell proliferation processes were significantly enriched in up-regulated and down-regulated sets, respectively, in BPH fed on resistant rice relative to BPH fed on susceptible wild type rice. Furthermore, the FoxO signaling pathway was involved and genes related to BPH starvation response (Nlbmm), apoptosis and autophagy (caspase 8, ATG13, BNIP3 and IAP), active oxygen elimination (catalase, MSR, ferritin) and detoxification (GST, CarE) were up-regulated in BPH responses to resistant rice.The current study provides the first demonstrations of the full diversity and complexity of the BPH transcriptome, and indicates that BPH responses to rice resistance, might be related to starvation stress responses, nutrient transformation, oxidative decomposition, and detoxification. The current result findings will facilitate further exploration of molecular mechanisms of interaction between BPH insects and host rice.


April 21, 2020

Systematic identification of intergenic long-noncoding RNAs in mouse retinas using full-length isoform sequencing.

A great mass of long noncoding RNAs (lncRNAs) have been identified in mouse genome and increasing evidences in the last decades have revealed their crucial roles in diverse biological processes. Nevertheless, the biological roles of lncRNAs in the mouse retina remains largely unknown due to the lack of a comprehensive annotation of lncRNAs expressed in the retina.In this study, we applied the long-reads sequencing strategy to unravel the transcriptomes of developing mouse retinas and identified a total of 940 intergenic lncRNAs (lincRNAs) in embryonic and neonatal retinas, including about 13% of them were transcribed from unannotated gene loci. Subsequent analysis revealed that function of lincRNAs expressed in mouse retinas were closely related to the physiological roles of this tissue, including 90 lincRNAs that were differentially expressed after the functional loss of key regulators of retinal ganglion cell (RGC) differentiation. In situ hybridization results demonstrated the enrichment of three class IV POU-homeobox genes adjacent lincRNAs (linc-3a, linc-3b and linc-3c) in ganglion cell layer and indicated they were potentially RGC-specific.In summary, this study systematically annotated the lincRNAs expressed in embryonic and neonatal mouse retinas and implied their crucial regulatory roles in retinal development such as RGC differentiation.


April 21, 2020

A genomic resource derived from the integration of genome sequences, expressed transcripts and genetic markers in ramie.

The redundancy of genomic resources, including transcript and molecular markers, and their uncertain position in the genome have dramatically hindered the study of traits in ramie, an important natural fiber crop.We obtained a high-quality transcriptome consisting of 30,591 non-redundant transcripts using single-molecule long-read sequencing and proposed it as a universal ramie transcriptome. Additionally, 55,882 single nucleotide polymorphisms (SNPs) were identified and a high-density genetic map was developed. Based on this genetic map, 181.7?Mb ramie genome sequences were assembled into 14 chromosomes. For the convenient use of these resources, 29,286 (~?95.7%) of the transcripts and all 55,882 SNPs, along with 1827 previously reported sequence repeat markers (SSRs), were mapped into the ramie genome, and 22,343 (~?73.0%) transcripts, 50,154 (~?89.7%) SNPs, and 1466 (~?80.3%) SSRs were assigned to a specific location in the corresponding chromosome.This is the first study to characterize the ramie transcriptome by long-read sequencing, and the substantial number of transcripts of significant length obtained will accelerate our understanding of ramie growth and development. This integration of genome sequences, expressed transcripts, and genetic markers will provide an extremely useful resource for genetic, molecular, and breeding studies of ramie.


April 21, 2020

Chromosome conformation capture resolved near complete genome assembly of broomcorn millet.

Broomcorn millet (Panicum miliaceum L.) has strong tolerance to abiotic stresses, and is probably one of the oldest crops, with its earliest cultivation that dated back to ca. ~10,000 years. We report here its genome assembly through a combination of PacBio sequencing, BioNano, and Hi-C (in vivo) mapping. The 18 super scaffolds cover ~95.6% of the estimated genome (~887.8?Mb). There are 63,671 protein-coding genes annotated in this tetraploid genome. About ~86.2% of the syntenic genes in foxtail millet have two homologous copies in broomcorn millet, indicating rare gene loss after tetraploidization in broomcorn millet. Phylogenetic analysis reveals that broomcorn millet and foxtail millet diverged around ~13.1 Million years ago (Mya), while the lineage specific tetraploidization of broomcorn millet may be happened within ~5.91 million years. The genome is not only beneficial for the genome assisted breeding of broomcorn millet, but also an important resource for other Panicum species.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.