Menu
September 22, 2019  |  

The dynamic landscape of fission yeast meiosis alternative-splice isoforms.

Alternative splicing increases the diversity of transcriptomes and proteomes in metazoans. The extent to which alternative splicing is active and functional in unicellular organisms is less understood. Here, we exploit a single-molecule long-read sequencing technique and develop an open-source software program called SpliceHunter to characterize the transcriptome in the meiosis of fission yeast. We reveal 14,353 alternative splicing events in 17,669 novel isoforms at different stages of meiosis, including antisense and read-through transcripts. Intron retention is the major type of alternative splicing, followed by alternate “intron in exon.” Seven hundred seventy novel transcription units are detected; 53 of the predicted proteins show homology in other species and form theoretical stable structures. We report the complexity of alternative splicing along isoforms, including 683 intra-molecularly co-associated intron pairs. We compare the dynamics of novel isoforms based on the number of supporting full-length reads with those of annotated isoforms and explore the translational capacity and quality of novel isoforms. The evaluation of these factors indicates that the majority of novel isoforms are unlikely to be both condition-specific and translatable but consistent with the possibility of biologically functional novel isoforms. Moreover, the co-option of these unusual transcripts into newly born genes seems likely. Together, the results of this study highlight the diversity and dynamics at the isoform level in the sexual development of fission yeast. © 2017 Kuang et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019  |  

The small peptide world in long noncoding RNAs.

Long noncoding RNAs (lncRNAs) are a group of transcripts that are longer than 200 nucleotides (nt) without coding potential. Over the past decade, tens of thousands of novel lncRNAs have been annotated in animal and plant genomes because of advanced high-throughput RNA sequencing technologies and with the aid of coding transcript classifiers. Further, a considerable number of reports have revealed the existence of stable, functional small peptides (also known as micropeptides), translated from lncRNAs. In this review, we discuss the methods of lncRNA classification, the investigations regarding their coding potential and the functional significance of the peptides they encode.


September 22, 2019  |  

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.


September 22, 2019  |  

Towards long-read metagenomics: complete assembly of three novel genomes from bacteria dependent on a diazotrophic cyanobacterium in a freshwater lake co-culture.

Here we report three complete bacterial genome assemblies from a PacBio shotgun metagenome of a co-culture from Upper Klamath Lake, OR. Genome annotations and culture conditions indicate these bacteria are dependent on carbon and nitrogen fixation from the cyanobacterium Aphanizomenon flos-aquae, whose genome was assembled to draft-quality. Due to their taxonomic novelty relative to previously sequenced bacteria, we have temporarily designated these bacteria as incertae sedis Hyphomonadaceae strain UKL13-1 (3,501,508 bp and 56.12% GC), incertae sedis Betaproteobacterium strain UKL13-2 (3,387,087 bp and 54.98% GC), and incertae sedis Bacteroidetes strain UKL13-3 (3,236,529 bp and 37.33% GC). Each genome consists of a single circular chromosome with no identified plasmids. When compared with binned Illumina assemblies of the same three genomes, there was ~7% discrepancy in total genome length. Gaps where Illumina assemblies broke were often due to repetitive elements. Within these missing sequences were essential genes and genes associated with a variety of functional categories. Annotated gene content reveals that both Proteobacteria are aerobic anoxygenic phototrophs, with Betaproteobacterium UKL13-2 potentially capable of phototrophic oxidation of sulfur compounds. Both proteobacterial genomes contain transporters suggesting they are scavenging fixed nitrogen from A. flos-aquae in the form of ammonium. Bacteroidetes UKL13-3 has few completely annotated biosynthetic pathways, and has a comparatively higher proportion of unannotated genes. The genomes were detected in only a few other freshwater metagenomes, suggesting that these bacteria are not ubiquitous in freshwater systems. Our results indicate that long-read sequencing is a viable method for sequencing dominant members from low-diversity microbial communities, and should be considered for environmental metagenomics when conditions meet these requirements.


September 22, 2019  |  

Improved high-quality genome assembly and annotation of Tibetan hulless barley

Background The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called textquotedblleftQingketextquotedblright in Chinese and textquotedblleftNetextquotedblright in Tibetan, is the staple food for Tibetans and an important livestock feed in the Tibetan Plateau. The Tibetan hulless barley in China has about 3500 years of cultivation history, mainly produced in Tibet, Qinghai, Sichuan, Yunnan and other areas. In addition, Tibetan hulless barley has rich nutritional value and outstanding health effects, including the beta glucan, dietary fiber, amylopectin, the contents of trace elements, which are higher than any other cereal crops.Findings Here, we reported an improved high-quality assembly of Tibetan hulless barley genome with 4.0 Gb in size. We employed the falcon assembly package, scaffolding and error correction tools to finish improvement using PacBio long reads sequencing technology, with contig and scaffold N50 lengths of 1.563Mb and 4.006Mb, respectively, representing more continuous than the original Tibetan hulless barley genome nearly two orders of magnitude. We also re-annotated the new assembly, and reported 61,303 stringent confident putative protein-coding genes, of which 40,457 is HC genes. We have developed a new Tibetan hulless barley genome database (THBGD) to download and use friendly, as well as to better manage the information of the Tibetan hulless barley genetic resources.Conclusions The availability of new Tibetan hulless barley genome and annotations will take the genetics of Tibetan hulless barley to a new level and will greatly simplify the breeders effort. It will also enrich the granary of the Tibetan people.AbbreviationsBLASTBasic Local Alignment Search ToolBUSCOBenchmarking Universal Single-Copy OrthologsQVquality valuePacBioPacifc BiosciencesRNA-seqRNA sequencingNGSNext generation sequencingTGSThird generation sequencingTHBGDTibetan hulless barley Genome Database


September 22, 2019  |  

Novel molecules lncRNAs, tRFs and circRNAs deciphered from next-generation sequencing/RNA sequencing: computational databases and tools.

Powerful next-generation sequencing (NGS) technologies, more specifically RNA sequencing (RNA-seq), have been pivotal toward the detection and analysis and hypotheses generation of novel biomolecules, long noncoding RNAs (lncRNAs), tRNA-derived fragments (tRFs) and circular RNAs (circRNAs). Experimental validation of the occurrence of these biomolecules inside the cell has been reported. Their differential expression and functionally important role in several cancers types as well as other diseases such as Alzheimer’s and cardiovascular diseases have garnered interest toward further studies in this research arena. In this review, starting from a brief relevant introduction to NGS and RNA-seq and the expression and role of lncRNAs, tRFs and circRNAs in cancer, we have comprehensively analyzed the current landscape of databases developed and computational software used for analyses and visualization for this emerging and highly interesting field of these novel biomolecules. Our review will help the end users and research investigators gain information on the existing databases and tools as well as an understanding of the specific features which these offer. This will be useful for the researchers in their proper usage thereby guiding them toward novel hypotheses generation and saving time and costs involved in extensive experimental processes in these three different novel functional RNAs.© The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.


September 22, 2019  |  

Transcriptome sequencing reveals thousands of novel long non-coding RNAs in B cell lymphoma.

Gene profiling of diffuse large B cell lymphoma (DLBCL) has revealed broad gene expression deregulation compared to normal B cells. While many studies have interrogated well known and annotated genes in DLBCL, none have yet performed a systematic analysis to uncover novel unannotated long non-coding RNAs (lncRNA) in DLBCL. In this study we sought to uncover these lncRNAs by examining RNA-seq data from primary DLBCL tumors and performed supporting analysis to identify potential role of these lncRNAs in DLBCL.We performed a systematic analysis of novel lncRNAs from the poly-adenylated transcriptome of 116 primary DLBCL samples. RNA-seq data were processed using de novo transcript assembly pipeline to discover novel lncRNAs in DLBCL. Systematic functional, mutational, cross-species, and co-expression analyses using numerous bioinformatics tools and statistical analysis were performed to characterize these novel lncRNAs.We identified 2,632 novel, multi-exonic lncRNAs expressed in more than one tumor, two-thirds of which are not expressed in normal B cells. Long read single molecule sequencing supports the splicing structure of many of these lncRNAs. More than one-third of novel lncRNAs are differentially expressed between the two major DLBCL subtypes, ABC and GCB. Novel lncRNAs are enriched at DLBCL super-enhancers, with a fraction of them conserved between human and dog lymphomas. We see transposable elements (TE) overlap in the exonic regions; particularly significant in the last exon of the novel lncRNAs suggest potential usage of cryptic TE polyadenylation signals. We identified highly co-expressed protein coding genes for at least 88 % of the novel lncRNAs. Functional enrichment analysis of co-expressed genes predicts a potential function for about half of novel lncRNAs. Finally, systematic structural analysis of candidate point mutations (SNVs) suggests that such mutations frequently stabilize lncRNA structures instead of destabilizing them.Discovery of these 2,632 novel lncRNAs in DLBCL significantly expands the lymphoma transcriptome and our analysis identifies potential roles of these lncRNAs in lymphomagenesis and/or tumor maintenance. For further studies, these novel lncRNAs also provide an abundant source of new targets for antisense oligonucleotide pharmacology, including shared targets between human and dog lymphomas.


September 22, 2019  |  

Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis.

Astragalus membranaceus, also known as Huangqi in China, is one of the most widely used medicinal herbs in Traditional Chinese Medicine. Traditional Chinese Medicine formulations from Astragalus membranaceus have been used to treat a wide range of illnesses, such as cardiovascular disease, type 2 diabetes, nephritis and cancers. Pharmacological studies have shown that immunomodulating, anti-hyperglycemic, anti-inflammatory, antioxidant and antiviral activities exist in the extract of Astragalus membranaceus. Therefore, characterising the biosynthesis of bioactive compounds in Astragalus membranaceus, such as Astragalosides, Calycosin and Calycosin-7-O-ß-d-glucoside, is of particular importance for further genetic studies of Astragalus membranaceus. In this study, we reconstructed the Astragalus membranaceus full-length transcriptomes from leaf and root tissues using PacBio Iso-Seq long reads. We identified 27 975 and 22 343 full-length unique transcript models in each tissue respectively. Compared with previous studies that used short read sequencing, our reconstructed transcripts are longer, and are more likely to be full-length and include numerous transcript variants. Moreover, we also re-characterised and identified potential transcript variants of genes involved in Astragalosides, Calycosin and Calycosin-7-O-ß-d-glucoside biosynthesis. In conclusion, our study provides a practical pipeline to characterise the full-length transcriptome for species without a reference genome and a useful genomic resource for exploring the biosynthesis of active compounds in Astragalus membranaceus.


September 22, 2019  |  

Induced salt tolerance of perennial ryegrass by a novel bacterium strain from the rhizosphere of a desert shrub Haloxylon ammodendron.

Drought and soil salinity reduce agricultural output worldwide. Plant-growth-promoting rhizobacteria (PGPR) can enhance plant growth and augment plant tolerance to biotic and abiotic stresses.Haloxylon ammodendron, a C4 perennial succulent xerohalophyte shrub with excellent drought and salt tolerance, is naturally distributed in the desert area of northwest China. In our previous work, a bacterium strain numbered as M30-35 was isolated from the rhizosphere ofH. ammodendronin Tengger desert, Gansu province, northwest China. In current work, the effects of M30-35 inoculation on salt tolerance of perennial ryegrass were evaluated and its genome was sequenced to identify genes associated with plant growth promotion. Results showed that M30-35 significantly enhanced growth and salt tolerance of perennial ryegrass by increasing shoot fresh and dry weights, chlorophyll content, root volume, root activity, leaf catalase activity, soluble sugar and proline contents that contributed to reduced osmotic potential, tissue K? content and K?/Na? ratio, while decreasing malondialdehyde (MDA) content and relative electric conductivity (REC), especially under higher salinity. The genome of M30-35 contains 4421 protein encoding genes, 12 rRNA, 63 tRNA-encoding genes and four rRNA operons. M30-35 was initially classified as a new species inPseudomonasand named asPseudomonassp. M30-35. Thirty-four genes showing homology to genes associated with PGPR traits and abiotic stress tolerance were identified inPseudomonassp. M30-35 genome, including 12 related to insoluble phosphorus solubilization, four to auxin biosynthesis, four to other process of growth promotion, seven to oxidative stress alleviation, four to salt and drought tolerance and three to cold and heat tolerance. Further study is needed to clarify the correlation between these genes from M30-35 and the salt stress alleviation of inoculated plants under salt stress. Overall, our research indicated that desert shrubs appear rich in PGPRs that can help important crops tolerate abiotic stress.


September 22, 2019  |  

Genetic separation of Listeria monocytogenes causing central nervous system infections in animals.

Listeria monocytogenes is a foodborne pathogen that causes abortion, septicemia, gastroenteritis and central nervous system (CNS) infections in ruminants and humans. L. monocytogenes strains mainly belong to two distinct phylogenetic groups, named lineages I and II. In general, clinical cases in humans and animals, in particular CNS infections, are caused by lineage I strains, while most of the environmental and food strains belong to lineage II. Little is known about why lineage I is more virulent than lineage II, even though various molecular factors and mechanisms associated with pathogenesis are known. In this study, we have used a variety of whole genome sequence analyses and comparative genomic tools in order to find characteristics that distinguish lineage I from lineage II strains and CNS infection strains from non-CNS strains. We analyzed 225 strains and identified single nucleotide variants between lineages I and II, as well as differences in the gene content. Using a novel approach based on Reads Per Kilobase per Million Mapped (RPKM), we identified 167 genes predominantly absent in lineage II but present in lineage I. These genes are mostly encoding for membrane-associated proteins. Additionally, we found 77 genes that are largely absent in the non-CNS associated strains, while 39 genes are especially lacking in our defined “non-clinical” group. Based on the RPKM analysis and the metadata linked to the L. monocytogenes strains, we identified 6 genes potentially associated with CNS cases, which include a transcriptional regulator, an ABC transporter and a non-coding RNA. Although there is not a clear separation between pathogenic and non-pathogenic strains based on phylogenetic lineages, the presence of the genes identified in our study reveals potential pathogenesis traits in ruminant L. monocytogenes strains. Ultimately, the differences that we have found in our study will help steer future studies in understanding the virulence mechanisms of the most pathogenic L. monocytogenes strains.


September 22, 2019  |  

Complete genome sequence and genomic characterization of Lactobacillus acidophilus LA1 (11869BP).

Our body has natural defense systems to protect against potentially harmful microbes, including the physical and chemical barriers of the intestinal epithelium (Corfield et al., 2000). The physical barrier of the intestinal epithelium protects the host against pathogenic microbes (Anderson et al., 1993), and the intestinal mucosa coated with mucus excretes pathogens from the intestinal tract (Corfield et al., 2000).


September 22, 2019  |  

Pseudomonas orientalis F9: A potent antagonist against phytopathogens with phytotoxic effect in the apple flower.

In light of public concerns over the use of pesticides and antibiotics in plant protection and the subsequent selection for spread of resistant bacteria in the environment, it is inevitable to broaden our knowledge about viable alternatives, such as natural antagonists and their mode of action. The genus Pseudomonas is known for its metabolic versatility and genetic plasticity, encompassing pathogens as well as antagonists. We characterized strain Pseudomonas orientalis F9, an isolate from apple flowers in a Swiss orchard, and determined its antagonistic activity against several phytopathogenic bacteria, in particular Erwinia amylovora, the causal agent of fire blight. P. orientalis F9 displayed antagonistic activity against a broad suite of phytopathogenic bacteria in the in vitro tests. The promising results from this analysis led to an ex vivo assay with E. amylovora CFBP1430Rif and P. orientalis F9 infected detached apple flowers. F9 diminished the fire blight pathogen in the flowers but also revealed phytotoxic traits. The experimental results were discussed in light of the complete genome sequence of F9, which revealed the strain to carry phenazine genes. Phenazines are known to contribute to antagonistic activity of bacterial strains against soil pathogens. When tested in the cress assay with Pythium ultimum as pathogen, F9 showed results comparable to the known antagonist P. protegens CHA0.


September 22, 2019  |  

Culture-facilitated comparative genomics of the facultative symbiont Hamiltonella defensa.

Many insects host facultative, bacterial symbionts that confer conditional fitness benefits to their hosts. Hamiltonella defensa is a common facultative symbiont of aphids that provides protection against parasitoid wasps. Protection levels vary among strains of H. defensa that are also differentially infected by bacteriophages named APSEs. However, little is known about trait variation among strains because only one isolate has been fully sequenced. Generating complete genomes for facultative symbionts is hindered by relatively large genome sizes but low abundances in hosts like aphids that are very small. Here, we took advantage of methods for culturing H. defensa outside of aphids to generate complete genomes and transcriptome data for four strains of H. defensa from the pea aphid Acyrthosiphon pisum. Chosen strains also spanned the breadth of the H. defensa phylogeny and differed in strength of protection conferred against parasitoids. Results indicated that strains shared most genes with roles in nutrient acquisition, metabolism, and essential housekeeping functions. In contrast, the inventory of mobile genetic elements varied substantially, which generated strain specific differences in gene content and genome architecture. In some cases, specific traits correlated with differences in protection against parasitoids, but in others high variation between strains obscured identification of traits with likely roles in defense. Transcriptome data generated continuous distributions to genome assemblies with some genes that were highly expressed and others that were not. Single molecule real-time sequencing further identified differences in DNA methylation patterns and restriction modification systems that provide defense against phage infection.


September 22, 2019  |  

Comparative genomics of the Baltic Sea toxic cyanobacteria Nodularia spumigena UHCC 0039 and its response to varying salinity.

Salinity is an important abiotic factor controlling the distribution and abundance of Nodularia spumigena, the dominating diazotrophic and toxic phototroph, in the brackish water cyanobacterial blooms of the Baltic Sea. To expand the available genomic information for brackish water cyanobacteria, we sequenced the isolate Nodularia spumigena UHCC 0039 using an Illumina-SMRT hybrid sequencing approach, revealing a chromosome of 5,294,286 base pairs (bp) and a single plasmid of 92,326 bp. Comparative genomics in Nostocales showed pronounced genetic similarity among Nodularia spumigena strains evidencing their short evolutionary history. The studied Baltic Sea strains share similar sets of CRISPR-Cas cassettes and a higher number of insertion sequence (IS) elements compared to Nodularia spumigena CENA596 isolated from a shrimp production pond in Brazil. Nodularia spumigena UHCC 0039 proliferated similarly at three tested salinities, whereas the lack of salt inhibited its growth and triggered transcriptome remodeling, including the up-regulation of five sigma factors and the down-regulation of two other sigma factors, one of which is specific for strain UHCC 0039. Down-regulated genes additionally included a large genetic region for the synthesis of two yet unidentified natural products. Our results indicate a remarkable plasticity of the Nodularia salinity acclimation, and thus salinity strongly impacts the intensity and distribution of cyanobacterial blooms in the Baltic Sea.


September 22, 2019  |  

Repeat-driven generation of antigenic diversity in a major human pathogen, Trypanosoma cruzi

Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina sequencing of 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene structure reveals an unusual duality in the organisation of the parasite genome, a core genomic region syntenous with related protozoa flanked by unique and highly plastic subtelomeric regions encoding surface antigens. The presence of abundant interspersed retrotransposons in the subtelomeres suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.