Menu
September 22, 2019

Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing.

Zea mays is an important genetic model for elucidating transcriptional networks. Uncertainties about the complete structure of mRNA transcripts limit the progress of research in this system. Here, using single-molecule sequencing technology, we produce 111,151 transcripts from 6 tissues capturing ~70% of the genes annotated in maize RefGen_v3 genome. A large proportion of transcripts (57%) represent novel, sometimes tissue-specific, isoforms of known genes and 3% correspond to novel gene loci. In other cases, the identified transcripts have improved existing gene models. Averaging across all six tissues, 90% of the splice junctions are supported by short reads from matched tissues. In addition, we identified a large number of novel long non-coding RNAs and fusion transcripts and found that DNA methylation plays an important role in generating various isoforms. Our results show that characterization of the maize B73 transcriptome is far from complete, and that maize gene expression is more complex than previously thought.


September 22, 2019

Packaging of Dinoroseobacter shibae DNA into gene transfer agent particles is not random.

Gene transfer agents (GTAs) are phage-like particles which contain a fragment of genomic DNA of the bacterial or archaeal producer and deliver this to a recipient cell. GTA gene clusters are present in the genomes of almost all marine Rhodobacteraceae (Roseobacters) and might be important contributors to horizontal gene transfer in the world’s oceans. For all organisms studied so far, no obvious evidence of sequence specificity or other nonrandom process responsible for packaging genomic DNA into GTAs has been found. Here, we show that knock-out of an autoinducer synthase gene of Dinoroseobacter shibae resulted in overproduction and release of functional GTA particles (DsGTA). Next-generation sequencing of the 4.2-kb DNA fragments isolated from DsGTAs revealed that packaging was not random. DNA from low-GC conjugative plasmids but not from high-GC chromids was excluded from packaging. Seven chromosomal regions were strongly overrepresented in DNA isolated from DsGTA. These packaging peaks lacked identifiable conserved sequence motifs that might represent recognition sites for the GTA terminase complex. Low-GC regions of the chromosome, including the origin and terminus of replication, were underrepresented in DNA isolated from DsGTAs. DNA methylation reduced packaging frequency while the level of gene expression had no influence. Chromosomal regions found to be over- and underrepresented in DsGTA-DNA were regularly spaced. We propose that a “headful” type of packaging is initiated at the sites of coverage peaks and, after linearization of the chromosomal DNA, proceeds in both directions from the initiation site. GC-content, DNA-modifications, and chromatin structure might influence at which sides GTA packaging can be initiated.© The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


September 22, 2019

Identification of candidate genes at the Dp-fl locus conferring resistance against the rosy apple aphid Dysaphis plantaginea

The cultivated apple is susceptible to several pests including the rosy apple aphid (RAA; Dysaphis plantaginea Passerini), control of which is mainly based on chemical treatments. A few cases of resistance to aphids have been described in apple germplasm resources, laying the basis for the development of new resistant cultivars by breeding. The cultivar ‘Florina’ is resistant to RAA, and recently, the Dp-fl locus responsible for its resistance was mapped on linkage group 8 of the apple genome. In this paper, a chromosome walking approach was performed by using a ‘Florina’ bacterial artificial chromosome (BAC) library. The walking started from the available tightly linked molecular markers flanking the resistance region. Various walking steps were performed in order to identify the minimum tiling path of BAC clones covering the Dp-fl region from both the “resistant” and “susceptible” chromosomes of ‘Florina’. A genomic region of about 279 Kb encompassing the Dp-fl resistance locus was fully sequenced by the PacBio technology. Through the development of new polymorphic markers, the mapping interval around the resistance locus was narrowed down to a physical region of 95 Kb. The annotation of this sequence resulted in the identification of four candidate genes putatively involved in the RAA resistance response.


September 22, 2019

The genome of the Hi5 germ cell line from Trichoplusia ni, an agricultural pest and novel model for small RNA biology.

We report a draft assembly of the genome of Hi5 cells from the lepidopteran insect pest,Trichoplusia ni, assigning 90.6% of bases to one of 28 chromosomes and predicting 14,037 protein-coding genes. Chemoreception and detoxification gene families revealT. ni-specific gene expansions that may explain its widespread distribution and rapid adaptation to insecticides. Transcriptome and small RNA data from thorax, ovary, testis, and the germline-derived Hi5 cell line show distinct expression profiles for 295 microRNA- and >393 piRNA-producing loci, as well as 39 genes encoding small RNA pathway proteins. Nearly all of the W chromosome is devoted to piRNA production, andT. nisiRNAs are not 2´-O-methylated. To enable use of Hi5 cells as a model system, we have established genome editing and single-cell cloning protocols. TheT. nigenome provides insights into pest control and allows Hi5 cells to become a new tool for studying small RNAs ex vivo.© 2018, Fu et al.


September 22, 2019

Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants.

Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated.De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups.We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.


September 22, 2019

Comparative genome and methylome analysis reveals restriction/modification system diversity in the gut commensal Bifidobacterium breve.

Bifidobacterium breve represents one of the most abundant bifidobacterial species in the gastro-intestinal tract of breast-fed infants, where their presence is believed to exert beneficial effects. In the present study whole genome sequencing, employing the PacBio Single Molecule, Real-Time (SMRT) sequencing platform, combined with comparative genome analysis allowed the most extensive genetic investigation of this taxon. Our findings demonstrate that genes encoding Restriction/Modification (R/M) systems constitute a substantial part of the B. breve variable gene content (or variome). Using the methylome data generated by SMRT sequencing, combined with targeted Illumina bisulfite sequencing (BS-seq) and comparative genome analysis, we were able to detect methylation recognition motifs and assign these to identified B. breve R/M systems, where in several cases such assignments were confirmed by restriction analysis. Furthermore, we show that R/M systems typically impose a very significant barrier to genetic accessibility of B. breve strains, and that cloning of a methyltransferase-encoding gene may overcome such a barrier, thus allowing future functional investigations of members of this species.


September 22, 2019

The DNA methylome of the hyperthermoacidophilic crenarchaeon Sulfolobus acidocaldarius.

DNA methylation is the most common epigenetic modification observed in the genomic DNA (gDNA) of prokaryotes and eukaryotes. Methylated nucleobases, N6-methyl-adenine (m6A), N4-methyl-cytosine (m4C), and 5-methyl-cytosine (m5C), detected on gDNA represent the discrimination mark between self and non-self DNA when they are part of restriction-modification systems in prokaryotes (Bacteria and Archaea). In addition, m5C in Eukaryotes and m6A in Bacteria play an important role in the regulation of key cellular processes. Although archaeal genomes present modified bases as in the two other domains of life, the significance of DNA methylations as regulatory mechanisms remains largely uncharacterized in Archaea. Here, we began by investigating the DNA methylome of Sulfolobus acidocaldarius. The strategy behind this initial study entailed the use of combined digestion assays, dot blots, and genome resequencing, which utilizes specific restriction enzymes, antibodies specifically raised against m6A and m5C and single-molecule real-time (SMRT) sequencing, respectively, to identify DNA methylations occurring in exponentially growing cells. The previously identified restriction-modification system, specific of S. acidocaldarius, was confirmed by digestion assay and SMRT sequencing while, the presence of m6A was revealed by dot blot and identified on the characteristic Dam motif by SMRT sequencing. No m5C was detected by dot blot under the conditions tested. Furthermore, by comparing the distribution of both detected methylations along the genome and, by analyzing DNA methylation profiles in synchronized cells, we investigated in which cellular pathways, in particular the cell cycle, this m6A methylation could be a key player. The analysis of sequencing data rejected a role for m6A methylation in another defense system and also raised new questions about a potential involvement of this modification in the regulation of other biological functions in S. acidocaldarius.


September 22, 2019

Culture-facilitated comparative genomics of the facultative symbiont Hamiltonella defensa.

Many insects host facultative, bacterial symbionts that confer conditional fitness benefits to their hosts. Hamiltonella defensa is a common facultative symbiont of aphids that provides protection against parasitoid wasps. Protection levels vary among strains of H. defensa that are also differentially infected by bacteriophages named APSEs. However, little is known about trait variation among strains because only one isolate has been fully sequenced. Generating complete genomes for facultative symbionts is hindered by relatively large genome sizes but low abundances in hosts like aphids that are very small. Here, we took advantage of methods for culturing H. defensa outside of aphids to generate complete genomes and transcriptome data for four strains of H. defensa from the pea aphid Acyrthosiphon pisum. Chosen strains also spanned the breadth of the H. defensa phylogeny and differed in strength of protection conferred against parasitoids. Results indicated that strains shared most genes with roles in nutrient acquisition, metabolism, and essential housekeeping functions. In contrast, the inventory of mobile genetic elements varied substantially, which generated strain specific differences in gene content and genome architecture. In some cases, specific traits correlated with differences in protection against parasitoids, but in others high variation between strains obscured identification of traits with likely roles in defense. Transcriptome data generated continuous distributions to genome assemblies with some genes that were highly expressed and others that were not. Single molecule real-time sequencing further identified differences in DNA methylation patterns and restriction modification systems that provide defense against phage infection.


September 22, 2019

Analyzing AbrB-knockout effects through genome and transcriptome sequencing of Bacillus licheniformis DW2.

As an industrial bacterium, Bacillus licheniformis DW2 produces bacitracin which is an important antibiotic for many pathogenic microorganisms. Our previous study showed AbrB-knockout could significantly increase the production of bacitracin. Accordingly, it was meaningful to understand its genome features, expression differences between wild and AbrB-knockout (?AbrB) strains, and the regulation of bacitracin biosynthesis. Here, we sequenced, de novo assembled and annotated its genome, and also sequenced the transcriptomes in three growth phases. The genome of DW2 contained a DNA molecule of 4,468,952 bp with 45.93% GC content and 4,717 protein coding genes. The transcriptome reads were mapped to the assembled genome, and obtained 4,102~4,536 expressed genes from different samples. We investigated transcription changes in B. licheniformis DW2 and showed that ?AbrB caused hundreds of genes up-regulation and down-regulation in different growth phases. We identified a complete bacitracin synthetase gene cluster, including the location and length of bacABC, bcrABC, and bacT, as well as their arrangement. The gene cluster bcrABC were significantly up-regulated in ?AbrB strain, which supported the hypothesis in previous study of bcrABC transporting bacitracin out of the cell to avoid self-intoxication, and was consistent with the previous experimental result that ?AbrB could yield more bacitracin. This study provided a high quality reference genome for B. licheniformis DW2, and the transcriptome data depicted global alterations across two strains and three phases offered an understanding of AbrB regulation and bacitracin biosynthesis through gene expression.


September 22, 2019

Characterizing the DNA methyltransferases of Haloferax volcanii via bioinformatics, gene deletion, and SMRT Sequencing.

DNA methyltransferases (MTases), which catalyze the methylation of adenine and cytosine bases in DNA, can occur in bacteria and archaea alongside cognate restriction endonucleases (REases) in restriction-modification (RM) systems or independently as orphan MTases. Although DNA methylation and MTases have been well-characterized in bacteria, research into archaeal MTases has been limited. A previous study examined the genomic DNA methylation patterns (methylome) of the halophilic archaeonHaloferax volcanii, a model archaeal system which can be easily manipulated in laboratory settings, via single-molecule real-time (SMRT) sequencing and deletion of a putative MTase gene (HVO_A0006). In this follow-up study, we deleted other putative MTase genes inH. volcaniiand sequenced the methylomes of the resulting deletion mutants via SMRT sequencing to characterize the genes responsible for DNA methylation. The results indicate that deletion of putative RM genesHVO_0794,HVO_A0006, andHVO_A0237in a single strain abolished methylation of the sole cytosine motif in the genome (Cm4TAG). Amino acid alignments demonstrated thatHVO_0794shares homology with characterized cytosine CTAG MTases in other organisms, indicating that this MTase is responsible for Cm4TAG methylation inH. volcanii. The CTAG motif has high density at only one of the origins of replication, and there is no relative increase in CTAG motif frequency in the genome ofH. volcanii, indicating that CTAG methylation might not have effectively taken over the role of regulating DNA replication and mismatch repair in the organism as previously predicted. Deletion of the putative Type I RM operonrmeRMS(HVO_2269-2271) resulted in abolished methylation of the adenine motif in the genome (GCAm6BN6VTGC). Alignments of the MTase (HVO_2270) and site specificity subunit (HVO_2271) demonstrate homology with other characterized Type I MTases and site specificity subunits, indicating that thermeRMSoperon is responsible for adenine methylation inH. volcanii. Together with HVO_0794, these genes appear to be responsible for all detected methylation inH. volcanii, even though other putative MTases (HVO_C0040,HVO_A0079) share homology with characterized MTases in other organisms. We also report the construction of a multi-RM deletion mutant (?RM), with multiple RM genes deleted and with no methylation detected via SMRT sequencing, which we anticipate will be useful for future studies on DNA methylation inH. volcanii.


September 22, 2019

N4-cytosine DNA methylation regulates transcription and pathogenesis in Helicobacter pylori.

Many bacterial genomes exclusively display an N4-methyl cytosine base (m4C), whose physiological significance is not yet clear. Helicobacter pylori is a carcinogenic bacterium and the leading cause of gastric cancer in humans. Helicobacter pylori strain 26695 harbors a single m4C cytosine methyltransferase, M2.HpyAII which recognizes 5′ TCTTC 3′ sequence and methylates the first cytosine residue. To understand the role of m4C modification, M2.hpyAII deletion strain was constructed. Deletion strain displayed lower adherence to host AGS cells and reduced potential to induce inflammation and apoptosis. M2.hpyAII gene deletion strain exhibited reduced capacity for natural transformation, which was rescued in the complemented strain carrying an active copy of M2.hpyAII gene in the genome. Genome-wide gene expression and proteomic analysis were carried out to discern the possible reasons behind the altered phenotype of the M2.hpyAII gene deletion strain. Upon the loss of m4C modification a total of 102 genes belonging to virulence, ribosome assembly and cellular components were differentially expressed. The present study adds a functional role for the presence of m4C modification in H. pylori and provides the first evidence that m4C signal acts as a global epigenetic regulator in H. pylori.


September 22, 2019

Assimilation of cyanide and cyano-derivatives by Pseudomonas pseudoalcaligenes CECT5344: from omic approaches to biotechnological applications.

Mining, jewellery and metal-processing industries use cyanide for extracting gold and other valuable metals, generating large amounts of highly toxic wastewater. Biological treatments may be a clean alternative under the environmental point of view to the conventional physical or chemical processes used to remove cyanide and related compounds from these industrial effluents. Pseudomonas pseudoalcaligenes CECT5344 can grow under alkaline conditions using cyanide, cyanate or different nitriles as the sole nitrogen source, and is able to remove up to 12 mM total cyanide from a jewellery industry wastewater that contains cyanide free and complexed to metals. Complete genome sequencing of this bacterium has allowed the application of transcriptomic and proteomic techniques, providing a holistic view of the cyanide biodegradation process. The complex response to cyanide by the cyanotrophic bacterium P. pseudoalcaligenes CECT5344 and the potential biotechnological applications of this model organism in the bioremediation of cyanide-containing industrial residues are reviewed.


September 22, 2019

A reference genome and methylome for the Plasmodium knowlesi A1-H.1 line.

Plasmodium knowlesi, a common parasite of macaques, is recognised as a significant cause of human malaria in Malaysia. The P. knowlesi A1H1 line has been adapted to continuous culture in human erythrocytes, successfully providing an in vitro model to study the parasite. We have assembled a reference genome for the PkA1-H.1 line using PacBio long read combined with Illumina short read sequence data. Compared with the H-strain reference, the new reference has improved genome coverage and a novel description of methylation sites. The PkA1-H.1 reference will enhance the capabilities of the in vitro model to improve the understanding of P. knowlesi infection in humans. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.


September 22, 2019

A survey of Type III restriction-modification systems reveals numerous, novel epigenetic regulators controlling phase-variable regulons; phasevarions.

Many bacteria utilize simple DNA sequence repeats as a mechanism to randomly switch genes on and off. This process is called phase variation. Several phase-variable N6-adenine DNA-methyltransferases from Type III restriction-modification systems have been reported in bacterial pathogens. Random switching of DNA methyltransferases changes the global DNA methylation pattern, leading to changes in gene expression. These epigenetic regulatory systems are called phasevarions – phase-variable regulons. The extent of these phase-variable genes in the bacterial kingdom is unknown. Here, we interrogated a database of restriction-modification systems, REBASE, by searching for all simple DNA sequence repeats in mod genes that encode Type III N6-adenine DNA-methyltransferases. We report that 17.4% of Type III mod genes (662/3805) contain simple sequence repeats. Of these, only one-fifth have been previously identified. The newly discovered examples are widely distributed and include many examples in opportunistic pathogens as well as in environmental species. In many cases, multiple phasevarions exist in one genome, with examples of up to 4 independent phasevarions in some species. We found several new types of phase-variable mod genes, including the first example of a phase-variable methyltransferase in pathogenic Escherichia coli. Phasevarions are a common epigenetic regulation contingency strategy used by both pathogenic and non-pathogenic bacteria.


September 22, 2019

Epigenetic landscape influences the liver cancer genome architecture.

The accumulations of different types of genetic alterations such as nucleotide substitutions, structural rearrangements and viral genome integrations and epigenetic alterations contribute to carcinogenesis. Here, we report correlation between the occurrence of epigenetic features and genetic aberrations by whole-genome bisulfite, whole-genome shotgun, long-read, and virus capture sequencing of 373 liver cancers. Somatic substitutions and rearrangement breakpoints are enriched in tumor-specific hypo-methylated regions with inactive chromatin marks and actively transcribed highly methylated regions in the cancer genome. Individual mutation signatures depend on chromatin status, especially, signatures with a higher transcriptional strand bias occur within active chromatic areas. Hepatitis B virus (HBV) integration sites are frequently detected within inactive chromatin regions in cancer cells, as a consequence of negative selection for integrations in active chromatin regions. Ultra-high structural instability and preserved unmethylation of integrated HBV genomes are observed. We conclude that both precancerous and somatic epigenetic features contribute to the cancer genome architecture.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.