Menu
September 22, 2019

Identification of the DNA methyltransferases establishing the methylome of the cyanobacterium Synechocystis sp. PCC 6803.

DNA methylation in bacteria is important for defense against foreign DNA, but is also involved in DNA repair, replication, chromosome partitioning, and regulatory processes. Thus, characterization of the underlying DNA methyltransferases in genetically tractable bacteria is of paramount importance. Here, we characterized the methylome and orphan methyltransferases in the model cyanobacterium Synechocystis sp. PCC 6803. Single molecule real-time (SMRT) sequencing revealed four DNA methylation recognition sequences in addition to the previously known motif m5CGATCG, which is recognized by M.Ssp6803I. For three of the new recognition sequences, we identified the responsible methyltransferases. M.Ssp6803II, encoded by the sll0729 gene, modifies GGm4CC, M.Ssp6803III, encoded by slr1803, represents the cyanobacterial dam-like methyltransferase modifying Gm6ATC, and M.Ssp6803V, encoded by slr6095 on plasmid pSYSX, transfers methyl groups to the bipartite motif GGm6AN7TTGG/CCAm6AN7TCC. The remaining methylation recognition sequence GAm6AGGC is probably recognized by methyltransferase M.Ssp6803IV encoded by slr6050. M.Ssp6803III and M.Ssp6803IV were essential for the viability of Synechocystis, while the strains lacking M.Ssp6803I and M.Ssp6803V showed growth similar to the wild type. In contrast, growth was strongly diminished of the ?sll0729 mutant lacking M.Ssp6803II. These data provide the basis for systematic studies on the molecular mechanisms impacted by these methyltransferases.


September 22, 2019

Biology and genome of a newly discovered sibling species of Caenorhabditis elegans.

A ‘sibling’ species of the model organism Caenorhabditis elegans has long been sought for use in comparative analyses that would enable deep evolutionary interpretations of biological phenomena. Here, we describe the first sibling species of C. elegans, C. inopinata n. sp., isolated from fig syconia in Okinawa, Japan. We investigate the morphology, developmental processes and behaviour of C. inopinata, which differ significantly from those of C. elegans. The 123-Mb C. inopinata genome was sequenced and assembled into six nuclear chromosomes, allowing delineation of Caenorhabditis genome evolution and revealing unique characteristics, such as highly expanded transposable elements that might have contributed to the genome evolution of C. inopinata. In addition, C. inopinata exhibits massive gene losses in chemoreceptor gene families, which could be correlated with its limited habitat area. We have developed genetic and molecular techniques for C. inopinata; thus C. inopinata provides an exciting new platform for comparative evolutionary studies.


September 22, 2019

Genomic analysis of Sparus aurata reveals the evolutionary dynamics of sex-biased genes in a sequential hermaphrodite fish

Sexual dimorphism is a fascinating subject in evolutionary biology and mostly results from sex-biased expression of genes, which have been shown to evolve faster in gonochoristic species. We report here genome and sex-specific transcriptome sequencing of Sparus aurata, a sequential hermaphrodite fish. Evolutionary comparative analysis reveals that sex-biased genes in S. aurata are similar in number and function, but evolved following strikingly divergent patterns compared with gonochoristic species, showing overall slower rates because of stronger functional constraints. Fast evolution is observed only for highly ovary-biased genes due to female-specific patterns of selection that are related to the peculiar reproduction mode of S. aurata, first maturing as male, then as female. To our knowledge, these findings represent the first genome-wide analysis on sex-biased loci in a hermaphrodite vertebrate species, demonstrating how having two sexes in the same individual profoundly affects the fate of a large set of evolutionarily relevant genes.


September 22, 2019

Comparative genome analysis and evaluation of probiotic characteristics of Lactobacillus plantarum strain JDFM LP11.

In the current study, the probiotic potential of approximately 250 strains of lactic acid bacteria (LAB) isolated from piglet fecal samples were investigated; among them Lactobacillus plantarum strain JDFM LP11, which possesses significant probiotic potential, with enhanced acid/bile tolerance, attachment to porcine intestinal epithelial cells (IPEC-J2), and antimicrobial activity. The genetic characteristics of strain JDFM LP11 were explored by performing whole genome sequencing (WGS) using a PacBio system. The circular draft genome have a total length of 3,206,883 bp and a total of 3,021 coding sequences were identified. Phylogenetically, three genes, possibly related to survival and metabolic activity in the porcine host, were identified. These genes encode p60, lichenan permease IIC component, and protein TsgA, which are a putative endopeptidase, a component of the phosphotransferase system (PTS), and a major facilitator in the gut environment, respectively. Our findings suggest that understanding the functional and genetic characteristics of L. plantarum strain JDFM LP11, with its candidate genes for gut health, could provide new opportunities and insights into applications in the animal food and feed additive industries.


September 22, 2019

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.


September 22, 2019

Genotype to phenotype: Diet-by-mitochondrial DNA haplotype interactions drive metabolic flexibility and organismal fitness.

Diet may be modified seasonally or by biogeographic, demographic or cultural shifts. It can differentially influence mitochondrial bioenergetics, retrograde signalling to the nuclear genome, and anterograde signalling to mitochondria. All these interactions have the potential to alter the frequencies of mtDNA haplotypes (mitotypes) in nature and may impact human health. In a model laboratory system, we fed four diets varying in Protein: Carbohydrate (P:C) ratio (1:2, 1:4, 1:8 and 1:16 P:C) to four homoplasmic Drosophila melanogaster mitotypes (nuclear genome standardised) and assayed their frequency in population cages. When fed a high protein 1:2 P:C diet, the frequency of flies harbouring Alstonville mtDNA increased. In contrast, when fed the high carbohydrate 1:16 P:C food the incidence of flies harbouring Dahomey mtDNA increased. This result, driven by differences in larval development, was generalisable to the replacement of the laboratory diet with fruits having high and low P:C ratios, perturbation of the nuclear genome and changes to the microbiome. Structural modelling and cellular assays suggested a V161L mutation in the ND4 subunit of complex I of Dahomey mtDNA was mildly deleterious, reduced mitochondrial functions, increased oxidative stress and resulted in an increase in larval development time on the 1:2 P:C diet. The 1:16 P:C diet triggered a cascade of changes in both mitotypes. In Dahomey larvae, increased feeding fuelled increased ß-oxidation and the partial bypass of the complex I mutation. Conversely, Alstonville larvae upregulated genes involved with oxidative phosphorylation, increased glycogen metabolism and they were more physically active. We hypothesise that the increased physical activity diverted energy from growth and cell division and thereby slowed development. These data further question the use of mtDNA as an assumed neutral marker in evolutionary and population genetic studies. Moreover, if humans respond similarly, we posit that individuals with specific mtDNA variations may differentially metabolise carbohydrates, which has implications for a variety of diseases including cardiovascular disease, obesity, and perhaps Parkinson’s Disease.


September 22, 2019

Unexpected patterns of segregation distortion at a selfish supergene in the fire ant Solenopsis invicta.

The Sb supergene in the fire ant Solenopsis invicta determines the form of colony social organization, with colonies whose inhabitants bear the element containing multiple reproductive queens and colonies lacking it containing only a single queen. Several features of this supergene – including suppressed recombination, presence of deleterious mutations, association with a large centromere, and “green-beard” behavior – suggest that it may be a selfish genetic element that engages in transmission ratio distortion (TRD), defined as significant departures in progeny allele frequencies from Mendelian inheritance ratios. We tested this possibility by surveying segregation ratios in embryo progenies of 101 queens of the “polygyne” social form (3512 embryos) using three supergene-linked markers and twelve markers outside the supergene.Significant departures from Mendelian ratios were observed at the supergene loci in 3-5 times more progenies than expected in the absence of TRD and than found, on average, among non-supergene loci. Also, supergene loci displayed the greatest mean deviations from Mendelian ratios among all study loci, although these typically were modest. A surprising feature of the observed inter-progeny variation in TRD was that significant deviations involved not only excesses of supergene alleles but also similarly frequent excesses of the alternate alleles on the homologous chromosome. As expected given the common occurrence of such “drive reversal” in this system, alleles associated with the supergene gain no consistent transmission advantage over their alternate alleles at the population level. Finally, we observed low levels of recombination and incomplete gametic disequilibrium across the supergene, including between adjacent markers within a single inversion.Our data confirm the prediction that the Sb supergene is a selfish genetic element capable of biasing its own transmission during reproduction, yet counterselection for suppressor loci evidently has produced an evolutionary stalemate in TRD between the variant homologous haplotypes on the “social chromosome”. Evidence implicates prezygotic segregation distortion as responsible for the TRD we document, with “true” meiotic drive the most likely mechanism. Low levels of recombination and incomplete gametic disequilibrium across the supergene suggest that selection does not preserve a single uniform supergene haplotype responsible for inducing polygyny.


September 22, 2019

N6-methyladenine DNA modification in Xanthomonas oryzae pv. oryzicola genome.

DNA N6-methyladenine (6mA) modifications expand the information capacity of DNA and have long been known to exist in bacterial genomes. Xanthomonas oryzae pv. Oryzicola (Xoc) is the causative agent of bacterial leaf streak, an emerging and destructive disease in rice worldwide. However, the genome-wide distribution patterns and potential functions of 6mA in Xoc are largely unknown. In this study, we analyzed the levels and global distribution patterns of 6mA modification in genomic DNA of seven Xoc strains (BLS256, BLS279, CFBP2286, CFBP7331, CFBP7341, L8 and RS105). The 6mA modification was found to be widely distributed across the seven Xoc genomes, accounting for percent of 3.80, 3.10, 3.70, 4.20, 3.40, 2.10, and 3.10 of the total adenines in BLS256, BLS279, CFBP2286, CFBP7331, CFBP7341, L8, and RS105, respectively. Notably, more than 82% of 6mA sites were located within gene bodies in all seven strains. Two specific motifs for 6?mA modification, ARGT and AVCG, were prevalent in all seven strains. Comparison of putative DNA methylation motifs from the seven strains reveals that Xoc have a specific DNA methylation system. Furthermore, the 6?mA modification of rpfC dramatically decreased during Xoc infection indicates the important role for Xoc adaption to environment.


September 22, 2019

Reconstitution of eukaryotic chromosomes and manipulation of DNA N6-methyladenine alters chromatin and gene expression

DNA N6-adenine methylation (6mA) has recently been reported in diverse eukaryotes, spanning unicellular organisms to metazoans. Yet the functional significance of 6mA remains elusive due to its low abundance, difficulty of manipulation within native DNA, and lack of understanding of eukaryotic 6mA writers. Here, we report a novel DNA 6mA methyltransferase in ciliates, termed MTA1. The enzyme contains an MT-A70 domain but is phylogenetically distinct from all known RNA and DNA methyltransferases. Disruption of MTA1 in vivo leads to the genome-wide loss of 6mA in asexually growing cells and abolishment of the consensus ApT dimethylated motif. Genes exhibit subtle changes in chromatin organization or RNA expression upon loss of 6mA, depending on their starting methylation level. Mutants fail to complete the sexual cycle, which normally coincides with a peak of MTA1 expression. Thus, MTA1 functions in a developmental stage-specific manner. We determine the impact of 6mA on chromatin organization in vitro by reconstructing complete, full-length ciliate chromosomes harboring 6mA in native or ectopic positions. Using these synthetic chromosomes, we show that 6mA directly disfavors nucleosomes in vitro in a local, quantitative manner, independent of DNA sequence. Furthermore, the chromatin remodeler ACF can overcome this effect. Our study identifies a novel MT-A70 protein necessary for eukaryotic 6mA methylation and defines the impact of 6mA on chromatin organization using epigenetically defined synthetic chromosomes.


September 22, 2019

Hybrid correction of highly noisy long reads using a variable-order de Bruijn graph.

The recent rise of long read sequencing technologies such as Pacific Biosciences and Oxford Nanopore allows to solve assembly problems for larger and more complex genomes than what allowed short reads technologies. However, these long reads are very noisy, reaching an error rate of around 10-15% for Pacific Biosciences, and up to 30% for Oxford Nanopore. The error correction problem has been tackled by either self-correcting the long reads, or using complementary short reads in a hybrid approach. However, even though sequencing technologies promise to lower the error rate of the long reads below 10%, it is still higher in practice, and correcting such noisy long reads remains an issue.We present HG-CoLoR, a hybrid error correction method that focuses on a seed-and-extend approach based on the alignment of the short reads to the long reads, followed by the traversal of a variable-order de Bruijn graph, built from the short reads. Our experiments show that HG-CoLoR manages to efficiently correct highly noisy long reads that display an error rate as high as 44%. When compared to other state-of-the-art long read error correction methods, our experiments also show that HG-CoLoR provides the best trade-off between runtime and quality of the results, and is the only method able to efficiently scale to eukaryotic genomes.HG-CoLoR is implemented is C++, supported on Linux platforms and freely available at https://github.com/morispi/HG-CoLoR.Supplementary data are available at Bioinformatics online.


September 22, 2019

Biparental Inheritance of Mitochondrial DNA in Humans.

Although there has been considerable debate about whether paternal mitochondrial DNA (mtDNA) transmission may coexist with maternal transmission of mtDNA, it is generally believed that mitochondria and mtDNA are exclusively maternally inherited in humans. Here, we identified three unrelated multigeneration families with a high level of mtDNA heteroplasmy (ranging from 24 to 76%) in a total of 17 individuals. Heteroplasmy of mtDNA was independently examined by high-depth whole mtDNA sequencing analysis in our research laboratory and in two Clinical Laboratory Improvement Amendments and College of American Pathologists-accredited laboratories using multiple approaches. A comprehensive exploration of mtDNA segregation in these families shows biparental mtDNA transmission with an autosomal dominantlike inheritance mode. Our results suggest that, although the central dogma of maternal inheritance of mtDNA remains valid, there are some exceptional cases where paternal mtDNA could be passed to the offspring. Elucidating the molecular mechanism for this unusual mode of inheritance will provide new insights into how mtDNA is passed on from parent to offspring and may even lead to the development of new avenues for the therapeutic treatment for pathogenic mtDNA transmission.


September 22, 2019

N6-methyladenine DNA methylation in Japonica and Indica rice genomes and its association with gene expression, plant development, and stress responses.

N6-Methyladenine (6mA) DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes, including the dicot model Arabidopsis thaliana. However, the conservation and divergence of 6mA distribution patterns and functions in plants remain elusive. Here we report high-quality 6mA methylomes at single-nucleotide resolution in rice based on substantially improved genome sequences of two rice cultivars, Nipponbare (Nip; Japonica) and 93-11 (Indica). Analysis of 6mA genomic distribution and its association with transcription suggest that 6mA distribution and function is rather conserved between rice and Arabidopsis. We found that 6mA levels are positively correlated with the expression of key stress-related genes, which may be responsible for the difference in stress tolerance between Nip and 93-11. Moreover, we showed that mutations in DDM1 cause defects in plant growth and decreased 6mA level. Our results reveal that 6mA is a conserved DNA modification that is positively associated with gene expression and contributes to key agronomic traits in plants. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

MadID, a versatile approach to map protein-DNA interactions, highlights telomere-nuclear envelope contact sites in human cells.

Mapping the binding sites of DNA- or chromatin-interacting proteins is essential to understanding biological processes. DNA adenine methyltransferase identification (DamID) has emerged as a comprehensive method to map genome-wide occupancy of proteins of interest. A caveat of DamID is the specificity of Dam methyltransferase for GATC motifs that are not homogenously distributed in the genome. Here, we developed an optimized method named MadID, using proximity labeling of DNA by the methyltransferase M.EcoGII. M.EcoGII mediates N6-adenosine methylation in any DNA sequence context, resulting in deeper and unbiased coverage of the genome. We demonstrate, using m6A-specific immunoprecipitation and deep sequencing, that MadID is a robust method to identify protein-DNA interactions at the whole-genome level. Using MadID, we revealed contact sites between human telomeres, repetitive sequences devoid of GATC sites, and the nuclear envelope. Overall, MadID opens the way to identification of binding sites in genomic regions that were largely inaccessible. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 21, 2019

Assessing genome assembly quality using the LTR Assembly Index (LAI).

Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.


September 21, 2019

Direct detection of DNA methylation during single-molecule, real-time sequencing.

We describe the direct detection of DNA methylation, without bisulfite conversion, through single-molecule, real-time (SMRT) sequencing. In SMRT sequencing, DNA polymerases catalyze the incorporation of fluorescently labeled nucleotides into complementary nucleic acid strands. The arrival times and durations of the resulting fluorescence pulses yield information about polymerase kinetics and allow direct detection of modified nucleotides in the DNA template, including N6-methyladenine, 5-methylcytosine and 5-hydroxymethylcytosine. Measurement of polymerase kinetics is an intrinsic part of SMRT sequencing and does not adversely affect determination of primary DNA sequence. The various modifications affect polymerase kinetics differently, allowing discrimination between them. We used these kinetic signatures to identify adenine methylation in genomic samples and found that, in combination with circular consensus sequencing, they can enable single-molecule identification of epigenetic modifications with base-pair resolution. This method is amenable to long read lengths and will likely enable mapping of methylation patterns in even highly repetitive genomic regions.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.