Menu
July 7, 2019

NOVOPlasty: de novo assembly of organelle genomes from whole genome data.

The evolution in next-generation sequencing (NGS) technology has led to the development of many different assembly algorithms, but few of them focus on assembling the organelle genomes. These genomes are used in phylogenetic studies, food identification and are the most deposited eukaryotic genomes in GenBank. Producing organelle genome assembly from whole genome sequencing (WGS) data would be the most accurate and least laborious approach, but a tool specifically designed for this task is lacking. We developed a seed-and-extend algorithm that assembles organelle genomes from whole genome sequencing (WGS) data, starting from a related or distant single seed sequence. The algorithm has been tested on several new (Gonioctena intermedia and Avicennia marina) and public (Arabidopsis thaliana and Oryza sativa) whole genome Illumina data sets where it outperforms known assemblers in assembly accuracy and coverage. In our benchmark, NOVOPlasty assembled all tested circular genomes in less than 30 min with a maximum memory requirement of 16 GB and an accuracy over 99.99%. In conclusion, NOVOPlasty is the sole de novo assembler that provides a fast and straightforward extraction of the extranuclear genomes from WGS data in one circular high quality contig. The software is open source and can be downloaded at https://github.com/ndierckx/NOVOPlasty.


July 7, 2019

Draft genome assembly and annotation of Glycyrrhiza uralensis, a medicinal legume.

Chinese liquorice/licorice (Glycyrrhiza uralensis) is a leguminous plant species whose roots and rhizomes have been widely used as a herbal medicine and natural sweetener. Whole-genome sequencing is essential for gene discovery studies and molecular breeding in liquorice. Here, we report a draft assembly of the approximately 379-Mb whole-genome sequence of strain 308-19 of G. uralensis; this assembly contains 34 445 predicted protein-coding genes. Comparative analyses suggested well-conserved genomic components and collinearity of gene loci (synteny) between the genome of liquorice and those of other legumes such as Medicago and chickpea. We observed that three genes involved in isoflavonoid biosynthesis, namely, 2-hydroxyisoflavanone synthase (CYP93C), 2,7,4′-trihydroxyisoflavanone 4′-O-methyltransferase/isoflavone 4′-O-methyltransferase (HI4OMT) and isoflavone-7-O-methyltransferase (7-IOMT) formed a cluster on the scaffold of the liquorice genome and showed conserved microsynteny with Medicago and chickpea. Based on the liquorice genome annotation, we predicted genes in the P450 and UDP-dependent glycosyltransferase (UGT) superfamilies, some of which are involved in triterpenoid saponin biosynthesis, and characterised their gene expression with the reference genome sequence. The genome sequencing and its annotations provide an essential resource for liquorice improvement through molecular breeding and the discovery of useful genes for engineering bioactive components through synthetic biology approaches.© 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.


July 7, 2019

LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences.

Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed.LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences.LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422.


July 7, 2019

Methods for genome-wide methylome profiling of Campylobacter jejuni.

Methylation has a profound role in the regulation of numerous biological processes in bacteria including virulence. The study of methylation in bacteria has greatly advanced thanks to next-generation sequencing technologies. These technologies have expedited the process of uncovering unique features of many bacterial methylomes such as characterizing previously uncharacterized methyltransferases, cataloging genome-wide DNA methylations in bacteria, identifying the frequency of methylation at particular genomic loci, and revealing regulatory roles of methylation in the biology of various bacterial species. For instance, methylation has been cited as a potential source for the pathogenicity differences observed in C. jejuni strains with syntenic genomes as seen in recent publications. Here, we describe the methodology for the use of Pacific Biosciences’ single molecule real-time (SMRT) sequencing for detecting methylation patterns in C. jejuni and bioinformatics tools to profile its methylome.


July 7, 2019

Whole genome sequencing analysis of the cutaneous pathogenic yeast Malassezia restricta and identification of the major lipase expressed on the scalp of patients with dandruff.

Malassezia species are opportunistic pathogenic fungi that are frequently associated with seborrhoeic dermatitis, including dandruff. Most Malassezia species are lipid dependent, a property that is compensated by breaking down host sebum into fatty acids by lipases. In this study, we aimed to sequence and analyse the whole genome of Malassezia restricta KCTC 27527, a clinical isolate from a Korean patient with severe dandruff, to search for lipase orthologues and identify the lipase that is the most frequently expressed on the scalp of patients with dandruff. The genome of M. restricta KCTC 27527 was sequenced using the Illumina MiSeq and PacBio platforms. Lipase orthologues were identified by comparison with known lipase genes in the genomes of Malassezia globosa and Malassezia sympodialis. The expression of the identified lipase genes was directly evaluated in swab samples from the scalps of 56 patients with dandruff. We found that, among the identified lipase-encoding genes, the gene encoding lipase homolog MRES_03670, named LIP5 in this study, was the most frequently expressed lipase in the swab samples. Our study provides an overview of the genome of a clinical isolate of M. restricta and fundamental information for elucidating the role of lipases during fungus-host interaction.© 2016 Blackwell Verlag GmbH.


July 7, 2019

The comparative landscape of duplications in Heliconius melpomene and Heliconius cydno.

Gene duplications can facilitate adaptation and may lead to interpopulation divergence, causing reproductive isolation. We used whole-genome resequencing data from 34 butterflies to detect duplications in two Heliconius species, Heliconius cydno and Heliconius melpomene. Taking advantage of three distinctive signals of duplication in short-read sequencing data, we identified 744 duplicated loci in H. cydno and H. melpomene and evaluated the accuracy of our approach using single-molecule sequencing. We have found that duplications overlap genes significantly less than expected at random in H. melpomene, consistent with the action of background selection against duplicates in functional regions of the genome. Duplicate loci that are highly differentiated between H. melpomene and H. cydno map to four different chromosomes. Four duplications were identified with a strong signal of divergent selection, including an odorant binding protein and another in close proximity with a known wing colour pattern locus that differs between the two species. Heredity advance online publication, 7 December 2016; doi:10.1038/hdy.2016.107.


July 7, 2019

Draft genome sequence of Mentha longifolia (L.) and development of resources for mint cultivar improvement.

The genus Mentha encompasses mint species cultivated for their essential oils, which are formulated into a vast array of consumer products. Desirable oil characteristics and resistance to the fungal disease Verticillium wilt are top priorities for the mint industry. However, cultivated mints have complex polyploid genomes and are sterile. Breeding efforts, therefore, require the development of genomic resources for fertile mint species. Here, we present draft de novo genome and plastome assemblies for a wilt-resistant South African accession of Mentha longifolia (L.) Huds., a diploid species ancestral to cultivated peppermint and spearmint. The 353 Mb genome contains 35 597 predicted protein-coding genes, including 292 disease resistance gene homologs, and nine genes determining essential oil characteristics. A genetic linkage map ordered 1397 genome scaffolds on 12 pseudochromosomes. More than two million simple sequence repeats were identified, which will facilitate molecular marker development. The M. longifolia genome is a valuable resource for both metabolic engineering and molecular breeding. This is exemplified by employing the genome sequence to clone and functionally characterize the promoters in a peppermint cultivar, and demonstrating the utility of a glandular trichome-specific promoter to increase expression of a biosynthetic gene, thereby modulating essential oil composition. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.


July 7, 2019

Comparative genomics of extrachromosomal elements in Bacillus thuringiensis subsp. israelensis.

Bacillus thuringiensis subsp. israelensis is one of the most important microorganisms used against mosquitoes. It was intensively studied following its discovery and became a model bacterium of the B. thuringiensis species. Those studies focused on toxin genes, aggregation-associated conjugation, linear genome phages, etc. Recent announcements of genomic sequences of different strains have not been explicitly related to the biological properties studied. We report data on plasmid content analysis of four strains using ultra-high-throughput sequencing. The strains were commercial product isolates, with their putative ancestor and type B. thuringiensis subsp. israelensis strain sequenced earlier. The assembled contigs corresponding to published and novel data were assigned to plasmids described earlier in B. thuringiensis subsp. israelensis and other B. thuringiensis strains. A new 360 kb plasmid was identified, encoding multiple transporters, also found in most of the earlier sequenced strains. Our genomic data show the presence of two toxin-coding plasmids of 128 and 100 kb instead of the reported 225 kb plasmid, a co-integrate of the former two. In two of the sequenced strains, only a 100 kb plasmid was present. Some heterogeneity exists in the small plasmid content and structure between strains. These data support the perception of active plasmid exchange among B. thuringiensis subsp. israelensis strains in nature. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.


July 7, 2019

Detection, isolation and characterization of Fusobacterium gastrosuis sp. nov. colonizing the stomach of pigs.

Nine strains of a novel Fusobacterium sp. were isolated from the stomach of 6-8 months old and adult pigs. The isolates were obligately anaerobic, although they endured 2h exposure to air. Phylogenetic analysis based on 16S rRNA and gyrase B genes demonstrated that the isolates showed high sequence similarity with Fusobacterium mortiferum, Fusobacterium ulcerans, Fusobacterium varium, Fusobacterium russii and Fusobacterium necrogenes, but formed a distinct lineage in the genus Fusobacterium. Comparative analysis of the genome of the type strain of this novel Fusobacterium sp. confirmed that it is different from other recognized Fusobacterium spp. DNA-DNA hybridization, fingerprinting and genomic %GC determination further supported the conclusion that the isolates belong to a new, distinct species. The isolates were also distinguishable from these and other Fusobacterium spp. by phenotypical characterization. The strains produced indole and exhibited proline arylamidase and glutamic acid decarboxylase activity. They did not hydrolyse esculin, did not exhibit pyroglutamic acid arylamidase, valine arylamidase, a-galactosidase, ß-galactosidase, ß-galactosidase-6-phosphate or a-glucosidase activity nor produced acid from cellobiose, glucose, lactose, mannitol, mannose, maltose, raffinose, saccharose, salicin or trehalose. The major fatty acids were C16:0 and C18:1?9c. The name Fusobacterium gastrosuis sp. nov. is proposed for the novel isolates with the type strain CDW1(T) (=DSM 101753(T)=LMG 29236(T)). We also demonstrated that Clostridium rectum and mortiferum Fusobacterium represent the same species, with nomenclatural priority for the latter. Copyright © 2016 Elsevier GmbH. All rights reserved.


July 7, 2019

Evolutionary origins of the emergent ST796 clone of vancomycin resistant Enterococcus faecium.

From early 2012, a novel clone of vancomycin resistant Enterococcus faecium (assigned the multi locus sequence type ST796) was simultaneously isolated from geographically separate hospitals in south eastern Australia and New Zealand. Here we describe the complete genome sequence of Ef_aus0233, a representative ST796 E. faecium isolate. We used PacBio single molecule real-time sequencing to establish a high quality, fully assembled genome comprising a circular chromosome of 2,888,087 bp and five plasmids. Comparison of Ef_aus0233 to other E. faecium genomes shows Ef_aus0233 is a member of the epidemic hospital-adapted lineage and has evolved from an ST555-like ancestral progenitor by the accumulation or modification of five mosaic plasmids and five putative prophage, acquisition of two cryptic genomic islands, accrued chromosomal single nucleotide polymorphisms and a 80 kb region of recombination, also gaining Tn1549 and Tn916, transposons conferring resistance to vancomycin and tetracycline respectively. The genomic dissection of this new clone presented here underscores the propensity of the hospital E. faecium lineage to change, presumably in response to the specific conditions of hospital and healthcare environments.


July 7, 2019

Comparative mitogenomic analysis of three species of periwinkles: Littorina fabalis, L. obtusata and L. saxatilis.

The flat periwinkles, Littorina fabalis and L. obtusata, offer an interesting system for local adaptation and ecological speciation studies. In order to provide genomic resources for these species, we sequenced their mitogenomes together with that of the rough periwinkle L. saxatilis by means of next-generation sequencing technologies. The three mitogenomes present the typical repertoire of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes and a putative control region. Although the latter could not be fully recovered in flat periwinkles using short-reads due to a highly repetitive fragment, in L. saxatilis this problem was overcome with additional long-reads and we were able to assemble the complete mitogenome. Both gene order and nucleotide composition are similar between the three species as well as compared to other Littorinimorpha. A large variance in divergence was observed across mitochondrial regions, with six- to ten-fold difference between the highest and the lowest divergence rates. Based on nucleotide changes on the whole molecule and assuming a molecular clock, L. fabalis and L. obtusata started to diverge around 0.8 Mya (0.4-1.1 Mya). The evolution of the mitochondrial protein-coding genes in the three Littorina species appears mainly influenced by purifying selection as revealed by phylogenetic tests based on dN/dS ratios that did not detect any evidence for positive selection, although some caution is required given the limited power of the dataset and the implemented approaches. Copyright © 2016 Elsevier B.V. All rights reserved.


July 7, 2019

Burkholderia humptydooensis sp. nov., a new species related to Burkholderia thailandensis and the fifth member of the Burkholderia pseudomallei complex.

During routine screening for Burkholderia pseudomallei from water wells in northern Australia in areas where it is endemic, Gram-negative bacteria (strains MSMB43(T), MSMB121, and MSMB122) with a similar morphology and biochemical pattern to B. pseudomallei and B. thailandensis were coisolated with B. pseudomallei on Ashdown’s selective agar. To determine the exact taxonomic position of these strains and to distinguish them from B. pseudomallei and B. thailandensis, they were subjected to a series of phenotypic and molecular analyses. Biochemical and fatty acid methyl ester analysis was unable to distinguish B. humptydooensis sp. nov. from closely related species. With matrix-assisted laser desorption ionization-time of flight analysis, all isolates grouped together in a cluster separate from other Burkholderia spp. 16S rRNA and recA sequence analyses demonstrated phylogenetic placement for B. humptydooensis sp. nov. in a novel clade within the B. pseudomallei group. Multilocus sequence typing (MLST) analysis of the three isolates in comparison with MLST data from 3,340 B. pseudomallei strains and related taxa revealed a new sequence type (ST318). Genome-to-genome distance calculations and the average nucleotide identity of all isolates to both B. thailandensis and B. pseudomallei, based on whole-genome sequences, also confirmed B. humptydooensis sp. nov. as a novel Burkholderia species within the B. pseudomallei complex. Molecular analyses clearly demonstrated that strains MSMB43(T), MSMB121, and MSMB122 belong to a novel Burkholderia species for which the name Burkholderia humptydooensis sp. nov. is proposed, with the type strain MSMB43(T) (American Type Culture Collection BAA-2767; Belgian Co-ordinated Collections of Microorganisms LMG 29471; DDBJ accession numbers CP013380 to CP013382).IMPORTANCEBurkholderia pseudomallei is a soil-dwelling bacterium and the causative agent of melioidosis. The genus Burkholderia consists of a diverse group of species, with the closest relatives of B. pseudomallei referred to as the B. pseudomallei complex. A proposed novel species, B. humptydooensis sp. nov., was isolated from a bore water sample from the Northern Territory in Australia. B. humptydooensis sp. nov. is phylogenetically distinct from B. pseudomallei and other members of the B. pseudomallei complex, making it the fifth member of this important group of bacteria. Copyright © 2017 Tuanyok et al.


July 7, 2019

Competition assays and physiological experiments of soil and phyllosphere yeasts identify Candida subhashii as a novel antagonist of filamentous fungi.

While recent advances in next generation sequencing technologies have enabled researchers to readily identify countless microbial species in soil, rhizosphere, and phyllosphere microbiomes, the biological functions of the majority of these species are unknown. Functional studies are therefore urgently needed in order to characterize the plethora of microorganisms that are being identified and to point out species that may be used for biotechnology or plant protection. Here, we used a dual culture assay and growth analyses to characterise yeasts (40 different isolates) and their antagonistic effect on 16 filamentous fungi; comprising plant pathogens, antagonists, and saprophytes.Overall, this competition screen of 640 pairwise combinations revealed a broad range of outcomes, ranging from small stimulatory effects of some yeasts up to a growth inhibition of more than 80% by individual species. On average, yeasts isolated from soil suppressed filamentous fungi more strongly than phyllosphere yeasts and the antagonistic activity was a species-/isolate-specific property and not dependent on the filamentous fungus a yeast was interacting with. The isolates with the strongest antagonistic activity were Metschnikowia pulcherrima, Hanseniaspora sp., Cyberlindnera sargentensis, Aureobasidium pullulans, Candida subhashii, and Pichia kluyveri. Among these, the soil yeasts (C. sargentensis, A. pullulans, C. subhashii) assimilated and/or oxidized more di-, tri- and tetrasaccharides and organic acids than yeasts from the phyllosphere. Only the two yeasts C. subhashii and M. pulcherrima were able to grow with N-acetyl-glucosamine as carbon source.The competition assays and physiological experiments described here identified known antagonists that have been implicated in the biological control of plant pathogenic fungi in the past, but also little characterised species such as C. subhashii. Overall, soil yeasts were more antagonistic and metabolically versatile than yeasts from the phyllosphere. Noteworthy was the strong antagonistic activity of the soil yeast C. subhashii, which had so far only been described from a clinical sample and not been studied with respect to biocontrol. Based on binary competition assays and growth analyses (e.g., on different carbon sources, growth in root exudates), C. subhashii was identified as a competitive and antagonistic soil yeast with potential as a novel biocontrol agent against plant pathogenic fungi.


July 7, 2019

Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.

The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present.We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae).Organelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA .


July 7, 2019

Structure and evolution of the filaggrin gene repeated region in primates

The evolutionary dynamics of repeat sequences is quite complex, with some duplicates never having differentiated from each other. Two models can explain the complex evolutionary process for repeated genes—concerted and birth-and-death, of which the latter is driven by duplications maintained by selection. Copy number variations caused by random duplications and losses in repeat regions may modulate molecular pathways and therefore affect phenotypic characteristics in a population, resulting in individuals that are able to adapt to new environments. In this study, we investigated the filaggrin gene (FLG), which codes for filaggrin—an important component of the outer layers of mammalian skin—and contains tandem repeats that exhibit copy number variation between and within species. To examine which model best fits the evolutionary pathway for the complete tandem repeats within a single exon of FLG, we determined the repeat sequences in crab-eating macaque (Macaca fascicularis), orangutan (Pongo abelii), gorilla (Gorilla gorilla), and chimpanzee (Pan troglodytes) and compared these with the sequence in human (Homo sapiens).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.