Menu
July 7, 2019

A complete toolset for the study of Ustilago bromivora and Brachypodium sp. as a fungal-temperate grass pathosystem.

Due to their economic relevance, the study of plant pathogen interactions is of importance. However, elucidating these interactions and their underlying molecular mechanisms remains challenging since both host and pathogen need to be fully genetically accessible organisms. Here we present milestones in the establishment of a new biotrophic model pathosystem: Ustilago bromivora and Brachypodium sp. We provide a complete toolset, including an annotated fungal genome and methods for genetic manipulation of the fungus and its host plant. This toolset will enable researchers to easily study biotrophic interactions at the molecular level on both the pathogen and the host side. Moreover, our research on the fungal life cycle revealed a mating type bias phenomenon. U. bromivora harbors a haplo-lethal allele that is linked to one mating type region. As a result, the identified mating type bias strongly promotes inbreeding, which we consider to be a potential speciation driver.


July 7, 2019

Genomic analysis of phylotype I strain EP1 reveals substantial divergence from other strains in the Ralstonia solanacearum species complex.

Ralstonia solanacearum species complex is a devastating group of phytopathogens with an unusually wide host range and broad geographical distribution. R. solanacearum isolates may differ considerably in various properties including host range and pathogenicity, but the underlying genetic bases remain vague. Here, we conducted the genome sequencing of strain EP1 isolated from Guangdong Province of China, which belongs to phylotype I and is highly virulent to a range of solanaceous crops. Its complete genome contains a 3.95-Mb chromosome and a 2.05-Mb mega-plasmid, which is considerably bigger than reported genomes of other R. solanacearum strains. Both the chromosome and the mega-plasmid have essential house-keeping genes and many virulence genes. Comparative analysis of strain EP1 with other 3 phylotype I and 3 phylotype II, III, IV strains unveiled substantial genome rearrangements, insertions and deletions. Genome sequences are relatively conserved among the 4 phylotype I strains, but more divergent among strains of different phylotypes. Moreover, the strains exhibited considerable variations in their key virulence genes, including those encoding secretion systems and type III effectors. Our results provide valuable information for further elucidation of the genetic basis of diversified virulences and host range of R. solanacearum species.


July 7, 2019

Complete sequence of a F33:A-:B- conjugative plasmid carrying the oqxAB, fosA3, and blaCTX-M-55 elements from a foodborne Escherichia coli strain.

This study reports the complete sequence of pE80, a conjugative IncFII plasmid recovered from an Escherichia coli strain isolated from chicken meat. This plasmid harbors multiple resistance determinants including oqxAB, fosA3, blaCTX-M-55, and blaTEM-1, and is a close variant of the recently reported p42-2 element, which was recovered from E. coli of veterinary source. Recovery of pE80 constitutes evidence that evolution or genetic re-arrangement of IncFII type plasmids residing in animal-borne organisms is an active event, which involves acquisition and integration of foreign resistance elements into the plasmid backbone. Dissemination of these plasmids may further compromise the effectiveness of current antimicrobial strategies.


July 7, 2019

Structure and dynamics underlying elementary ligand binding events in human pacemaking channels.

Although molecular recognition is crucial for cellular signaling, mechanistic studies have relied primarily on ensemble measures that average over and thereby obscure underlying steps. Single-molecule observations that resolve these steps are lacking due to diffraction-limited resolution of single fluorophores at relevant concentrations. Here, we combined zero-mode waveguides with fluorescence resonance energy transfer (FRET) to directly observe binding at individual cyclic nucleotide-binding domains (CNBDs) from human pacemaker ion channels critical for heart and brain function. Our observations resolve the dynamics of multiple distinct steps underlying cyclic nucleotide regulation: a slow initial binding step that must select a ‘receptive’ conformation followed by a ligand-induced isomerization of the CNBD. X-ray structure of the apo CNBD and atomistic simulations reveal that the isomerization involves both local and global transitions. Our approach reveals fundamental mechanisms underpinning ligand regulation of pacemaker channels, and is generally applicable to weak-binding interactions governing a broad spectrum of signaling processes.


July 7, 2019

The genome of the toluene-degrading Pseudomonas veronii strain 1YdBTEX2 and its differential gene expression in contaminated sand.

The natural restoration of soils polluted by aromatic hydrocarbons such as benzene, toluene, ethylbenzene and m- and p-xylene (BTEX) may be accelerated by inoculation of specific biodegraders (bioaugmentation). Bioaugmentation mainly involves introducing bacteria that deploy their metabolic properties and adaptation potential to survive and propagate in the contaminated environment by degrading the pollutant. In order to better understand the adaptive response of cells during a transition to contaminated material, we analyzed here the genome and short-term (1 h) changes in genome-wide gene expression of the BTEX-degrading bacterium Pseudomonas veronii 1YdBTEX2 in non-sterile soil and liquid medium, both in presence or absence of toluene. We obtained a gapless genome sequence of P. veronii 1YdBTEX2 covering three individual replicons with a total size of 8 Mb, two of which are largely unrelated to current known bacterial replicons. One-hour exposure to toluene, both in soil and liquid, triggered massive transcription (up to 208-fold induction) of multiple gene clusters, such as toluene degradation pathway(s), chemotaxis and toluene efflux pumps. This clearly underlines their key role in the adaptive response to toluene. In comparison to liquid medium, cells in soil drastically changed expression of genes involved in membrane functioning (e.g., lipid composition, lipid metabolism, cell fatty acid synthesis), osmotic stress response (e.g., polyamine or trehalose synthesis, uptake of potassium) and putrescine metabolism, highlighting the immediate response mechanisms of P. veronii 1YdBTEX2 for successful establishment in polluted soil.


July 7, 2019

Whole-genome de novo sequencing, combined with RNA-Seq analysis, reveals unique genome and physiological features of the amylolytic yeast Saccharomycopsis fibuligera and its interspecies hybrid.

Genomic studies on fungal species with hydrolytic activity have gained increased attention due to their great biotechnological potential for biomass-based biofuel production. The amylolytic yeast Saccharomycopsis fibuligera has served as a good source of enzymes and genes involved in saccharification. Despite its long history of use in food fermentation and bioethanol production, very little is known about the basic physiology and genomic features of S. fibuligera.We performed whole-genome (WG) de novo sequencing and complete assembly of S. fibuligera KJJ81 and KPH12, two isolates from wheat-based Nuruk in Korea. Intriguingly, the KJJ81 genome (~38 Mb) was revealed as a hybrid between the KPH12 genome (~18 Mb) and another unidentified genome sharing 88.1% nucleotide identity with the KPH12 genome. The seven chromosome pairs of KJJ81 subgenomes exhibit highly conserved synteny, indicating a very recent hybridization event. The phylogeny inferred from WG comparisons showed an early divergence of S. fibuligera before the separation of the CTG and Saccharomycetaceae clades in the subphylum Saccharomycotina. Reconstructed carbon and sulfur metabolic pathways, coupled with RNA-Seq analysis, suggested a marginal Crabtree effect under high glucose and activation of sulfur metabolism toward methionine biosynthesis under sulfur limitation in this yeast. Notably, the lack of sulfate assimilation genes in the S. fibuligera genome reflects a unique phenotype for Saccharomycopsis clades as natural sulfur auxotrophs. Extended gene families, including novel genes involved in saccharification and proteolysis, were identified. Moreover, comparative genome analysis of S. fibuligera ATCC 36309, an isolate from chalky rye bread in Germany, revealed that an interchromosomal translocation occurred in the KPH12 genome before the generation of the KJJ81 hybrid genome.The completely sequenced S. fibuligera genome with high-quality annotation and RNA-Seq analysis establishes an important foundation for functional inference of S. fibuligera in the degradation of fermentation mash. The gene inventory facilitates the discovery of new genes applicable to the production of novel valuable enzymes and chemicals. Moreover, as the first gapless genome assembly in the genus Saccharomycopsis including members with desirable traits for bioconversion, the unique genomic features of S. fibuligera and its hybrid will provide in-depth insights into fungal genome dynamics as evolutionary adaptation.


July 7, 2019

Chromosome assembly of large and complex genomes using multiple references

Despite the rapid development of sequencing technologies, assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout, a reference-assisted assembly tool that now works for large and complex genomes. Taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. Using Ragout, we transformed NGS assemblies of 15 different Mus musculus and one Mus spretus genomes into sets of complete chromosomes, leaving less than 5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long PacBio reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. Additionally, we applied Ragout to Mus caroli and Mus pahari genomes, which exhibit karyotype-scale variations compared to other genomes from the Muridae family. Chromosome color maps confirmed most large-scale rearrangements that Ragout detected.


July 7, 2019

WhatsHap: fast and accurate read-based phasing

Read-based phasing allows to reconstruct the haplotype structure of a sample purely from sequencing reads. While phasing is a required step for answering questions about population genetics, compound heterozygosity, and to aid in clinical decision making, there has been a lack of an accurate, usable and standards-based software. WhatsHap is a production-ready tool for highly accurate read-based phasing. It was designed from the beginning to leverage third-generation sequencing technologies, whose long reads can span many variants and are therefore ideal for phasing. WhatsHap works also well with second-generation data, is easy to use and will phase not only SNVs, but also indels and other variants. It is unique in its ability to combine read-based with genetic phasing, allowing to further improve accuracy if multiple related samples are provided.


July 7, 2019

MICADo – Looking for mutations in targeted PacBio cancer data: an alignment-free method.

Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo.


July 7, 2019

Complete genome anatomy of the emerging potato pathogen Dickeya solani type strain IPO 2222(T).

Several species of the genus Dickeya provoke soft rot and blackleg diseases on a wide range of plants and crops. Dickeya solani has been identified as the causative agent of diseases outbreaks on potato culture in Europe for the last decade. Here, we report the complete genome of the D. solani IPO 2222(T). Using PacBio and Illumina technologies, a unique circular chromosome of 4,919,833 bp was assembled. The G?+?C content reaches 56% and the genomic sequence contains 4,059 predicted proteins. The ANI values calculated for D. solani IPO 2222(T) vs. other available D. solani genomes was over 99.9% indicating a high genetic homogeneity within D. solani species.


July 7, 2019

Genome sequence of a commensal bacterium, Enterococcus faecalis CBA7120, isolated from a Korean fecal sample.

Enterococcus faecalis, the type strain of the genus Enterococcus, is not only a commensal bacterium in the gastrointestinal tract in vertebrates and invertebrates, but also causes serious disease as an opportunistic pathogen. To date, genome sequences have been published for over four hundred E. faecalis strains; however, pathogenicity of these microbes remains complicated. To increase our knowledge of E. faecalis virulence factors, we isolated strain CBA7120 from the feces of an 81-year-old female from the Republic of Korea and performed a comparative genomic analysis.The genome sequence of E. faecalis CBA7120 is 3,134,087 bp in length, with a G + C content of 37.35 mol%, and is comprised of four contigs with an N50 value of 2,922,046 bp. The genome showed high similarity with other strains of E. faecalis, including OG1RF, T13, 12107 and T20, based on OrthoANI values. Strain CBA7120 contains 374 pan-genome orthologous groups (POGs) as singletons, including “Phages, Prophages, Transposable elements, Plasmids,” “Carbohydrates,” “DNA metabolism,” and “Virulence, Disease and Defense” subsystems. Genes related to multidrug resistance efflux pumps were annotated in the genome.The comparative genomic analysis of E. faecalis strains presented in this study was performed using a variety of analysis methods and will facilitate future identification of hypothetical proteins.


July 7, 2019

Use of single molecule sequencing for comparative genomics of an environmental and a clinical isolate of Clostridium difficile ribotype 078.

How the pathogen Clostridium difficile might survive, evolve and be transferred between reservoirs within the natural environment is poorly understood. Some ribotypes are found both in clinical and environmental settings. Whether these strains are distinct from each another and evolve in the specific environments is not established. The possession of a highly mobile genome has contributed to the genetic diversity and ongoing evolution of C. difficile. Interpretations of genetic diversity have been limited by fragmented assemblies resulting from short-read length sequencing approaches and by a limited understanding of epigenetic regulation of diversity. To address this, single molecule real time (SMRT) sequencing was used in this study as it produces high quality genome sequences, with resolution of repeat regions (including those found in mobile elements) and can generate data to determine methylation modifications across the sequence (the methylome).Chromosomal rearrangements and ribosomal operon duplications were observed in both genomes. The rearrangements occurred at insertion sites within two mobile genetic elements (MGEs), Tn6164 and Tn6293, present only in the M120 and CD105HS27 genomes, respectively. The gene content of these two transposons differ considerably which could impact upon horizontal gene transfer; differences include CDSs encoding methylases and a conjugative prophage only in Tn6164. To investigate mechanisms which could affect MGE transfer, the methylome, restriction modification (RM)  and the CRISPR/Cas systems were characterised for each strain. Notably, the environmental isolate, CD105HS27, does not share a consensus motif for (m4)C methylation, but has one additional spacer  when compared to the clinical isolate M120.These findings show key differences between the two strains in terms of their genetic capacity for MGE transfer. The carriage of horizontally transferred genes appear to have genome wide effects based on two different methylation patterns. The CRISPR/Cas system appears active although perhaps slow to evolve. Data suggests that both mechanisms are functional and impact upon horizontal gene transfer and genome evolution within C. difficile.


July 7, 2019

The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance.

The whitefly Bemisia tabaci (Hemiptera: Aleyrodidae) is among the 100 worst invasive species in the world. As one of the most important crop pests and virus vectors, B. tabaci causes substantial crop losses and poses a serious threat to global food security. We report the 615-Mb high-quality genome sequence of B. tabaci Middle East-Asia Minor 1 (MEAM1), the first genome sequence in the Aleyrodidae family, which contains 15,664 protein-coding genes. The B. tabaci genome is highly divergent from other sequenced hemipteran genomes, sharing no detectable synteny. A number of known detoxification gene families, including cytochrome P450s and UDP-glucuronosyltransferases, are significantly expanded in B. tabaci. Other expanded gene families, including cathepsins, large clusters of tandemly duplicated B. tabaci-specific genes, and phosphatidylethanolamine-binding proteins (PEBPs), were found to be associated with virus acquisition and transmission and/or insecticide resistance, likely contributing to the global invasiveness and efficient virus transmission capacity of B. tabaci. The presence of 142 horizontally transferred genes from bacteria or fungi in the B. tabaci genome, including genes encoding hopanoid/sterol synthesis and xenobiotic detoxification enzymes that are not present in other insects, offers novel insights into the unique biological adaptations of this insect such as polyphagy and insecticide resistance. Interestingly, two adjacent bacterial pantothenate biosynthesis genes, panB and panC, have been co-transferred into B. tabaci and fused into a single gene that has acquired introns during its evolution.The B. tabaci genome contains numerous genetic novelties, including expansions in gene families associated with insecticide resistance, detoxification and virus transmission, as well as numerous horizontally transferred genes from bacteria and fungi. We believe these novelties likely have shaped B. tabaci as a highly invasive polyphagous crop pest and efficient vector of plant viruses. The genome serves as a reference for resolving the B. tabaci cryptic species complex, understanding fundamental biological novelties, and providing valuable genetic information to assist the development of novel strategies for controlling whiteflies and the viruses they transmit.


July 7, 2019

Whole genome sequence and comparative genomics of the novel Lyme borreliosis causing pathogen, Borrelia mayonii.

Borrelia mayonii, a Borrelia burgdorferi sensu lato (Bbsl) genospecies, was recently identified as a cause of Lyme borreliosis (LB) among patients from the upper midwestern United States. By microscopy and PCR, spirochete/genome loads in infected patients were estimated at 105 to 106 per milliliter of blood. Here, we present the full chromosome and plasmid sequences of two B. mayonii isolates, MN14-1420 and MN14-1539, cultured from blood of two of these patients. Whole genome sequencing and assembly was conducted using PacBio long read sequencing (Pacific Biosciences RSII instrument) followed by hierarchical genome-assembly process (HGAP). The B. mayonii genome is ~1.31 Mbp in size (26.9% average GC content) and is comprised of a linear chromosome, 8 linear and 7 circular plasmids. Consistent with its taxonomic designation as a new Bbsl genospecies, the B. mayonii linear chromosome shares only 93.83% average nucleotide identity with other genospecies. Both B. mayonii genomes contain plasmids similar to B. burgdorferi sensu stricto lp54, lp36, lp28-3, lp28-4, lp25, lp17, lp5, 5 cp32s, cp26, and cp9. The vls locus present on lp28-10 of B. mayonii MN14-1420 is remarkably long, being comprised of 24 silent vls cassettes. Genetic differences between the two B. mayonii genomes are limited and include 15 single nucleotide variations as well as 7 fewer silent vls cassettes and a lack of the lp5 plasmid in MN14-1539. Notably, 68 homologs to proteins present in B. burgdorferi sensu stricto appear to be lacking from the B. mayonii genomes. These include the complement inhibitor, CspZ (BB_H06), the fibronectin binding protein, BB_K32, as well as multiple lipoproteins and proteins of unknown function. This study shows the utility of long read sequencing for full genome assembly of Bbsl genomes, identifies putative genome regions of B. mayonii that may be linked to clinical manifestation or tissue tropism, and provides a valuable resource for pathogenicity, diagnostic and vaccine studies.


July 7, 2019

Colib’read on galaxy: a tools suite dedicated to biological information extraction from raw NGS reads

With next-generation sequencing (NGS) technologies, the life sciences face a deluge of raw data. Classical analysis processes for such data often begin with an assembly step, needing large amounts of computing resources, and potentially removing or modifying parts of the biological information contained in the data. Our approach proposes to focus directly on biological questions, by considering raw unassembled NGS data, through a suite of six command-line tools.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.