Menu
September 22, 2019

Intraspecific comparative genomics of isolates of the Norway spruce pathogen (Heterobasidion parviporum) and identification of its potential virulence factors.

Heterobasidion parviporum is an economically most important fungal forest pathogen in northern Europe, causing root and butt rot disease of Norway spruce (Picea abies (L.) Karst.). The mechanisms underlying the pathogenesis and virulence of this species remain elusive. No reference genome to facilitate functional analysis is available for this species.To better understand the virulence factor at both phenotypic and genomic level, we characterized 15 H. parviporum isolates originating from different locations across Finland for virulence, vegetative growth, sporulation and saprotrophic wood decay. Wood decay capability and latitude of fungal origins exerted interactive effects on their virulence and appeared important for H. parviporum virulence. We sequenced the most virulent isolate, the first full genome sequences of H. parviporum as a reference genome, and re-sequenced the remaining 14 H. parviporum isolates. Genome-wide alignments and intrinsic polymorphism analysis showed that these isolates exhibited overall high genomic similarity with an average of at least 96% nucleotide identity when compared to the reference, yet had remarkable intra-specific level of polymorphism with a bias for CpG to TpG mutations. Reads mapping coverage analysis enabled the classification of all predicted genes into five groups and uncovered two genomic regions exclusively present in the reference with putative contribution to its higher virulence. Genes enriched for copy number variations (deletions and duplications) and nucleotide polymorphism were involved in oxidation-reduction processes and encoding domains relevant to transcription factors. Some secreted protein coding genes based on the genome-wide selection pressure, or the presence of variants were proposed as potential virulence candidates.Our study reported on the first reference genome sequence for this Norway spruce pathogen (H. parviporum). Comparative genomics analysis gave insight into the overall genomic variation among this fungal species and also facilitated the identification of several secreted protein coding genes as putative virulence factors for the further functional analysis. We also analyzed and identified phenotypic traits potentially linked to its virulence.


September 22, 2019

Primordial origin and diversification of plasmids in Lyme disease agent bacteria.

With approximately one-third of their genomes consisting of linear and circular plasmids, the Lyme disease agent cluster of species has the most complex genomes among known bacteria. We report here a comparative analysis of plasmids in eleven Borreliella (also known as Borrelia burgdorferi sensu lato) species.We sequenced the complete genomes of two B. afzelii, two B. garinii, and individual B. spielmanii, B. bissettiae, B. valaisiana and B. finlandensis isolates. These individual isolates carry between seven and sixteen plasmids, and together harbor 99 plasmids. We report here a comparative analysis of these plasmids, along with 70 additional Borreliella plasmids available in the public sequence databases. We identify only one new putative plasmid compatibility type (the 30th) among these 169 plasmid sequences, suggesting that all or nearly all such types have now been discovered. We find that the linear plasmids in the non-B. burgdorferi species have undergone the same kinds of apparently random, chaotic rearrangements mediated by non-homologous recombination that we previously discovered in B. burgdorferi. These rearrangements occurred independently in the different species lineages, and they, along with an expanded chromosomal phylogeny reported here, allow the identification of several whole plasmid transfer events among these species. Phylogenetic analyses of the plasmid partition genes show that a majority of the plasmid compatibility types arose early, most likely before separation of the Lyme agent Borreliella and relapsing fever Borrelia clades, and this, with occasional cross species plasmid transfers, has resulted in few if any species-specific or geographic region-specific Borreliella plasmid types.The primordial origin and persistent maintenance of the Borreliella plasmid types support their functional indispensability as well as evolutionary roles in facilitating genome diversity. The improved resolution of Borreliella plasmid phylogeny based on conserved partition-gene clusters will lead to better determination of gene orthology which is essential for prediction of biological function, and it will provide a basis for inferring detailed evolutionary mechanisms of Borreliella genomic variability including homologous gene and plasmid exchanges as well as non-homologous rearrangements.


September 22, 2019

Targeted long-read sequencing of a locus under long-term balancing selection in Capsella.

Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10-5 A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach. Copyright © 2018 Bachmann et al.


September 22, 2019

Predominant gut Lactobacillus murinus strain mediates anti-inflammaging effects in calorie-restricted mice.

Calorie restriction (CR), which has a potent anti-inflammaging effect, has been demonstrated to induce dramatic changes in the gut microbiota. Whether the modulated gut microbiota contributes to the attenuation of inflammation during CR is unknown, as are the members of the microbial community that may be key mediators of this process.Here, we report that a unique Lactobacillus-predominated microbial community was rapidly attained in mice within 2 weeks of CR, which decreased the levels of circulating microbial antigens and systemic inflammatory markers such as tumour necrosis factor alpha (TNF-a). Lactobacillus murinus CR147, an isolate in the most abundant operational taxonomic unit (OTU) enriched by CR, downregulated interleukin-8 production in TNF-a-stimulated Caco-2 cells and significantly increased the lifespan and the brood size of the nematode Caenorhabditis elegans. In gnotobiotic mice colonized with the gut microbiota from old mice, this strain decreased their intestinal permeability and serum endotoxin load, consequently attenuating the inflammation induced by the old microbiota.Our study demonstrated that a strain of Lactobacillus murinus was promoted in CR mice and causatively contributed to the attenuation of ageing-associated inflammation.


September 22, 2019

In vitro culture of the insect endosymbiont Spiroplasma poulsonii highlights bacterial genes involved in host-symbiont interaction.

Endosymbiotic bacteria associated with eukaryotic hosts are omnipresent in nature, particularly in insects. Studying the bacterial side of host-symbiont interactions is, however, often limited by the unculturability and genetic intractability of the symbionts. Spiroplasma poulsonii is a maternally transmitted bacterial endosymbiont that is naturally associated with several Drosophila species. S. poulsonii strongly affects its host’s physiology, for example by causing male killing or by protecting it against various parasites. Despite intense work on this model since the 1950s, attempts to cultivate endosymbiotic Spiroplasma in vitro have failed so far. Here, we developed a method to sustain the in vitro culture of S. poulsonii by optimizing a commercially accessible medium. We also provide a complete genome assembly, including the first sequence of a natural plasmid of an endosymbiotic Spiroplasma species. Last, by comparing the transcriptome of the in vitro culture to the transcriptome of bacteria extracted from the host, we identified genes putatively involved in host-symbiont interactions. This work provides new opportunities to study the physiology of endosymbiotic Spiroplasma and paves the way to dissect insect-endosymbiont interactions with two genetically tractable partners.IMPORTANCE The discovery of insect bacterial endosymbionts (maternally transmitted bacteria) has revolutionized the study of insects, suggesting novel strategies for their control. Most endosymbionts are strongly dependent on their host to survive, making them uncultivable in artificial systems and genetically intractable. Spiroplasma poulsonii is an endosymbiont of Drosophila that affects host metabolism, reproduction, and defense against parasites. By providing the first reliable culture medium that allows a long-lasting in vitro culture of Spiroplasma and by elucidating its complete genome, this work lays the foundation for the development of genetic engineering tools to dissect endosymbiosis with two partners amenable to molecular study. Furthermore, the optimization method that we describe can be used on other yet uncultivable symbionts, opening new technical opportunities in the field of host-microbes interactions. Copyright © 2018 Masson et al.


September 22, 2019

Synchronous termination of replication of the two chromosomes is an evolutionary selected feature in Vibrionaceae.

Vibrio cholerae, the causative agent of the cholera disease, is commonly used as a model organism for the study of bacteria with multipartite genomes. Its two chromosomes of different sizes initiate their DNA replication at distinct time points in the cell cycle and terminate in synchrony. In this study, the time-delayed start of Chr2 was verified in a synchronized cell population. This replication pattern suggests two possible regulation mechanisms for other Vibrio species with different sized secondary chromosomes: Either all Chr2 start DNA replication with a fixed delay after Chr1 initiation, or the timepoint at which Chr2 initiates varies such that termination of chromosomal replication occurs in synchrony. We investigated these two models and revealed that the two chromosomes of various Vibrionaceae species terminate in synchrony while Chr2-initiation timing relative to Chr1 is variable. Moreover, the sequence and function of the Chr2-triggering crtS site recently discovered in V. cholerae were found to be conserved, explaining the observed timing mechanism. Our results suggest that it is beneficial for bacterial cells with multiple chromosomes to synchronize their replication termination, potentially to optimize chromosome related processes as dimer resolution or segregation.


September 22, 2019

Two groups of cocirculating, epidemic Clostridiodes difficile strains microdiversify through different mechanisms.

Clostridiodes difficile strains from the NAPCR1/ST54 and NAP1/ST01 types have caused outbreaks despite of their notable differences in genome diversity. By comparing whole genome sequences of 32 NAPCR1/ST54 isolates and 17 NAP1/ST01 recovered from patients infected with C. difficile we assessed whether mutation, homologous recombination (r) or nonhomologous recombination (NHR) through lateral gene transfer (LGT) have differentially shaped the microdiversification of these strains. The average number of single nucleotide polymorphisms (SNPs) in coding sequences (NAPCR1/ST54?=?24; NAP1/ST01?=?19) and SNP densities (NAPCR1/ST54?=?0.54/kb; NAP1/ST01?=?0.46/kb) in the NAPCR1/ST54 and NAP1/ST01 isolates was comparable. However, the NAP1/ST01 isolates showed 3× higher average dN/dS rates (8.35) that the NAPCR1/ST54 isolates (2.62). Regarding r, whereas 31 of the NAPCR1/ST54 isolates showed 1 recombination block (3,301-8,226?bp), the NAP1/ST01 isolates showed no bases in recombination. As to NHR, the pangenome of the NAPCR1/ST54 isolates was larger (4,802 gene clusters, 26% noncore genes) and more heterogeneous (644?±?33 gene content changes) than that of the NAP1/ST01 isolates (3,829 gene clusters, ca. 6% noncore genes, 129?±?37 gene content changes). Nearly 55% of the gene content changes seen among the NAPCR1/ST54 isolates (355?±?31) were traced back to MGEs with putative genes for antimicrobial resistance and virulence factors that were only detected in single isolates or isolate clusters. Congruently, the LGT/SNP rate calculated for the NAPCR1/ST54 isolates (26.8?±?2.8) was 4× higher than the one obtained for the NAP1/ST1 isolates (6.8?±?2.0). We conclude that NHR-LGT has had a greater role in the microdiversification of the NAPCR1/ST54 strains, opposite to the NAP1/ST01 strains, where mutation is known to play a more prominent role.


September 22, 2019

Antibiotic resistance plasmids cointegrated into a megaplasmid harboring the blaOXA-427 carbapenemase gene.

OXA-427 is a new class D carbapenemase encountered in different species of Enterobacteriaceae in a Belgian hospital. To study the dispersal of this gene, we performed a comparative analysis of two plasmids containing the blaOXA-427 gene, isolated from a Klebsiella pneumoniae strain and an Enterobacter cloacae complex strain. The two IncA/C2 plasmids containing blaOXA-427 share the same backbone; in the K. pneumoniae strain, however, this plasmid is cointegrated into an IncFIb plasmid, forming a 321-kb megaplasmid with multiple multiresistance regions. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Reproducible integration of multiple sequencing datasets to form high-confidence SNP, indel, and reference calls for five human genome reference materials

Benchmark small variant calls from the Genome in a Bottle Consortium (GIAB) for the CEPH/HapMap genome NA12878 (HG001) have been used extensively for developing, optimizing, and demonstrating performance of sequencing and bioinformatics methods. Here, we develop a reproducible, cloud-based pipeline to integrate multiple sequencing datasets and form benchmark calls, enabling application to arbitrary human genomes. We use these reproducible methods to form high-confidence calls with respect to GRCh37 and GRCh38 for HG001 and 4 additional broadly-consented genomes from the Personal Genome Project that are available as NIST Reference Materials. These new genomes’ broad, open consent with few restrictions on availability of samples and data is enabling a uniquely diverse array of applications. Our new methods produce 17% more high-confidence SNPs, 176% more indels, and 12% larger regions than our previously published calls. To demonstrate that these calls can be used for accurate benchmarking, we compare other high-quality callsets to ours (e.g., Illumina Platinum Genomes), and we demonstrate that the majority of discordant calls are errors in the other callsets, We also highlight challenges in interpreting performance metrics when benchmarking against imperfect high-confidence calls. We show that benchmarking tools from the Global Alliance for Genomics and Health can be used with our calls to stratify performance metrics by variant type and genome context and elucidate strengths and weaknesses of a method.


September 22, 2019

Repeat-driven generation of antigenic diversity in a major human pathogen, Trypanosoma cruzi

Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina sequencing of 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene structure reveals an unusual duality in the organisation of the parasite genome, a core genomic region syntenous with related protozoa flanked by unique and highly plastic subtelomeric regions encoding surface antigens. The presence of abundant interspersed retrotransposons in the subtelomeres suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.


September 22, 2019

Dynamic evolution of a-gliadin prolamin gene family in homeologous genomes of hexaploid wheat.

Wheat Gli-2 loci encode complex groups of a-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of a-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous a-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 a-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of a-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of a-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of a-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.


September 22, 2019

Genomic diversity of Taylorella equigenitalis introduced into the United States from 1978 to 2012.

Contagious equine metritis is a disease of worldwide concern in equids. The United States is considered to be free of the disease although sporadic outbreaks have occurred over the last few decades that were thought to be associated with the importation of horses. The objective of this study was to create finished, reference quality genomes that characterize the diversity of Taylorella equigenitalis isolates introduced into the USA, and identify their differences. Five isolates of T. equigenitalis associated with introductions into the USA from unique sources were sequenced using both short and long read chemistries allowing for complete assembly and annotation. These sequences were compared to previously published genomes as well as the short read sequences of the 200 isolates in the National Veterinary Services Laboratories’ diagnostic repository to identify unique regions and genes, potential virulence factors, and characterize diversity. The 5 genomes varied in size by up to 100,000 base pairs, but averaged 1.68 megabases. The majority of that diversity in size can be explained by repeat regions and 4 main regions of difference, which ranged in size from 15,000 to 45,000 base pairs. The first region of difference contained mostly hypothetical proteins, the second contained the CRISPR, the third contained primarily hemagglutinin proteins, and the fourth contained primarily segments of a type IV secretion system. As expected and previously reported, little evidence of recombination was found within these genomes. Several additional areas of interest were also observed including a mechanism for streptomycin resistance and other virulence factors. A SNP distance comparison of the T. equigenitalis isolates and Mycobacterium tuberculosis complex (MTBC) showed that relatively, T. equigenitalis was a more diverse species than the entirety of MTBC.


September 22, 2019

Microsatellite polymorphism in the endangered snail kite reveals a panmictic, low diversity population

Genetic structure and genetic diversity are key population characteristics that can inform conservation decisions, such as delineating management units or assessing potential risks for inbreeding depression. Evidence of genetic structuring or low genetic diversity in the critically endangered snail kite (Rostrhamus sociabilis plumbeus) would have implications for monitoring and planning decisions. Recent work on understanding connectivity across the snail kite range indicated that there is less dispersal between northern and southern parts of the current range, and that dispersal is shaped by individual habitat preference. We examine whether there is neutral genetic structure and the amount of genetic variation in the population by non-lethally sampling 235 nestlings from unique nests across the entire breeding range between 2013 and 2014. Data on 15 microsatellite revealed low diversity (e.g., Na?=?2.54, He?=?0.37) and range-wide panmixia based on AMOVA, Bayesian clustering, spatial autocorrelation, isolation by distance, and spatially explicit ordination analyses. Our results emphasize that long-term recovery goals and management strategies should be based on viewing snail kites as a single genetic population, despite evidence for non-random dispersal between wetlands over ecological time scales. These results also highlight the need to understand potential effects of low genetic diversity on population dynamics and viability of snail kites. More broadly, these results add to the growing evidence for potential discrepancies between dispersal and genetic patterns, emphasizing that care should be taken if using one to interpret the other, particularly for widely-ranging species.


September 22, 2019

Challenges of Francisella classification exemplified by an atypical clinical isolate.

The accumulation of sequenced Francisella strains has made it increasingly apparent that the 16S rRNA gene alone is not enough to stratify the Francisella genus into precise and clinically useful classifications. Continued whole-genome sequencing of isolates will provide a larger base of knowledge for targeted approaches with broad applicability. Additionally, examination of genomic information on a case-by-case basis will help resolve outstanding questions regarding strain stratification. We report the complete genome sequence of a clinical isolate, designated here as F. novicida-like strain TCH2015, acquired from the lymph node of a 6-year-old male. Two features were atypical for F. novicida: exhibition of functional oxidase activity and additional gene content, including proposed virulence determinants. These differences, which could potentially impact virulence and clinical diagnosis, emphasize the need for more comprehensive methods to profile Francisella isolates. This study highlights the value of whole-genome sequencing, which will lead to a more robust database of environmental and clinical genomes and inform strategies to improve detection and classification of Francisella strains. Copyright © 2017 Elsevier Inc. All rights reserved.


September 22, 2019

The sequence of the salamander.

The genome of the aquatic axolotl salamander, a native of Mexico’s lakes, has yielded some surprises, and the technique used could point the way to analysis of other organisms that have complex genomes with large numbers of sequence repeats, such as the lungfish and many species of plants.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.