Menu
July 7, 2019

Improved draft genome sequence of a monoteliosporic culture of the karnal bunt (Tilletia indica) pathogen of wheat.

Karnal bunt of wheat is an internationally quarantined fungal pathogen disease caused by Tilletia indica and affects the international commercial seed trade of wheat. We announce here the first improved draft genome assembly of a monoteliosporic culture of the Tilletia indica fungus, consisting of 787 scaffolds with an approximate total genome size of 31.83 Mbp, which is more accurate and near to complete than the previous version. Copyright © 2018 Kumar et al.


July 7, 2019

GtTR: Bayesian estimation of absolute tandem repeat copy number using sequence capture and high throughput sequencing.

Tandem repeats comprise significant proportion of the human genome including coding and regulatory regions. They are highly prone to repeat number variation and nucleotide mutation due to their repetitive and unstable nature, making them a major source of genomic variation between individuals. Despite recent advances in high throughput sequencing, analysis of tandem repeats in the context of complex diseases is still hindered by technical limitations. We report a novel targeted sequencing approach, which allows simultaneous analysis of hundreds of repeats. We developed a Bayesian algorithm, namely – GtTR – which combines information from a reference long-read dataset with a short read counting approach to genotype tandem repeats at population scale. PCR sizing analysis was used for validation.We used a PacBio long-read sequenced sample to generate a reference tandem repeat genotype dataset with on average 13% absolute deviation from PCR sizing results. Using this reference dataset GtTR generated estimates of VNTR copy number with accuracy within 95% high posterior density (HPD) intervals of 68 and 83% for capture sequence data and 200X WGS data respectively, improving to 87 and 94% with use of a PCR reference. We show that the genotype resolution increases as a function of depth, such that the median 95% HPD interval lies within 25, 14, 12 and 8% of the its midpoint copy number value for 30X, 200X WGS, 395X and 800X capture sequence data respectively. We validated nine targets by PCR sizing analysis and genotype estimates from sequencing results correlated well with PCR results.The novel genotyping approach described here presents a new cost-effective method to explore previously unrecognized class of repeat variation in GWAS studies of complex diseases at the population level. Further improvements in accuracy can be obtained by improving accuracy of the reference dataset.


July 7, 2019

HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning.

Second-generation DNA sequencing techniques generate short reads that can result in fragmented genome assemblies. Third-generation sequencing platforms mitigate this limitation by producing longer reads that span across complex and repetitive regions. However, the usefulness of such long reads is limited because of high sequencing error rates. To exploit the full potential of these longer reads, it is imperative to correct the underlying errors. We propose HECIL-Hybrid Error Correction with Iterative Learning-a hybrid error correction framework that determines a correction policy for erroneous long reads, based on optimal combinations of decision weights obtained from short read alignments. We demonstrate that HECIL outperforms state-of-the-art error correction algorithms for an overwhelming majority of evaluation metrics on diverse, real-world data sets including E. coli, S. cerevisiae, and the malaria vector mosquito A. funestus. Additionally, we provide an optional avenue of improving the performance of HECIL’s core algorithm by introducing an iterative learning paradigm that enhances the correction policy at each iteration by incorporating knowledge gathered from previous iterations via data-driven confidence metrics assigned to prior corrections.


July 7, 2019

Modular traits of the Rhizobiales root microbiota and their evolutionary relationship with symbiotic Rhizobia.

Rhizobia are a paraphyletic group of soil-borne bacteria that induce nodule organogenesis in legume roots and fix atmospheric nitrogen for plant growth. In non-leguminous plants, species from the Rhizobiales order define a core lineage of the plant microbiota, suggesting additional functional interactions with plant hosts. In this work, genome analyses of 1,314 Rhizobiales isolates along with amplicon studies of the root microbiota reveal the evolutionary history of nitrogen-fixing symbiosis in this bacterial order. Key symbiosis genes were acquired multiple times, and the most recent common ancestor could colonize roots of a broad host range. In addition, root growth promotion is a characteristic trait of Rhizobiales in Arabidopsis thaliana, whereas interference with plant immunity constitutes a separate, strain-specific phenotype of root commensal Alphaproteobacteria. Additional studies with a tripartite gnotobiotic plant system reveal that these traits operate in a modular fashion and thus might be relevant to microbial homeostasis in healthy roots. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019

Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak.

Multi-country outbreaks of foodborne bacterial disease present challenges in their detection, tracking, and notification. As food is increasingly distributed across borders, such outbreaks are becoming more common. This increases the need for high-resolution, accessible, and replicable isolate typing schemes. Here we evaluate a core genome multilocus typing (cgMLST) scheme for the high-resolution reproducible typing of Salmonella enterica (S. enterica) isolates, by its application to a large European outbreak of S. enterica serovar Enteritidis. This outbreak had been extensively characterised using single nucleotide polymorphism (SNP)-based approaches. The cgMLST analysis was congruent with the original SNP-based analysis, the epidemiological data, and whole genome MLST (wgMLST) analysis. Combination of the cgMLST and epidemiological data confirmed that the genetic diversity among the isolates predated the outbreak, and was likely present at the infection source. There was consequently no link between country of isolation and genetic diversity, but the cgMLST clusters were congruent with date of isolation. Furthermore, comparison with publicly available Enteritidis isolate data demonstrated that the cgMLST scheme presented is highly scalable, enabling outbreaks to be contextualised within the Salmonella genus. The cgMLST scheme is therefore shown to be a standardised and scalable typing method, which allows Salmonella outbreaks to be analysed and compared across laboratories and jurisdictions. Copyright © 2018. Published by Elsevier B.V.


July 7, 2019

First description of novel arginine catabolic mobile elements (ACMEs) types IV and V harboring a kdp operon in Staphylococcus epidermidis characterized by whole genome sequencing.

The arginine catabolic mobile element (ACME) was first described in the methicillin-resistant Staphylococcus aureus strain USA300 and is thought to facilitate survival on skin. To date three distinct ACME types have been characterized comprehensively in S. aureus and/or Staphylococcus epidermidis. Type I harbors the arc and opp3 operons encoding an arginine deaminase pathway and an oligopeptide permease ABC transporter, respectively, type II harbors the arc operon only, and type III harbors the opp3 operon only. To investigate the diversity and detailed genetic organization of ACME, whole genome sequencing (WGS) was performed on 32 ACME-harboring oro-nasal S. epidermidis isolates using MiSeq- and PacBio-based WGS platforms. In nine isolates the ACMEs lacked the opp3 operon, but harbored a complete kdp operon (kdpE/D/A/B/C) located a maximum of 2.8?kb upstream of the arc operon. The kdp operon exhibited 63% DNA sequence identity to the native S. aureus kdp operon. These findings identified a novel, previously undescribed ACME type (designated ACME IV), which could be subtyped (IVa and IVb) based on distinct 5′ flanking direct repeat sequences (DRs). Multilocus sequence typing (MLST) sequences extracted from the WGS data identified the sequence types (STs) of the isolates investigated. Four of the nine ACME IV isolates belonged to ST153, and one to ST17, a single locus variant of ST153. A tenth isolate, identified as ST5, harbored another novel ACME type (designated ACME V) containing the kdp, arc and opp3 operons and flanked by DR_F, and DR_B but lacked any internal DRs. ACME V was colocated with a staphylococcal chromosome cassette mec (SCCmec) IV element and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) in a 116.9?kb composite island. The extensive genetic diversity of ACME in S. epidermidis has been further elucidated by WGS, revealing two novel ACME types IV and V for the first time. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequences of Canadian epidemic methicillin-resistant Staphylococcus aureus strains CMRSA3 and CMRSA6.

Methicillin-resistant Staphylococcus aureus (MRSA) clonal complex 8 (CC8) sequence type 239 (ST239) represents a predominant hospital-associated MRSA sublineage present worldwide. The Canadian epidemic MRSA strains CMRSA3 and CMRSA6 are moderately virulent members of this group but are closely related to the highly virulent strain TW20. Whole-genome sequencing of CMRSA3 and CMRSA6 was conducted to identify genetic determinants associated with their virulence.


July 7, 2019

Tracing the de novo origin of protein-coding genes in yeast.

De novo genes are very important for evolutionary innovation. However, how these genes originate and spread remains largely unknown. To better understand this, we rigorously searched for de novo genes in Saccharomyces cerevisiae S288C and examined their spread and fixation in the population. Here, we identified 84 de novo genes in S. cerevisiae S288C since the divergence with their sister groups. Transcriptome and ribosome profiling data revealed at least 8 (10%) and 28 (33%) de novo genes being expressed and translated only under specific conditions, respectively. DNA microarray data, based on 2-fold change, showed that 87% of the de novo genes are regulated during various biological processes, such as nutrient utilization and sporulation. Our comparative and evolutionary analyses further revealed that some factors, including single nucleotide polymorphism (SNP)/indel mutation, high GC content, and DNA shuffling, contribute to the birth of de novo genes, while domestication and natural selection drive the spread and fixation of these genes. Finally, we also provide evidence suggesting the possible parallel origin of a de novo gene between S. cerevisiae and Saccharomyces paradoxus Together, our study provides several new insights into the origin and spread of de novo genes.IMPORTANCE Emergence of de novo genes has occurred in many lineages during evolution, but the birth, spread, and function of these genes remain unresolved. Here we have searched for de novo genes from Saccharomyces cerevisiae S288C using rigorous methods, which reduced the effects of bad annotation and genomic gaps on the identification of de novo genes. Through this analysis, we have found 84 new genes originating de novo from previously noncoding regions, 87% of which are very likely involved in various biological processes. We noticed that 10% and 33% of de novo genes were only expressed and translated under specific conditions, therefore, verification of de novo genes through transcriptome and ribosome profiling, especially from limited expression data, may underestimate the number of bona fide new genes. We further show that SNP/indel mutation, high GC content, and DNA shuffling could be involved in the birth of de novo genes, while domestication and natural selection drive the spread and fixation of these genes. Finally, we provide evidence suggesting the possible parallel origin of a new gene. Copyright © 2018 Wu and Knudson.


July 7, 2019

New reference genome sequences for 17 bacterial strains of the honey bee gut microbiota.

We sequenced the genomes of 17 strains isolated from the gut of honey bees, including strains representing the genera Lactobacillus, Bifidobacterium, Gilliamella, Snodgrassella, Frischella, and Commensalibacter. These genome sequences represent an important step forward in the development of a comprehensive reference database to aid future analysis of this emerging gut microbiota model.


July 7, 2019

Transposon insertion sequencing elucidates novel gene involvement in susceptibility and resistance to phages T4 and T7 in Escherichia coli O157.

Experiments using bacteriophage (phage) to infect bacterial strains have helped define some basic genetic concepts in microbiology, but our understanding of the complexity of bacterium-phage interactions is still limited. As the global threat of antibiotic resistance continues to increase, phage therapy has reemerged as an attractive alternative or supplement to treating antibiotic-resistant bacterial infections. Further, the long-used method of phage typing to classify bacterial strains is being replaced by molecular genetic techniques. Thus, there is a growing need for a complete understanding of the precise molecular mechanisms underpinning phage-bacterium interactions to optimize phage therapy for the clinic as well as for retrospectively interpreting phage typing data on the molecular level. In this study, a genomics-based fitness assay (TraDIS) was used to identify all host genes involved in phage susceptibility and resistance for a T4 phage infecting Shiga-toxigenic Escherichia coli O157. The TraDIS results identified both established and previously unidentified genes involved in phage infection, and a subset were confirmed by site-directed mutagenesis and phenotypic testing of 14 T4 and 2 T7 phages. For the first time, the entire sap operon was implicated in phage susceptibility and, conversely, the stringent starvation protein A gene (sspA) was shown to provide phage resistance. Identifying genes involved in phage infection and replication should facilitate the selection of bespoke phage combinations to target specific bacterial pathogens.IMPORTANCE Antibiotic resistance has diminished treatment options for many common bacterial infections. Phage therapy is an alternative option that was once popularly used across Europe to kill bacteria within humans. Phage therapy acts by using highly specific viruses (called phages) that infect and lyse certain bacterial species to treat the infection. Whole-genome sequencing has allowed modernization of the investigations into phage-bacterium interactions. Here, using E. coli O157 and T4 bacteriophage as a model, we have exploited a genome-wide fitness assay to investigate all genes involved in defining phage resistance or susceptibility. This knowledge of the genetic determinants of phage resistance and susceptibility can be used to design bespoke phage combinations targeted to specific bacterial infections for successful infection eradication. Copyright © 2018 Cowley et al.


July 7, 2019

Low-level antimicrobials in the medicinal leech select for resistant pathogens that spread to patients.

Fluoroquinolones (FQs) and ciprofloxacin (Cp) are important antimicrobials that pollute the environment in trace amounts. Although Cp has been recommended as prophylaxis for patients undergoing leech therapy to prevent infections by the leech gut symbiont Aeromonas, a puzzling rise in Cp-resistant (Cpr) Aeromonas infections has been reported. We report on the effects of subtherapeutic FQ concentrations on bacteria in an environmental reservoir, the medicinal leech, and describe the presence of multiple antibiotic resistance mutations and a gain-of-function resistance gene. We link the rise of CprAeromonas isolates to exposure of the leech microbiota to very low levels of Cp (0.01 to 0.04 µg/ml), <1/100 of the clinical resistance breakpoint for Aeromonas Using competition experiments and comparative genomics of 37 strains, we determined the mechanisms of resistance in clinical and leech-derived Aeromonas isolates, traced their origin, and determined that the presence of merely 0.01 µg/ml Cp provides a strong competitive advantage for Cpr strains. Deep-sequencing the Cpr-conferring region of gyrA enabled tracing of the mutation-harboring Aeromonas population in archived gut samples, and an increase in the frequency of the Cpr-conferring mutation in 2011 coincides with the initial reports of CprAeromonas infections in patients receiving leech therapy.IMPORTANCE The role of subtherapeutic antimicrobial contamination in selecting for resistant strains has received increasing attention and is an important clinical matter. This study describes the relationship of resistant bacteria from the medicinal leech, Hirudo verbana, with patient infections following leech therapy. While our results highlight the need for alternative antibiotic therapies, the rise of Cpr bacteria demonstrates the importance of restricting the exposure of animals to antibiotics approved for veterinary use. The shift to a more resistant community and the dispersion of Cpr-conferring mechanisms via mobile elements occurred in a natural setting due to the presence of very low levels of fluoroquinolones, revealing the challenges of controlling the spread of antibiotic-resistant bacteria and highlighting the importance of a holistic approach in the management of antibiotic use. Copyright © 2018 Beka et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.