Menu
July 7, 2019

Rifamorpholines A-E, potential antibiotics from locust-associated actinobacteria Amycolatopsis sp. Hca4.

Cultivation of locust associated rare actinobacteria, Amycolatopsis sp. HCa4, has provided five unusual macrolactams rifamorpholines A-E. Their structures were determined by interpretation of spectroscopic and crystallographic data. Rifamorpholines A-E possess an unprecedented 5/6/6/6 ring chromophore, representing a new subclass of rifamycin antibiotics. The biosynthetic pathway for compounds 1-5 involves a key 1,6-cyclization for the formation of the morpholine ring. Compounds 2 and 4 showed potent activities against methicillin-resistant Staphylococcus aureus (MRSA) with MICs of 4.0 and 8.0 µM, respectively.


July 7, 2019

Adaptation of genetically monomorphic bacteria: evolution of copper resistance through multiple horizontal gene transfers of complex and versatile mobile genetic elements.

Copper-based antimicrobial compounds are widely used to control plant bacterial pathogens. Pathogens have adapted in response to this selective pressure. Xanthomonas citri pv. citri, a major citrus pathogen causing Asiatic citrus canker, was first reported to carry plasmid-encoded copper resistance in Argentina. This phenotype was conferred by the copLAB gene system. The emergence of resistant strains has since been reported in Réunion and Martinique. Using microsatellite-based genotyping and copLAB PCR, we demonstrated that the genetic structure of the copper-resistant strains from these three regions was made up of two distant clusters and varied for the detection of copLAB amplicons. In order to investigate this pattern more closely, we sequenced six copper-resistant X. citri pv. citri strains from Argentina, Martinique and Réunion, together with reference copper-resistant Xanthomonas and Stenotrophomonas strains using long-read sequencing technology. Genes involved in copper resistance were found to be strain dependent with the novel identification in X. citri pv. citri of copABCD and a cus heavy metal efflux resistance-nodulation-division system. The genes providing the adaptive trait were part of a mobile genetic element similar to Tn3-like transposons and included in a conjugative plasmid. This indicates the system’s great versatility. The mining of all available bacterial genomes suggested that, within the bacterial community, the spread of copper resistance associated with mobile elements and their plasmid environments was primarily restricted to the Xanthomonadaceae family.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Genomic exploration of individual giant ocean viruses.

Viruses are major pathogens in all biological systems. Virus propagation and downstream analysis remains a challenge, particularly in the ocean where the majority of their microbial hosts remain recalcitrant to current culturing techniques. We used a cultivation-independent approach to isolate and sequence individual viruses. The protocol uses high-speed fluorescence-activated virus sorting flow cytometry, multiple displacement amplification (MDA), and downstream genomic sequencing. We focused on ‘giant viruses’ that are readily distinguishable by flow cytometry. From a single-milliliter sample of seawater collected from off the dock at Boothbay Harbor, ME, USA, we sorted almost 700 single virus particles, and subsequently focused on a detailed genome analysis of 12. A wide diversity of viruses was identified that included Iridoviridae, extended Mimiviridae and even a taxonomically novel (unresolved) giant virus. We discovered a viral metacaspase homolog in one of our sorted virus particles and discussed its implications in rewiring host metabolism to enhance infection. In addition, we demonstrated that viral metacaspases are widespread in the ocean. We also discovered a virus that contains both a reverse transcriptase and a transposase; although highly speculative, we suggest such a genetic complement would potentially allow this virus to exploit a latency propagation mechanism. Application of single virus genomics provides a powerful opportunity to circumvent cultivation of viruses, moving directly to genomic investigation of naturally occurring viruses, with the assurance that the sequence data is virus-specific, non-chimeric and contains no cellular contamination.


July 7, 2019

Sequencing the genomic regions flanking S-linked PvGLO sequences confirms the presence of two GLO loci, one of which lies adjacent to the style-length determinant gene CYP734A50.

Primula vulgaris contains two GLOBOSA loci, one located adjacent to the style length determinant gene CYP734A50 which lies within the S -locus. Using a combination of BAC walking and PacBio sequencing, we have sequenced two substantial genomic contigs in and around the S-locus of Primula vulgaris. Using these data, we were able to demonstrate that two alleles of PvGlo (P) as well as PvGlo (T) can be present in the genome of a single plant, providing empirical evidence that these two forms of the MADS-box gene GLOBOSA are separate loci and not allelic as previously reported. We propose they should be renamed PvGLO1 and PvGLO2. BAC contigs extending from each GLOBOSA locus were identified and fully sequenced. No homologous genes were found between the contigs other than the GLOBOSA genes themselves, consistent with their identity as separate loci. Exons of the recently identified style-length determinant gene CYP734A50 were identified on one end of the contig containing PvGLO2 and these genes are adjacent in the genome, suggesting that PvGLO2 lies either within or at least very close to the S-locus. Current evidence suggests that both CYP734A50 and GLO2 are specific to the S-morph mating type and are hemizygous rather than heterozygous in the Primula genome. This finding contrasts classical models of the HSI locus, which propose that components of the S-locus are allelic, suggesting that these models may need to be reconsidered.


July 7, 2019

Niche partitioning of diverse sulfur-oxidizing bacteria at hydrothermal vents.

At deep-sea hydrothermal vents, primary production is carried out by chemolithoautotrophic microorganisms, with the oxidation of reduced sulfur compounds being a major driver for microbial carbon fixation. Dense and highly diverse assemblies of sulfur-oxidizing bacteria (SOB) are observed, yet the principles of niche differentiation between the different SOB across geochemical gradients remain poorly understood. In this study niche differentiation of the key SOB was addressed by extensive sampling of active sulfidic vents at six different hydrothermal venting sites in the Manus Basin, off Papua New Guinea. We subjected 33 diffuse fluid and water column samples and 23 samples from surfaces of chimneys, rocks and fauna to a combined analysis of 16S rRNA gene sequences, metagenomes and real-time in situ measured geochemical parameters. We found Sulfurovum Epsilonproteobacteria mainly attached to surfaces exposed to diffuse venting, while the SUP05-clade dominated the bacterioplankton in highly diluted mixtures of vent fluids and seawater. We propose that the high diversity within Sulfurimonas- and Sulfurovum-related Epsilonproteobacteria observed in this study derives from the high variation of environmental parameters such as oxygen and sulfide concentrations across small spatial and temporal scales.


July 7, 2019

IgA-coated E. coli enriched in Crohn’s disease spondyloarthritis promote TH17-dependent inflammation.

Peripheral spondyloarthritis (SpA) is a common extraintestinal manifestation in patients with active inflammatory bowel disease (IBD) characterized by inflammatory enthesitis, dactylitis, or synovitis of nonaxial joints. However, a mechanistic understanding of the link between intestinal inflammation and SpA has yet to emerge. We evaluated and functionally characterized the fecal microbiome of IBD patients with or without peripheral SpA. Coupling the sorting of immunoglobulin A (IgA)-coated microbiota with 16S ribosomal RNA-based analysis (IgA-seq) revealed a selective enrichment in IgA-coated Escherichia coli in patients with Crohn’s disease-associated SpA (CD-SpA) compared to CD alone. E. coli isolates from CD-SpA-derived IgA-coated bacteria were similar in genotype and phenotype to an adherent-invasive E. coli (AIEC) pathotype. In comparison to non-AIEC E. coli, colonization of germ-free mice with CD-SpA E. coli isolates induced T helper 17 cell (TH17) mucosal immunity, which required the virulence-associated metabolic enzyme propanediol dehydratase (pduC). Modeling the increase in mucosal and systemic TH17 immunity we observed in CD-SpA patients, colonization of interleukin-10-deficient or K/BxN mice with CD-SpA-derived E. coli lead to more severe colitis or inflammatory arthritis, respectively. Collectively, these data reveal the power of IgA-seq to identify immunoreactive resident pathosymbionts that link mucosal and systemic TH17-dependent inflammation and offer microbial and immunophenotype stratification of CD-SpA that may guide medical and biologic therapy. Copyright © 2017, American Association for the Advancement of Science.


July 7, 2019

A supervised statistical learning approach for accurate Legionella pneumophila source attribution during outbreaks.

Public health agencies are increasingly relying on genomics during Legionnaires’ disease investigations. However, the causative bacterium (Legionella pneumophila) has an unusual population structure, with extreme temporal and spatial genome sequence conservation. Furthermore, Legionnaires’ disease outbreaks can be caused by multiple L. pneumophila genotypes in a single source. These factors can confound cluster identification using standard phylogenomic methods. Here, we show that a statistical learning approach based on L. pneumophila core genome single nucleotide polymorphism (SNP) comparisons eliminates ambiguity for defining outbreak clusters and accurately predicts exposure sources for clinical cases. We illustrate the performance of our method by genome comparisons of 234 L. pneumophila isolates obtained from patients and cooling towers in Melbourne, Australia, between 1994 and 2014. This collection included one of the largest reported Legionnaires’ disease outbreaks, which involved 125 cases at an aquarium. Using only sequence data from L. pneumophila cooling tower isolates and including all core genome variation, we built a multivariate model using discriminant analysis of principal components (DAPC) to find cooling tower-specific genomic signatures and then used it to predict the origin of clinical isolates. Model assignments were 93% congruent with epidemiological data, including the aquarium Legionnaires’ disease outbreak and three other unrelated outbreak investigations. We applied the same approach to a recently described investigation of Legionnaires’ disease within a UK hospital and observed a model predictive ability of 86%. We have developed a promising means to breach L. pneumophila genetic diversity extremes and provide objective source attribution data for outbreak investigations.IMPORTANCE Microbial outbreak investigations are moving to a paradigm where whole-genome sequencing and phylogenetic trees are used to support epidemiological investigations. It is critical that outbreak source predictions are accurate, particularly for pathogens, like Legionella pneumophila, which can spread widely and rapidly via cooling system aerosols, causing Legionnaires’ disease. Here, by studying hundreds of Legionella pneumophila genomes collected over 21 years around a major Australian city, we uncovered limitations with the phylogenetic approach that could lead to a misidentification of outbreak sources. We implement instead a statistical learning technique that eliminates the ambiguity of inferring disease transmission from phylogenies. Our approach takes geolocation information and core genome variation from environmental L. pneumophila isolates to build statistical models that predict with high confidence the environmental source of clinical L. pneumophila during disease outbreaks. We show the versatility of the technique by applying it to unrelated Legionnaires’ disease outbreaks in Australia and the UK. Copyright © 2017 American Society for Microbiology.


July 7, 2019

Euglena gracilis genome and transcriptome: organelles, nuclear genome assembly strategies and initial features.

Euglena gracilis is a major component of the aquatic ecosystem and together with closely related species, is ubiquitous worldwide. Euglenoids are an important group of protists, possessing a secondarily acquired plastid and are relatives to the Kinetoplastidae, which themselves have global impact as disease agents. To understand the biology of E. gracilis, as well as to provide further insight into the evolution and origins of the Kinetoplastidae, we embarked on sequencing the nuclear genome; the plastid and mitochondrial genomes are already in the public domain. Earlier studies suggested an extensive nuclear DNA content, with likely a high degree of repetitive sequence, together with significant extrachromosomal elements. To produce a list of coding sequences we have combined transcriptome data from both published and new sources, as well as embarked on de novo sequencing using a combination of 454, Illumina paired end libraries and long PacBio reads. Preliminary analysis suggests a surprisingly large genome approaching 2 Gbp, with a highly fragmented architecture and extensive repeat composition. Over 80% of the RNAseq reads from E. gracilis maps to the assembled genome sequence, which is comparable with the well assembled genomes of T. brucei and T. cruzi. In order to achieve this level of assembly we employed multiple informatics pipelines, which are discussed here. Finally, as a preliminary view of the genome architecture, we discuss the tubulin and calmodulin genes, which highlight potential novel splicing mechanisms.


July 7, 2019

A small secreted protein in Zymoseptoria tritici is responsible for avirulence on wheat cultivars carrying the Stb6 resistance gene.

Zymoseptoria tritici is the causal agent of Septoria tritici blotch, a major pathogen of wheat globally and the most damaging pathogen of wheat in Europe. A gene-for-gene (GFG) interaction between Z. tritici and wheat cultivars carrying the Stb6 resistance gene has been postulated for many years, but the genes have not been identified. We identified AvrStb6 by combining quantitative trait locus mapping in a cross between two Swiss strains with a genome-wide association study using a natural population of c. 100 strains from France. We functionally validated AvrStb6 using ectopic transformations. AvrStb6 encodes a small, cysteine-rich, secreted protein that produces an avirulence phenotype on wheat cultivars carrying the Stb6 resistance gene. We found 16 nonsynonymous single nucleotide polymorphisms among the tested strains, indicating that AvrStb6 is evolving very rapidly. AvrStb6 is located in a highly polymorphic subtelomeric region and is surrounded by transposable elements, which may facilitate its rapid evolution to overcome Stb6 resistance. AvrStb6 is the first avirulence gene to be functionally validated in Z. tritici, contributing to our understanding of avirulence in apoplastic pathogens and the mechanisms underlying GFG interactions between Z. tritici and wheat. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


July 7, 2019

Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula.

Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner.Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly.Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.


July 7, 2019

Resolving multicopy duplications de novo using polyploid phasing

While the rise of single-molecule sequencing systems has enabled an unprecedented rise in the ability to assemble complex regions of the genome, long segmental duplications in the genome still remain a challenging frontier in assembly. Segmental duplications are at the same time both gene rich and prone to large structural rearrangements, making the resolution of their sequences important in medical and evolutionary studies. Duplicated sequences that are collapsed in mammalian de novo assemblies are rarely identical; after a sequence is duplicated, it begins to acquire paralog-specific variants. In this paper, we study the problem of resolving the variations in multicopy, long segmental duplications by developing and utilizing algorithms for polyploid phasing. We develop two algorithms: the first one is targeted at maximizing the likelihood of observing the reads given the underlying haplotypes using discrete matrix completion. The second algorithm is based on correlation clustering and exploits an assumption, which is often satisfied in these duplications, that each paralog has a sizable number of paralog-specific variants. We develop a detailed simulation methodology and demonstrate the superior performance of the proposed algorithms on an array of simulated datasets. We measure the likelihood score as well as reconstruction accuracy, i.e., what fraction of the reads are clustered correctly. In both the performance metrics, we find that our algorithms dominate existing algorithms on more than 93% of the datasets. While the discrete matrix completion performs better on likelihood score, the correlation-clustering algorithm performs better on reconstruction accuracy due to the stronger regularization inherent in the algorithm. We also show that our correlation-clustering algorithm can reconstruct on average 7.0 haplotypes in 10-copy duplication datasets whereas existing algorithms reconstruct less than one copy on average.


July 7, 2019

Chromosome end repair and genome stability in Plasmodium falciparum.

The human malaria parasite Plasmodium falciparum replicates within circulating red blood cells, where it is subjected to conditions that frequently cause DNA damage. The repair of DNA double-stranded breaks (DSBs) is thought to rely almost exclusively on homologous recombination (HR), due to a lack of efficient nonhomologous end joining. However, given that the parasite is haploid during this stage of its life cycle, the mechanisms involved in maintaining genome stability are poorly understood. Of particular interest are the subtelomeric regions of the chromosomes, which contain the majority of the multicopy variant antigen-encoding genes responsible for virulence and disease severity. Here, we show that parasites utilize a competitive balance between de novo telomere addition, also called “telomere healing,” and HR to stabilize chromosome ends. Products of both repair pathways were observed in response to DSBs that occurred spontaneously during routine in vitro culture or resulted from experimentally induced DSBs, demonstrating that both pathways are active in repairing DSBs within subtelomeric regions and that the pathway utilized was determined by the DNA sequences immediately surrounding the break. In combination, these two repair pathways enable parasites to efficiently maintain chromosome stability while also contributing to the generation of genetic diversity.IMPORTANCE Malaria is a major global health threat, causing approximately 430,000 deaths annually. This mosquito-transmitted disease is caused by Plasmodium parasites, with infection with the species Plasmodium falciparum being the most lethal. Mechanisms underlying DNA repair and maintenance of genome integrity in P. falciparum are not well understood and represent a gap in our understanding of how parasites survive the hostile environment of their vertebrate and insect hosts. Our work examines DNA repair in real time by using single-molecule real-time (SMRT) sequencing focused on the subtelomeric regions of the genome that harbor the multicopy gene families important for virulence and the maintenance of infection. We show that parasites utilize two competing molecular mechanisms to repair double-strand breaks, homologous recombination and de novo telomere addition, with the pathway used being determined by the surrounding DNA sequence. In combination, these two pathways balance the need to maintain genome stability with the selective advantage of generating antigenic diversity. Copyright © 2017 Calhoun et al.


July 7, 2019

Complete genome analysis of Thermus parvatiensis and comparative genomics of Thermus spp. provide insights into genetic variability and evolution of natural competence as strategic survival attributes.

Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness.


July 7, 2019

SMRT Sequencing revealed mitogenome characteristics and mitogenome-wide DNA modification pattern in Ophiocordyceps sinensis.

Single molecule, real-time (SMRT) sequencing was used to characterize mitochondrial (mt) genome of Ophiocordyceps sinensis and to analyze the mt genome-wide pattern of epigenetic DNA modification. The complete mt genome of O. sinensis, with a size of 157,539 bp, is the fourth largest Ascomycota mt genome sequenced to date. It contained 14 conserved protein-coding genes (PCGs), 1 intronic protein rps3, 27 tRNAs and 2 rRNA subunits, which are common characteristics of the known mt genomes in Hypocreales. A phylogenetic tree inferred from 14 PCGs in Pezizomycotina fungi supports O. sinensis as most closely related to Hirsutella rhossiliensis in Ophiocordycipitaceae. A total of 36 sequence sites in rps3 were under positive selection, with dN/dS >1 in the 20 compared fungi. Among them, 16 sites were statistically significant. In addition, the mt genome-wide base modification pattern of O. sinensis was determined in this study, especially DNA methylation. The methylations were located in coding and uncoding regions of mt PCGs in O. sinensis, and might be closely related to the expression of PCGs or the binding affinity of transcription factor A to mtDNA. Consequently, these methylations may affect the enzymatic activity of oxidative phosphorylation and then the mt respiratory rate; or they may influence mt biogenesis. Therefore, methylations in the mitogenome of O. sinensis might be a genetic feature to adapt to the cold and low PO2 environment at high altitude, where O. sinensis is endemic. This is the first report on epigenetic modifications in a fungal mt genome.


July 7, 2019

Complete genome sequences of three isolates of Xanthomonas fragariae, the bacterium responsible for angular leaf spots on strawberry plants.

Xanthomonas fragariae is a worldwide-spread plant bacterial disease causing angular leaf spots, thus reducing the yield of production for strawberry fruits. Three isolates with various geographic and time origins were sequenced with long-read technology (PacBio) to generate finished genome sequences of virulent strains and observe the variability in their contents. Copyright © 2017 Gétaz et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.