Buzz off, that’s my bee!
This month’s Genome Watch explores the interactions of bee bacterial symbionts with each other and with their apian hosts.
This month’s Genome Watch explores the interactions of bee bacterial symbionts with each other and with their apian hosts.
Marine sponges are ancient metazoans that are populated by distinct and highly diverse microbial communities. In order to obtain deeper insights into the functional gene repertoire of the Mediterranean sponge Aplysina aerophoba, we combined Illumina short-read and PacBio long-read sequencing followed by un-targeted metagenomic binning. We identified a total of 37 high-quality bins representing 11 bacterial phyla and two candidate phyla. Statistical comparison of symbiont genomes with selected reference genomes revealed a significant enrichment of genes related to bacterial defense (restriction-modification systems, toxin-antitoxin systems) as well as genes involved in host colonization and extracellular matrix utilization in sponge symbionts. A within-symbionts genome comparison revealed a nutritional specialization of at least two symbiont guilds, where one appears to metabolize carnitine and the other sulfated polysaccharides, both of which are abundant molecules in the sponge extracellular matrix. A third guild of symbionts may be viewed as nutritional generalists that perform largely the same metabolic pathways but lack such extraordinary numbers of the relevant genes. This study characterizes the genomic repertoire of sponge symbionts at an unprecedented resolution and it provides greater insights into the molecular mechanisms underlying microbial-sponge symbiosis.
In Kazakhstan, traditional artisanal cheeses have a long history and are widely consumed. The unique characteristics of local artisanal cheeses are almost completely preserved. However, their microbial communities have rarely been reported. The current study firstly generated the Single Molecule, Real-Time (SMRT) sequencing bacterial diversity profiles of 6 traditional artisanal cheese samples of Kazakhstan origin, followed by comparatively analyzed the microbiota composition between the current dataset and those from cheeses originated from Belgium, Russian Republic of Kalmykia (Kalmykia) and Italy.
Multi-pond salterns constitute an excellent model for the study of the microbial diversity and ecology of hypersaline environments, showing a wide range of salt concentrations, from seawater to salt saturation. Accumulated studies on the Santa Pola (Alicante, Spain) multi-pond solar saltern during the last 35 years include culture-dependent and culture-independent molecular methods and metagenomics more recently. These approaches have permitted to determine in depth the microbial diversity of the ponds with intermediate salinities (from 10 % salts) up to salt saturation, with haloarchaea and bacteria as the two main dominant groups. In this review, we describe the main results obtained using the different methodologies, the most relevant contributions for understanding the ecology of these extreme environments and the future perspectives for such studies.
Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.The species comprising a microbial community are often difficult to deconvolute due to technical limitations inherent to most short-read sequencing technologies. Here, we leverage new advances in sequencing technology, single-molecule sequencing, to significantly improve reconstruction of a complex human skin microbial community. With this long-read technology, we were able to reconstruct and annotate a closed, high-quality genome of a previously uncharacterized skin species. We demonstrate that hybrid approaches with short-read technology are sufficiently powerful to reconstruct even single-nucleotide polymorphism level variation of species in this a community. Copyright © 2016 Tsai et al.
Sequence comparison of genetic material between known and unknown organisms plays a crucial role in genomics, metagenomics and phylogenetic analysis. The emerging long-read sequencing technologies can now produce reads of tens of kilobases in length that promise a more accurate assessment of their origin. To facilitate the classification of long and short DNA sequences, we have developed a Python package that implements a new sequence classification model that we have demonstrated to improve the classification accuracy when compared with other state of the art classification methods. For the purpose of validation, and to demonstrate its usefulness, we test the combined sequence similarity score classifier (CSSSCL) using three different datasets, including a metagenomic dataset composed of short reads.Package’s source code and test datasets are available under the GPLv3 license at https://github.com/oicr-ibc/cssscl.ivan.borozan@oicr.on.caSupplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Capillary malformation-arteriovenous malformation (CM-AVM) is an autosomal dominant vascular disorder that is associated with inherited inactivating mutations of the RASA1 gene in the majority of cases. Characteristically, patients exhibit one or more focal cutaneous CM that may occur alone or together with AVM, arteriovenous fistulas or lymphatic vessel abnormalities. The focal nature and varying presentation of lesions has led to the hypothesis that somatic “second hit” inactivating mutations of RASA1 are necessary for disease development. In this study, we examined CM from four different CM-AVM patients for the presence of somatically acquired RASA1 mutations. All four patients were shown to possess inactivating heterozygous germline RASA1 mutations. In one of the patients, a somatic inactivating RASA1 mutation (c.1534C > T, p.Arg512*) was additionally identified in CM lesion tissue. The somatic RASA1 mutation was detected within endothelial cells specifically and was in trans with the germline RASA1 mutation. Together with the germline RASA1 mutation (c.2125C > T, p.Arg709*) in the same patient, the endothelial cell somatic RASA1 mutation likely contributed to lesion development. These studies provide the first clear evidence of the second hit model of CM-AVM pathogenesis. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Evolution has provided environmental bacteria with a plethora of genes that give resistance to antibiotic compounds. Under anthropogenic selection pressures, some of these genes are believed to be recruited over time into pathogens by horizontal gene transfer. River sediment polluted with fluoroquinolones and other drugs discharged from bulk drug production in India constitute an environment with unprecedented, long-term antibiotic selection pressures. It is therefore plausible that previously unknown resistance genes have evolved and/or are promoted here. In order to search for novel resistance genes, we therefore analyzed such river sediments by a functional metagenomics approach. DNA fragments providing resistance to different antibiotics in E. coli were sequenced using Sanger and PacBio RSII platforms. We recaptured the majority of known antibiotic resistance genes previously identified by open shot-gun metagenomics sequencing of the same samples. In addition, seven novel resistance gene candidates (six beta-lactamases and one amikacin resistance gene) were identified. Two class A beta-lactamases, blaRSA1 and blaRSA2, were phylogenetically close to clinically important ESBLs like blaGES, blaBEL and blaL2, and were further characterized for their substrate spectra. The blaRSA1 protein, encoded as an integron gene cassette, efficiently hydrolysed penicillins, first generation cephalosporins and cefotaxime, while blaRSA2 was an inducible class A beta-lactamase, capable of hydrolyzing carbapenems albeit with limited efficiency, similar to the L2 beta-lactamase from Stenotrophomonas maltophilia. All detected novel genes were associated with plasmid mobilization proteins, integrons, and/or other resistance genes, suggesting a potential for mobility. This study provides insight into a resistome shaped by an exceptionally strong and long-term antibiotic selection pressure. An improved knowledge of mobilized resistance factors in the external environment may make us better prepared for the resistance challenges that we may face in clinics in the future. Copyright © 2017 Elsevier Ltd. All rights reserved.
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. To identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylum Bacteroidetes. IMPORTANCE Molecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.
We recently demonstrated that lymph nodes (LNs) PD-1+/T follicular helper (Tfh) cells from antiretroviral therapy (ART)-treated HIV-infected individuals were enriched in cells containing replication competent virus. However, the distribution of cells containing inducible replication competent virus has been only partially elucidated in blood memory CD4 T-cell populations including the Tfh cell counterpart circulating in blood (cTfh). In this context, we have investigated the distribution of (1) total HIV-infected cells and (2) cells containing replication competent and infectious virus within various blood and LN memory CD4 T-cell populations of conventional antiretroviral therapy (cART)-treated HIV-infected individuals. In the present study, we show that blood CXCR3-expressing memory CD4 T cells are enriched in cells containing inducible replication competent virus and contributed the most to the total pool of cells containing replication competent and infectious virus in blood. Interestingly, subsequent proviral sequence analysis did not indicate virus compartmentalization between blood and LN CD4 T-cell populations, suggesting dynamic interchanges between the two compartments. We then investigated whether the composition of blood HIV reservoir may reflect the polarization of LN CD4 T cells at the time of reservoir seeding and showed that LN PD-1+CD4 T cells of viremic untreated HIV-infected individuals expressed significantly higher levels of CXCR3 as compared to CCR4 and/or CCR6, suggesting that blood CXCR3-expressing CD4 T cells may originate from LN PD-1+CD4 T cells. Taken together, these results indicate that blood CXCR3-expressing CD4 T cells represent the major blood compartment containing inducible replication competent virus in treated aviremic HIV-infected individuals.
Mitotic recombination can result in loss of heterozygosity and chromosomal rearrangements that shape genome structure and initiate human disease. Engineered double-strand breaks (DSBs) are a potent initiator of recombination, but whether spontaneous events initiate with the breakage of one or both DNA strands remains unclear. In the current study, a crossover (CO)-specific assay was used to compare heteroduplex DNA (hetDNA) profiles, which reflect strand exchange intermediates, associated with DSB-induced versus spontaneous events in yeast. Most DSB-induced CO products had the two-sided hetDNA predicted by the canonical DSB repair model, with a switch in hetDNA position from one product to the other at the position of the break. Approximately 40% of COs, however, had hetDNA on only one side of the initiating break. This anomaly can be explained by a modified model in which there is frequent processing of an early invasion (D-loop) intermediate prior to extension of the invading end. Finally, hetDNA tracts exhibited complexities consistent with frequent expansion of the DSB into a gap, migration of strand-exchange junctions, and template switching during gap-filling reactions. hetDNA patterns in spontaneous COs isolated in either a wild-type background or in a background with elevated levels of reactive oxygen species (tsa1? mutant) were similar to those associated with the DSB-induced events, suggesting that DSBs are the major instigator of spontaneous mitotic recombination in yeast.
Highly mutable RNA viruses such as influenza A virus, human immunodeficiency virus and hepatitis C virus exist in infected hosts as highly heterogeneous populations of closely related genomic variants. The presence of low-frequency variants with few mutations with respect to major strains may result in an immune escape, emergence of drug resistance, and an increase of virulence and infectivity. Next-generation sequencing technologies permit detection of sample intra-host viral population at extremely great depth, thus providing an opportunity to access low-frequency variants. Long read lengths offered by single-molecule sequencing technologies allow all viral variants to be sequenced in a single pass. However, high sequencing error rates limit the ability to study heterogeneous viral populations composed of rare, closely related variants. In this article, we present CliqueSNV, a novel reference-based method for reconstruction of viral variants from NGS data. It efficiently constructs an allele graph based on linkage between single nucleotide variations and identifies true viral variants by merging cliques of that graph using combinatorial optimization techniques. The new method outperforms existing methods in both accuracy and running time on experimental and simulated NGS data for titrated levels of known viral variants. For PacBio reads, it accurately reconstructs variants with frequency as low as 0.1%. For Illumina reads, it fully reconstructs main variants. The open source implementation of CliqueSNV is freely available for download at https://github.com/vyacheslav-tsivina/CliqueSNV
The majority of Legionnaires’ disease (LD) cases are caused by Legionella pneumophila, a genetically heterogeneous species composed of at least 17 serogroups. Previously, it was demonstrated that L. pneumophila consists of three subspecies: pneumophila, fraseri and pascullei. During an LD outbreak investigation in 2012, we detected that representatives of both subspecies fraseri and pascullei colonized the same water system and that the outbreak-causing strain was a new member of the least represented subspecies pascullei. We used partial sequence based typing consensus patterns to mine an international database for additional representatives of fraseri and pascullei subspecies. As a result, we identified 46 sequence types (STs) belonging to subspecies fraseri and two STs belonging to subspecies pascullei. Moreover, a recent retrospective whole genome sequencing analysis of isolates from New York State LD clusters revealed the presence of a fourth L. pneumophila subspecies that we have termed raphaeli. This subspecies consists of 15 STs. Comparative analysis was conducted using the genomes of multiple members of all four L. pneumophila subspecies. Whereas each subspecies forms a distinct phylogenetic clade within the L. pneumophila species, they share more average nucleotide identity with each other than with other Legionella species. Unique genes for each subspecies were identified and could be used for rapid subspecies detection. Improved taxonomic classification of L. pneumophila strains may help identify environmental niches and virulence attributes associated with these genetically distinct subspecies. Published by Elsevier B.V.
Large-scale population genomic surveys are essential to explore the phenotypic diversity of natural populations. Here we report the whole-genome sequencing and phenotyping of 1,011 Saccharomyces cerevisiae isolates, which together provide an accurate evolutionary picture of the genomic variants that shape the species-wide phenotypic landscape of this yeast. Genomic analyses support a single ‘out-of-China’ origin for this species, followed by several independent domestication events. Although domesticated isolates exhibit high variation in ploidy, aneuploidy and genome content, genome evolution in wild isolates is mainly driven by the accumulation of single nucleotide polymorphisms. A common feature is the extensive loss of heterozygosity, which represents an essential source of inter-individual variation in this mainly asexual species. Most of the single nucleotide polymorphisms, including experimentally identified functional polymorphisms, are present at very low frequencies. The largest numbers of variants identified by genome-wide association are copy-number changes, which have a greater phenotypic effect than do single nucleotide polymorphisms. This resource will guide future population genomics and genotype-phenotype studies in this classic model system.
Recent studies indicate that there is selection bias for transmission of viral polymorphisms associated with higher viral fitness. Furthermore, after transmission and before a specific immune response is mounted in the recipient, the virus undergoes a number of reversions which allow an increase in their replicative capacity. These aspects, and others, affect the viral population characteristic of early acute infection.160 singlegag-gene amplifications were obtained by limiting-dilution RT-PCR from plasma samples of 8 ARV-naïve patients with early acute infection (<30?days, 22?days average) and 8 ARV-naive patients with approximately a year of infection (10 amplicons per patient). Sanger sequencing and NGS SMRT technology (Pacific Biosciences) were implemented to sequence the amplicons. Phylogenetic analysis was performed by using MEGA 6.06. HLA-I (A and B) typing was performed by SSOP-PCR method. The chromatograms were analyzed with Sequencher 4.10. Epitopes and immune-proteosomal cleavages prediction was performed with CBS prediction server for the 30 HLA-A and -B alleles most prevalent in our population with peptide lengths from 8 to 14 mer. Cytotoxic response prediction was performed by using IEDB Analysis Resource.After implementing epitope prediction analysis, we identified a total number of 325 possible viral epitopes present in two or more acute or chronic patients. 60.3% (n?=?196) of them were present only in acute infection (prevalent acute epitopes) while 39.7% (n?=?129) were present only in chronic infection (prevalent chronic epitopes). Within p24, the difference was equally dramatic with 59.4% (79/133) being acute epitopes (p?0.05). This is consistent with progressive viral adaptation to immune response in time and further supported by the fact that cytotoxic responses prediction showed that acute epitopes are more likely to generate immune response than chronic epitopes. Interestingly, only 27.5% of acute epitopes match the population-level consensus sequence of the virus.Our results indicate that certain non-consensus viral residues might be transmitted more frequently than consensus-residues when located in immunological relevant positions (epitopes). This observation might be relevant to the rationale behind development of an effective vaccineto reduce viral reservoir and induce functional cure of HIV infection based in prevalent acute epitopes. Copyright © 2018 Elsevier Ltd. All rights reserved.
If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.