Menu
September 22, 2019

TranSurVeyor: an improved database-free algorithm for finding non-reference transpositions in high-throughput sequencing data.

Transpositions transfer DNA segments between different loci within a genome; in particular, when a transposition is found in a sample but not in a reference genome, it is called a non-reference transposition. They are important structural variations that have clinical impact. Transpositions can be called by analyzing second generation high-throughput sequencing datasets. Current methods follow either a database-based or a database-free approach. Database-based methods require a database of transposable elements. Some of them have good specificity; however this approach cannot detect novel transpositions, and it requires a good database of transposable elements, which is not yet available for many species. Database-free methods perform de novo calling of transpositions, but their accuracy is low. We observe that this is due to the misalignment of the reads; since reads are short and the human genome has many repeats, false alignments create false positive predictions while missing alignments reduce the true positive rate. This paper proposes new techniques to improve database-free non-reference transposition calling: first, we propose a realignment strategy called one-end remapping that corrects the alignments of reads in interspersed repeats; second, we propose a SNV-aware filter that removes some incorrectly aligned reads. By combining these two techniques and other techniques like clustering and positive-to-negative ratio filter, our proposed transposition caller TranSurVeyor shows at least 3.1-fold improvement in terms of F1-score over existing database-free methods. More importantly, even though TranSurVeyor does not use databases of prior information, its performance is at least as good as existing database-based methods such as MELT, Mobster and Retroseq. We also illustrate that TranSurVeyor can discover transpositions that are not known in the current database.


September 22, 2019

Full-length extension of HLA allele sequences by HLA allele-specific hemizygous Sanger sequencing (SSBT).

The gold standard for typing at the allele level of the highly polymorphic Human Leucocyte Antigen (HLA) gene system is sequence based typing. Since sequencing strategies have mainly focused on identification of the peptide binding groove, full-length sequence information is lacking for >90% of the HLA alleles. One of the goals of the 17th IHIWS workshop is to establish full-length sequences for as many HLA alleles as possible. In our component “Extension of HLA sequences by full-length HLA allele-specific hemizygous Sanger sequencing” we have used full-length hemizygous Sanger Sequence Based Typing to achieve this goal. We selected samples of which full length sequences were not available in the IPD-IMGT/HLA database. In total we have generated the full-length sequences of 48 HLA-A, 45 -B and 31 -C alleles. For HLA-A extended alleles, 39/48 showed no intron differences compared to the first allele of the corresponding allele group, for HLA-B this was 26/45 and for HLA-C 20/31. Comparing the intron sequences to other alleles of the same allele group revealed that in 5/48 HLA-A, 16/45 HLA-B and 8/31 HLA-C alleles the intron sequence was identical to another allele of the same allele group. In the remaining 10 cases, the sequence either showed polymorphism at a conserved nucleotide or was the result of a gene conversion event. Elucidation of the full-length sequence gives insight in the polymorphic content of the alleles and facilitates the identification of its evolutionary origin. Copyright © 2018 American Society for Histocompatibility and Immunogenetics. All rights reserved.


September 22, 2019

Tracing back multidrug-resistant bacteria in fresh herb production: from chive to source through the irrigation water chain.

Environmental antibiotic-resistant bacteria (ARB) can be transferred to humans through foods. Fresh produce in particular is an ideal vector due to frequent raw consumption. A major contamination source of fresh produce is irrigation water. We hypothesized that water quality significantly affects loads of ARB and their diversity on fresh produce despite various other contamination sources present under agricultural practice conditions. Chive irrigated from an open-top reservoir or sterile-filtered water (control) was examined. Heterotrophic plate counts (HPC) and ARB were determined for water and chive with emphasis on Escherichia coli and Enterococcus spp. High HPC of freshly planted chive decreased over time and were significantly lower on control- vs. reservoir-irrigated chive at harvest (1.3 log (CFU/g) lower). Ciprofloxacin- and ceftazidime-resistant bacteria were significantly lower on control-irrigated chive at harvest and end of shelf life (up to 1.8 log (CFU/g) lower). Escherichia coli and Enterococcus spp. repeatedly isolated from water and chive proved resistant to up to six or four antibiotic classes (80% or 49% multidrug-resistant, respectively). Microbial source tracking identified E. coli-ST1056 along the irrigation chain and on chive. Whole-genome sequencing revealed that E. coli-ST1056 from both environments were clonal and carried the same transmissible multidrug-resistance plasmid, proving water as source of chive contamination. These findings emphasize the urgent need for guidelines concerning ARB in irrigation water and development of affordable water disinfection technologies to diminish ARB on irrigated produce.


September 22, 2019

Full gene HLA class I sequences of 79 novel and 519 mostly uncommon alleles from a large United States registry population.

HLA class I assignments were obtained at single genotype, G-level resolution from 98?855 volunteers for an unrelated donor registry in the United States. In spite of the diverse ancestry of the volunteers, over 99% of the assignments at each locus are common. Within this population, 52 novel alleles differing in exons 2 and 3 are identified and characterized. Previously reported alleles with incomplete sequences in the IPD-IMGT/HLA database (n?=?519) were selected for full gene sequencing and, from this sampling, another 27 novel alleles are described.© 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.


September 22, 2019

An IncX1 plasmid isolated from Salmonella enterica subsp. enterica serovar Pullorum carrying blaTEM-1B, sul2, arsenic resistant operons.

We have identified an IncX1 plasmid named pQJDSal1 from Salmonella enterica subsp. enterica serovar Pullorum (S. Pullorum). The plasmid is 67,685?bp in size and has 72 putative genes. pQJDSal1 harbors a conserved IncX1-type backbone with predicted regions for conjugation, replication and partitioning, as well as a toxin/antitoxin plasmid addiction system. Two regions (A and B) that have not been previously reported in IncX1 plasmids are inserted into the backbone. Region A (10.7?kb), inserted between parA and taxD, consists of a new Tn6168-like transposon containing an arsenic resistant operon arsB2CHR and sulfonamide resistance gene sul2. Region B contains another arsenic resistant operon arsADHR, resistance gene blaTEM-1B and three transposable elements. Conjugation experiments showed that pQJDSal1 could transfer from S. Pullorum to Escherichia coli (E. coli) J53. Statistical analysis of 70 sequenced IncX1 plasmids revealed that IncX1 plasmids harbored various antibiotic resistance genes. The results highlight the importance of IncX1 plasmids in disseminating antibiotic resistance genes.Copyright © 2018. Published by Elsevier Inc.


September 22, 2019

A metabolic and genomic assessment of sugar fermentation profiles of the thermophilic Thermotogales, Fervidobacterium pennivorans.

A metabolic, genomic and proteomic assessment of Fervidobacterium pennivorans strains was undertaken to clarify the metabolic and genetic capabilities of this Thermotogales species. The type strain Ven5 originally isolated from a hot mud spa in Italy, and a newly isolated strain (DYC) from a hot spring at Ngatamariki, New Zealand, were compared for metabolic and genomic differences. The fermentation profiles of both strains on cellobiose generated similar major end products (acetate, alanine, glutamate, H2, and CO2). The vast majority of end products produced were redox neutral, and carbon balances were in the range of 95-115%. Each strain showed distinct fermentation profiles on sugar substrates. The genome of strain DYC was sequenced and shown to have high sequence similarity and synteny with F. pennivorans Ven5 genome, suggesting they are the same species. The unique genome regions in Ven5, corresponded to genes involved in the Entner-Doudoroff pathway confirming our observation of DYC’s inability to utilize gluconate. Genome analysis was able to elucidate pathways involved in production of the observed end-products with the exception of alanine and glutamate synthesis which were resolved with less clarity due to poor sequence identity and missing critical enzymes within the pathway, respectively.


September 22, 2019

Functional metagenomics identifies an exosialidase with an inverting catalytic mechanism that defines a new glycoside hydrolase family (GH156).

Exosialidases are glycoside hydrolases that remove a single terminal sialic acid residue from oligosaccharides. They are widely distributed in biology, having been found in prokaryotes, eukaryotes, and certain viruses. Most characterized prokaryotic sialidases are from organisms that are pathogenic or commensal with mammals. However, in this study, we used functional metagenomic screening to seek microbial sialidases encoded by environmental DNA isolated from an extreme ecological niche, a thermal spring. Using recombinant expression of potential exosialidase candidates and a fluorogenic sialidase substrate, we discovered an exosialidase having no homology to known sialidases. Phylogenetic analysis indicated that this protein is a member of a small family of bacterial proteins of previously unknown function. Proton NMR revealed that this enzyme functions via an inverting catalytic mechanism, a biochemical property that is distinct from those of known exosialidases. This unique inverting exosialidase defines a new CAZy glycoside hydrolase family we have designated GH156.© 2018 Chuzel et al.


September 22, 2019

Phenazines in plant-beneficial Pseudomonas spp.: biosynthesis, regulation, function and genomics.

Plant-beneficial phenazine-producing Pseudomonas spp. are proficient biocontrol agents of soil-dwelling plant pathogens. Phenazines are redox-active molecules that display broad-spectrum antibiotic activity toward many fungal, bacterial and oomycete plant pathogens. Phenazine compounds also play a role in the persistence and survival of Pseudomonas spp. in the rhizosphere. This mini-review focuses on plant-beneficial phenazine-producing Pseudomonas spp. from the P. fluorescens species complex, which includes numerous well-known phenazine-producing strains of biocontrol interest. In this review the current knowledge on phenazine biosynthesis and regulation, the role played by phenazines in biocontrol and rhizosphere colonization, as well as exciting new advances in the genomics of plant-beneficial phenazine-producing Pseudomonas spp. will be discussed.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Genome mining of Streptomyces xinghaiensis NRRL B-24674T for the discovery of the gene cluster involved in anticomplement activities and detection of novel xiamycin analogs.

Marine actinobacterium Streptomyces xinghaiensis NRRL B-24674T has been characterized as a novel species, but thus far, its biosynthetic potential remains unexplored. In this study, the high-quality genome sequence of S. xinghaiensis NRRL B-24674T was obtained, and the production of anticomplement agents, xiamycin analogs, and siderophores was investigated by genome mining. Anticomplement compounds are valuable for combating numerous diseases caused by the abnormal activation of the human complement system. The biosynthetic gene cluster (BGC) nrps1 resembles that of complestatins, which are potent microbial-derived anticomplement agents. The identification of the nrps1 BGC revealed a core peptide that differed from that in complestatin; thus, we studied the anticomplement activity of this strain. The culture broth of S. xinghaiensis NRRL B-24674T displayed good anticomplement activity. Subsequently, the disruption of the genes in the nrps1 BGC resulted in the loss of anticomplement activity, confirming the involvement of this BGC in the biosynthesis of anticomplement agents. In addition, the mining of the BGC tep5, which resembles that of the antiviral pentacyclic indolosesquiterpene xiamycin, resulted in the discovery of nine xiamycin analogs, including three novel compounds. In addition to the BGCs responsible for desferrioxamine B, neomycin, ectoine, and carotenoid, 18 BGCs present in the genome are predicted to be novel. The results of this study unveil the potential of S. xinghaiensis as a producer of novel anticomplement agents and provide a basis for further exploration of the biosynthetic potential of S. xinghaiensis NRRL B-24674T for the discovery of novel bioactive compounds by genome mining.


September 22, 2019

Alpha- and beta-mannan utilization by marine Bacteroidetes.

Marine microscopic algae carry out about half of the global carbon dioxide fixation into organic matter. They provide organic substrates for marine microbes such as members of the Bacteroidetes that degrade algal polysaccharides using carbohydrate-active enzymes (CAZymes). In Bacteroidetes genomes CAZyme encoding genes are mostly grouped in distinct regions termed polysaccharide utilization loci (PULs). While some studies have shown involvement of PULs in the degradation of algal polysaccharides, the specific substrates are for the most part still unknown. We investigated four marine Bacteroidetes isolated from the southern North Sea that harbour putative mannan-specific PULs. These PULs are similarly organized as PULs in human gut Bacteroides that digest a- and ß-mannans from yeasts and plants respectively. Using proteomics and defined growth experiments with polysaccharides as sole carbon sources we could show that the investigated marine Bacteroidetes express the predicted functional proteins required for a- and ß-mannan degradation. Our data suggest that algal mannans play an as yet unknown important role in the marine carbon cycle, and that biochemical principles established for gut or terrestrial microbes also apply to marine bacteria, even though their PULs are evolutionarily distant.© 2018 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Genomic evidence for asymmetric introgression by sexual selection in the common wall lizard.

Strongly selected characters can be transferred from one lineage to another with limited genetic exchange, resulting in asymmetric introgression and a mosaic genome in the receiving population. However, systems are rarely sufficiently well studied to link the pattern of introgression to its underlying process. Male common wall lizards in western Italy exhibit exaggeration of a suite of sexually selected characters that make them outcompete males from a distantly related lineage that lack these characters. This results in asymmetric hybridization and adaptive introgression of the suite of characters following secondary contact. We developed genomewide markers to infer the demographic history of gene flow between different genetic lineages, identify the spread of the sexually selected syndrome, and test the prediction that introgression should be asymmetric and heterogeneous across the genome. Our results show that secondary contact was accompanied by gene flow in both directions across most of the genome, but with approximately 3% of the genome showing highly asymmetric introgression in the predicted direction. Demographic simulations reveal that this asymmetric gene flow is more recent than the initial secondary contact, and the data suggest that the exaggerated male sexual characters originated within the Italian lineage and subsequently spread throughout this lineage before eventually reaching the contact zone. These results demonstrate that sexual selection can cause a suite of characters to spread throughout both closely and distantly related lineages with limited gene flow across the genome at large.© 2018 John Wiley & Sons Ltd.


September 22, 2019

Evaluation of bacterial contamination in goat milk powder using PacBio Single Molecule Real-Time Sequencing and Droplet Digital PCR.

Goat milk powder is a nutritious and easy-to-store product that is highly favored by consumers. However, the presence of contaminating bacteria and their metabolites may significantly affect the flavor, solubility, shelf life, and safety of the product. To comprehensively and accurately understand the sanitary conditions in the goat milk powder production process and potential threats from bacterial contamination, a combination of Pacific Biosciences single molecule real-time sequencing and droplet digital PCR was used to evaluate bacterial contamination in seven goat milk powder samples from three dairies. Ten phyla, 119 genera, and 249 bacterial species were identified. Bacillus, Paenibacillus, Lactococcus, and Cronobacter were the primary genera. Bacillus cereus, Lactococcus lactis, Alkaliphilus oremlandii, and Cronobacter sakazakii were the dominant species. With droplet digital PCR, 6.3 × 104 copies per g of Bacillus cereus and 1.0 × 104 copies per g of Cronobacter spp. were quantified, which may increase the risk of food spoilage and the probability of foodborne illness and should be monitored and controlled. This study offers a new approach for evaluating bacterial contamination in goat milk powder and supplies a reference for the assessment of food safety and control of potential risk, which will be of interest to the dairy industry.


September 22, 2019

Genomic discovery of the hypsin gene and biosynthetic pathways for terpenoids in Hypsizygus marmoreus.

Hypsizygus marmoreus (Beech mushroom) is a popular ingredient in Asian cuisine. The medicinal effects of its bioactive compounds such as hypsin and hypsiziprenol have been reported, but the genetic basis or biosynthesis of these components is unknown.In this study, we sequenced a reference strain of H. marmoreus (Haemi 51,987-8). We evaluated various assembly strategies, and as a result the Allpaths and PBJelly produced the best assembly. The resulting genome was 42.7 Mbp in length and annotated with 16,627 gene models. A putative gene (Hypma_04324) encoding the antifungal and antiproliferative hypsin protein with 75% sequence identity with the previously known N-terminal sequence was identified. Carbohydrate active enzyme analysis displayed the typical feature of white-rot fungi where auxiliary activity and carbohydrate-binding modules were enriched. The genome annotation revealed four terpene synthase genes responsible for terpenoid biosynthesis. From the gene tree analysis, we identified that terpene synthase genes can be classified into six clades. Four terpene synthase genes of H. marmoreus belonged to four different groups that implies they may be involved in the synthesis of different structures of terpenes. A terpene synthase gene cluster was well-conserved in Agaricomycetes genomes, which contained known biosynthesis and regulatory genes.Genome sequence analysis of this mushroom led to the discovery of the hypsin gene. Comparative genome analysis revealed the conserved gene cluster for terpenoid biosynthesis in the genome. These discoveries will further our understanding of the biosynthesis of medicinal bioactive molecules in this edible mushroom.


September 22, 2019

Comparative genomic and methylome analysis of non-virulent D74 and virulent Nagasaki Haemophilus parasuis isolates.

Haemophilus parasuis is a respiratory pathogen of swine and the etiological agent of Glässer’s disease. H. parasuis isolates can exhibit different virulence capabilities ranging from lethal systemic disease to subclinical carriage. To identify genomic differences between phenotypically distinct strains, we obtained the closed whole-genome sequence annotation and genome-wide methylation patterns for the highly virulent Nagasaki strain and for the non-virulent D74 strain. Evaluation of the virulence-associated genes contained within the genomes of D74 and Nagasaki led to the discovery of a large number of toxin-antitoxin (TA) systems within both genomes. Five predicted hemolysins were identified as unique to Nagasaki and seven putative contact-dependent growth inhibition toxin proteins were identified only in strain D74. Assessment of all potential vtaA genes revealed thirteen present in the Nagasaki genome and three in the D74 genome. Subsequent evaluation of the predicted protein structure revealed that none of the D74 VtaA proteins contain a collagen triple helix repeat domain. Additionally, the predicted protein sequence for two D74 VtaA proteins is substantially longer than any predicted Nagasaki VtaA proteins. Fifteen methylation sequence motifs were identified in D74 and fourteen methylation sequence motifs were identified in Nagasaki using SMRT sequencing analysis. Only one of the methylation sequence motifs was observed in both strains indicative of the diversity between D74 and Nagasaki. Subsequent analysis also revealed diversity in the restriction-modification systems harbored by D74 and Nagasaki. The collective information reported in this study will aid in the development of vaccines and intervention strategies to decrease the prevalence and disease burden caused by H. parasuis.


September 22, 2019

Unraveling microbial communities associated with methylmercury production in paddy soils.

Rice consumption is now recognized as an important pathway of human exposure to the neurotoxin methylmercury (MeHg), particularly in countries where rice is a staple food. Although the discovery of a two-gene cluster hgcAB has linked Hg methylation to several phylogenetically diverse groups of anaerobic microorganisms converting inorganic mercury (Hg) to MeHg, the prevalence and diversity of Hg methylators in microbial communities of rice paddy soils remain unclear. We characterized the abundance and distribution of hgcAB genes using third-generation PacBio long-read sequencing and Illumina short-read metagenomic sequencing, in combination with quantitative PCR analyses in several mine-impacted paddy soils from southwest China. Both Illumina and PacBio sequencing analyses revealed that Hg methylating communities were dominated by iron-reducing bacteria (i.e., Geobacter) and methanogens, with a relatively low abundance of hgcA + sulfate-reducing bacteria in the soil. A positive correlation was observed between the MeHg content in soil and the relative abundance of Geobacter carrying the hgcA gene. Phylogenetic analysis also uncovered some hgcAB sequences closely related to three novel Hg methylators, Geobacter anodireducens, Desulfuromonas sp. DDH964, and Desulfovibrio sp. J2, among which G. anodireducens was validated for its ability to methylate Hg. These findings shed new light on microbial community composition and major clades likely driving Hg methylation in rice paddy soils.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.