Bioinformatics Archives - Page 209 of 267

July 7, 2019

Twelve complete reference genomes of clinical isolates in the Capnocytophaga genus.

We report here 1 near-complete genome sequence and 12 complete genome sequences for clinical Capnocytophaga isolates. Total read coverages ranged from 211× to 737×, and genome sizes ranged from 2.41 Mb to 3.10 Mb. These genomes will enable a more comprehensive taxonomic evaluation of the Capnocytophaga genus.

July 7, 2019

Mechanisms of surface antigenic variation in the human pathogenic fungus Pneumocystis jirovecii.

Microbial pathogens commonly escape the human immune system by varying surface proteins. We investigated the mechanisms used for that purpose by Pneumocystis jirovecii This uncultivable fungus is an obligate pulmonary pathogen that in immunocompromised individuals causes pneumonia, a major life-threatening infection. Long-read PacBio sequencing was used to assemble a core of subtelomeres of a single P. jirovecii strain from a bronchoalveolar lavage fluid specimen from a single patient. A total of 113 genes encoding surface proteins were identified, including 28 pseudogenes. These genes formed a subtelomeric gene superfamily, which included five families encoding adhesive glycosylphosphatidylinositol (GPI)-anchored glycoproteins and one family encoding excreted glycoproteins. Numerical analyses suggested that diversification of the glycoproteins relies on mosaic genes created by ectopic recombination and occurs only within each family. DNA motifs suggested that all genes are expressed independently, except those of the family encoding the most abundant surface glycoproteins, which are subject to mutually exclusive expression. PCR analyses showed that exchange of the expressed gene of the latter family occurs frequently, possibly favored by the location of the genes proximal to the telomere because this allows concomitant telomere exchange. Our observations suggest that (i) the P. jirovecii cell surface is made of a complex mixture of different surface proteins, with a majority of a single isoform of the most abundant glycoprotein, (ii) genetic mosaicism within each family ensures variation of the glycoproteins, and (iii) the strategy of the fungus consists of the continuous production of new subpopulations composed of cells that are antigenically different.IMPORTANCEPneumocystis jirovecii is a fungus causing severe pneumonia in immunocompromised individuals. It is the second most frequent life-threatening invasive fungal infection. We have studied the mechanisms of antigenic variation used by this pathogen to escape the human immune system, a strategy commonly used by pathogenic microorganisms. Using a new DNA sequencing technology generating long reads, we could characterize the highly repetitive gene families encoding the proteins that are present on the cellular surface of this pest. These gene families are localized in the regions close to the ends of all chromosomes, the subtelomeres. Such chromosomal localization was found to favor genetic recombinations between members of each gene family and to allow diversification of these proteins continuously over time. This pathogen seems to use a strategy of antigenic variation consisting of the continuous production of new subpopulations composed of cells that are antigenically different. Such a strategy is unique among human pathogens. Copyright © 2017 Schmid-Siegert et al.

July 7, 2019

Complete genome sequence of Streptococcus thermophilus strain B59671, which naturally produces the broad-spectrum bacteriocin thermophilin 110.

Streptococcus thermophilus strain B59671 is a Gram-positive lactic acid bacterium that naturally produces a broad-spectrum bacteriocin, thermophilin 110, and is capable of producing gamma-aminobutyric acid (GABA). The complete genome sequence for this strain contains 1,821,173 nucleotides, 1,936 predicted genes, and an average G+C content of 39.1%.

July 7, 2019

Complete genome sequence of the fruiting myxobacterium Melittangium boletus DSM 14713.

The formation of spore-filled fruiting bodies in response to starvation represents a hallmark of many members of the order Myxococcales Here, we present the complete 9.9-Mb genome of the fruiting type strain Melittangium boletus DSM 14713, the first member of this genus to have its genome sequenced. Copyright © 2017 Treuner-Lange et al.

July 7, 2019

Genomic variation and evolution of Vibrio parahaemolyticus ST36 over the course of a transcontinental epidemic expansion.

Vibrio parahaemolyticus is the leading cause of seafood-related infections with illnesses undergoing a geographic expansion. In this process of expansion, the most fundamental change has been the transition from infections caused by local strains to the surge of pandemic clonal types. Pandemic clone sequence type 3 (ST3) was the only example of transcontinental spreading until 2012, when ST36 was detected outside the region where it is endemic in the U.S. Pacific Northwest causing infections along the U.S. northeast coast and Spain. Here, we used genome-wide analyses to reconstruct the evolutionary history of the V. parahaemolyticus ST36 clone over the course of its geographic expansion during the previous 25 years. The origin of this lineage was estimated to be in ~1985. By 1995, a new variant emerged in the region and quickly replaced the old clone, which has not been detected since 2000. The new Pacific Northwest (PNW) lineage was responsible for the first cases associated with this clone outside the Pacific Northwest region. After several introductions into the northeast coast, the new PNW clone differentiated into a highly dynamic group that continues to cause illness on the northeast coast of the United States. Surprisingly, the strains detected in Europe in 2012 diverged from this ancestral group around 2000 and have conserved genetic features present only in the old PNW lineage. Recombination was identified as the major driver of diversification, with some preliminary observations suggesting a trend toward a more specialized lifestyle, which may represent a critical element in the expansion of epidemics under scenarios of coastal warming.IMPORTANCEVibrio parahaemolyticus and Vibrio cholerae represent the only two instances of pandemic expansions of human pathogens originating in the marine environment. However, while the current pandemic of V. cholerae emerged more than 50 years ago, the global expansion of V. parahaemolyticus is a recent phenomenon. These modern expansions provide an exceptional opportunity to study the evolutionary process of these pathogens at first hand and gain an understanding of the mechanisms shaping the epidemic dynamics of these diseases, in particular, the emergence, dispersal, and successful introduction in new regions facilitating global spreading of infections. In this study, we used genomic analysis to examine the evolutionary divergence that has occurred over the course of the most recent transcontinental expansion of a pathogenic Vibrio, the spreading of the V. parahaemolyticus sequence type 36 clone from the region where it is endemic on the Pacific coast of North America to the east coast of the United States and finally to the west coast of Europe.

July 7, 2019

Draft genome sequence and annotation of the apicomplexan parasite Besnoitia besnoiti.

The apicomplexan parasite Besnoitia besnoiti is the causative agent of bovine besnoitiosis that affects livestock, particularly cattle. The definitive host of B. besnoiti is unknown and its transmission only partially understood. Here, we report the first draft genome sequence, assembly, and annotation of this parasite. Copyright © 2017 Schares et al.

July 7, 2019

Complete genome sequence of a psychrotolerant denitrifying bacterium, Janthinobacterium svalbardensis PAMC 27463.

We report here the complete genome sequence of Janthinobacterium svalbardensis PAMC 27463 isolated from a freshwater lake on Barton Peninsula on King George Island, Antarctica. The genome consists of a chromosome with 6,274,078 bp which contains 5,585 genes, including 121 RNA genes. Copyright © 2017 Cho et al.

July 7, 2019

Complete genome sequence of Acetobacter tropicalis Oregon-R-modENCODE strain BDGP1, an acetic acid bacterium found in the Drosophila melanogaster gut.

Acetobacter tropicalis Oregon-R-modENCODE strain BDGP1 was isolated from Drosophila melanogaster for functional host-microbe interaction studies. The complete genome comprises a single chromosomal circle of 3,988,649 bp with a G+C content of 56% and a conjugative plasmid of 151,013 bp.

July 7, 2019

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.© 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.

July 7, 2019

Complete genome sequencing of Arachidicoccus ginsenosidimutans sp. nov., and its application for production of minor ginsenosides by finding a novel ginsenoside-transforming beta-glucosidase

A novel bacterial strain (BS20T), which has ginsenoside-transforming ability, was whole genome sequenced for the identification of a target gene. After complete genome sequencing, phylogenetic, phenotypic and chemotaxonomic analyses, the strain BS20T (Arachidicoccus ginsenosidimutans sp. nov.) was placed within the genus Arachidicoccus of family Chitinophagaceae. The complete genome of strain BS20T comprised a circular chromosome of 4[thin space (1/6-em)]138[thin space (1/6-em)]017 bp. To find the target functional gene, 17 sets of four different glycoside hydrolases were cloned in E. coli BL21 (DE3) using the pGEX4T-1 vector and were characterized. Among these 17 sets of clones, only one, BglAg-762, exhibited ginsenoside-conversion ability. The BglAg-762 comprised 762 amino acid residues and belonged to the glycoside hydrolase family 3. The recombinant enzyme (GST-BglAg-762) was able to convert major ginsenosides Rb1 to F2 via gypenoside-XVII (Gyp-XVII), Rb2 to C-O, and Rb3, Rc, Rd, and Gyp-XVII to C-Mx1, C-Mc1, and F2, respectively. Finally, ginsenoside F2 was transformed into compound K (C-K). Besides, these pilot data demonstrate the identification of 17 sets of target/functional genes of 4 different glycoside hydrolases from a novel bacterial species via whole genome sequencing. Our results have shown that the recombinant BglAg-762 very quickly converts the major ginsenosides into minor ginsenosides, which can be used for the enhanced production of target minor ginsenosides. Furthermore, the web service of NCBI is suitable for any targeted gene identification, but based on our experimental analysis we concluded that the hypothetical protein present in NCBI should be considered as a putative or uncharacterized protein.

July 7, 2019

Complete genome sequence of Acinetobacter baumannii A1296 (ST1469) with a small plasmid harbouring the tet(39) tetracycline resistance gene.

Acinetobacter baumannii is considered an important nosocomial pathogen worldwide owing to its increasing antibiotic resistance. This study aimed to determine the complete genome sequence of A. baumannii strain A1296 and to perform a comparative analysis among A. baumannii.The complete genome sequence of A. baumannii A1296 was sequenced on two SMRT cells using P6C4 chemistry on a PacBio Single Molecule, Real-Time (SMRT) RS II instrument. The A1296 genome sequence was annotated using Prokaryotic Genome Automatic Annotation Pipeline (PGAAP), and the sequence type and resistance genes of the strain were analysed.Here we present the complete genome sequence of A. baumannii strain A1296, belonging to a novel sequence type (ST1469) and isolated from patient in China, that was sensitive to multiple antibiotics. The genome of A. baumannii A1296 was 3810701bp in length, including one circular chromosome and two plasmids. The tet(39) resistance gene was located on the small plasmid in this A. baumannii strain.The genome sequence of A. baumannii strain A1296 can be used as a reference sequence for comparative analysis aimed at elucidating the acquisition, dissemination and mobilisation of resistance genes among A. baumannii. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.

July 7, 2019

Complete genome sequence of Salmonella enterica subsp. enterica serovar Minnesota strain

Mango has been implicated as food vehicle in several Salmonella-causing foodborne outbreaks. Here, Salmonella enterica subsp. enterica serovar Minnesota was isolated from fresh mango fruit imported from Mexico in 2014. The complete genome sequence of S. Minnesota CFSAN017963 was sequenced using single-molecule real-time DNA sequencing. Distinct prophage regions, Salmonella pathogenicity islands, and fimbrial gene clusters were observed in comparative genomic analysis on S. Minnesota CFSAN017963 with other phylogenetically closely related Salmonella serovars. Core genome multilocus sequencing typing analysis of all the S. Minnesota isolates in the Genbank and Enterobase also revealed a high genomic diversity among the genomes analyzed.

July 7, 2019

Draft genomes of the fungal pathogen Phellinus noxius in Hong Kong

The fungal pathogen Phellinus noxius is the underlying cause of brown root rot, a disease with causing tree mortality globally, causing extensive damage in urban areas and crop plants. This disease currently has no cure, and despite the global epidemic, little is known about the pathogenesis and virulence of this pathogen. Using Ion Torrent PGM, Illumina MiSeq and PacBio RSII sequencing platforms with various genome assembly methods, we produced the draft genome sequences of four P. noxius strains isolated from infected trees in Hong Kong to further understand the pathogen and identify the mechanisms behind the aggressive nature and virulence of this fungus. The resulting genomes ranged from 30.8Mb to 31.8Mb in size, and of the four sequences, the YTM97 strain was chosen to produce a high-quality Hong Kong strain genome sequence, resulting in a 31Mb final assembly with 457 scaffolds, an N50 length of 275,889 bp and 96.2% genome completeness. RNA-seq of YTM97 using Illumina HiSeq400 was performed for improved gene prediction. AUGUSTUS and Genemark-ES prediction programs predicted 9,887 protein-coding genes which were annotated using GO and Pfam databases. The encoded carbohydrate active enzymes revealed large numbers of lignolytic enzymes present, comparable to those of other white-rot plant pathogens. In addition, P. noxius also possessed larger numbers of cellulose, xylan and hemicellulose degrading enzymes than other plant pathogens. Searches for virulence genes was also performed using PHI-Base and DFVF databases revealing a host of virulence-related genes and effectors. The combination of non-specific host range, unique carbohydrate active enzyme profile and large amount of putative virulence genes could explain the reasons behind the aggressive nature and increased virulence of this plant pathogen. The draft genome sequences presented here will provide references for strains found in Hong Kong. Together with emerging research, this information could be used for genetic diversity and epidemiology research on a global scale as well as expediting our efforts towards discovering the mechanisms of pathogenicity of this devastating pathogen.

July 7, 2019

Detection of complex structural variation from paired-end sequencing data

Detecting structural variants (SVs) from sequencing data is a key problem in genome analysis, but the full diversity of SVs is not captured by most methods. We introduce the Automated Reconstruction of Complex Structural Variants (ARC-SV) method, which detects a broad class of structural variants from paired-end whole genome sequencing (WGS) data. Analysis of samples from NA12878 and HuRef suggests that complex SVs are often misclassified by traditional methods. We validated our results both experimentally and by comparison to whole genome assembly and PacBio data; ARC-SV compares favorably to existing algorithms in general and gives state-of-the-art results on complex SV detection. By expanding the range of detectable SVs compared to commonly-used algorithms, ARC-SV allows additional information to be extracted from existing WGS data.

July 7, 2019

Dissemination and characteristics of a novel plasmid-encoded carbapenem-hydrolyzing class D beta-lactamase, OXA-436 from four patients involving six different hospitals in Denmark.

The diversity of OXA-48-like carbapenemases is continually expanding. In this study, we describe the dissemination and characteristics of a novel carbapenem-hydrolyzing class D carbapenemase (CHDL) named OXA-436. In total, six OXA-436-producing Enterobacteriaceae isolates including Enterobacter asburiae (n=3), Citrobacter freundii (n=2) and Klebsiella pneumoniae (n=1) were identified in four patients in the period between September 2013 and April 2015. All three species of OXA-436-producing Enterobacteriaceae were found in one patient. The amino acid sequence of OXA-436 showed 90.4-92.8% identity to other acquired OXA-48-like variants. Expression of OXA-436 in Escherichia coli and kinetic analysis of purified OXA-436 revealed an activity profile similar to OXA-48 and OXA-181 with activity against penicillins including temocillin, limited or no activity against extended-spectrum cephalosporins and activity against carbapenems. The blaOXA-436 gene was located on a conjugative ~314 kb IncHI2/IncHI2A plasmid belonging to pMLST ST1, in a region surrounded by chromosomal genes previously identified adjacent to blaOXA-genes in Shewanella spp. In conclusion, OXA-436 is a novel CHDL with similar functional properties as OXA-48-like CHDLs. The described geographical spread among different Enterobacteriaceae and plasmid location of blaOXA-436 illustrates its potential for further dissemination. Copyright © 2017 American Society for Microbiology.

Auto Tag: Bioinformatics

Twelve complete reference genomes of clinical isolates in the Capnocytophaga genus.

Mechanisms of surface antigenic variation in the human pathogenic fungus Pneumocystis jirovecii.

Complete genome sequence of Streptococcus thermophilus strain B59671, which naturally produces the broad-spectrum bacteriocin thermophilin 110.

Complete genome sequence of the fruiting myxobacterium Melittangium boletus DSM 14713.

Genomic variation and evolution of Vibrio parahaemolyticus ST36 over the course of a transcontinental epidemic expansion.

Draft genome sequence and annotation of the apicomplexan parasite Besnoitia besnoiti.

Complete genome sequence of a psychrotolerant denitrifying bacterium, Janthinobacterium svalbardensis PAMC 27463.

Complete genome sequence of Acetobacter tropicalis Oregon-R-modENCODE strain BDGP1, an acetic acid bacterium found in the Drosophila melanogaster gut.

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Complete genome sequencing of Arachidicoccus ginsenosidimutans sp. nov., and its application for production of minor ginsenosides by finding a novel ginsenoside-transforming beta-glucosidase

Complete genome sequence of Acinetobacter baumannii A1296 (ST1469) with a small plasmid harbouring the tet(39) tetracycline resistance gene.

Complete genome sequence of Salmonella enterica subsp. enterica serovar Minnesota strain

Draft genomes of the fungal pathogen Phellinus noxius in Hong Kong

Detection of complex structural variation from paired-end sequencing data

Dissemination and characteristics of a novel plasmid-encoded carbapenem-hydrolyzing class D beta-lactamase, OXA-436 from four patients involving six different hospitals in Denmark.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert