Menu
July 7, 2019

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.© 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Genomic comparison between Staphylococcus aureus GN strains clinically isolated from a familial infection case: IS1272 transposition through a novel inverted repeat-replacing mechanism.

A bacterial insertion sequence (IS) is a mobile DNA sequence carrying only the transposase gene (tnp) that acts as a mutator to disrupt genes, alter gene expressions, and cause genomic rearrangements. “Canonical” ISs have historically been characterized by their terminal inverted repeats (IRs), which may form a stem-loop structure, and duplications of a short (non-IR) target sequence at both ends, called target site duplications (TSDs). The IS distributions and virulence potentials of Staphylococcus aureus genomes in familial infection cases are unclear. Here, we determined the complete circular genome sequences of familial strains from a Panton-Valentine leukocidin (PVL)-positive ST50/agr4 S. aureus (GN) infection of a 4-year old boy with skin abscesses. The genomes of the patient strain (GN1) and parent strain (GN3) were rich for “canonical” IS1272 with terminal IRs, both having 13 commonly-existing copies (ce-IS1272). Moreover, GN1 had a newly-inserted IS1272 (ni-IS1272) on the PVL-converting prophage, while GN3 had two copies of ni-IS1272 within the DNA helicase gene and near rot. The GN3 genome also had a small deletion. The targets of ni-IS1272 transposition were IR structures, in contrast with previous “canonical” ISs. There were no TSDs. Based on a database search, the targets for ce-IS1272 were IRs or “non-IRs”. IS1272 included a larger structure with tandem duplications of the left (IRL) side sequence; tnp included minor cases of a long fusion form and truncated form. One ce-IS1272 was associated with the segments responsible for immune evasion and drug resistance. Regarding virulence, GN1 expressed cytolytic peptides (phenol-soluble modulin a and d-hemolysin) and PVL more strongly than some other familial strains. These results suggest that IS1272 transposes through an IR-replacing mechanism, with an irreversible process unlike that of “canonical” transpositions, resulting in genomic variations, and that, among the familial strains, the patient strain has strong virulence potential based on community-associated virulence factors.


July 7, 2019

Complete genome sequencing of Arachidicoccus ginsenosidimutans sp. nov., and its application for production of minor ginsenosides by finding a novel ginsenoside-transforming beta-glucosidase

A novel bacterial strain (BS20T), which has ginsenoside-transforming ability, was whole genome sequenced for the identification of a target gene. After complete genome sequencing, phylogenetic, phenotypic and chemotaxonomic analyses, the strain BS20T (Arachidicoccus ginsenosidimutans sp. nov.) was placed within the genus Arachidicoccus of family Chitinophagaceae. The complete genome of strain BS20T comprised a circular chromosome of 4[thin space (1/6-em)]138[thin space (1/6-em)]017 bp. To find the target functional gene, 17 sets of four different glycoside hydrolases were cloned in E. coli BL21 (DE3) using the pGEX4T-1 vector and were characterized. Among these 17 sets of clones, only one, BglAg-762, exhibited ginsenoside-conversion ability. The BglAg-762 comprised 762 amino acid residues and belonged to the glycoside hydrolase family 3. The recombinant enzyme (GST-BglAg-762) was able to convert major ginsenosides Rb1 to F2 via gypenoside-XVII (Gyp-XVII), Rb2 to C-O, and Rb3, Rc, Rd, and Gyp-XVII to C-Mx1, C-Mc1, and F2, respectively. Finally, ginsenoside F2 was transformed into compound K (C-K). Besides, these pilot data demonstrate the identification of 17 sets of target/functional genes of 4 different glycoside hydrolases from a novel bacterial species via whole genome sequencing. Our results have shown that the recombinant BglAg-762 very quickly converts the major ginsenosides into minor ginsenosides, which can be used for the enhanced production of target minor ginsenosides. Furthermore, the web service of NCBI is suitable for any targeted gene identification, but based on our experimental analysis we concluded that the hypothetical protein present in NCBI should be considered as a putative or uncharacterized protein.


July 7, 2019

Complete genome sequence of Spirosoma rigui KCTC 12531 T, a bacterium isolated from fresh water from the Woopo wetland for taxonomic study

Spirosoma rigui KCTC 12531T was isolated from fresh water from the Woopo wetland, Korea. In this study, we report the complete genome sequence of a bacterium Spirosoma rigui KCTC 12531T, its complete genome sequence was obtained using the PacBio RS II platform. The genome comprised of 5,828,404 bp with the G + C content of 54.4%, the genome included 4,774 genes were predicted, among them, 4,647 genes are protein-coding genes.


July 7, 2019

Complete genome sequence of Acinetobacter baumannii A1296 (ST1469) with a small plasmid harbouring the tet(39) tetracycline resistance gene.

Acinetobacter baumannii is considered an important nosocomial pathogen worldwide owing to its increasing antibiotic resistance. This study aimed to determine the complete genome sequence of A. baumannii strain A1296 and to perform a comparative analysis among A. baumannii.The complete genome sequence of A. baumannii A1296 was sequenced on two SMRT cells using P6C4 chemistry on a PacBio Single Molecule, Real-Time (SMRT) RS II instrument. The A1296 genome sequence was annotated using Prokaryotic Genome Automatic Annotation Pipeline (PGAAP), and the sequence type and resistance genes of the strain were analysed.Here we present the complete genome sequence of A. baumannii strain A1296, belonging to a novel sequence type (ST1469) and isolated from patient in China, that was sensitive to multiple antibiotics. The genome of A. baumannii A1296 was 3810701bp in length, including one circular chromosome and two plasmids. The tet(39) resistance gene was located on the small plasmid in this A. baumannii strain.The genome sequence of A. baumannii strain A1296 can be used as a reference sequence for comparative analysis aimed at elucidating the acquisition, dissemination and mobilisation of resistance genes among A. baumannii. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.


July 7, 2019

Complete genome sequence of Salmonella enterica subsp. enterica serovar Minnesota strain

Mango has been implicated as food vehicle in several Salmonella-causing foodborne outbreaks. Here, Salmonella enterica subsp. enterica serovar Minnesota was isolated from fresh mango fruit imported from Mexico in 2014. The complete genome sequence of S. Minnesota CFSAN017963 was sequenced using single-molecule real-time DNA sequencing. Distinct prophage regions, Salmonella pathogenicity islands, and fimbrial gene clusters were observed in comparative genomic analysis on S. Minnesota CFSAN017963 with other phylogenetically closely related Salmonella serovars. Core genome multilocus sequencing typing analysis of all the S. Minnesota isolates in the Genbank and Enterobase also revealed a high genomic diversity among the genomes analyzed.


July 7, 2019

Complete genome sequence of Vibrio campbellii LMB 29 isolated from red drum with four native megaplasmids.

Vibrio spp. are the most common pathogens for animals reared in aquaculture. Vibrio campbellii, which is often involved in shrimp, fish and mollusks diseases, is widely distributed in the marine environment worldwide, but our knowledge about its pathogenesis and antimicrobial resistance is very limited. The existence of this knowledge gap is at least partially because that V. campbellii was originally classified as Vibrio harveyi, and the detailed information of its comparative genome analysis to other Vibrio spp. is currently lacking. In this study, the complete genome of a V. campbellii predominant strain, LMB29, was determined by MiSeq in conjunction with PacBio SMRT sequencing. This genome consists of two circular DNA chromosomes and four megaplasmids. Comparative genome analysis indicates that LMB29 shares a 96.66% similarity (average nucleotide identity) with the V. campbellii ATCC strain BAA-1116 based on a 75% AF (average fraction) calculations, and its functional profile is very similar to V. campbellii E1 and V. campbellii CAIM115. Both type III secretion system (T3SS) and type VI secretion system (T6SS), along with the tlh gene which encodes a thermolabile hemolysin, are present in LMB29 which may contribute to the bacterial pathogenesis. The virulence of this strain was experimental confirmed by performing a LDH assay on a fish cell infection model, and cell death was observed as early as within 3 h post infection. Thirty-seven antimicrobial resistance genes (>45% identity) were predicted in LMB29 which includes a novel rifampicin ADP ribosyltransferase, arr-9, in plasmid pLMB157. The gene arr-9 was predicted on a genomic island with horizontal transferable potentials which may facilitate the rifampicin resistance dissemination. Future researches are needed to explore the pathogenesis of V. campbellii LMB29, but the availability of this genome sequence will certainly aid as a basis for further analysis.


July 7, 2019

pSY153-MDR, a p12969-DIM-related mega plasmid carrying blaIMP-45 and armA, from clinical Pseudomonas putida.

This work characterized mega plasmid pSY153-MDR, carrying blaIMP-45 and armA, from a multidrug-resistant (MDR) Pseudomonas putida isolate from the urine of a cerebral infarction patient in China. The backbone of pSY153-MDR was closely related to Pseudomonas plasmids p12969-DIM, pOZ176, pBM413, pTTS12, and pRBL16, and could not be assigned to any of the known incompatibility groups. The accessory modules of pSY153-MDR were composed of 10 individual insertion sequence elements and two different MDR regions, and differed dramatically from the above plasmids. Fifteen non-redundant resistance markers were identified to be involved in resistance to at least eight distinct classes of antibiotics. All of these resistance genes were associated with mobile elements, and were embedded within the two MDR regions. blaIMP-45 and armA coexisted in a Tn1403-Tn1548 region, which was generated from homologous recombination of Tn1403- and Tn1548-like transposons. The second copy of armA was a component of the ISCR28-armA-?ISCR28 structure, representing a novel armA vehicle. This vehicle was located within In48, which was related to In363 and In1058. Data presented here provide a deeper insight into the evolutionary history of SY153, especially in regard to how it became extensively drug-resistant.


July 7, 2019

Complete genome sequence of Acidihalobacter prosperus strain F5, an extremely acidophilic, iron- and sulfur-oxidizing halophile with potential industrial applicability in saline water bioleaching of chalcopyrite.

Successful process development for the bioleaching of mineral ores, particularly the refractory copper sulfide ore chalcopyrite, remains a challenge in regions where freshwater is scarce and source water contains high concentrations of chloride ion. In this study, a pure isolate of Acidihalobacter prosperus strain F5 was characterized for its ability to leach base metals from sulfide ores (pyrite, chalcopyrite and pentlandite) at increasing chloride ion concentrations. F5 successfully released base metals from ores including pyrite and pentlandite at up to 30gL(-1) chloride ion and chalcopyrite up to 18gL(-1) chloride ion. In order to understand the genetic mechanisms of tolerance to high acid, saline and heavy metal stress the genome of F5 was sequenced and analysed. As well as being the first strain of Ac. prosperus to be isolated from Australia it is also the first complete genome of the Ac. prosperus species to be sequenced. The F5 genome contains genes involved in the biosynthesis of compatible solutes and genes encoding monovalent cation/proton antiporters and heavy metal transporters which could explain its abilities to tolerate high salinity, acidity and heavy metal stress. Genome analysis also confirmed the presence of genes involved in copper tolerance. The study demonstrates the potential biotechnological applicability of Ac. prosperus strain F5 for saline water bioleaching of mineral ores. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of multidrug-resistant Staphylococcus sciuri strain SNUDS-18 isolated from a farmed duck in South Korea.

This study aimed to determine the complete genome sequence of multidrug-resistant Staphylococcus sciuri strain SNUDS-18 isolated from a farmed duck in South Korea.Genomic DNA was sequenced using a PacBio RS II system. The obtained genome was annotated and antimicrobial resistance and virulence genes were identified.The sequenced genome possessed a mecA homologue (mecA1) that was almost identical to that of other oxacillin-susceptible S. sciuri strains, whereas the staphylococcal cassette chromosome mec (SCCmec) was not detected. Moreover, various antimicrobial resistance genes conferring resistance to ß-lactams, aminoglycosides, phenicols, tetracycline and macrolide-lincosamide-streptogramin B (MLSB) antimicrobials were identified.The SNUDS-18 genome and its associated genomic data will provide important insights into the biodiversity of the S. sciuri group as well as valuable information for the control of this potential pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.


July 7, 2019

Complete genome sequence of a colistin-resistant Escherichia coli strain harboring mcr-1 on an IncHI2 plasmid in the United States.

We report here the incidental detection and complete genome sequence of a urinary Escherichia coli strain harboring mcr-1 and resistant to colistin in a New York patient returning from Portugal in 2016. This strain, with sequence type 1485 (ST1485), was a non-extended-spectrum beta-lactamase (ESBL) and non-carbapenemase producer and carried the mcr-1 gene on an IncHI2 plasmid. Copyright © 2017 Gilrane et al.


July 7, 2019

Complete genome sequence analysis of Enterobacter sp. SA187, a plant multi-stress tolerance promoting endophytic bacterium

Enterobacter sp. SA187 is an endophytic bacterium that has been isolated from root nodules of the indigenous desert plant Indigofera argentea. SA187 could survive in the rhizosphere as well as in association with different plant species, and was able to provide abiotic stress tolerance to Arabidopsis thaliana. The genome sequence of SA187 was obtained by using Pacific BioScience (PacBio) single-molecule sequencing technology, with average coverage of 275X. The genome of SA187 consists of one single 4,429,597 bp chromosome, with an average 56% GC content and 4,347 predicted protein coding DNA sequences (CDS), 153 ncRNA, 7 rRNA, and 84 tRNA. Functional analysis of the SA187 genome revealed a large number of genes involved in uptake and exchange of nutrients, chemotaxis, mobilization and plant colonization. A high number of genes were also found to be involved in survival, defense against oxidative stress and production of antimicrobial compounds and toxins. Moreover, different metabolic pathways were identified that potentially contribute to plant growth promotion. The information encoded in the genome of SA187 reveals the characteristics of a dualistic lifestyle of a bacterium that can adapt to different environments and promote the growth of plants. This information provides a better understanding of the mechanisms involved in plant-microbe interaction and could be further exploited to develop SA187 as a biological agent to improve agricultural practices in marginal and arid lands.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.