Menu
July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Complete genome sequence of Salinicoccus halodurans H3B36, isolated from the Qaidam Basin in China.

Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further analysis of epigenetic modifications revealed the presence of 11,000 m4C-type modified bases, 7,545 m6A-type modified bases, and 89,064 other modified bases. The data on the genome of this strain may provide an insight into the metabolism of N (a)-acetyl-a-lysine.


July 7, 2019

Complete genome sequence of the molybdenum-resistant bacterium Bacillus subtilis strain LM 4-2.

Bacillus subtilis LM 4-2, a Gram-positive bacterium was isolated from a molybdenum mine in Luoyang city. Due to its strong resistance to molybdate and potential utilization in bioremediation of molybdate-polluted area, we describe the features of this organism, as well as its complete genome sequence and annotation. The genome was composed of a circular 4,069,266 bp chromosome with average GC content of 43.83 %, which included 4149 predicted ORFs and 116 RNA genes. Additionally, 687 transporter-coding and 116 redox protein-coding genes were identified in the strain LM 4-2 genome.


July 7, 2019

IncI1 plasmids encoding various blaCTX-Ms contributed to ceftriaxone resistance in Salmonella Enteritidis in China.

Resistance to extended spectrum ß-lactams in Salmonella, in particular serotypes such as S. Enteritidis that are frequently associated with clinical infections, is a serious public health concern. In this study, phenotypic characterization of 433 clinical S. Enteritidis strains obtained from a nationwide collection of China CDC during the period of 2005~2010 depicted an increasing trend of resistance to ceftriaxone from 2008 onwards. Seventeen (4%) of the strains were found to be resistant to ceftriaxone, 7% to ciprofloxacin and 0.7% to both ciprofloxacin and ceftriaxone. Most of the ceftriaxone-resistant S. Enteritidis strains (15/17) were genetically unrelated, and originated from Henan province. The complete sequence of an IncI1 plasmid pSE115 which belonged to a novel Sequence Type was obtained. This 87,255bp IncI1 plasmid was found to harbour a blaCTX-M-14 gene located in a novel Multidrug Resistance Region (MRR) within the tra locus. Although the majority of strains were also found to contain conjugative IncI1 plasmids of similar size to pSE115(~90kb) and harbor a variety of blaCTX-MGroup 1 and Group 9 elements, the novel MRR site at the tra locus in pSE115 was not detectable in the other IncI1 plasmids. Findings in this study show that cephalosporin resistance in S. Enteritidis strains collected in China was mainly due to dissemination of blaCTX-M-encoding IncI1 plasmids, resembling the situation in which IncI1 plasmids serve as major vectors of blaCTX-M variants in other members of Enterobacteriaceae. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Complete genome sequence of Pseudomonas aeruginosa PA1, isolated from a patient with a respiratory tract infection.

We report the 6,498,072-bp complete genome sequence of Pseudomonas aeruginosa PA1, which was isolated from a patient with a respiratory tract infection in Chongqing, People’s Republic of China. Whole-genome sequencing was performed using single-molecule real-time (SMRT) technology, and de novo assembly revealed a single contig with 396-fold sequence coverage. Copyright © 2015 Lu et al.


July 7, 2019

Next-generation sequencing and comparative analysis of sequential outbreaks caused by multidrug-resistant Acinetobacter baumannii at a large academic burn center.

Next-generation sequencing (NGS) analysis has emerged as a promising molecular epidemiological method for investigating health care-associated outbreaks. Here, we used NGS to investigate a 3-year outbreak of multidrug-resistant Acinetobacter baumannii (MDRAB) at a large academic burn center. A reference genome from the index case was generated using de novo assembly of PacBio reads. Forty-six MDRAB isolates were analyzed by pulsed-field gel electrophoresis (PFGE) and sequenced using an Illumina platform. After mapping to the index case reference genome, four samples were excluded due to low coverage, leaving 42 samples for further analysis. Multilocus sequence types (MLST) and the presence of acquired resistance genes were also determined from the sequencing data. A transmission network was inferred from genomic and epidemiological data using a Bayesian framework. Based on single-nucleotide variant (SNV) differences, this MDRAB outbreak represented three sequential outbreaks caused by distinct clones. The first and second outbreaks were caused by sequence type 2 (ST2), while the third outbreak was caused by ST79. For the second outbreak, the MLST and PFGE results were discordant. However, NGS-based SNV typing detected a recombination event and consequently enabled a more accurate phylogenetic analysis. The distribution of resistance genes varied among the three outbreaks. The first- and second-outbreak strains possessed a blaOXA-23-like group, while the third-outbreak strains harbored a blaOXA-40-like group. NGS-based analysis demonstrated the superior resolution of outbreak transmission networks for MDRAB and provided insight into the mechanisms of strain diversification between sequential outbreaks through recombination. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Complete genome sequence of Chelatococcus sp. CO-6, a crude-oil-degrading bacterium.

Chelatococcus sp. CO-6 is a crude-oil-degrading strain, which was isolated from Shengli Oilfield. However, little genetic information is known about this species. We present the complete genome sequence analysis of Chelatococcus sp. CO-6 in this study. It could provide further insight into its genetic basis for membrane transport and immune system against bacteriophage in strain CO-6. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics.

Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.


July 7, 2019

Complete genome sequence of Bacillus methylotrophicus JJ-D34 isolated from deonjang, a Korean traditional fermented soybean paste.

Bacillus methylotrophicus JJ-D34 showing good proteolytic and antipathogenic activities was isolated from doenjang, a Korean traditional fermented soybean paste. Here, we report the complete genome sequence of strain JJ-D34 harboring a 4,105,955bp circular chromosome encoding 4044 genes with a 46.24% G+C content, which will provide insights into the genomic basis of its effects and facilitating its application to doenjang fermentation. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Pandoraea oxalativorans DSM 23570(T), an oxalate metabolizing soil bacterium.

Pandoraea oxalativorans DSM 23570(T) is an oxalate-degrading bacterium that was originally isolated from soil litter near to oxalate-producing plant of the genus Oxalis. Here, we report the first complete genome of P. oxalativorans DSM 23570(T) which would allow its potential biotechnological applications to be unravelled. Copyright © 2016. Published by Elsevier B.V.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.