Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93?Gb (contig N50: 8.3?Mb, scaffold N50: 22.0?Mb, including 39.3?Mb N-bases), together with 206?Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%) N-gaps in the reference genome GRCh38. Comparison to GRCh38 reveals 12.8?Mb of HX1-specific sequences, including 4.1?Mb that are not present in previously reported Asian genomes. Furthermore, long-read sequencing…
The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could…
The increasing prevalence of KPC-producing Klebsiella pneumoniae strains in clinical settings has been largely attributed to dissemination of organisms of specific multilocus sequence types, such as ST258 and ST11. Compared with the ST258 clone, which is prevalent in North America and Europe, ST11 is common in China but information regarding its genetic features remains scarce. In this study, we performed detailed genetic characterization of ST11 K. pneumoniae strains by analyzing whole-genome sequences of 58 clinical strains collected from diverse geographic locations in China. The ST11 genomes were found to be highly heterogeneous and clustered into at least three major lineages…
Oxidative stresses commonly exist in natural environments, and microbes have developed a variety of defensive systems to counteract such events. Although increasing evidence has shown that high hydrostatic pressure (HHP) and low temperature (LT) induce antioxidant defense responses in cells, there is no direct evidence to prove the connection between antioxidant defense mechanisms and the adaptation of bacteria to HHP and LT. In this study, using the wild-type (WT) strain of a deep-sea bacterium, Shewanella piezotolerans WP3, as an ancestor, we obtained a mutant, OE100, with an enhanced antioxidant defense capacity by experimental evolution under H2O2 stress. Notably, OE100 exhibited…
This study aimed to characterize novel conjugative plasmids that encode transferrable ciprofloxacin resistance in Salmonella In this study, 157 non-duplicated Salmonella isolates were recovered from food products, 55 out of which were found to be resistant to ciprofloxacin. Interestingly, 37 out of the 55 (67%) CipRSalmonella isolates did not harbor any mutations in the Quinolone resistance determine regions (QRDR). Interestingly, six Salmonella isolates were shown to carry two novel types of conjugative plasmids that could transfer ciprofloxacin resistance phenotype to E. coli J53 (AziR). The first type belonged to the ~110kb IncFIB type conjugative plasmid carrying qnrB-bearing and aac(6′)-Ib-cr-bearing mobile…
Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further…
Resistance to extended spectrum ß-lactams in Salmonella, in particular serotypes such as S. Enteritidis that are frequently associated with clinical infections, is a serious public health concern. In this study, phenotypic characterization of 433 clinical S. Enteritidis strains obtained from a nationwide collection of China CDC during the period of 2005~2010 depicted an increasing trend of resistance to ceftriaxone from 2008 onwards. Seventeen (4%) of the strains were found to be resistant to ceftriaxone, 7% to ciprofloxacin and 0.7% to both ciprofloxacin and ceftriaxone. Most of the ceftriaxone-resistant S. Enteritidis strains (15/17) were genetically unrelated, and originated from Henan province.…
Streptomyces lydicus A02 is used by industry because it has a higher natamycin-producing capacity than the reference strain S. natalensis ATCC 27448. We sequenced the complete genome of A02 using next-generation sequencing platforms, and to achieve better sequence coverage and genome assembly, we utilized single-molecule real-time (SMRT) sequencing. The assembled genome comprises a 9,307,519-bp linear chromosome with a GC content of 70.67%, and contained 8,888 predicted genes. Comparative genomics and natamycin biosynthetic gene cluster (BGC) analysis showed that BGC are highly conserved among evolutionarily diverse strains, and they also shared closer genome evolution compared with other Streptomyces species. Forty gene…
To analyse and compare mcr-1-bearing plasmids from animal Escherichia coli isolates, and to investigate potential mechanisms underlying dissemination of mcr-1.Ninety-seven ESBL-producing E. coli strains isolated from pig farms in China were screened for the mcr-1 gene. Fifteen mcr-1-positive strains were subjected to molecular characterization and bioinformatic analysis of the mcr-1-bearing plasmids that they harboured.Three major types of mcr-1-bearing plasmids were recovered: IncX4 (~33 kb), IncI2 (~60 kb) and IncHI2 (~216-280 kb), among which the IncX4 and IncI2 plasmids were found to harbour the mcr-1 gene only, whereas multiple resistance elements including blaCTX-M, blaCMY, blaTEM, fosA, qnrS, floR and oqxAB were…
To characterize a novel virulence-resistance plasmid pSE380T carried by a Salmonella enterica serotype Enteritidis clinical strain SE380.The plasmid pSE380T was conjugated to Escherichia coli strain J53 and sequenced by PacBio RSII, followed by subsequent annotation and genetic analysis.Sequence analysis of this plasmid revealed that the entire Salmonella Enteritidis-specific virulence plasmid, pSEN, had been incorporated into an IncHI2 MDR plasmid, which comprises the cephalosporin and fosfomycin resistance determinants blaCTX-M-14 and fosA3. Based on BLAST analysis and scrutiny of insertion footprints, the insertion event was found to involve a replicative transposition process mediated by IS26, an IS element frequently detected in various…
Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders.
Bacillus vallismortis NBIF-001, a Gram-positive bacterium, was isolated from soil in Shangri-La, China. Here, we provide the complete genome sequence of this bacterium, which has a 3,929,787-bp-long genome, including 4,030 protein-coding genes and 195 RNA genes. This strain possesses a number of genes encoding virulence factors of pathogens. Copyright © 2017 Liu et al.
As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the a-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform…
Streptomyces xiamenensis 318, a moderate halophile isolated from a mangrove sediment, produces the anti-fibrotic compound xiamenmycin. The whole genome sequence of strain 318 was obtained through long-read single-molecule real-time (SMRT) sequencing, high-throughput Illumina HiSeq and 454 pyrosequencing technologies. The assembled genome comprises a linear chromosome as a single contig of 5,961,401-bp, which is considerably smaller than other reported complete genomes of the genus Streptomyces. Based on the antiSMASH pipeline, a total of 21?gene clusters were predicted to be involved in secondary metabolism. The gene cluster responsible for the biosynthesis of xiamenmycin resides in a strain-specific 61,387-bp genomic island belonging to…
Cnaphalocrocis medinalis is a major pest of rice in South and South-East Asia. Insecticides are the major means farmers use for management. A naturally occurring baculovirus, C. medinalis granulovirus (CnmeGV), has been isolated from the larvae and this has the potential for use as microbial agent. Here, we described the complete genome sequence of CnmeGV and compared it to other baculovirus genomes. The genome of CnmeGV is 112,060 base pairs in length, has a G+C content of 35.2%. It contains 133 putative open reading frames (ORFs) of at least 150 nucleotides. A hundred and one (101) of these ORFs are…