Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93?Gb (contig N50: 8.3?Mb, scaffold N50: 22.0?Mb, including 39.3?Mb N-bases), together with 206?Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%) N-gaps in the reference genome GRCh38. Comparison to GRCh38 reveals 12.8?Mb of HX1-specific sequences, including 4.1?Mb that are not present in previously reported Asian genomes. Furthermore, long-read sequencing of the transcriptome reveals novel spliced genes that are not annotated in GENCODE and are missed by short-read RNA-Seq. Our results imply that improved characterization of genome functional variation may require the use of a range of genomic technologies on diverse human populations.
Single-molecule real-time transcript sequencing facilitates common wheat genome annotation and grain transcriptome research.
The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat.Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.
The increasing prevalence of KPC-producing Klebsiella pneumoniae strains in clinical settings has been largely attributed to dissemination of organisms of specific multilocus sequence types, such as ST258 and ST11. Compared with the ST258 clone, which is prevalent in North America and Europe, ST11 is common in China but information regarding its genetic features remains scarce. In this study, we performed detailed genetic characterization of ST11 K. pneumoniae strains by analyzing whole-genome sequences of 58 clinical strains collected from diverse geographic locations in China. The ST11 genomes were found to be highly heterogeneous and clustered into at least three major lineages based on the patterns of single-nucleotide polymorphisms. Exhibiting five different capsular types, these ST11 strains were found to harbor multiple resistance and virulence determinants such as the blaKPC-2 gene, which encodes carbapenemase, and the yersiniabactin-associated virulence genes irp, ybt and fyu. Moreover, genes encoding the virulence factor aerobactin and the regulator of the mucoid phenotype (rmpA) were detectable in six genomes, whereas genes encoding salmochelin were found in three genomes. In conclusion, our data indicated that carriage of a wide range of resistance and virulence genes constitutes the underlying basis of the high level of prevalence of ST11 in clinical settings. Such findings provide insight into the development of novel strategies for prevention, diagnosis and treatment of K. pneumoniae infections.
Enhancing the adaptability of the deep-sea bacterium Shewanella piezotolerans WP3 to high pressure and low temperature by experimental evolution under H2O2 stress.
Oxidative stresses commonly exist in natural environments, and microbes have developed a variety of defensive systems to counteract such events. Although increasing evidence has shown that high hydrostatic pressure (HHP) and low temperature (LT) induce antioxidant defense responses in cells, there is no direct evidence to prove the connection between antioxidant defense mechanisms and the adaptation of bacteria to HHP and LT. In this study, using the wild-type (WT) strain of a deep-sea bacterium, Shewanella piezotolerans WP3, as an ancestor, we obtained a mutant, OE100, with an enhanced antioxidant defense capacity by experimental evolution under H2O2 stress. Notably, OE100 exhibited better tolerance not only to H2O2 stress but also to HHP and LT (20 MPa and 4°C, respectively). Whole-genome sequencing identified a deletion mutation in the oxyR gene, which encodes the transcription factor that controls the oxidative stress response. Comparative transcriptome analysis showed that the genes associated with oxidative stress defense, anaerobic respiration, DNA repair, and the synthesis of flagella and bacteriophage were differentially expressed in OE100 compared with the WT at 20 MPa and 4°C. Genetic analysis of oxyR and ccpA2 indicated that the OxyR-regulated cytochrome c peroxidase CcpA2 significantly contributed to the adaptation of WP3 to HHP and LT. Taken together, these results confirmed the inherent relationship between antioxidant defense mechanisms and the adaptation of a benthic microorganism to HHP and LT.IMPORTANCE Oxidative stress exists in various niches, including the deep-sea ecosystem, which is an extreme environment with conditions of HHP and predominantly LT. Although previous studies have shown that HHP and LT induce antioxidant defense responses in cells, direct evidence to prove the connection between antioxidant defense mechanisms and the adaptation of bacteria to HHP and LT is lacking. In this work, using the deep-sea bacterium Shewanella piezotolerans WP3 as a model, we proved that enhancement of the adaptability of WP3 to HHP and LT can benefit from its antioxidant defense mechanism, which provided useful insight into the ecological roles of antioxidant genes in a benthic microorganism and contributed to an improved understanding of microbial adaptation strategies in deep-sea environments.
Identification and characterization of conjugative plasmids that encode ciprofloxacin resistance in Salmonella.
This study aimed to characterize novel conjugative plasmids that encode transferrable ciprofloxacin resistance in Salmonella In this study, 157 non-duplicated Salmonella isolates were recovered from food products, 55 out of which were found to be resistant to ciprofloxacin. Interestingly, 37 out of the 55 (67%) CipRSalmonella isolates did not harbor any mutations in the Quinolone resistance determine regions (QRDR). Interestingly, six Salmonella isolates were shown to carry two novel types of conjugative plasmids that could transfer ciprofloxacin resistance phenotype to E. coli J53 (AziR). The first type belonged to the ~110kb IncFIB type conjugative plasmid carrying qnrB-bearing and aac(6′)-Ib-cr-bearing mobile elements. Transfer of the plasmid between E. coli or Salmonella could confer CIP MIC to 1 to 2µg/ml. The second type of conjugative plasmid belonged to ~240kb IncH1/IncF plasmids carrying a single PMQR gene, qnrS Importantly, this type of conjugative ciprofloxacin resistance plasmids could be detected in clinical isolates of Salmonella Dissemination of these conjugative plasmids that confer ciprofloxaicn resistance poses serious public health impact and Salmonella infection control. Copyright © 2018 American Society for Microbiology.
Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further analysis of epigenetic modifications revealed the presence of 11,000 m4C-type modified bases, 7,545 m6A-type modified bases, and 89,064 other modified bases. The data on the genome of this strain may provide an insight into the metabolism of N (a)-acetyl-a-lysine.
IncI1 plasmids encoding various blaCTX-Ms contributed to ceftriaxone resistance in Salmonella Enteritidis in China.
Resistance to extended spectrum ß-lactams in Salmonella, in particular serotypes such as S. Enteritidis that are frequently associated with clinical infections, is a serious public health concern. In this study, phenotypic characterization of 433 clinical S. Enteritidis strains obtained from a nationwide collection of China CDC during the period of 2005~2010 depicted an increasing trend of resistance to ceftriaxone from 2008 onwards. Seventeen (4%) of the strains were found to be resistant to ceftriaxone, 7% to ciprofloxacin and 0.7% to both ciprofloxacin and ceftriaxone. Most of the ceftriaxone-resistant S. Enteritidis strains (15/17) were genetically unrelated, and originated from Henan province. The complete sequence of an IncI1 plasmid pSE115 which belonged to a novel Sequence Type was obtained. This 87,255bp IncI1 plasmid was found to harbour a blaCTX-M-14 gene located in a novel Multidrug Resistance Region (MRR) within the tra locus. Although the majority of strains were also found to contain conjugative IncI1 plasmids of similar size to pSE115(~90kb) and harbor a variety of blaCTX-MGroup 1 and Group 9 elements, the novel MRR site at the tra locus in pSE115 was not detectable in the other IncI1 plasmids. Findings in this study show that cephalosporin resistance in S. Enteritidis strains collected in China was mainly due to dissemination of blaCTX-M-encoding IncI1 plasmids, resembling the situation in which IncI1 plasmids serve as major vectors of blaCTX-M variants in other members of Enterobacteriaceae. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Streptomyces lydicus A02 is used by industry because it has a higher natamycin-producing capacity than the reference strain S. natalensis ATCC 27448. We sequenced the complete genome of A02 using next-generation sequencing platforms, and to achieve better sequence coverage and genome assembly, we utilized single-molecule real-time (SMRT) sequencing. The assembled genome comprises a 9,307,519-bp linear chromosome with a GC content of 70.67%, and contained 8,888 predicted genes. Comparative genomics and natamycin biosynthetic gene cluster (BGC) analysis showed that BGC are highly conserved among evolutionarily diverse strains, and they also shared closer genome evolution compared with other Streptomyces species. Forty gene clusters were predicted to involve in the secondary metabolism of A02, and it was richly displayed in two-component signal transduction systems (TCS) in the genome, indicating a complex regulatory systems and high diversity of metabolites. Disruption of the phoP gene of the phoR-phoP TCS and nsdA gene confirmed phosphate sensitivity and global negative regulation of natamycin production. The genome sequence and analyses presented in this study provide an important molecular basis for research on natamycin production in Streptomyces, which could facilitate rational genome modification to improve the industrial use of A02.
Genetic characterization of mcr-1-bearing plasmids to depict molecular mechanisms underlying dissemination of the colistin resistance determinant.
To analyse and compare mcr-1-bearing plasmids from animal Escherichia coli isolates, and to investigate potential mechanisms underlying dissemination of mcr-1.Ninety-seven ESBL-producing E. coli strains isolated from pig farms in China were screened for the mcr-1 gene. Fifteen mcr-1-positive strains were subjected to molecular characterization and bioinformatic analysis of the mcr-1-bearing plasmids that they harboured.Three major types of mcr-1-bearing plasmids were recovered: IncX4 (~33 kb), IncI2 (~60 kb) and IncHI2 (~216-280 kb), among which the IncX4 and IncI2 plasmids were found to harbour the mcr-1 gene only, whereas multiple resistance elements including blaCTX-M, blaCMY, blaTEM, fosA, qnrS, floR and oqxAB were detected, in various combinations, alongside mcr-1 in the IncHI2 plasmids. The profiles of mcr-1-bearing plasmids in the test strains were highly variable, with coexistence of two mcr-1-bearing plasmids being common. However, the MIC of colistin was not affected by the number of mcr-1-carrying plasmids harboured. Comparative analysis of the plasmids showed that they contained an mcr-1 gene cassette with varied structures (mcr-1-orf, ISApl1-mcr-1-orf and Tn6330), with the IncHI2 type being the most active in acquiring foreign resistance genes. A novel transposon, Tn6330, with the structure ISApl1-mcr-1-orf-ISApl1 was found to be the key element mediating translocation of mcr-1 into various plasmid backbones through formation of a circular intermediate.The mcr-1 gene can be disseminated via multiple mobile elements including Tn6330, its circular intermediate and plasmids harbouring such elements. It is often co-transmitted with other resistance determinants through IncHI2 plasmids. The functional mechanism of Tn6330, a typical composite transposon harbouring mcr-1, should be further investigated.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: firstname.lastname@example.org.
To characterize a novel virulence-resistance plasmid pSE380T carried by a Salmonella enterica serotype Enteritidis clinical strain SE380.The plasmid pSE380T was conjugated to Escherichia coli strain J53 and sequenced by PacBio RSII, followed by subsequent annotation and genetic analysis.Sequence analysis of this plasmid revealed that the entire Salmonella Enteritidis-specific virulence plasmid, pSEN, had been incorporated into an IncHI2 MDR plasmid, which comprises the cephalosporin and fosfomycin resistance determinants blaCTX-M-14 and fosA3. Based on BLAST analysis and scrutiny of insertion footprints, the insertion event was found to involve a replicative transposition process mediated by IS26, an IS element frequently detected in various resistance plasmids. The resulting pSE380T plasmid also comprises backbone elements of IncHI2 and IncFIA plasmids, producing a rare fusion product that simultaneously encodes functional features of both, i.e. virulence, resistance and high transmissibility.This is a novel hybrid plasmid mediating MDR and virulence from a clinical Salmonella Enteritidis strain. This plasmid is likely to be transmissible amongst various serotypes of Salmonella and other Enterobacteriaceae species, rendering a wide range of bacterial pathogens resistant to cephalosporins and fosfomycin, and further enhancing their virulence potential. It will be important to monitor the spread and further evolution of this plasmid among the Enterobacteriaceae strains.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: email@example.com.
Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders.
Complete genome sequence of Bacillus vallismortis NBIF-001, a novel strain from Shangri-La, China, that has high activity against Fusarium oxysporum.
Bacillus vallismortis NBIF-001, a Gram-positive bacterium, was isolated from soil in Shangri-La, China. Here, we provide the complete genome sequence of this bacterium, which has a 3,929,787-bp-long genome, including 4,030 protein-coding genes and 195 RNA genes. This strain possesses a number of genes encoding virulence factors of pathogens. Copyright © 2017 Liu et al.
Genome mining of astaxanthin biosynthetic genes from Sphingomonas sp. ATCC 55669 for heterologous overproduction in Escherichia coli.
As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the a-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform was constructed in E. coli for identifying the functional roles of candidate genes. All genes involved in astaxanthin biosynthesis showed discrete distributions on the chromosome. Moreover, the overexpression of exogenous E. coli idi in Sphingomonas sp. ATCC 55669 increased astaxanthin production by 5.4-fold. This study described a new astaxanthin producer and provided more biosynthesis components for bioengineering of astaxanthin in the future. © 2015 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Deciphering the streamlined genome of Streptomyces xiamenensis 318 as the producer of the anti-fibrotic drug candidate xiamenmycin.
Streptomyces xiamenensis 318, a moderate halophile isolated from a mangrove sediment, produces the anti-fibrotic compound xiamenmycin. The whole genome sequence of strain 318 was obtained through long-read single-molecule real-time (SMRT) sequencing, high-throughput Illumina HiSeq and 454 pyrosequencing technologies. The assembled genome comprises a linear chromosome as a single contig of 5,961,401-bp, which is considerably smaller than other reported complete genomes of the genus Streptomyces. Based on the antiSMASH pipeline, a total of 21?gene clusters were predicted to be involved in secondary metabolism. The gene cluster responsible for the biosynthesis of xiamenmycin resides in a strain-specific 61,387-bp genomic island belonging to the left-arm region. A core metabolic network consisting of 104 reactions that supports xiamenmycin biosynthesis was constructed to illustrate the necessary precursors derived from the central metabolic pathway. In accordance with the finding of a putative ikarugamycin gene cluster in the genome, the targeted chemical profiling of polycyclic tetramate macrolactams (PTMs) resulted in the identification of ikarugamycin. A successful genome mining for bioactive molecules with different skeletons suggests that the naturally minimized genome of S. xiamenensis 318 could be used as a blueprint for constructing a chassis cell with versatile biosynthetic capabilities for the production of secondary metabolites.
Genome of Cnaphalocrocis medinalis granulovirus, the first Crambidae-infecting betabaculovirus isolated from rice leaffolder to sequenced.
Cnaphalocrocis medinalis is a major pest of rice in South and South-East Asia. Insecticides are the major means farmers use for management. A naturally occurring baculovirus, C. medinalis granulovirus (CnmeGV), has been isolated from the larvae and this has the potential for use as microbial agent. Here, we described the complete genome sequence of CnmeGV and compared it to other baculovirus genomes. The genome of CnmeGV is 112,060 base pairs in length, has a G+C content of 35.2%. It contains 133 putative open reading frames (ORFs) of at least 150 nucleotides. A hundred and one (101) of these ORFs are homologous to other baculovirus genes including 37 baculovirus core genes. Thirty-two (32) ORFs are unique to CnmeGV with no homologues detected in the GeneBank and 53 tandem repeats (TRs) with sequence length from 25 to 551 nt intersperse throughout the genome of CnmeGV. Six (6) homologous regions (hrs) were identified interspersed throughout the genome. Hr2 contains 11 imperfect palindromes and a high content of AT sequence (about 73%). The unique ORF28 contains a coiled-coil region and a zinc finger-like domain of 4-50 residues specialized by two C2C2 zinc finger motifs that putatively bound two atoms of zinc. ORF21 encoding a chit-1 protein suggesting a horizontal gene transfer from alphabaculovirus. The putative protein presents two carbohydrate-binding module family 14 (CBM_14) domains rather than other homologues detected from betabaculovirus that only contains one chit-binding region. Gene synteny maps showed the colinearity of sequenced betabaculovirus. Phylogenetic analysis indicated that CnmeGV grouped in the betabaculovirus, with a close relation to AdorGV. The cladogram obtained in this work grouped the 17 complete GV genomes in one monophyletic clade. CnmeGV represents a new crambidae host-isolated virus species from the genus Betabaculovirus and is most closely relative of AdorGV. The analyses and information derived from this study will provide a better understanding of the pathological symptoms caused by this virus and its potential use as a microbial pesticide.