Menu
April 21, 2020

A First Study of the Virulence Potential of a Bacillus subtilis Isolate From Deep-Sea Hydrothermal Vent.

Bacillus subtilis is the best studied Gram-positive bacterium, primarily as a model of cell differentiation and industrial exploitation. To date, little is known about the virulence of B. subtilis. In this study, we examined the virulence potential of a B. subtilis strain (G7) isolated from the Iheya North hydrothermal field of Okinawa Trough. G7 is aerobic, motile, endospore-forming, and requires NaCl for growth. The genome of G7 is composed of one circular chromosome of 4,216,133 base pairs with an average GC content of 43.72%. G7 contains 4,416 coding genes, 27.5% of which could not be annotated, and the remaining 72.5% were annotated with known or predicted functions in 25 different COG categories. Ten sets of 23S, 5S, and 16S ribosomal RNA operons, 86 tRNA and 14 sRNA genes, 50 tandem repeats, 41 mini-satellites, one microsatellite, and 42 transposons were identified in G7. Comparing to the genome of the B. subtilis wild type strain NCIB 3610T, G7 genome contains many genomic translocations, inversions, and insertions, and twice the amount of genomic Islands (GIs), with 42.5% of GI genes encoding hypothetical proteins. G7 possesses abundant putative virulence genes associated with adhesion, invasion, dissemination, anti-phagocytosis, and intracellular survival. Experimental studies showed that G7 was able to cause mortality in fish and mice following intramuscular/intraperitoneal injection, resist the killing effect of serum complement, and replicate in mouse macrophages and fish peripheral blood leukocytes. Taken together, our study indicates that G7 is a B. subtilis isolate with unique genetic features and can be lethal to vertebrate animals once being introduced into the animals by artificial means. These results provide the first insight into the potential harmfulness of deep-sea B. subtilis.


April 21, 2020

New genetic context of lnu(B) composed of two multi-resistance gene clusters in clinical Streptococcus agalactiae ST-19 strains.

Clindamycin is a lincosamide antibiotic used to treat staphylococcal and streptococcal infections. Reports of clinical Streptococcus agalactiae isolates with the rare lincosamide resistance/macrolide susceptibility (LR/MS) phenotype are increasing worldwide. In this study, we characterised three clinical S. agalactiae strains with the unusual L phenotype from China.Three clinical S. agalactiae strains, Sag3, Sag27 and Sag4104, with the L phenotype were identified from 186 isolates collected from 2016 to 2018 in Shanghai, China. The MICs of clindamycin, erythromycin, tetracycline, levofloxacin, and penicillin were determined using Etest. PCR for the lnu(B) gene was conducted. Whole genome sequencing and sequence analysis were carried out to investigate the genetic context of lnu(B). Efforts to transfer lincomycin resistance by conjugation and to identify the circular form by inverse PCR were made.Sag3, Sag27, and Sag4104 were susceptible to erythromycin (MIC =0.25?mg/L) but resistant to clindamycin (MIC =1?mg/L). lnu(B) was found to be responsible for the L phenotype. lnu(B) in Sag3 and Sag27 were chromosomally located in an aadE-spw-lsa(E)-lnu(B) resistance gene cluster adjacent to an upstream 7-kb tet(L)-cat resistance gene cluster. Two resistance gene clusters were flanked by the IS6-like element, IS1216. Sag4104 only contained partial genes of aadE-spw-lsa(E)-lnu(B) resistance gene cluster and was also flanked by IS1216.These results established the presence of the L phenotype associated with lnu(B) in clinical S. agalactiae isolates in China. The lnu(B)-containing multi-resistance gene cluster possibly acts as a composite transposon flanked by IS1216 and as a vehicle for the dissemination of multidrug resistance among S. agalactiae.


April 21, 2020

Characterization of a male specific region containing a candidate sex determining gene in Atlantic cod.

The genetic mechanisms determining sex in teleost fishes are highly variable and the master sex determining gene has only been identified in few species. Here we characterize a male-specific region of 9?kb on linkage group 11 in Atlantic cod (Gadus morhua) harboring a single gene named zkY for zinc knuckle on the Y chromosome. Diagnostic PCR test of phenotypically sexed males and females confirm the sex-specific nature of the Y-sequence. We identified twelve highly similar autosomal gene copies of zkY, of which eight code for proteins containing the zinc knuckle motif. 3D modeling suggests that the amino acid changes observed in six copies might influence the putative RNA-binding specificity. Cod zkY and the autosomal proteins zk1 and zk2 possess an identical zinc knuckle structure, but only the Y-specific gene zkY was expressed at high levels in the developing larvae before the onset of sex differentiation. Collectively these data suggest zkY as a candidate master masculinization gene in Atlantic cod. PCR amplification of Y-sequences in Arctic cod (Arctogadus glacialis) and Greenland cod (Gadus macrocephalus ogac) suggests that the male-specific region emerged in codfishes more than 7.5 million years ago.


April 21, 2020

A draft genome for Spatholobus suberectus.

Spatholobus suberectus Dunn (S. suberectus), which belongs to the Leguminosae, is an important medicinal plant in China. Owing to its long growth cycle and increased use in human medicine, wild resources of S. suberectus have decreased rapidly and may be on the verge of extinction. De novo assembly of the whole S. suberectus genome provides us a critical potential resource towards biosynthesis of the main bioactive components and seed development regulation mechanism of this plant. Utilizing several sequencing technologies such as Illumina HiSeq X Ten, single-molecule real-time sequencing, 10x Genomics, as well as new assembly techniques such as FALCON and chromatin interaction mapping (Hi-C), we assembled a chromosome-scale genome about 798?Mb in size. In total, 748?Mb (93.73%) of the contig sequences were anchored onto nine chromosomes with the longest scaffold being 103.57?Mb. Further annotation analyses predicted 31,634 protein-coding genes, of which 93.9% have been functionally annotated. All data generated in this study is available in public databases.


April 21, 2020

Adaptive Strategies in a Poly-Extreme Environment: Differentiation of Vegetative Cells in Serratia ureilytica and Resistance to Extreme Conditions.

Poly-extreme terrestrial habitats are often used as analogs to extra-terrestrial environments. Understanding the adaptive strategies allowing bacteria to thrive and survive under these conditions could help in our quest for extra-terrestrial planets suitable for life and understanding how life evolved in the harsh early earth conditions. A prime example of such a survival strategy is the modification of vegetative cells into resistant resting structures. These differentiated cells are often observed in response to harsh environmental conditions. The environmental strain (strain Lr5/4) belonging to Serratia ureilytica was isolated from a geothermal spring in Lirima, Atacama Desert, Chile. The Atacama Desert is the driest habitat on Earth and furthermore, due to its high altitude, it is exposed to an increased amount of UV radiation. The geothermal spring from which the strain was isolated is oligotrophic and the temperature of 54°C exceeds mesophilic conditions (15 to 45°C). Although the vegetative cells were tolerant to various environmental insults (desiccation, extreme pH, glycerol), a modified cell type was formed in response to nutrient deprivation, UV radiation and thermal shock. Scanning (SEM) and Transmission Electron Microscopy (TEM) analyses of vegetative cells and the modified cell structures were performed. In SEM, a change toward a circular shape with reduced size was observed. These circular cells possessed what appears as extra coating layers under TEM. The resistance of the modified cells was also investigated, they were resistant to wet heat, UV radiation and desiccation, while vegetative cells did not withstand any of those conditions. A phylogenomic analysis was undertaken to investigate the presence of known genes involved in dormancy in other bacterial clades. Genes related to spore-formation in Myxococcus and Firmicutes were found in S. ureilytica Lr5/4 genome; however, these genes were not enough for a full sporulation pathway that resembles either group. Although, the molecular pathway of cell differentiation in S. ureilytica Lr5/4 is not fully defined, the identified genes may contribute to the modified phenotype in the Serratia genus. Here, we show that a modified cell structure can occur as a response to extremity in a species that was previously not known to deploy this strategy. This strategy may be widely spread in bacteria, but only expressed under poly-extreme environmental conditions.


April 21, 2020

Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight.

The human genome contains “dark” gene regions that cannot be adequately assembled or aligned using standard short-read sequencing technologies, preventing researchers from identifying mutations within these gene regions that may be relevant to human disease. Here, we identify regions with few mappable reads that we call dark by depth, and others that have ambiguous alignment, called camouflaged. We assess how well long-read or linked-read technologies resolve these regions.Based on standard whole-genome Illumina sequencing data, we identify 36,794 dark regions in 6054 gene bodies from pathways important to human health, development, and reproduction. Of these gene bodies, 8.7% are completely dark and 35.2% are =?5% dark. We identify dark regions that are present in protein-coding exons across 748 genes. Linked-read or long-read sequencing technologies from 10x Genomics, PacBio, and Oxford Nanopore Technologies reduce dark protein-coding regions to approximately 50.5%, 35.6%, and 9.6%, respectively. We present an algorithm to resolve most camouflaged regions and apply it to the Alzheimer’s Disease Sequencing Project. We rescue a rare ten-nucleotide frameshift deletion in CR1, a top Alzheimer’s disease gene, found in disease cases but not in controls.While we could not formally assess the association of the CR1 frameshift mutation with Alzheimer’s disease due to insufficient sample-size, we believe it merits investigating in a larger cohort. There remain thousands of potentially important genomic regions overlooked by short-read sequencing that are largely resolved by long-read technologies.


April 21, 2020

High-coverage, long-read sequencing of Han Chinese trio reference samples.

Single-molecule long-read sequencing datasets were generated for a son-father-mother trio of Han Chinese descent that is part of the Genome in a Bottle (GIAB) consortium portfolio. The dataset was generated using the Pacific Biosciences Sequel System. The son and each parent were sequenced to an average coverage of 60 and 30, respectively, with N50 subread lengths between 16 and 18?kb. Raw reads and reads aligned to both the GRCh37 and GRCh38 are available at the NCBI GIAB ftp site (ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/ChineseTrio/). The GRCh38 aligned read data are archived in NCBI SRA (SRX4739017, SRX4739121, and SRX4739122). This dataset is available for anyone to develop and evaluate long-read bioinformatics methods.


April 21, 2020

Plasmids of Shigella flexneri serotype 1c strain Y394 provide advantages to bacteria in the host.

Shigella flexneri has an extremely complex genome with a significant number of virulence traits acquired by mobile genetic elements including bacteriophages and plasmids. S. flexneri serotype 1c is an emerging etiological agent of bacillary dysentery in developing countries. In this study, the complete nucleotide sequence of two plasmids of S. flexneri serotype 1c strain Y394 was determined and analysed.The plasmid pINV-Y394 is an invasive or virulence plasmid of size 221,293?bp composed of a large number of insertion sequences (IS), virulence genes, regulatory and maintenance genes. Three hundred and twenty-eight open reading frames (ORFs) were identified in pINV-Y394, of which about a half (159 ORFs) were identified as IS elements. Ninety-seven ORFs were related to characterized genes (majority of which are associated with virulence and their regulons), and 72 ORFs were uncharacterized or hypothetical genes. The second plasmid pNV-Y394 is of size 10,866?bp and encodes genes conferring resistance against multiple antibiotics of clinical importance. The multidrug resistance gene cassette consists of tetracycline resistance gene tetA, streptomycin resistance gene strA-strB and sulfonamide-resistant dihydropteroate synthase gene sul2.These two plasmids together play a key role in the fitness of Y394 in the host environment. The findings from this study indicate that the pathogenic S. flexneri is a highly niche adaptive pathogen which is able to co-evolve with its host and respond to the selection pressure in its environment.


April 21, 2020

Characterization of an NDM-5 carbapenemase-producing Escherichia coli ST156 isolate from a poultry farm in Zhejiang, China.

The emergence of carbapenem-resistant Enterobacteriaceae strains has posed a severe threat to public health in recent years. The mobile elements carrying the New Delhi metallo-ß-lactqtamase (NDM) gene have been regarded as the major mechanism leading to the rapid increase of carbapenem-resistant Enterobacteriaceae strains isolated from clinics and animals.We describe an NDM-5-producing Escherichia coli strain, ECCRA-119 (sequence type 156 [ST156]), isolated from a poultry farm in Zhejiang, China. ECCRA-119 is a multidrug-resistant (MDR) isolate that exhibited resistance to 27 antimicrobial compounds, including imipenem and meropenem, as detected by antimicrobial susceptibility testing (AST). The complete genome sequence of the ECCRA-119 isolate was also obtained using the PacBio RS II platform. Eleven acquired resistance genes were identified in the chromosome; four were detected in plasmid pTB201, while six were detected in plasmid pTB202. Importantly, the carbapenem-resistant gene blaNDM-5 was detected in the IncX3 plasmid pTB203. In addition, seven virulence genes and one metal-resistance gene were also detected. The results of conjugation experiments and the transfer regions identification indicated that the blaNDM-5-harboring plasmid pTB203 could be transferred between E. coli strains.The results reflected the severe bacterial resistance in a poultry farm in Zhejiang province and increased our understanding of the presence and transmission of the blaNDM-5 gene.


April 21, 2020

The Anaplasma ovis genome reveals a high proportion of pseudogenes.

The genus Anaplasma is made up of organisms characterized by small genomes that are undergoing reductive evolution. Anaplasma ovis, one of the seven recognized species in this genus, is an understudied pathogen of sheep and other ruminants. This tick-borne agent is thought to induce only mild clinical disease; however, small deficits may add to larger economic impacts due to the wide geographic distribution of this pathogen.In this report we present the first complete genome sequence for A. ovis and compare the genome features with other closely related species. The 1,214,674?bp A. ovis genome encodes 933 protein coding sequences, the split operon arrangement for ribosomal RNA genes, and more pseudogenes than previously recognized for other Anaplasma species. The metabolic potential is similar to other Anaplasma species. Anaplasma ovis has a small repertoire of surface proteins and transporters. Several novel genes are identified.Analyses of these important features and significant gene families/genes with potential to be vaccine candidates are presented in a comparative context. The availability of this genome will significantly facilitate research for this pathogen.


April 21, 2020

Progression of the canonical reference malaria parasite genome from 2002-2019.

Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpretating intra-species sequence diversity. We therefore describe in this publication a population reference genome of P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly. P. falciparum 3D7 will be continued to be maintained with ongoing curation ensuring continual improvements in annotation quality.


April 21, 2020

Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.

Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome. Our method is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we prioritize pathogenic expansions within the top 10 out of 700,000 tandem repeats in whole genome sequencing data. This may help to elucidate the many genetic diseases whose causes remain unknown.


April 21, 2020

Comparative Genomic Analyses Reveal Core-Genome-Wide Genes Under Positive Selection and Major Regulatory Hubs in Outlier Strains of Pseudomonas aeruginosa.

Genomic information for outlier strains of Pseudomonas aeruginosa is exiguous when compared with classical strains. We sequenced and constructed the complete genome of an environmental strain CR1 of P. aeruginosa and performed the comparative genomic analysis. It clustered with the outlier group, hence we scaled up the analyses to understand the differences in environmental and clinical outlier strains. We identified eight new regions of genomic plasticity and a plasmid pCR1 with a VirB/D4 complex followed by trimeric auto-transporter that can induce virulence phenotype in the genome of strain CR1. Virulence genotype analysis revealed that strain CR1 lacked hemolytic phospholipase C and D, three genes for LPS biosynthesis and had reduced antibiotic resistance genes when compared with clinical strains. Genes belonging to proteases, bacterial exporters and DNA stabilization were found to be under strong positive selection, thus facilitating pathogenicity and survival of the outliers. The outliers had the complete operon for the production of vibrioferrin, a siderophore present in plant growth promoting bacteria. The competence to acquire multidrug resistance and new virulence factors makes these strains a potential threat. However, we identified major regulatory hubs that can be used as drug targets against both the classical and outlier groups.


April 21, 2020

Genome plasticity favours double chromosomal Tn4401b-blaKPC-2 transposon insertion in the Pseudomonas aeruginosa ST235 clone.

Pseudomonas aeruginosa Sequence Type 235 is a clone that possesses an extraordinary ability to acquire mobile genetic elements and has been associated with the spread of resistance genes, including genes that encode for carbapenemases. Here, we aim to characterize the genetic platforms involved in resistance dissemination in blaKPC-2-positive P. aeruginosa ST235 in Colombia.In a prospective surveillance study of infections in adult patients attended in five ICUs in five distant cities in Colombia, 58 isolates of P. aeruginosa were recovered, of which, 27 (46.6%) were resistant to carbapenems. The molecular analysis showed that 6 (22.2%) and 4 (14.8%) isolates harboured the blaVIM and blaKPC-2 genes, respectively. The four blaKPC-2-positive isolates showed a similar PFGE pulsotype and belonged to ST235. Complete genome sequencing of a representative ST235 isolate shows a unique chromosomal contig of 7097.241?bp with eight different resistance genes identified and five transposons: a Tn6162-like with ant(2?)-Ia, two Tn402-like with ant(3?)-Ia and blaOXA-2 and two Tn4401b with blaKPC-2. All transposons were inserted into the genomic islands. Interestingly, the two Tn4401b copies harbouring blaKPC-2 were adjacently inserted into a new genomic island (PAGI-17) with traces of a replicative transposition process. This double insertion was probably driven by several structural changes within the chromosomal region containing PAGI-17 in the ST235 background.This is the first report of a double Tn4401b chromosomal insertion in P. aeruginosa, just within a new genomic island (PAGI-17). This finding indicates once again the great genomic plasticity of this microorganism.


April 21, 2020

Construction of JRG (Japanese reference genome) with single-molecule real-time sequencing

In recent genome analyses, population-specific reference panels have indicated important. However, reference panels based on short-read sequencing data do not sufficiently cover long insertions. Therefore, the nature of long insertions has not been well documented. Here, we assembled a Japanese genome using single-molecule real-time sequencing data and characterized insertions found in the assembled genome. We identified 3691 insertions ranging from 100?bps to ~10,000?bps in the assembled genome relative to the international reference sequence (GRCh38). To validate and characterize these insertions, we mapped short-reads from 1070 Japanese individuals and 728 individuals from eight other populations to insertions integrated into GRCh38. With this result, we constructed JRGv1 (Japanese Reference Genome version 1) by integrating the 903 verified insertions, totaling 1,086,173 bases, shared by at least two Japanese individuals into GRCh38. We also constructed decoyJRGv1 by concatenating 3559 verified insertions, totaling 2,536,870 bases, shared by at least two Japanese individuals or by six other assemblies. This assembly improved the alignment ratio by 0.4% on average. These results demonstrate the importance of refining the reference assembly and creating a population-specific reference genome. JRGv1 and decoyJRGv1 are available at the JRG website.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.