Menu
April 21, 2020

Progression of the canonical reference malaria parasite genome from 2002-2019.

Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpretating intra-species sequence diversity. We therefore describe in this publication a population reference genome of P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly. P. falciparum 3D7 will be continued to be maintained with ongoing curation ensuring continual improvements in annotation quality.


April 21, 2020

Structural variation of centromeric endogenous retroviruses in human populations and their impact on cutaneous T-cell lymphoma, Sézary syndrome, and HIV infection.

Human Endogenous Retroviruses type K HML-2 (HK2) are integrated into 117 or more areas of human chromosomal arms while two newly discovered HK2 proviruses, K111 and K222, spread extensively in pericentromeric regions, are the first retroviruses discovered in these areas of our genome.We use PCR and sequencing analysis to characterize pericentromeric K111 proviruses in DNA from individuals of diverse ethnicities and patients with different diseases.We found that the 5′ LTR-gag region of K111 proviruses is missing in certain individuals, creating pericentromeric instability. K111 deletion (-/- K111) is seen in about 15% of Caucasian, Asian, and Middle Eastern populations; it is missing in 2.36% of African individuals, suggesting that the -/- K111 genotype originated out of Africa. As we identified the -/-K111 genotype in Cutaneous T-cell lymphoma (CTCL) cell lines, we studied whether the -/-K111 genotype is associated with CTCL. We found a significant increase in the frequency of detection of the -/-K111 genotype in Caucasian patients with severe CTCL and/or Sézary syndrome (n?=?35, 37.14%), compared to healthy controls (n?=?160, 15.6%) [p?=?0.011]. The -/-K111 genotype was also found to vary in HIV-1 infection. Although Caucasian healthy individuals have a similar frequency of detection of the -/- K111 genotype, Caucasian HIV Long-Term Non-Progressors (LTNPs) and/or elite controllers, have significantly higher detection of the -/-K111 genotype (30.55%; n?=?36) than patients who rapidly progress to AIDS (8.5%; n?=?47) [p?=?0.0097].Our data indicate that pericentromeric instability is associated with more severe CTCL and/or Sézary syndrome in Caucasians, and appears to allow T-cells to survive lysis by HIV infection. These findings also provide new understanding of human evolution, as the -/-K111 genotype appears to have arisen out of Africa and is distributed unevenly throughout the world, possibly affecting the severity of HIV in different geographic areas.


April 21, 2020

Comparative Genomic Analyses Reveal Core-Genome-Wide Genes Under Positive Selection and Major Regulatory Hubs in Outlier Strains of Pseudomonas aeruginosa.

Genomic information for outlier strains of Pseudomonas aeruginosa is exiguous when compared with classical strains. We sequenced and constructed the complete genome of an environmental strain CR1 of P. aeruginosa and performed the comparative genomic analysis. It clustered with the outlier group, hence we scaled up the analyses to understand the differences in environmental and clinical outlier strains. We identified eight new regions of genomic plasticity and a plasmid pCR1 with a VirB/D4 complex followed by trimeric auto-transporter that can induce virulence phenotype in the genome of strain CR1. Virulence genotype analysis revealed that strain CR1 lacked hemolytic phospholipase C and D, three genes for LPS biosynthesis and had reduced antibiotic resistance genes when compared with clinical strains. Genes belonging to proteases, bacterial exporters and DNA stabilization were found to be under strong positive selection, thus facilitating pathogenicity and survival of the outliers. The outliers had the complete operon for the production of vibrioferrin, a siderophore present in plant growth promoting bacteria. The competence to acquire multidrug resistance and new virulence factors makes these strains a potential threat. However, we identified major regulatory hubs that can be used as drug targets against both the classical and outlier groups.


April 21, 2020

Complete genome sequence of the halophilic PHA-producing bacterium Halomonas sp. SF2003: insights into its biotechnological potential.

A halophilic Gram-negative eubacterium was isolated from the Iroise Sea and identified as an efficient producer of polyhydroxyalkanoates (PHA). The strain, designated SF2003, was found to belong to the Halomonas genus on the basis of 16S rRNA gene sequence similarity. Previous biochemical tests indicated that the Halomonas sp. strain SF2003 is capable of supporting various culture conditions which sometimes can be constraining for marine strains. This versatility could be of great interest for biotechnological applications. Therefore, a complete bacterial genome sequencing and de novo assembly were performed using a PacBio RSII sequencer and Hierarchical Genome Assembly Process software in order to predict Halomonas sp. SF2003 metabolisms, and to identify genes involved in PHA production and stress tolerance. This study demonstrates the complete genome sequence of Halomonas sp. SF2003 which contains a circular 4,36 Mbp chromosome, and replaces the strain in a phylogenetic tree. Genes related to PHA metabolism, carbohydrate metabolism, fatty acid metabolism and stress tolerance were identified and a comparison was made with metabolisms of relative species. Genes annotation highlighted the presence of typical genes involved in PHA biosynthesis such as phaA, phaB and phaC and enabled a preliminary analysis of their organization and characteristics. Several genes of carbohydrates and fatty acid metabolisms were also identified which provided helpful insights into both a better knowledge of the intricacies of PHA biosynthetic pathways and of production purposes. Results show the strong versatility of Halomonas sp. SF2003 to adapt to various temperatures and salinity which can subsequently be exploited for industrial applications such as PHA production.


April 21, 2020

Replicon-Based Typing of IncI-Complex Plasmids, and Comparative Genomics Analysis of IncI?/K1 Plasmids.

IncI-complex plasmids can be divided into seven subgroups IncI1, IncI2, IncI?, IncB/O, IncK1, IncK2, and IncZ. In this study, a replicon-based scheme was proposed for typing IncI-complex plasmids into four separately clustering subgroups IncI2, IncI1/B/O, IncI?/K1 and IncK2/Z, the last three of which were combined from IncI1 and IncB/O, IncI? and IncK1, and IncK2 and IncZ, respectively. Four IncI?/K1 plasmids p205880-NR2, p14E509-CTXM, p11011-CTXM and p61806-CTXM were fully sequenced and compared with IncI?/K1 reference pCT, IncI2 reference R721, IncI1/B/O reference R64 and IncK2/Z reference pO26-CRL-125. These plasmids shared conserved gene organization in the replication and conjugal transfer regions, but displaying considerable sequence diversity among different subgroups. Remarkable modular differences were observed in the maintenance and transfer leading regions. p205880-NR2 contained no resistance genes or accessory modules, while the other seven plasmids acquired one or more accessory modules, which harbored mobile elements [including unit transposons, insertion sequence (IS)-based transposition units and individual IS elements] and associated resistance markers (especially including those involved in resistance to ß-lactams, aminoglycosides, tetracyclins, phenicols, streptomycins, trimethoprims, sulphonamides, tunicamycins and erythromycins). Data presented here provided a deeper insight into diversification and evolution of IncI-complex plasmids.


April 21, 2020

Genome plasticity favours double chromosomal Tn4401b-blaKPC-2 transposon insertion in the Pseudomonas aeruginosa ST235 clone.

Pseudomonas aeruginosa Sequence Type 235 is a clone that possesses an extraordinary ability to acquire mobile genetic elements and has been associated with the spread of resistance genes, including genes that encode for carbapenemases. Here, we aim to characterize the genetic platforms involved in resistance dissemination in blaKPC-2-positive P. aeruginosa ST235 in Colombia.In a prospective surveillance study of infections in adult patients attended in five ICUs in five distant cities in Colombia, 58 isolates of P. aeruginosa were recovered, of which, 27 (46.6%) were resistant to carbapenems. The molecular analysis showed that 6 (22.2%) and 4 (14.8%) isolates harboured the blaVIM and blaKPC-2 genes, respectively. The four blaKPC-2-positive isolates showed a similar PFGE pulsotype and belonged to ST235. Complete genome sequencing of a representative ST235 isolate shows a unique chromosomal contig of 7097.241?bp with eight different resistance genes identified and five transposons: a Tn6162-like with ant(2?)-Ia, two Tn402-like with ant(3?)-Ia and blaOXA-2 and two Tn4401b with blaKPC-2. All transposons were inserted into the genomic islands. Interestingly, the two Tn4401b copies harbouring blaKPC-2 were adjacently inserted into a new genomic island (PAGI-17) with traces of a replicative transposition process. This double insertion was probably driven by several structural changes within the chromosomal region containing PAGI-17 in the ST235 background.This is the first report of a double Tn4401b chromosomal insertion in P. aeruginosa, just within a new genomic island (PAGI-17). This finding indicates once again the great genomic plasticity of this microorganism.


April 21, 2020

Complete genome sequence of Caulobacter flavus RHGG3T, a type species of the genus Caulobacter with plant growth-promoting traits and heavy metal resistance.

Caulobacter flavus RHGG3T, a novel type species in the genus Caulobacter, originally isolated from rhizosphere soil of watermelon (Citrullus lanatus), has the ability to improve the growth of watermelon seedling and tolerate heavy metals. In vitro, C. flavus RHGG3T was able to solubilize phosphate (80.56 mg L-1), produce indole-3-acetic acid (IAA) (11.58 mg L-1) and was resistant to multiple heavy metals (copper, zinc, cadmium, cobalt and lead). Inoculating watermelon with this strain increased shoot and root length by 22.1% and 43.7%, respectively, and the total number of lateral roots by 55.9% compared to non-inoculated watermelon. In this study, we present the complete genome sequence of C. flavus RHGG3T, which was comprised of a single circular chromosome of 5,659,202 bp with a G?+?C content of 69.25%. An annotation analysis revealed that the C. flavus RHGG3T genome contained 5172 coding DNA sequences, 9 rRNA and 55 tRNA genes. Genes related to plant growth promotion (PGP), such as those associated with phosphate solubilization, nitrogen fixation, IAA, phenazine, volatile compounds, spermidine and cobalamin synthesis, were found in the C. flavus RHGG3T genome. Some genes responsible for heavy metal tolerance were also identified. The genome sequence of strain RHGG3T reported here provides new insight into the molecular mechanisms underlying the promotion of plant growth and the resistance to heavy metals in C. flavus. This study will be valuable for further exploration of the biotechnological applications of strain RHGG3T in agriculture.


April 21, 2020

Genomic characterisation of the new Dickeya fangzhongdai species regrouping plant pathogens and environmental isolates.

The Dickeya genus is part of the Pectobacteriaceae family that is included in the newly described enterobacterales order. It comprises a group of aggressive soft rot pathogens with wide geographic distribution and host range. Among them, the new Dickeya fangzhongdai species groups causative agents of maceration-associated diseases that impact a wide variety of crops and ornamentals. It affects mainly monocot plants, but D. fangzhongdai strains have also been isolated from pear trees and water sources. Here, we analysed which genetic novelty exists in this new species, what are the D. fangzhongdai-specific traits and what is the intra-specific diversity.The genomes of eight D. fangzhongdai strains isolated from diverse environments were compared to 31 genomes of strains belonging to other Dickeya species. The D. fangzhongdai core genome regroups approximately 3500 common genes, including most genes that encode virulence factors and regulators characterised in the D. dadantii 3937 model strain. Only 38 genes are present in D. fangzhongdai and absent in all other Dickeyas. One of them encodes a pectate lyase of the PL10 family of polysaccharide lyases that is found only in a few bacteria from the plant environment, soil or human gut. Other D. fangzhongdai-specific genes with a known or predicted function are involved in regulation or metabolism. The intra-species diversity analysis revealed that seven of the studied D. fangzhongdai strains were grouped into two distinct clades. Each clade possesses a pool of 100-150 genes that are shared by the clade members, but absent from the other D. fangzhongdai strains and several of these genes are clustered into genomic regions. At the strain level, diversity resides mainly in the arsenal of T5SS- and T6SS-related toxin-antitoxin systems and in secondary metabolite biogenesis pathways.This study identified the genome-specific traits of the new D. fangzhongdai species and highlighted the intra-species diversity of this species. This diversity encompasses secondary metabolites biosynthetic pathways and toxins or the repertoire of genes of extrachromosomal origin. We however didn’t find any relationship between gene content and phenotypic differences or sharing of environmental habitats.


April 21, 2020

Whole Genome Sequencing and Comparative Genomics Analyses of Pandoraea sp. XY-2, a New Species Capable of Biodegrade Tetracycline.

Few bacteria are resistant to tetracycline and can even biodegrade tetracycline in the environment. In this study, we isolated a bacterium Pandoraea sp. XY-2, which could biodegrade 74% tetracycline at pH 7.0 and 30°C within 6 days. Thereafter, we determined the whole genome sequence of Pandoraea sp. XY-2 genome is a single circular chromosome of 5.06 Mb in size. Genomic annotation showed that two AA6 family members-encoding genes and nine glutathione S-transferase (GSTs)-encoding genes could be relevant to tetracycline biodegradation. In addition, the average nucleotide identities (ANI) analysis between the genomes of Pandoraea sp. XY-2 and other Pandoraea spp. revealed that Pandoraea sp. XY-2 belongs to a new species. Moreover, comparative genome analysis of 36 Pandoraea strains identified the pan and specific genes, numerous single nucleotide polymorphisms (SNPs), insertions, and deletion variations (InDels) and different syntenial relationships in the genome of Pandoraea sp. XY-2. Finally, the evolution and the origin analysis of genes related to tetracycline resistance revealed that the six tetA(48) genes and two specificgenes tetG and tetR in Pandoraea sp. XY-2 were acquired by horizontal gene transfer (HGT) events from sources related to Paraburkholderia, Burkholderia, Caballeronia, Salmonella, Vibrio, Proteobacteria, Pseudomonas, Acinetobacter, Flavimaricola, and some unidentified sources. As a new species, Pandoraea sp. XY-2 will be an excellent resource for the bioremediation of tetracycline-contaminated environment.


April 21, 2020

Direct pathway cloning of the sodorifen biosynthetic gene cluster and recombinant generation of its product in E. coli.

Serratia plymuthica WS3236 was selected for whole genome sequencing based on preliminary genetic and chemical screening indicating the presence of multiple natural product pathways. This led to the identification of a putative sodorifen biosynthetic gene cluster (BGC). The natural product sodorifen is a volatile organic compound (VOC) with an unusual polymethylated hydrocarbon bicyclic structure (C16H26) produced by selected strains of S. plymuthica. The BGC encoding sodorifen consists of four genes, two of which (sodA, sodB) are homologs of genes encoding enzymes of the non-mevalonate pathway and are thought to enhance the amounts of available farnesyl pyrophosphate (FPP), the precursor of sodorifen. Proceeding from FPP, only two enzymes are necessary to produce sodorifen: an S-adenosyl methionine dependent methyltransferase (SodC) with additional cyclisation activity and a terpene-cyclase (SodD). Previous analysis of S. plymuthica found sodorifen production titers are generally low and vary significantly among different producer strains. This precludes studies on the still elusive biological function of this structurally and biosynthetically fascinating bacterial terpene.Sequencing and mining of the S. plymuthica WS3236 genome revealed the presence of 38 BGCs according to antiSMASH analysis, including a putative sodorifen BGC. Further genome mining for sodorifen and sodorifen-like BGCs throughout bacteria was performed using SodC and SodD as queries and identified a total of 28 sod-like gene clusters. Using direct pathway cloning (DiPaC) we intercepted the 4.6 kb candidate sodorifen BGC from S. plymuthica WS3236 (sodA-D) and transformed it into Escherichia coli BL21. Heterologous expression under the control of the tetracycline inducible PtetO promoter firmly linked this BGC to sodorifen production. By utilizing this newly established expression system, we increased the production yields by approximately 26-fold when compared to the native producer. In addition, sodorifen was easily isolated in high purity by simple head-space sampling.Genome mining of all available genomes within the NCBI and JGI IMG databases led to the identification of a wealth of sod-like pathways which may be responsible for producing a range of structurally unknown sodorifen analogs. Introduction of the S. plymuthica WS3236 sodorifen BGC into the fast-growing heterologous expression host E. coli with a very low VOC background led to a significant increase in both sodorifen product yield and purity compared to the native producer. By providing a reliable, high-level production system, this study sets the stage for future investigations of the biological role and function of sodorifen and for functionally unlocking the bioinformatically identified putative sod-like pathways.


April 21, 2020

Complete Sequences of Multiple-Drug Resistant IncHI2 ST3 Plasmids in Escherichia coli of Porcine Origin in Australia

IncHI2 ST3 plasmids are known carriers of multiple antimicrobial resistance genes. Complete plasmid sequences from multiple drug resistant Escherichia coli circulating in Australian swine is however limited. Here we sequenced two related IncHI2 ST3 plasmids, pSDE-SvHI2 and pSDC-F2_12BHI2, from phylogenetically unrelated multiple-drug resistant Escherichia coli strains SvETEC (CC23:O157:H19) and F2_12B (ST93:O7:H4) from geographically disparate pig production operations in New South Wales, Australia. Unicycler was used to co-assemble short read (Illumina) and long read (PacBio SMRT) nucleotide sequence data. The plasmids encoded three drug-resistance loci, two of which carried class 1 integrons. One integron, hosting drfA12-orfF-aadA2, was within a hybrid Tn1721/21, with the second residing within a copper/silver resistance transposon, comprising part of an atypical sul3-associated structure. The third resistance locus was flanked by IS15DI and encoded neomycin resistance (neoR). An oqx-encoding transposon (quinolone resistance), similar in structure to Tn6010, was identified only in pSDC-F2_12BHI2. Both plasmids showed high sequence identity to plasmid pSTM6-275, recently described in Salmonella enterica serotype 1,4,[5],12:i:- that has risen to prominence and become endemic in Australia. IncHI2 ST3 plasmids circulating in commensal and pathogenic E. coli from Australian swine belong to a lineage of plasmids often in association with sul3 and host multiple complex antibiotic and metal resistance structures, formed in part by IS26.


April 21, 2020

CAMISIM: simulating metagenomes and microbial communities.

Shotgun metagenome data sets of microbial communities are highly diverse, not only due to the natural variation of the underlying biological systems, but also due to differences in laboratory protocols, replicate numbers, and sequencing technologies. Accordingly, to effectively assess the performance of metagenomic analysis software, a wide range of benchmark data sets are required.We describe the CAMISIM microbial community and metagenome simulator. The software can model different microbial abundance profiles, multi-sample time series, and differential abundance studies, includes real and simulated strain-level diversity, and generates second- and third-generation sequencing data from taxonomic profiles or de novo. Gold standards are created for sequence assembly, genome binning, taxonomic binning, and taxonomic profiling. CAMSIM generated the benchmark data sets of the first CAMI challenge. For two simulated multi-sample data sets of the human and mouse gut microbiomes, we observed high functional congruence to the real data. As further applications, we investigated the effect of varying evolutionary genome divergence, sequencing depth, and read error profiles on two popular metagenome assemblers, MEGAHIT, and metaSPAdes, on several thousand small data sets generated with CAMISIM.CAMISIM can simulate a wide variety of microbial communities and metagenome data sets together with standards of truth for method evaluation. All data sets and the software are freely available at https://github.com/CAMI-challenge/CAMISIM.


April 21, 2020

Complete genome sequence of bile-isolated Enterococcus avium strain 352

Background: Enterococcus avium is a Gram-positive pathogenic bacterium belonging to the family Enterobacte- riaceae. E. avium can cause bacteremia, peritonitis, and intracranial suppurative infection. However, the mechanism of its pathogenesis and its adaptation to a special niche is still unclear. Results: In this study, the E. avium strain 352 was isolated from human bile and whole genome sequencing was per- formed. The E. avium strain 352 consists of a circular 4,794,392 bp chromosome as well as an 87,705 bp plasmid. The GC content of the chromosome is 38.98%. There are 4905 and 99 protein coding sequences in the chromosome and the plasmid, respectively. The genome of the E. avium strain 352 contains number of genes reported to be associated with bile adaption, including bsh, sbcC, mutS, nifI, galU, and hupB. There are also several virulence-associated genes including esp, fss1, fss3, ecbA, bsh, lap, clpC, clpE, and clpP. Conclusions: This study demonstrates the presence of various virulence factors of the E. avium strain 352, which has the potential to cause infections. Moreover, the genes involved in bile adaption might contribute to its ability to live in bile. Further comparative genomic studies would help to elucidate the evolution of pathogenesis of E. avium.


April 21, 2020

Whole-genome sequencing of Klebsiella pneumoniae isolates to track strain progression in a single patient with recurrent urinary tract infection.

Klebsiella pneumoniae is an important uropathogen that increasingly harbors broad-spectrum antibiotic resistance determinants. Evidence suggests that some same-strain recurrences in women with frequent urinary tract infections (UTIs) may emanate from a persistent intravesicular reservoir. Our objective was to analyze K. pneumoniae isolates collected over weeks from multiple body sites of a single patient with recurrent UTI in order to track ordered strain progression across body sites, as has been employed across patients in outbreak settings. Whole-genome sequencing of 26 K. pneumoniae isolates was performed utilizing the Illumina platform. PacBio sequencing was used to create a refined reference genome of the original urinary isolate (TOP52). Sequence variation was evaluated by comparing the 26 isolate sequences to the reference genome sequence. Whole-genome sequencing of the K. pneumoniae isolates from six different body sites of this patient with recurrent UTI demonstrated 100% chromosomal sequence identity of the isolates, with only a small P2 plasmid deletion in a minority of isolates. No single nucleotide variants were detected. The complete absence of single-nucleotide variants from 26 K. pneumoniae isolates from multiple body sites collected over weeks from a patient with recurrent UTI suggests that, unlike in an outbreak situation with strains collected from numerous patients, other methods are necessary to discern strain progression within a single host over a relatively short time frame.


April 21, 2020

Genome sequencing and comparison of five Tilletia species to identify candidate genes for the detection of regulated species infecting wheat

Tilletia species cause diseases on grass hosts with some causing bunt diseases on wheat (Triticum). Two of the four species infecting wheat have restricted distributions globally and are subject to quarantine regulations to prevent their spread to new areas. Tilletia indica causes Karnal bunt and is regulated by many countries while the non-regulated T. walkeri is morphologically similar and very closely related phylogenetically, but infects ryegrass (Lolium) and not wheat. Tilletia controversa causes dwarf bunt of wheat (DB) and is also regulated by some countries, while the closely related but non-regulated species, T. caries and T. laevis, both cause common bunt of wheat (CB). Historically, diagnostic methods have relied on cryptic morphology to differentiate these species in subsamples from grain shipments. Of the DNA-based methods published so far, most have focused on sequence variation among tested strains at a single gene locus. To facilitate the development of additional molecular assays for diagnostics, we generated whole genome data for multiple strains of the two regulated wheat pathogens and their closest relatives. Depending on the species, the genomes were assembled into 907 to 4633 scaffolds ranging from 24?Mb to 30?Mb with 7842 to 9952 gene models predicted. Phylogenomic analyses confirmed the placement of Tilletia in the Exobasidiomycetes and showed that T. indica and T. walkeri were in one clade whereas T. controversa, T. caries and T. laevis grouped in a separate clade. Single copy and species-specific genes were identified by orthologous group analysis. Unique species-specific genes were identified and evaluated as suitable markers to differentiate the quarantine and non-quarantine species. After further analyses and manual inspection, primers and probes for the optimum candidate genes were designed and tested in silico, for validation in future wet-lab studies.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.