Menu
September 22, 2019

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019

Antiviral adaptive immunity and tolerance in the mosquito Aedes aegyti

Mosquitoes spread pathogenic arboviruses while themselves tolerate infection. We here characterize an immunity pathway providing long-term antiviral protection and define how this pathway discriminates between self and non-self. Mosquitoes use viral RNAs to create viral derived cDNAs (vDNAs) central to the antiviral response. vDNA molecules are acquired through a process of reverse-transcription and recombination directed by endogenous retrotransposons. These vDNAs are thought to integrate in the host genome as endogenous viral elements (EVEs). Sequencing of pre-integrated vDNA revealed that the acquisition process exquisitely distinguishes viral from host RNA, providing one layer of self-nonself discrimination. Importantly, we show EVE-derived piRNAs have antiviral activity and are loaded onto Piwi4 to inhibit virus replication. In a second layer of self-non-self discrimination, Piwi4 preferentially loads EVE-derived piRNAs, discriminating against transposon-targeting piRNAs. Our findings define a fundamental virus-specific immunity pathway in mosquitoes that uses EVEs as a potent and specific antiviral transgenerational mechanism.


September 22, 2019

Comparative analyses of CTX prophage region of Vibrio cholerae seventh pandemic wave 1 strains isolated in Asia.

Vibrio cholerae O1 causes cholera, and cholera toxin, the principal mediator of massive diarrhea, is encoded by ctxAB in the cholera toxin (CTX) prophage. In this study, the structures of the CTX prophage region of V. cholerae strains isolated during the seventh pandemic wave 1 in Asian countries were determined and compared. Eighteen strains were categorized into eight groups by CTX prophage region-specific restriction fragment length polymorphism and PCR profiles and the structure of the region of a representative strain from each group was determined by DNA sequencing. Eight representative strains revealed eight distinct CTX prophage regions with various combinations of CTX-1, RS1 and a novel genomic island on chromosome I. CTX prophage regions carried by the wave 1 strains were diverse in structure. V. cholerae strains with an area specific CTX prophage region are believed to circulate in South-East Asian countries; additionally, multiple strains with distinct types of CTX prophage region are co-circulating in the area. Analysis of a phylogenetic tree generated by single nucleotide polymorphism differences across 2483 core genes revealed that V. cholerae strains categorized in the same group based on CTX prophage region structure were segregated in closer clusters. CTX prophage region-specific recombination events or gain and loss of genomic elements within the region may have occurred at much higher frequencies and contributed to producing a panel of CTX prophage regions with distinct structures among V. cholerae pathogenic strains in lineages with close genetic backgrounds in the early wave 1 period of the seventh cholera pandemic.© 2018 The Authors. Microbiology and Immunology published by The Societies and John Wiley & Sons Australia, Ltd.


September 22, 2019

Comparative genomics of Czech vaccine strains of Bordetella pertussis.

Bordetella pertussis is a strictly human pathogen causing the respiratory infectious disease called whooping cough or pertussis. B. pertussis adaptation to acellular pertussis vaccine pressure has been repeatedly highlighted, but recent data indicate that adaptation of circulating strains started already in the era of the whole cell pertussis vaccine (wP) use. We sequenced the genomes of five B. pertussis wP vaccine strains isolated in the former Czechoslovakia in the pre-wP (1954-1957) and early wP (1958-1965) eras, when only limited population travel into and out of the country was possible. Four isolates exhibit a similar genome organization and form a distinct phylogenetic cluster with a geographic signature. The fifth strain is rather distinct, both in genome organization and SNP-based phylogeny. Surprisingly, despite isolation of this strain before 1966, its closest sequenced relative appears to be a recent isolate from the US. On the genome content level, the five vaccine strains contained both new and already described regions of difference. One of the new regions contains duplicated genes potentially associated with transport across the membrane. The prevalence of this region in recent isolates indicates that its spread might be associated with selective advantage leading to increased strain fitness.


September 22, 2019

The sequence of a male-specific genome region containing the sex determination switch in Aedes aegypti.

Aedes aegypti is the principal vector of several important arboviruses. Among the methods of vector control to limit transmission of disease are genetic strategies that involve the release of sterile or genetically modified non-biting males, which has generated interest in manipulating mosquito sex ratios. Sex determination in Ae. aegypti is controlled by a non-recombining Y chromosome-like region called the M locus, yet characterisation of this locus has been thwarted by the repetitive nature of the genome. In 2015, an M locus gene named Nix was identified that displays the qualities of a sex determination switch.With the use of a whole-genome bacterial artificial chromosome (BAC) library, we amplified and sequenced a ~200 kb region containing the male-determining gene Nix. In this study, we show that Nix is comprised of two exons separated by a 99 kb intron primarily composed of repetitive DNA, especially transposable elements.Nix, an unusually large and highly repetitive gene, exhibits features in common with Y chromosome genes in other organisms. We speculate that the lack of recombination at the M locus has allowed the expansion of repeats in a manner characteristic of a sex-limited chromosome, in accordance with proposed models of sex chromosome evolution in insects.


September 22, 2019

Whole genome sequencing for investigations of meningococcal outbreaks in the United States: a retrospective analysis.

Although rare in the U.S., outbreaks due to Neisseria meningitidis do occur. Rapid, early outbreak detection is important for timely public health response. In this study, we characterized U.S. meningococcal isolates (N?=?201) from 15 epidemiologically defined outbreaks (2009-2015) along with temporally and geographically matched sporadic isolates using multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), and six whole genome sequencing (WGS) based methods. Recombination-corrected maximum likelihood (ML) and Bayesian phylogenies were reconstructed to identify genetically related outbreak isolates. All WGS analysis methods showed high degree of agreement and distinguished isolates with similar or indistinguishable PFGE patterns, or the same strain genotype. Ten outbreaks were caused by a single strain; 5 were due to multiple strains. Five sporadic isolates were phylogenetically related to 2 outbreaks. Analysis of 9 outbreaks using timed phylogenies identified the possible origin and estimated the approximate time that the most recent common ancestor emerged for outbreaks analyzed. U.S. meningococcal outbreaks were caused by single- or multiple-strain introduction, with organizational outbreaks mainly caused by a clonal strain and community outbreaks by divergent strains. WGS can infer linkage of meningococcal cases when epidemiological links are uncertain. Accurate identification of outbreak-associated cases requires both WGS typing and epidemiological data.


September 22, 2019

Multi-population genomic analysis of malaria parasites indicates local selection and differentiation at the gdv1 locus regulating sexual development.

Parasites infect hosts in widely varying environments, encountering diverse challenges for adaptation. To identify malaria parasite genes under locally divergent selection across a large endemic region with a wide spectrum of transmission intensity, genome sequences were obtained from 284 clinical Plasmodium falciparum infections from four newly sampled locations in Senegal, The Gambia, Mali and Guinea. Combining these with previous data from seven other sites in West Africa enabled a multi-population analysis to identify discrete loci under varying local selection. A genome-wide scan showed the most exceptional geographical divergence to be at the early gametocyte gene locus gdv1 which is essential for parasite sexual development and transmission. We identified a major structural dimorphism with alternative 1.5?kb and 1.0?kb sequence deletions at different positions of the 3′-intergenic region, in tight linkage disequilibrium with the most highly differentiated single nucleotide polymorphism, one of the alleles being very frequent in Senegal and The Gambia but rare in the other locations. Long non-coding RNA transcripts were previously shown to include the entire antisense of the gdv1 coding sequence and the portion of the intergenic region with allelic deletions, suggesting adaptive regulation of parasite sexual development and transmission in response to local conditions.


September 22, 2019

Whole genome sequencing and microsatellite analysis of the Plasmodium falciparum E5 NF54 strain show that the var, rifin and stevor gene families follow Mendelian inheritance.

Plasmodium falciparum exhibits a high degree of inter-isolate genetic diversity in its variant surface antigen (VSA) families: P. falciparum erythrocyte membrane protein 1, repetitive interspersed family (RIFIN) and subtelomeric variable open reading frame (STEVOR). The role of recombination for the generation of this diversity is a subject of ongoing research. Here the genome of E5, a sibling of the 3D7 genome strain is presented. Short and long read whole genome sequencing (WGS) techniques (Ilumina, Pacific Bioscience) and a set of 84 microsatellites (MS) were employed to characterize the 3D7 and non-3D7 parts of the E5 genome. This is the first time that VSA genes in sibling parasites were analysed with long read sequencing technology.Of the 5733 E5 genes only 278 genes, mostly var and rifin/stevor genes, had no orthologues in the 3D7 genome. WGS and MS analysis revealed that chromosomal crossovers occurred at a rate of 0-3 per chromosome. var, stevor and rifin genes were inherited within the respective non-3D7 or 3D7 chromosomal context. 54 of the 84 MS PCR fragments correctly identified the respective MS as 3D7- or non-3D7 and this correlated with var and rifin/stevor gene inheritance in the adjacent chromosomal regions. E5 had 61 var and 189 rifin/stevor genes. One large non-chromosomal recombination event resulted in a new var gene on chromosome 14. The remainder of the E5 3D7-type subtelomeric and central regions were identical to 3D7.The data show that the rifin/stevor and var gene families represent the most diverse compartments of the P. falciparum genome but that the majority of var genes are inherited without alterations within their respective parental chromosomal context. Furthermore, MS genotyping with 54 MS can successfully distinguish between two sibling progeny of a natural P. falciparum cross and thus can be used to investigate identity by descent in field isolates.


September 22, 2019

Complete genomic analysis of a kingdom crossing Klebsiella variicola isolate.

Bacterial isolate X39 was isolated from a community-acquired pneumonia patient in Beijing, China. A phylogenetic tree based on rpoB genes and average nucleotide identity data confirmed that isolate X39 belonged to Klebsiella variicola. The genome of K. variicola X39 contained one circular chromosome and nine plasmids. Comparative genomic analyses with other K. variicola isolates revealed that K. variicola X39 contained the most unique genes. Of these unique genes, many were prophages and transposases. Many virulence factors were shared between K. variicola X39 and Klebsiella pneumoniae F1. The pathogenicity of K. variicola X39 was compared with that of K. pneumoniae F1 in an abdominal infection model. The results indicated that K. variicola X39 was less virulent than typical clinical K. pneumoniae F1. The genome of K. variicola X39 also contained some genes involved in plant colonization, nitrogen fixation, and defense against oxidative stress. GFP-labeled K. variicola X39 could colonize maize as an endophytic bacterium. We concluded that K. variicola X39 was a kingdom-crossing strain.


September 22, 2019

Type II restriction modification system in Ureaplasma parvum OMC-P162 strain.

Ureaplasma parvum serovar 3 strain, OMC-P162, was isolated from the human placenta of a preterm delivery at 26 weeks’ gestation. In this study, we sequenced the complete genome of OMC-P162 and compared it with other serovar 3 strains isolated from patients with different clinical conditions. Ten unique genes in OMC-P162, five of which encoded for hypothetical proteins, were identified. Of these, genes UPV_229 and UPV_230 formed an operon whose open reading frames were predicted to code for a DNA methyltransferase and a hypothetical protein, respectively. DNA modification analysis of the OMC-P162 genome identified N4-methylcytosine (m4C) and N6-methyladenine (m6A), but not 5-methylocytosine (m5C). UPV230 recombinant protein displayed endonuclease activity and recognized the CATG sequence, resulting in a blunt cut between A and T. This restriction enzyme activity was identical to that of the cultivated OMC-P162 strain, suggesting that this restriction enzyme was naturally expressed in OMC-P162. We designated this enzyme as UpaP162. Treatment of pT7Blue plasmid with recombinant protein UPV229 completely blocked UpaP162 restriction enzyme activity. These results suggest that the UPV_229 and UPV_230 genes act as a type II restriction-modification system in Ureaplasma OMC-P162.


September 22, 2019

Complete genome sequence and characterization of linezolid-resistant Enterococcus faecalis clinical isolate KUB3006 carrying a cfr(B)-transposon on its chromosome and optrA-plasmid.

Linezolid (LZD) has become one of the most important antimicrobial agents for infections caused by gram-positive bacteria, including those caused by Enterococcus species. LZD-resistant (LR) genetic features include mutations in 23S rRNA/ribosomal proteins, a plasmid-borne 23S rRNA methyltransferase gene cfr, and ribosomal protection genes (optrA and poxtA). Recently, a cfr gene variant, cfr(B), was identified in a Tn6218-like transposon (Tn) in a Clostridioides difficile isolate. Here, we isolated an LR Enterococcus faecalis clinical isolate, KUB3006, from a urine specimen of a patient with urinary tract infection during hospitalization in 2017. Comparative and whole-genome analyses were performed to characterize the genetic features and overall antimicrobial resistance genes in E. faecalis isolate KUB3006. Complete genome sequencing of KUB3006 revealed that it carried cfr(B) on a chromosomal Tn6218-like element. Surprisingly, this Tn6218-like element was almost (99%) identical to that of C. difficile Ox3196, which was isolated from a human in the UK in 2012, and to that of Enterococcus faecium 5_Efcm_HA-NL, which was isolated from a human in the Netherlands in 2012. An additional oxazolidinone and phenicol resistance gene, optrA, was also identified on a plasmid. KUB3006 is sequence type (ST) 729, suggesting that it is a minor ST that has not been reported previously and is unlikely to be a high-risk E. faecalis lineage. In summary, LR E. faecalis KUB3006 possesses a notable Tn6218-like-borne cfr(B) and a plasmid-borne optrA. This finding raises further concerns regarding the potential declining effectiveness of LZD treatment in the future.


September 22, 2019

Loss of bacitracin resistance due to a large genomic deletion among Bacillus anthracis strains.

Bacillus anthracis is a Gram-positive endospore-forming bacterial species that causes anthrax in both humans and animals. In Zambia, anthrax cases are frequently reported in both livestock and wildlife, with occasional transmission to humans, causing serious public health problems in the country. To understand the genetic diversity of B. anthracis strains in Zambia, we sequenced and compared the genomic DNA of B. anthracis strains isolated across the country. Single nucleotide polymorphisms clustered these strains into three groups. Genome sequence comparisons revealed a large deletion in strains belonging to one of the groups, possibly due to unequal crossing over between a pair of rRNA operons. The deleted genomic region included genes conferring resistance to bacitracin, and the strains with the deletion were confirmed with loss of bacitracin resistance. Similar deletions between rRNA operons were also observed in a few B. anthracis strains phylogenetically distant from Zambian strains. The structure of bacitracin resistance genes flanked by rRNA operons was conserved only in members of the Bacillus cereus group. The diversity and genomic characteristics of B. anthracis strains determined in this study would help in the development of genetic markers and treatment of anthrax in Zambia. IMPORTANCE Anthrax is caused by Bacillus anthracis, an endospore-forming soil bacterium. The genetic diversity of B. anthracis is known to be low compared with that of Bacillus species. In this study, we performed whole-genome sequencing of Zambian isolates of B. anthracis to understand the genetic diversity between closely related strains. Comparison of genomic sequences revealed that closely related strains were separated into three groups based on single nucleotide polymorphisms distributed throughout the genome. A large genomic deletion was detected in the region containing a bacitracin resistance gene cluster flanked by rRNA operons, resulting in the loss of bacitracin resistance. The structure of the deleted region, which was also conserved among species of the Bacillus cereus group, has the potential for both deletion and amplification and thus might be enabling the species to flexibly control the level of bacitracin resistance for adaptive evolution.


September 22, 2019

Characterization and genomic analyses of Pseudomonas aeruginosa podovirus TC6: establishment of genus Pa11virus.

Phages have attracted a renewed interest as alternative to chemical antibiotics. Although the number of phages is 10-fold higher than that of bacteria, the number of genomically characterized phages is far less than that of bacteria. In this study, phage TC6, a novel lytic virus of Pseudomonas aeruginosa, was isolated and characterized. TC6 consists of an icosahedral head with a diameter of approximately 54 nm and a short tail with a length of about 17 nm, which are characteristics of the family Podoviridae. TC6 can lyse 86 out of 233 clinically isolated P. aeruginosa strains, thus showing application potentials for phage therapy. The linear double-stranded genomic DNA of TC6 consisted of 49796 base pairs and was predicted to contain 71 protein-coding genes. A total of 11 TC6 structural proteins were identified by mass spectrometry. Comparative analysis revealed that the P. aeruginosa phages TC6, O4, PA11, and IME180 shared high similarity at DNA sequence and proteome levels, among which PA11 was the first phage discovered and published. Meanwhile, these phages contain 54 core genes and have very close phylogenetic relationships, which distinguish them from other known phage genera. We therefore proposed that these four phages can be classified as Pa11virus, comprising a new phage genus of Podoviridae that infects Pseudomonas spp. The results of this work promoted our understanding of phage biology, classification, and diversity.


September 22, 2019

SKA: Split Kmer Analysis Toolkit for Bacterial Genomic Epidemiology

Genome sequencing is revolutionising infectious disease epidemiology, providing a huge step forward in sensitivity and specificity over more traditional molecular typing techniques. However, the complexity of genome data often means that its analysis and interpretation requires high-performance compute infrastructure and dedicated bioinformatics support. Furthermore, current methods have limitations that can differ between analyses and are often opaque to the user, and their reliance on multiple external dependencies makes reproducibility difficult. Here I introduce SKA, a toolkit for analysis of genome sequence data from closely-related, small, haploid genomes. SKA uses split kmers to rapidly identify variation between genome sequences, making it possible to analyse hundreds of genomes on a standard home computer. Tests on publicly available simulated and real-life data show that SKA is both faster and more efficient than the gold standard methods used today while retaining similar levels of accuracy for epidemiological purposes. SKA can take raw read data or genome assemblies as input and calculate pairwise distances, create single linkage clusters and align genomes to a reference genome or using a reference-free approach. SKA requires few decisions to be made by the user, which, along with its computational efficiency, allows genome analysis to become accessible to those with only basic bioinformatics training. The limitations of SKA are also far more transparent than for current approaches, and future improvements to mitigate these limitations are possible. Overall, SKA is a powerful addition to the armoury of the genomic epidemiologist. SKA source code is available from Github (https://github.com/simonrharris/SKA).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.