Menu
September 22, 2019

Computational tools to unmask transposable elements.

A substantial proportion of the genome of many species is derived from transposable elements (TEs). Moreover, through various self-copying mechanisms, TEs continue to proliferate in the genomes of most species. TEs have contributed numerous regulatory, transcript and protein innovations and have also been linked to disease. However, notwithstanding their demonstrated impact, many genomic studies still exclude them because their repetitive nature results in various analytical complexities. Fortunately, a growing array of methods and software tools are being developed to cater for them. This Review presents a summary of computational resources for TEs and highlights some of the challenges and remaining gaps to perform comprehensive genomic analyses that do not simply ‘mask’ repeats.


September 22, 2019

Genomic characterization reveals significant divergence within Chlorella sorokiniana (Chlorellales, Trebouxiophyceae)

Selection of highly productive algal strains is crucial for establishing economically viable biomass and biopro- duct cultivation systems. Characterization of algal genomes, including understanding strain-specific differences in genome content and architecture is a critical step in this process. Using genomic analyses, we demonstrate significant differences between three strains of Chlorella sorokiniana (strain 1228, UTEX 1230, and DOE1412). We found that unique, strain-specific genes comprise a substantial proportion of each genome, and genomic regions with> 80% local nucleotide identity constitute <15% of each genome among the strains, indicating substantial strain specific evolution. Furthermore, cataloging of meiosis and other sex-related genes in C. sor- okiniana strains suggests strategic breeding could be utilized to improve biomass and bioproduct yields if a sexual cycle can be characterized. Finally, preliminary investigation of epigenetic machinery suggests the pre- sence of potentially unique transcriptional regulation in each strain. Our data demonstrate that these three C. sorokiniana strains represent significantly different genomic content. Based on these findings, we propose in- dividualized assessment of each strain for potential performance in cultivation systems.


September 22, 2019

TranSurVeyor: an improved database-free algorithm for finding non-reference transpositions in high-throughput sequencing data.

Transpositions transfer DNA segments between different loci within a genome; in particular, when a transposition is found in a sample but not in a reference genome, it is called a non-reference transposition. They are important structural variations that have clinical impact. Transpositions can be called by analyzing second generation high-throughput sequencing datasets. Current methods follow either a database-based or a database-free approach. Database-based methods require a database of transposable elements. Some of them have good specificity; however this approach cannot detect novel transpositions, and it requires a good database of transposable elements, which is not yet available for many species. Database-free methods perform de novo calling of transpositions, but their accuracy is low. We observe that this is due to the misalignment of the reads; since reads are short and the human genome has many repeats, false alignments create false positive predictions while missing alignments reduce the true positive rate. This paper proposes new techniques to improve database-free non-reference transposition calling: first, we propose a realignment strategy called one-end remapping that corrects the alignments of reads in interspersed repeats; second, we propose a SNV-aware filter that removes some incorrectly aligned reads. By combining these two techniques and other techniques like clustering and positive-to-negative ratio filter, our proposed transposition caller TranSurVeyor shows at least 3.1-fold improvement in terms of F1-score over existing database-free methods. More importantly, even though TranSurVeyor does not use databases of prior information, its performance is at least as good as existing database-based methods such as MELT, Mobster and Retroseq. We also illustrate that TranSurVeyor can discover transpositions that are not known in the current database.


September 22, 2019

Plasmid and chromosomal integration of four novel blaIMP-carrying transposons from Pseudomonas aeruginosa, Klebsiella pneumoniae and an Enterobacter sp.

To provide detailed genetic characterization of four novel blaIMP-carrying transposons from Pseudomonas aeruginosa, Klebsiella pneumoniae and an Enterobacter sp.P. aeruginosa 60512, K. pneumoniae 447, P. aeruginosa 12939 and Enterobacter sp. A1137 were subjected to genome sequencing. The complete nucleotide sequences of two plasmids (p60512-IMP from the 60512 isolate and p447-IMP from the 447 isolate) and two chromosomes (the 12939 and A1137 isolates) were determined, then a genomic comparison of p60512-IMP, p447-IMP and four novel blaIMP-carrying transposons (Tn6394, Tn6375, Tn6411 and Tn6397) with related sequences was performed. Transferability of the blaIMP gene and bacterial antimicrobial susceptibility were tested.Tn6394 and Tn6375 were located in p60512-IMP and p447-IMP, respectively, while Tn6411 and Tn6397 were integrated into the 12939 and A1137 chromosomes, respectively. Tn6394 was an ISPa17-based transposition unit that harboured the integron In992 (carrying blaIMP-1). In73 (carrying blaIMP-8), In73 and In992, together with the ISEcp1:IS1R-blaCTX-M-14-IS903D unit, the macAB-tolC region and the truncated aacC2-tmrB region, respectively, were integrated into the prototype transposons Tn1722, Tn1696 and Tn7, respectively, generating the Tn3-family unit transposons, Tn6375 and Tn6378, and the Tn7-family unit transposon Tn6411, respectively. Tn6397 was a large integrative and conjugative element carrying Tn6378.Complex events of transposition and homologous recombination have occurred during the original formation and further plasmid and chromosomal integration of these four transposons, promoting accumulation and spread of antimicrobial resistance genes.


September 22, 2019

Diversity of DHA-1-encoding plasmids in Klebsiella pneumoniae isolates from 16 French hospitals.

To provide new insights into the spread of plasmidic cephalosporinase DHA-1, 16 strains of Klebsiella pneumoniae and a strain of Klebsiella variicola producing DHA-1 were isolated between January 2012 and December 2013 in six regions of France and two French overseas departments and territories.Disc diffusion assays, isoelectric focusing and PCRs were used to characterize the plasmidic DHA-1 ß-lactamase. Plasmid analysis was performed by the method of Kado and Liu and WGS. Virulence of the strains was studied by biofilm formation and the survival of Drosophila.The strains were of low virulence and had one to three plasmids including one of various sizes (~40 to 319?kb) mediating DHA-1. Nine strains belonged to ST11 and possessed a pKPS30-type DHA-1 plasmid of the IncR (incompatibility) group. A strain of ST307 possessed pENVA, a DHA-1 plasmid of the IncH-type group. The seven remaining plasmids were unknown. Three belonged to the IncL/M group. They were closely related and their sequences were determined. One of the four remaining strains was chosen for further investigation. This strain of ST16 had two plasmids, a pUUH239.2-related plasmid and a new DHA-1 plasmid of ~319?kb of IncHI2 type.These findings demonstrate the major role of the pKPS30-type plasmid in the spread of DHA-1 cephalosporinase in France and provide evidence of two new emerging plasmids carrying this enzyme.


September 22, 2019

Molecular epidemiology of isolates with multiple mcr plasmids from a pig farm in Great Britain: the effects of colistin withdrawal in the short and long term.

The environment, including farms, might act as a reservoir for mobile colistin resistance (mcr) genes, which has led to calls for reduction of usage in livestock of colistin, an antibiotic of last resort for humans.To establish the molecular epidemiology of mcr Enterobacteriaceae from faeces of two cohorts of pigs, where one group had initially been treated with colistin and the other not, over a 5?month period following stoppage of colistin usage on a farm in Great Britain; faecal samples were also taken at ~20?months.mcr-1 Enterobacteriaceae were isolated from positive faeces and was WGS performed; conjugation was performed on selected Escherichia coli and colistin MICs were determined.E. coli of diverse ST harbouring mcr-1 and multiple resistance genes were isolated over 5?months from both cohorts. Two STs, from treated cohorts, contained both mcr-1 and mcr-3 plasmids, with some isolates also harbouring multiple copies of mcr-1 on different plasmids. The mcr-1 plasmids grouped into four Inc types (X4, pO111, I2 and HI2), with mcr-3 found in IncP. Multiple copies of mcr plasmids did not have a noticeable effect on colistin MIC, but they could be transferred simultaneously to a Salmonella host in vitro. Neither mcr-1 nor mcr-3 was detected in samples collected ~20?months after colistin cessation.We report for the first known time on the presence in Great Britain of mcr-3 from MDR Enterobacteriaceae, which might concurrently harbour multiple copies of mcr-1 on different plasmids. However, control measures, including stoppage of colistin, can successfully mitigate long-term on-farm persistence.


September 22, 2019

An IncX1 plasmid isolated from Salmonella enterica subsp. enterica serovar Pullorum carrying blaTEM-1B, sul2, arsenic resistant operons.

We have identified an IncX1 plasmid named pQJDSal1 from Salmonella enterica subsp. enterica serovar Pullorum (S. Pullorum). The plasmid is 67,685?bp in size and has 72 putative genes. pQJDSal1 harbors a conserved IncX1-type backbone with predicted regions for conjugation, replication and partitioning, as well as a toxin/antitoxin plasmid addiction system. Two regions (A and B) that have not been previously reported in IncX1 plasmids are inserted into the backbone. Region A (10.7?kb), inserted between parA and taxD, consists of a new Tn6168-like transposon containing an arsenic resistant operon arsB2CHR and sulfonamide resistance gene sul2. Region B contains another arsenic resistant operon arsADHR, resistance gene blaTEM-1B and three transposable elements. Conjugation experiments showed that pQJDSal1 could transfer from S. Pullorum to Escherichia coli (E. coli) J53. Statistical analysis of 70 sequenced IncX1 plasmids revealed that IncX1 plasmids harbored various antibiotic resistance genes. The results highlight the importance of IncX1 plasmids in disseminating antibiotic resistance genes.Copyright © 2018. Published by Elsevier Inc.


September 22, 2019

The genomic architecture and molecular evolution of ant odorant receptors.

The massive expansions of odorant receptor (OR) genes in ant genomes are notable examples of rapid genome evolution and adaptive gene duplication. However, the molecular mechanisms leading to gene family expansion remain poorly understood, partly because available ant genomes are fragmentary. Here, we present a highly contiguous, chromosome-level assembly of the clonal raider ant genome, revealing the largest known OR repertoire in an insect. While most ant ORs originate via local tandem duplication, we also observe several cases of dispersed duplication followed by tandem duplication in the most rapidly evolving OR clades. We found that areas of unusually high transposable element density (TE islands) were depauperate in ORs in the clonal raider ant, and found no evidence for retrotransposition of ORs. However, OR loci were enriched for transposons relative to the genome as a whole, potentially facilitating tandem duplication by unequal crossing over. We also found that ant OR genes are highly AT-rich compared to other genes. In contrast, in flies, OR genes are dispersed and largely isolated within the genome, and we find that fly ORs are not AT-rich. The genomic architecture and composition of ant ORs thus show convergence with the unrelated vertebrate ORs rather than the related fly ORs. This might be related to the greater gene numbers and/or potential similarities in gene regulation between ants and vertebrates as compared to flies.© 2018 McKenzie and Kronauer; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Functional metagenomics identifies an exosialidase with an inverting catalytic mechanism that defines a new glycoside hydrolase family (GH156).

Exosialidases are glycoside hydrolases that remove a single terminal sialic acid residue from oligosaccharides. They are widely distributed in biology, having been found in prokaryotes, eukaryotes, and certain viruses. Most characterized prokaryotic sialidases are from organisms that are pathogenic or commensal with mammals. However, in this study, we used functional metagenomic screening to seek microbial sialidases encoded by environmental DNA isolated from an extreme ecological niche, a thermal spring. Using recombinant expression of potential exosialidase candidates and a fluorogenic sialidase substrate, we discovered an exosialidase having no homology to known sialidases. Phylogenetic analysis indicated that this protein is a member of a small family of bacterial proteins of previously unknown function. Proton NMR revealed that this enzyme functions via an inverting catalytic mechanism, a biochemical property that is distinct from those of known exosialidases. This unique inverting exosialidase defines a new CAZy glycoside hydrolase family we have designated GH156.© 2018 Chuzel et al.


September 22, 2019

Comparative genomic and methylome analysis of non-virulent D74 and virulent Nagasaki Haemophilus parasuis isolates.

Haemophilus parasuis is a respiratory pathogen of swine and the etiological agent of Glässer’s disease. H. parasuis isolates can exhibit different virulence capabilities ranging from lethal systemic disease to subclinical carriage. To identify genomic differences between phenotypically distinct strains, we obtained the closed whole-genome sequence annotation and genome-wide methylation patterns for the highly virulent Nagasaki strain and for the non-virulent D74 strain. Evaluation of the virulence-associated genes contained within the genomes of D74 and Nagasaki led to the discovery of a large number of toxin-antitoxin (TA) systems within both genomes. Five predicted hemolysins were identified as unique to Nagasaki and seven putative contact-dependent growth inhibition toxin proteins were identified only in strain D74. Assessment of all potential vtaA genes revealed thirteen present in the Nagasaki genome and three in the D74 genome. Subsequent evaluation of the predicted protein structure revealed that none of the D74 VtaA proteins contain a collagen triple helix repeat domain. Additionally, the predicted protein sequence for two D74 VtaA proteins is substantially longer than any predicted Nagasaki VtaA proteins. Fifteen methylation sequence motifs were identified in D74 and fourteen methylation sequence motifs were identified in Nagasaki using SMRT sequencing analysis. Only one of the methylation sequence motifs was observed in both strains indicative of the diversity between D74 and Nagasaki. Subsequent analysis also revealed diversity in the restriction-modification systems harbored by D74 and Nagasaki. The collective information reported in this study will aid in the development of vaccines and intervention strategies to decrease the prevalence and disease burden caused by H. parasuis.


September 22, 2019

Characterization of Streptococcus pluranimalium from a cattle with mastitis by whole genome sequencing and functional validation.

Streptococcus pluranimalium is a new member of the Streptococcus genus isolated from multiple different animal hosts. It has been identified as a pathogen associated with subclinical mastitis, valvular endocarditis and septicaemia in animals. Moreover, this bacterium has emerged as a new pathogen for human infective endocarditis and brain abscess. However, the patho-biological properties of S. pluranimalium remain virtually unknown. The aim of this study was to determine the complete genome sequence of S. pluranimalium strain TH11417 isolated from a cattle with mastitis, and to characterize its antimicrobial resistance, virulence, and carbon catabolism.The genome of S. pluranimalium TH11417, determined by single-molecule real-time (SMRT) sequencing, consists of 2,065,522 base pair (bp) with a G?+?C content of 38.65%, 2,007 predicted coding sequence (CDS), 58 transfer RNA (tRNA) genes and five ribosome RNA (rRNA) operons. It contains a novel ISSpl1 element (a memeber of the IS3 family) and a ?11417.1 prophage that carries the mef(A), msr(D) and lnu(C) genes. Consistently, our antimicrobial susceptibility test confirmed that S. pluranimalium TH11417 was resistant to erythromycin and lincomycin. However, this strain did not show virulence in murine pneumonia (intranasal inoculation, 107 colony forming unit – CFU) and sepsis (intraperitoneal inoculation, 107 CFU) models. Additionally, this strain is able to grow with glucose, lactose or galactose as the sole carbon source, and possesses a lactose-specific phosphoenolpyruvate-dependent phosphotransferase system (PTS).We reported the first whole genome sequence of S. pluranimalium isolated from a cattle with mastitis. It harbors a prophage carrying the mef(A), msr(D) and lnu(C) genes, and is avirulent in the murine infection model.


September 22, 2019

Phenotypic and genomic comparison of Photorhabdus luminescens subsp. laumondii TT01 and a widely used rifampicin-resistant Photorhabdus luminescens laboratory strain.

Photorhabdus luminescens is an enteric bacterium, which lives in mutualistic association with soil nematodes and is highly pathogenic for a broad spectrum of insects. A complete genome sequence for the type strain P. luminescens subsp. laumondii TT01, which was originally isolated in Trinidad and Tobago, has been described earlier. Subsequently, a rifampicin resistant P. luminescens strain has been generated with superior possibilities for experimental characterization. This strain, which is widely used in research, was described as a spontaneous rifampicin resistant mutant of TT01 and is known as TT01-RifR.Unexpectedly, upon phenotypic comparison between the rifampicin resistant strain and its presumed parent TT01, major differences were found with respect to bioluminescence, pigmentation, biofilm formation, haemolysis as well as growth. Therefore, we renamed the strain TT01-RifR to DJC. To unravel the genomic basis of the observed differences, we generated a complete genome sequence for strain DJC using the PacBio long read technology. As strain DJC was supposed to be a spontaneous mutant, only few sequence differences were expected. In order to distinguish these from potential sequencing errors in the published TT01 genome, we re-sequenced a derivative of strain TT01 in parallel, also using the PacBio technology. The two TT01 genomes differed at only 30 positions. In contrast, the genome of strain DJC varied extensively from TT01, showing 13,000 point mutations, 330 frameshifts, and 220 strain-specific regions with a total length of more than 300 kb in each of the compared genomes.According to the major phenotypic and genotypic differences, the rifampicin resistant P. luminescens strain, now named strain DJC, has to be considered as an independent isolate rather than a derivative of strain TT01. Strains TT01 and DJC both belong to P. luminescens subsp. laumondii.


September 22, 2019

Genome sequence of the potato pathogenic fungus Alternaria solani HWC-168 reveals clues for its conidiation and virulence.

Alternaria solani is a known air-born deuteromycete fungus with a polycyclic life cycle and is the causal agent of early blight that causes significant yield losses of potato worldwide. However, the molecular mechanisms underlying the conidiation and pathogenicity remain largely unknown.We produced a high-quality genome assembly of A. solani HWC-168 that was isolated from a major potato-producing region of Northern China, which facilitated a comprehensive gene annotation, the accurate prediction of genes encoding secreted proteins and identification of conidiation-related genes. The assembled genome of A. solani HWC-168 has a genome size 32.8 Mb and encodes 10,358 predicted genes that are highly similar with related Alternaria species including Alternaria arborescens and Alternaria brassicicola. We identified conidiation-related genes in the genome of A. solani HWC-168 by searching for sporulation-related homologues identified from Aspergillus nidulans. A total of 975 secreted protein-encoding genes, which might act as virulence factors, were identified in the genome of A. solani HWC-168. The predicted secretome of A. solani HWC-168 possesses 261 carbohydrate-active enzymes (CAZy), 119 proteins containing RxLx[EDQ] motif and 27 secreted proteins unique to A. solani.Our findings will facilitate the identification of conidiation- and virulence-related genes in the genome of A. solani. This will permit new insights into understanding the molecular mechanisms underlying the A. solani-potato pathosystem and will add value to the global fungal genome database.


September 22, 2019

Antibiotic-resistant indicator bacteria in irrigation water: High prevalence of extended-spectrum beta-lactamase (ESBL)-producing Escherichia coli.

Irrigation water is a major source of fresh produce contamination with undesired microorganisms including antibiotic-resistant bacteria (ARB), and contaminated fresh produce can transfer ARB to the consumer especially when consumed raw. Nevertheless, no legal guidelines exist so far regulating quality of irrigation water with respect to ARB. We therefore examined irrigation water from major vegetable growing areas for occurrence of antibiotic-resistant indicator bacteria Escherichia coli and Enterococcus spp., including extended-spectrum ß-lactamase (ESBL)-producing E. coli and vancomycin-resistant Enterococcus spp. Occurrence of ARB strains was compared to total numbers of the respective species. We categorized water samples according to total numbers and found that categories with higher total E. coli or Enterococcus spp. numbers generally had an increased proportion of respective ARB-positive samples. We further detected high prevalence of ESBL-producing E. coli with eight positive samples of thirty-six (22%), while two presumptive vancomycin-resistant Enterococcus spp. were vancomycin-susceptible in confirmatory tests. In disk diffusion assays all ESBL-producing E. coli were multidrug-resistant (n = 21) and whole-genome sequencing of selected strains revealed a multitude of transmissible resistance genes (ARG), with blaCTX-M-1 (4 of 11) and blaCTX-M-15 (3 of 11) as the most frequent ESBL genes. Overall, the increased occurrence of indicator ARB with increased total indicator bacteria suggests that the latter might be a suitable estimate for presence of respective ARB strains. Finally, the high prevalence of ESBL-producing E. coli with transmissible ARG emphasizes the need to establish legal critical values and monitoring guidelines for ARB in irrigation water.


September 22, 2019

An improved genome assembly for Larimichthys crocea reveals hepcidin gene expansion with diversified regulation and function.

Larimichthys crocea (large yellow croaker) is a type of perciform fish well known for its peculiar physiological properties and economic value. Here, we constructed an improved version of the L. crocea genome assembly, which contained 26,100 protein-coding genes. Twenty-four pseudo-chromosomes of L. crocea were also reconstructed, comprising 90% of the genome assembly. This improved assembly revealed several expansions in gene families associated with olfactory detection, detoxification, and innate immunity. Specifically, six hepcidin genes (LcHamps) were identified in L. crocea, possibly resulting from lineage-specific gene duplication. All LcHamps possessed similar genomic structures and functional domains, but varied substantially with respect to expression pattern, transcriptional regulation, and biological function. LcHamp1 was associated specifically with iron metabolism, while LcHamp2s were functionally diverse, involving in antibacterial activity, antiviral activity, and regulation of intracellular iron metabolism. This functional diversity among gene copies may have allowed L. crocea to adapt to diverse environmental conditions.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.