Menu
July 7, 2019

Comparative genomics reveals specific genetic architectures in nicotine metabolism of Pseudomonassp. JY-Q.

Microbial degradation of nicotine is an important process to control nicotine residues in the aqueous environment. In this study, a high active nicotine degradation strain namedPseudomonassp. JY-Q was isolated from tobacco waste extract (TWE). This strain could completely degrade 5.0 g l-1nicotine in 24 h under optimal culture conditions, and it showed some tolerance even at higher concentrations (10.0 g l-1) of nicotine. The complete genome of JY-Q was sequenced to understand the mechanism by which JY-Q could degrade nicotine and tolerate such high nicotine concentrations. Comparative genomic analysis indicated that JY-Q degrades nicotine through putative novel mechanisms. Two candidate gene cluster duplications located separately at distant loci were predicted to be responsible for nicotine degradation. These two nicotine (Nic) degradation-related loci (AA098_21325-AA098_21340, AA098_03885-AA098_03900) exhibit nearly completely consistent gene organization and component synteny. The nicotinic acid(NA)degradation gene cluster (AA098_17770-AA098_17790) andNic-like clusters were both predicted to be flanked by mobile genetic elements (MGE). Furthermore, we analyzed the regions of genomic plasticity (RGP) in the JY-Q strain and found a dynamic genome carrying a type VI secretion system (T6SS) that promotes nicotine metabolism and tolerance based on transcriptomics and usedin silicomethods to identify the T6SS effector protein. Thus, a novel nicotine degradation mechanism was elucidated forPseudomonassp. JY-Q, suggesting its potential application in the bioremediation of nicotine-contaminated environments, such as TWEs.


July 7, 2019

Probing genomic aspects of the multi-host pathogen Clostridium perfringens reveals significant pangenome diversity, and a diverse array of virulence factors.

Clostridium perfringens is an important cause of animal and human infections, however information about the genetic makeup of this pathogenic bacterium is currently limited. In this study, we sought to understand and characterise the genomic variation, pangenomic diversity, and key virulence traits of 56 C. perfringens strains which included 51 public, and 5 newly sequenced and annotated genomes using Whole Genome Sequencing. Our investigation revealed that C. perfringens has an “open” pangenome comprising 11667 genes and 12.6% of core genes, identified as the most divergent single-species Gram-positive bacterial pangenome currently reported. Our computational analyses also defined C. perfringens phylogeny (16S rRNA gene) in relation to some 25 Clostridium species, with C. baratii and C. sardiniense determined to be the closest relatives. Profiling virulence-associated factors confirmed presence of well-characterised C. perfringens-associated exotoxins genes including a-toxin (plc), enterotoxin (cpe), and Perfringolysin O (pfo or pfoA), although interestingly there did not appear to be a close correlation with encoded toxin type and disease phenotype. Furthermore, genomic analysis indicated significant horizontal gene transfer events as defined by presence of prophage genomes, and notably absence of CRISPR defence systems in >70% (40/56) of the strains. In relation to antimicrobial resistance mechanisms, tetracycline resistance genes (tet) and anti-defensins genes (mprF) were consistently detected in silico (tet: 75%; mprF: 100%). However, pre-antibiotic era strain genomes did not encode for tet, thus implying antimicrobial selective pressures in C. perfringens evolutionary history over the past 80 years. This study provides new genomic understanding of this genetically divergent multi-host bacterium, and further expands our knowledge on this medically and veterinary important pathogen.


July 7, 2019

Tracing origins of the Salmonella Bareilly strain causing a food-borne outbreak in the United States.

Using a novel combination of whole-genome sequencing (WGS) analysis and geographic metadata, we traced the origins of Salmonella Bareilly isolates collected in 2012 during a widespread food-borne outbreak in the United States associated with scraped tuna imported from India.Using next-generation sequencing, we sequenced the complete genome of 100 Salmonella Bareilly isolates obtained from patients who consumed contaminated product, from natural sources, and from unrelated historically and geographically disparate foods. Pathogen genomes were linked to geography by projecting the phylogeny on a virtual globe and produced a transmission network.Phylogenetic analysis of WGS data revealed a common origin for outbreak strains, indicating that patients in Maryland and New York were infected from sources originating at a facility in India.These data represent the first report fully integrating WGS analysis with geographic mapping and a novel use of transmission networks. Results showed that WGS vastly improves our ability to delimit the scope and source of bacterial food-borne contamination events. Furthermore, these findings reinforce the extraordinary utility that WGS brings to global outbreak investigation as a greatly enhanced approach to protecting the human food supply chain as well as public health in general. Published by Oxford University Press for the Infectious Diseases Society of America 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.


July 7, 2019

Stability of the encoding plasmids and surface expression of CS6 differs in enterotoxigenic Escherichia coli (ETEC) encoding Different heat-stable (ST) enterotoxins (STh and STp).

Enterotoxigenic Escherichia coli (ETEC), one of the most common reasons of diarrhea among infants and children in developing countries, causes disease by expression of either or both of the enterotoxins heat-labile (LT) and heat-stable (ST; divided into human-type [STh] and porcine-type [STp] variants), and colonization factors (CFs) among which CS6 is one of the most prevalent ETEC CFs. In this study we show that ETEC isolates expressing CS6+STh have higher copy numbers of the cssABCD operon encoding CS6 than those expressing CS6+STp. Long term cultivation of up to ten over-night passages of ETEC isolates harboring CS6+STh (n = 10) or CS6+STp (n = 15) showed instability of phenotypic expression of CS6 in a majority of the CS6+STp isolates, whereas most of the CS6+STh isolates retained CS6 expression. The observed instability was a correlated with loss of genes cssA and cssD as examined by PCR. Mobilization of the CS6 plasmid from an unstable CS6+STp isolate into a laboratory E. coli strain resulted in loss of the plasmid after a single over-night passage whereas the plasmid from an CS6+STh strain was retained in the laboratory strain during 10 passages. A sequence comparison between the CS6 plasmids from a stable and an unstable ETEC isolate revealed that genes necessary for plasmid stabilization, for example pemI, pemK, stbA, stbB and parM, were not present in the unstable ETEC isolate. Our results indicate that stable retention of CS6 may in part be affected by the stability of the plasmid on which both CS6 and STp or STh are located.


July 7, 2019

Evolutionary history of the global emergence of the Escherichia coli epidemic clone ST131.

Escherichia colisequence type 131 (ST131) has emerged globally as the most predominant extraintestinal pathogenic lineage within this clinically important species, and its association with fluoroquinolone and extended-spectrum cephalosporin resistance impacts significantly on treatment. The evolutionary histories of this lineage, and of important antimicrobial resistance elements within it, remain unclearly defined. This study of the largest worldwide collection (n= 215) of sequenced ST131E. coliisolates to date demonstrates that the clonal expansion of two previously recognized antimicrobial-resistant clades, C1/H30R and C2/H30Rx, started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine. These two clades appear to have emerged in the United States, with the expansion of the C2/H30Rx clade driven by the acquisition of ablaCTX-M-15-containing IncFII-like plasmid that has subsequently undergone extensive rearrangement. Several other evolutionary processes influencing the trajectory of this drug-resistant lineage are described, including sporadic acquisitions of CTX-M resistance plasmids and chromosomal integration ofblaCTX-Mwithin subclusters followed by vertical evolution. These processes are also occurring for another family of CTX-M gene variants more recently observed among ST131, theblaCTX-M-14/14-likegroup. The complexity of the evolutionary history of ST131 has important implications for antimicrobial resistance surveillance, epidemiological analysis, and control of emerging clinical lineages ofE. coli These data also highlight the global imperative to reduce specific antibiotic selection pressures and demonstrate the important and varied roles played by plasmids and other mobile genetic elements in the perpetuation of antimicrobial resistance within lineages.IMPORTANCEEscherichia coli, perennially a major bacterial pathogen, is becoming increasingly difficult to manage due to emerging resistance to all preferred antimicrobials. Resistance is concentrated within specificE. colilineages, such as sequence type 131 (ST131). Clarification of the genetic basis for clonally associated resistance is key to devising intervention strategies. We used high-resolution genomic analysis of a large global collection of ST131 isolates to define the evolutionary history of extended-spectrum beta-lactamase production in ST131. We documented diverse contributory genetic processes, including stable chromosomal integrations of resistance genes, persistence and evolution of mobile resistance elements within sublineages, and sporadic acquisition of different resistance elements. Both global distribution and regional segregation were evident. The diversity of resistance element acquisition and propagation within ST131 indicates a need for control and surveillance strategies that target both bacterial strains and mobile genetic elements. Copyright © 2016 Stoesser et al.


July 7, 2019

The Atlantic salmon genome provides insights into rediploidization.

The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.


July 7, 2019

The channel catfish genome sequence provides insights into the evolution of scale formation in teleosts.

Catfish represent 12% of teleost or 6.3% of all vertebrate species, and are of enormous economic value. Here we report a high-quality reference genome sequence of channel catfish (Ictalurus punctatus), the major aquaculture species in the US. The reference genome sequence was validated by genetic mapping of 54,000 SNPs, and annotated with 26,661 predicted protein-coding genes. Through comparative analysis of genomes and transcriptomes of scaled and scaleless fish and scale regeneration experiments, we address the genomic basis for the most striking physical characteristic of catfish, the evolutionary loss of scales and provide evidence that lack of secretory calcium-binding phosphoproteins accounts for the evolutionary loss of scales in catfish. The channel catfish reference genome sequence, along with two additional genome sequences and transcriptomes of scaled catfishes, provide crucial resources for evolutionary and biological studies. This work also demonstrates the power of comparative subtraction of candidate genes for traits of structural significance.


July 7, 2019

A hot L1 retrotransposon evades somatic repression and initiates human colorectal cancer.

Although human LINE-1 (L1) elements are actively mobilized in many cancers, a role for somatic L1 retrotransposition in tumor initiation has not been conclusively demonstrated. Here, we identify a novel somatic L1 insertion in the APC tumor suppressor gene that provided us with a unique opportunity to determine whether such insertions can actually initiate colorectal cancer (CRC), and if so, how this might occur. Our data support a model whereby a hot L1 source element on Chromosome 17 of the patient’s genome evaded somatic repression in normal colon tissues and thereby initiated CRC by mutating the APC gene. This insertion worked together with a point mutation in the second APC allele to initiate tumorigenesis through the classic two-hit CRC pathway. We also show that L1 source profiles vary considerably depending on the ancestry of an individual, and that population-specific hot L1 elements represent a novel form of cancer risk. © 2016 Scott et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads.

Motivation. The third generation sequencing (3GS) technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15-40%, significantly higher than those of the prevalent next generation sequencing (NGS) technologies (less than 1%). Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality sequences that need to be extracted from these long but erroneous 3GS sequences. Results. We describe a versatile and efficient linear complexity consensus algorithm Sparc to facilitate de novo genome assembly. Sparc builds a sparse k-mer graph using a collection of sequences from a targeted genomic region. The heaviest path which approximates the most likely genome sequence is searched through a sparsity-induced reweighted graph as the consensus sequence. Sparc supports using NGS and 3GS data together, which leads to significant improvements in both cost efficiency and computational efficiency. Experiments with Sparc show that our algorithm can efficiently provide high-quality consensus sequences using both PacBio and Oxford Nanopore sequencing technologies. With only 30× PacBio data, Sparc can reach a consensus with error rate <0.5%. With the more challenging Oxford Nanopore data, Sparc can also achieve similar error rate when combined with NGS data. Compared with the existing approaches, Sparc calculates the consensus with higher accuracy, and uses approximately 80% less memory and time. Availability. The source code is available for download at https://github.com/yechengxi/Sparc.


July 7, 2019

Genomics-informed isolation and characterization of a symbiotic Nanoarchaeota system from a terrestrial geothermal environment.

Biological features can be inferred, based on genomic data, for many microbial lineages that remain uncultured. However, cultivation is important for characterizing an organism’s physiology and testing its genome-encoded potential. Here we use single-cell genomics to infer cultivation conditions for the isolation of an ectosymbiotic Nanoarchaeota (‘Nanopusillus acidilobi’) and its host (Acidilobus, a crenarchaeote) from a terrestrial geothermal environment. The cells of ‘Nanopusillus’ are among the smallest known cellular organisms (100-300?nm). They appear to have a complete genetic information processing machinery, but lack almost all primary biosynthetic functions as well as respiration and ATP synthesis. Genomic and proteomic comparison with its distant relative, the marine Nanoarchaeum equitans illustrate an ancient, common evolutionary history of adaptation of the Nanoarchaeota to ectosymbiosis, so far unique among the Archaea.


July 7, 2019

Distinct Salmonella enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings.

An epidemiological paradox surrounds Salmonella enterica serovar Enteritidis. In high-income settings, it has been responsible for an epidemic of poultry-associated, self-limiting enterocolitis, whereas in sub-Saharan Africa it is a major cause of invasive nontyphoidal Salmonella disease, associated with high case fatality. By whole-genome sequence analysis of 675 isolates of S. Enteritidis from 45 countries, we show the existence of a global epidemic clade and two new clades of S. Enteritidis that are geographically restricted to distinct regions of Africa. The African isolates display genomic degradation, a novel prophage repertoire, and an expanded multidrug resistance plasmid. S. Enteritidis is a further example of a Salmonella serotype that displays niche plasticity, with distinct clades that enable it to become a prominent cause of gastroenteritis in association with the industrial production of eggs and of multidrug-resistant, bloodstream-invasive infection in Africa.


July 7, 2019

DBG2OLC: Efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies.

The highly anticipated transition from next generation sequencing (NGS) to third generation sequencing (3GS) has been difficult primarily due to high error rates and excessive sequencing cost. The high error rates make the assembly of long erroneous reads of large genomes challenging because existing software solutions are often overwhelmed by error correction tasks. Here we report a hybrid assembly approach that simultaneously utilizes NGS and 3GS data to address both issues. We gain advantages from three general and basic design principles: (i) Compact representation of the long reads leads to efficient alignments. (ii) Base-level errors can be skipped; structural errors need to be detected and corrected. (iii) Structurally correct 3GS reads are assembled and polished. In our implementation, preassembled NGS contigs are used to derive the compact representation of the long reads, motivating an algorithmic conversion from a de Bruijn graph to an overlap graph, the two major assembly paradigms. Moreover, since NGS and 3GS data can compensate for each other, our hybrid assembly approach reduces both of their sequencing requirements. Experiments show that our software is able to assemble mammalian-sized genomes orders of magnitude more quickly than existing methods without consuming a lot of memory, while saving about half of the sequencing cost.


July 7, 2019

Lysosomal Cathepsin A plays a significant role in the processing of endogenous bioactive peptides.

Lysosomal serine carboxypeptidase Cathepsin A (CTSA) is a multifunctional enzyme with distinct protective and catalytic function. CTSA present in the lysosomal multienzyme complex to facilitate the correct lysosomal routing, stability and activation of with beta-galactosidase and alpha-neuraminidase. Beside CTSA has role in inactivation of bioactive peptides including bradykinin, substances P, oxytocin, angiotensin I and endothelin-I by cleavage of 1 or 2 amino acid(s) from C-terminal ends. In this study, we aimed to elucidate the regulatory role of CTSA on bioactive peptides in knock-in mice model of CTSA(S190A) . We investigated the level of bradykinin, substances P, oxytocin, angiotensin I and endothelin-I in the kidney, liver, lung, brain and serum from CTSA(S190A) mouse model at 3- and 6-months of age. Our results suggest CTSA selectively contributes to processing of bioactive peptides in different tissues from CTSA(S190A) mice compared to age matched WT mice.


July 7, 2019

Genome sequences of Ralstonia insidiosa type strain ATCC 49129 and strain FC1138, a strong biofilm producer isolated from a fresh-cut produce-processing plant.

Ralstonia insidiosa is an opportunistic pathogen and a strong biofilm producer. Here, we present the complete genome sequences of R. insidiosa FC1138 and ATCC 49129. Both strains have two circular chromosomes of approximately 3.9 and 1.9 Mb and a 50-kb plasmid. ATCC 49129 also possesses a megaplasmid of approximately 318 kb. Copyright © 2016 Xu et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.