Menu
July 7, 2019

Study of mesophilic Aeromonas salmonicida A527 strain sheds light on the species’ lifestyles and taxonomic dilemma.

The Gram-negative bacterium Aeromonas salmonicida contains five subspecies: salmonicida, smithia, achromogenes, masoucida and pectinolytica. Pectinolytica is a mesophilic subspecies with the ability to thrive at a wide range of temperatures, including 37°C, while the four other subspecies are psychrophilic, restricted to lower temperatures. The psychrophilic subspecies are known to infect a wide range of fishes. However, there is no evidence of pathogenicity for the mesophilic subspecies pectinolytica. Study of the differences between the mesophilic and psychrophilic subspecies is hampered by the lack of completely sequenced and closed genomes from the mesophilic subspecies. A previous study reported that insertion sequences, which can induce genomic rearrangements at temperatures around 25°C, could be one of the determinants explaining the differences in lifestyle (mesophilic or psychrophilic) between the subspecies. In this study, the genome of mesophilic strain A527 of A. salmonicida was sequenced, closed and analyzed to investigate the mesophilic-psychrophilic discrepancy. This reference genome supports the hypothesis that insertion sequences are major determinants of the lifestyle differences between the A. salmonicida subspecies. Moreover, the phylogenetic analysis performed to position strain A527 within the taxonomy raises an issue regarding the intraspecies structure of A. salmonicida.© FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Genome-wide epigenetic studies in chicken: A review

Over the years, farmed birds have been selected on various performance traits mainly through genetic selection. However, many studies have shown that genetics may not be the sole contributor to phenotypic plasticity. Gene expression programs can be influenced by environmentally induced epigenetic changes that may alter the phenotypes of the developing animals. Recently, high-throughput sequencing techniques became sufficiently affordable thanks to technological advances to study whole epigenetic landscapes in model plants and animals. In birds, a growing number of studies recently took advantage of these techniques to gain insights into the epigenetic mechanisms of gene regulation in processes such as immunity or environmental adaptation. Here, we review the current gain of knowledge on the chicken epigenome made possible by recent advances in high-throughput sequencing techniques by focusing on the two most studied epigenetic modifications, DNA methylation and histone post-translational modifications. We discuss and provide insights about designing and performing analyses to further explore avian epigenomes. A better understanding of the molecular mechanisms underlying the epigenetic regulation of gene expression in relation to bird phenotypes may provide new knowledge and markers that should undoubtedly contribute to a sustainable poultry production.


July 7, 2019

Evolutionary context of non-sorbitol-fermenting Shiga toxin-producing Escherichia coli O55:H7.

In July 2014, an outbreak of Shiga toxin-producing Escherichia coli (STEC) O55:H7 in England involved 31 patients, 13 (42%) of whom had hemolytic uremic syndrome. Isolates were sequenced, and the sequences were compared with publicly available sequences of E. coli O55:H7 and O157:H7. A core-genome phylogeny of the evolutionary history of the STEC O55:H7 outbreak strain revealed that the most parsimonious model was a progenitor enteropathogenic O55:H7 sorbitol-fermenting strain, lysogenized by a Shiga toxin (Stx) 2a-encoding phage, followed by loss of the ability to ferment sorbitol because of a non-sense mutation in srlA. The parallel, convergent evolutionary histories of STEC O157:H7 and STEC O55:H7 may indicate a common driver in the evolutionary process. Because emergence of STEC O157:H7 as a clinically significant pathogen was associated with acquisition of the Stx2a-encoding phage, the emergence of STEC O55:H7 harboring the stx2a gene is of public health concern.


July 7, 2019

An update on bioinformatics resources for plant genomics research

Next-generation sequencing and traditional Sanger sequencing methods are of great significance in unraveling the complexity of plant genomes. These are constantly generating heaps of sequence data to be analyzed, annotated and stored. This has created a revolutionary demand for bioinformatics tools and software that can perform these functions. A large number of potentially useful bioinformatics tools and plant genome databases are created that have greatly simplified the analysis and storage of vast amounts of sequence data. The information garnered using the available bioinformatics methods have greatly helped in understanding the plant genome structure. Despite the availability of a good number of such tools, the information pouring from single gene-sequencing, and various whole-genome sequencing projects is overwhelming; thus, further innovations and improved methods are needed to sift through this sequence data, and assemble genomes. The current review focuses on diverse bioinformatics approaches and methods developed to systematically analyze and store plant sequence data. Finally, it outlines the bottlenecks in plant genome analysis, and some possible solutions that could be utilized to overcome the problems associated with plant genome analysis.


July 7, 2019

Genome sequencing brought Gossypium biology research into a new era.

The first sequenced diploid cotton genome was published in 2012 by the group led by the Institute of Cotton Research, Chinese Academy of Agricultural Sciences. Cotton genomics research subsequently entered a period of rapid development. The accumulating data have provided new insights into the evolution and domestication of cotton, the development of important agronomic traits, and strategies for improving cotton quality and production.


July 7, 2019

Bi-level error correction for PacBio long reads.

The latest sequencing technologies such as the Pacific Biosciences (PacBio) and Oxford Nanopore machines can generate long reads at the length of thousands of nucleic bases which is much longer than the reads at the length of hundreds generated by Illumina machines. However, these long reads are prone to much higher error rates, for example 15%, making downstream analysis and applications very difficult. Error correction is a process to improve the quality of sequencing data. Hybrid correction strategies have been recently proposed to combine Illumina reads of low error rates to fix sequencing errors in the noisy long reads with good performance. In this paper, we propose a new method named Bicolor, a bi-level framework of hybrid error correction for further improving the quality of PacBio long reads. At the first level, our method uses a de Bruijn graph-based error correction idea to search paths in pairs of solid -mers iteratively with an increasing length of -mer. At the second level, we combine the processed results under different parameters from the first level. In particular, a multiple sequence alignment algorithm is used to align those similar long reads, followed by a voting algorithm which determines the final base at each position of the reads. We compare the superior performance of Bicolor with three state-of-the-art methods on three real data sets. Results demonstrate that Bicolor always achieves the highest identity ratio. Bicolor also achieves a higher alignment ratio () and a higher number of aligned reads than the current methods on two data sets. On the third data set, our method is closely competitive to the current methods in terms of number of aligned reads and genome coverage. The C++ source codes of our algorithm are freely available at https://github.com/yuansliu/Bicolor.


July 7, 2019

Ultraaccurate genome sequencing and haplotyping of single human cells.

Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10-8and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

A blaOXA-181-harbouring multi-resistant ST147 Klebsiella pneumoniae isolate from Pakistan that represent an intermediate stage towards pan-drug resistance.

Carbapenem resistant Klebsiella pneumoniae (CR-KP) infections are an ever-increasing global issue, especially in the Indian subcontinent. Here we report genetic insight into a blaOXA-181 harbouring Klebsiella pneumoniae, belonging to the pandemic lineage ST147, that represents an intermediate stage towards pan-drug resistance. The CR-KP isolate DA48896 was isolated from a patient from Pakistan and was susceptible only to tigecycline and colistin. It harboured blaOXA-181 and was assigned to sequence type ST147. Analysis from whole genome sequencing revealed a very high sequence similarity to the previously sequenced pan-resistant K. pneumoniae isolate MS6671 from the United Arab Emirates. The two isolates are very closely related with only 46 chromosomal nucleotide differences, 14 indels and differences in plasmid content. Both carry a substantial number of plasmid-borne and chromosomally encoded resistance determinants. Interestingly, the two differences in susceptibility between the isolates could be attributed to DA48896 lacking an insertion of blaOXA-181 into the mgrB gene that results in colistin resistance in MS6671 and SNPs affecting AcrAB efflux pump expression likely to result in tigecycline resistance. These differences between the otherwise very similar isolates indicate that strong selection has occurred for resistance towards these last-resort drugs and illustrates the trajectory of resistance evolution of OXA-181-producing versions of the ST147 international risk clone.


July 7, 2019

Characterization of Fusobacterium varium Fv113-g1 isolated from a patient with ulcerative colitis based on complete genome sequence and transcriptome analysis.

Fusobacterium spp. present in the oral and gut flora is carcinogenic and is associated with the risk of pancreatic and colorectal cancers. Fusobacterium spp. is also implicated in a broad spectrum of human pathologies, including Crohn’s disease and ulcerative colitis (UC). Here we report the complete genome sequence of Fusobacterium varium Fv113-g1 (genome size, 3.96 Mb) isolated from a patient with UC. Comparative genome analyses totally suggested that Fv113-g1 is basically assigned as F. varium, in particular, it could be reclassified as notable F. varium subsp. similar to F. ulcerans because of partial shared orthologs. Compared with the genome sequences of F. varium ATCC 27725 (genome size, 3.30 Mb) and other strains of Fusobacterium spp., Fv113-g1 possesses many accessary pan-genome sequences with noteworthy multiple virulence factors, including 44 autotransporters (type V secretion system, T5SS) and 13 Fusobacterium adhesion (FadA) paralogs involved in potential mucosal inflammation. Indeed, transcriptome analysis demonstrated that Fv113-g1-specific accessary genes, such as multiple T5SS and fadA paralogs, showed notably increased expression with D-MEM cultivation than with brain heart infusion broth. This implied that growth condition may enhance the expression of such potential virulence factors, leading to remarkable survival against other gut microorganisms and to the pathogenicity to human intestinal epithelium.


July 7, 2019

Complete genome sequence and comparative genomics of the golden pompano (Trachinotus ovatus) pathogen, Vibrio harveyistrain QT520.

Vibrio harveyi is a Gram-negative, halophilic bacterium that is an opportunistic pathogen of commercially farmed marine vertebrate species. To understand the pathogenicity of this species, the genome of V. harveyi QT520 was analyzed and compared to that of other strains. The results showed the genome of QT520 has two unique circular chromosomes and three endogenous plasmids, totaling 6,070,846 bp with a 45% GC content, 5,701 predicted ORFs, 134 tRNAs and 37 rRNAs. Common virulence factors, including ACF, IlpA, OmpU, Flagellin, Cya, Hemolysin and MARTX, were detected in the genome, which are likely responsible for the virulence of QT520. The results of genomes comparisons with strains ATCC 33843 (392 (MAV)) and ATCC 43516 showed that greater numbers genes associated with types I, II, III, IV and VI secretion systems were detected in QT520 than in other strains, suggesting that QT520 is a highly virulent strain. In addition, three plasmids were only observed in the complete genome sequence of strain QT520. In plasmid p1 of QT520, specific virulence factors (cyaB, hlyB and rtxA) were identified, suggesting that the pathogenicity of this strain is plasmid-associated. Phylogenetic analysis of 12 complete Vibrio sp. genomes using ANI values, core genes and MLST revealed that QT520 was most closely related to ATCC 33843 (392 (MAV)) and ATCC 43516, suggesting that QT520 belongs to the species V. harveyi. This report is the first to describe the complete genome sequence of a V. harveyi strain isolated from an outbreak in a fish species in China. In addition, to the best of our knowledge, this report is the first to compare the V. harveyi genomes of several strains. The results of this study will expand our understanding of the genome, genetic characteristics, and virulence factors of V. harveyi, setting the stage for studies of pathogenesis, diagnostics, and disease prevention.


July 7, 2019

On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.

To benchmark algorithms for automated plasmid sequence reconstruction from short-read sequencing data, we selected 42 publicly available complete bacterial genome sequences spanning 12 genera, containing 148 plasmids. We predicted plasmids from short-read data with four programs (PlasmidSPAdes, Recycler, cBar and PlasmidFinder) and compared the outcome to the reference sequences. PlasmidSPAdes reconstructs plasmids based on coverage differences in the assembly graph. It reconstructed most of the reference plasmids (recall=0.82), but approximately a quarter of the predicted plasmid contigs were false positives (precision=0.75). PlasmidSPAdes merged 84?% of the predictions from genomes with multiple plasmids into a single bin. Recycler searches the assembly graph for sub-graphs corresponding to circular sequences and correctly predicted small plasmids, but failed with long plasmids (recall=0.12, precision=0.30). cBar, which applies pentamer frequency analysis to detect plasmid-derived contigs, showed a recall and precision of 0.76 and 0.62, respectively. However, cBar categorizes contigs as plasmid-derived and does not bin the different plasmids. PlasmidFinder, which searches for replicons, had the highest precision (1.0), but was restricted by the contents of its database and the contig length obtained fromde novoassembly (recall=0.36). PlasmidSPAdes and Recycler detected putative small plasmids (<10?kbp), which were also predicted as plasmids by cBar, but were absent in the original assembly. This study shows that it is possible to automatically predict small plasmids. Prediction of large plasmids (>50?kbp) containing repeated sequences remains challenging and limits the high-throughput analysis of plasmids from short-read whole-genome sequencing data.


July 7, 2019

Genomic characterization of a local epidemic Pseudomonas aeruginosa reveals specific features of the widespread clone ST395.

Pseudomonas aeruginosa is a ubiquitous opportunistic pathogen with several clones being frequently associated with outbreaks in hospital settings. ST395 is among these so-called ‘international’ clones. We aimed here to define the biological features that could have helped the implantation and spread of the clone ST395 in hospital settings. The complete genome of a multidrug resistant index isolate (DHS01) of a large hospital outbreak was analysed. We identified DHS01-specific genetic elements, among which were identified those shared with a panel of six independent ST395 isolates responsible for outbreaks in other hospitals. DHS01 has the fifth largest chromosome of the species (7.1 Mbp), with most of its 1555 accessory genes borne by either genomic islands (GIs,n=48) or integrative and conjugative elements (ICEs,n=5). DHS01 is multidrug resistant mostly due to chromosomal mutations. It displayed signatures of adaptation to chronic infection in part due to the loss of a 131 kbp chromosomal fragment. Four GIs were specific to the clone ST395 and contained genes involved in metabolism (GI-4), in virulence (GI-6) and in resistance to copper (GI-7). GI-7 harboured an array of six copper transporters and was shared with non-pathogenicPseudomonassp. retrieved from copper-contaminated environments. Copper resistance was confirmed phenotypically in all other ST395 isolates and possibly accounted for the spreading capability of the clone in hospital outbreaks, where water networks have been incriminated. This suggests that genes transferred from copper-polluted environments may have favoured the implantation and spread of the international cloneP. aeruginosaST395 in hospital settings.


July 7, 2019

Feathermoss and epiphytic Nostoc cooperate differently: expanding the spectrum of plant-cyanobacteria symbiosis.

Dinitrogen (N2)-fixation by cyanobacteria in symbiosis with feathermosses is the primary pathway of biological nitrogen (N) input into boreal forests. Despite its significance, little is known about the cyanobacterial gene repertoire and regulatory rewiring needed for the establishment and maintenance of the symbiosis. To determine gene acquisitions and regulatory changes allowing cyanobacteria to form and maintain this symbiosis, we compared genomically closely related symbiotic-competent and -incompetent Nostoc strains using a proteogenomics approach and an experimental set up allowing for controlled chemical and physical contact between partners. Thirty-two gene families were found only in the genomes of symbiotic strains, including some never before associated with cyanobacterial symbiosis. We identified conserved orthologs that were differentially expressed in symbiotic strains, including protein families involved in chemotaxis and motility, NO regulation, sulfate/phosphate transport, and glycosyl-modifying and oxidative stress-mediating exoenzymes. The physical moss-cyanobacteria epiphytic symbiosis is distinct from other cyanobacteria-plant symbioses, with Nostoc retaining motility, and lacking modulation of N2-fixation, photosynthesis, GS-GOGAT cycle and heterocyst formation. The results expand our knowledge base of plant-cyanobacterial symbioses, provide a model of information and material exchange in this ecologically significant symbiosis, and suggest new currencies, namely nitric oxide and aliphatic sulfonates, may be involved in establishing and maintaining the cyanobacteria-feathermoss symbiosis.


July 7, 2019

A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0).

The chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.1.4), the sequence is scattered across more then 183 000 contigs, incorporating more than 159 000 gaps, with a genome-wide contig N50 of 51 Kbp. In this work, we produce an extensive and diverse array of sequencing datasets to rapidly assemble a new chimpanzee reference that surpasses previous iterations in bases represented and organized in large scaffolds. To this end, we show substantial improvements over the current release of the chimpanzee genome (Pan_tro_2.1.4) by several metrics, such as increased contiguity by >750% and 300% on contigs and scaffolds, respectively, and closure of 77% of gaps in the Pan_tro_2.1.4 assembly gaps spanning >850 Kbp of the novel coding sequence based on RNASeq data. We further report more than 2700 genes that had putatively erroneous frame-shift predictions to human in Pan_tro_2.1.4 and show a substantial increase in the annotation of repetitive elements. We apply a simple 3-way hybrid approach to considerably improve the reference genome assembly for the chimpanzee, providing a valuable resource for the study of human origins. Furthermore, we produce extensive sequencing datasets that are all derived from the same cell line, generating a broad non-human benchmark dataset.© The Author 2017. Published by Oxford University Press.


July 7, 2019

Trajectories and drivers of genome evolution in surface-associated marine Phaeobacter.

The extent of genome divergence and the evolutionary events leading to speciation of marine bacteria have mostly been studied for (locally) abundant, free-living groups. The genus Phaeobacter is found on different marine surfaces, seems to occupy geographically disjunct habitats, and is involved in different biotic interactions, and was therefore targeted in the present study. The analysis of the chromosomes of 32 closely related but geographically spread Phaeobacter strains revealed an exceptionally large, highly syntenic core genome. The flexible gene pool is constantly but slightly expanding across all Phaeobacter lineages. The horizontally transferred genes mostly originated from bacteria of the Roseobacter group and horizontal transfer most likely was mediated by gene transfer agents. No evidence for geographic isolation and habitat specificity of the different phylogenomic Phaeobacter clades was detected based on the sources of isolation. In contrast, the functional gene repertoire and physiological traits of different phylogenomic Phaeobacter clades were sufficiently distinct to suggest an adaptation to an associated lifestyle with algae, to additional nutrient sources, or toxic heavy metals. Our study reveals that the evolutionary trajectories of surface-associated marine bacteria can differ significantly from free-living marine bacteria or marine generalists.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.