HGAP Archives - Page 31 of 122

September 21, 2019

Functional analysis of the first complete genome sequence of a multidrug resistant sequence type 2 Staphylococcus epidermidis.

Staphylococcus epidermidis is a significant opportunistic pathogen of humans. The ST2 lineage is frequently multidrug resistant and accounts for most of the clinical disease worldwide. However, there are no publically available, closed ST2 genomes and pathogenesis studies have not focused on these strains. We report the complete genome and methylome of BPH0662, a multidrug resistant, hospital adapted, ST2 S. epidermidis, and describe the correlation between resistome and phenotype, as well as demonstrate its relationship to publically available, international ST2 isolates. Furthermore, we delineate the methylome determined by the two type I restriction modification systems present in BPH0662 through heterologous expression in Escherichia coli, allowing the assignment of each system to its corresponding target recognition motif. As the first complete ST2 S. epidermidis genome, BPH0662 provides a valuable reference for future genomic studies of this clinically relevant lineage. Defining the methylome and the construction of these E. coli hosts provides the foundation for the development of molecular tools to bypass restriction modification systems in this lineage that has hitherto proven intractable.

September 21, 2019

Multiple genome sequences of important beer-spoiling lactic acid bacteria.

Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. Copyright © 2016 Geissler et al.

September 21, 2019

Recent advances in bioinformatics for fish genomics

In the past few years, we have contributed efforts to ~1/5 of the reported fish genomes. Based on our related experience, here we outline recent advances in bioinformatics for fish genomics, with an emphasis on development of software for genome assembly, genome annotation and evolutionary analysis. This review will be helpful for the new players of genome analysis on both animals and plants. In the past decade, whole genome sequences of approximately 50 fish species have been reported [1]. We have been involved in ~1/5 of these international works from 2014 to 2017, such as mudskippers (2014) [2], Chinese large yellow croaker [3], Chinese barbel fishes [4], Asian arowana [5,6], Channel catfish [7], seahorses [8], Japanese flounder [9], Chinese clearhead icefish [10] and Northern snakehead [11]. We are also in charge of the China Auqatic 10-100-1,000 Genomics Program [12], in which ~100 fish genomes are sequencing targets for the next 3~5 years. Based on our previous experience on fish genomic studies, here we outline recent advances in related bioinformatics for fish genomics to share with public readers. Since the basic informatics includes genome assembly, genome annotation and evolutionary analysis, we discuss them one by one in this order.

September 21, 2019

Characterization of multi-drug resistant Enterococcus faecalis isolated from cephalic recording chambers in research macaques (Macaca spp.).

Nonhuman primates are commonly used for cognitive neuroscience research and often surgically implanted with cephalic recording chambers for electrophysiological recording. Aerobic bacterial cultures from 25 macaques identified 72 bacterial isolates, including 15 Enterococcus faecalis isolates. The E. faecalis isolates displayed multi-drug resistant phenotypes, with resistance to ciprofloxacin, enrofloxacin, trimethoprim-sulfamethoxazole, tetracycline, chloramphenicol, bacitracin, and erythromycin, as well as high-level aminoglycoside resistance. Multi-locus sequence typing showed that most belonged to two E. faecalis sequence types (ST): ST 4 and ST 55. The genomes of three representative isolates were sequenced to identify genes encoding antimicrobial resistances and other traits. Antimicrobial resistance genes identified included aac(6′)-aph(2″), aph(3′)-III, str, ant(6)-Ia, tetM, tetS, tetL, ermB, bcrABR, cat, and dfrG, and polymorphisms in parC (S80I) and gyrA (S83I) were observed. These isolates also harbored virulence factors including the cytolysin toxin genes in ST 4 isolates, as well as multiple biofilm-associated genes (esp, agg, ace, SrtA, gelE, ebpABC), hyaluronidases (hylA, hylB), and other survival genes (ElrA, tpx). Crystal violet biofilm assays confirmed that ST 4 isolates produced more biofilm than ST 55 isolates. The abundance of antimicrobial resistance and virulence factor genes in the ST 4 isolates likely relates to the loss of CRISPR-cas. This macaque colony represents a unique model for studying E. faecalis infection associated with indwelling devices, and provides an opportunity to understand the basis of persistence of this pathogen in a healthcare setting.

September 21, 2019

Decreased fitness and virulence in ST10 Escherichia coli harboring blaNDM-5 and mcr-1 against a ST4981 strain with blaNDM-5.

Although coexistence of blaNDM-5 and mcr-1 in Escherichia coli has been reported, little is known about the fitness and virulence of such strains. Three carbapenem-resistant Escherichia coli (GZ1, GZ2, and GZ3) successively isolated from one patient in 2015 were investigated for microbiological fitness and virulence. GZ1 and GZ2 were also resistant to colistin. To verify the association between plasmids and fitness, growth kinetics of the transconjugants were performed. We also analyzed genomic sequences of GZ2 and GZ3 using PacBio sequencing. GZ1 and GZ2 (ST10) co-harbored blaNDM-5 and mcr-1, while GZ3 (ST4981) carried only blaNDM-5. GZ3 demonstrated significantly more rapid growth (P < 0.001) and overgrew GZ2 with a competitive index of 1.0157 (4 h) and 2.5207 (24 h). Increased resistance to serum killing and mice mortality was also identified in GZ3. While GZ2 had four plasmids (IncI2, IncX3, IncHI2, IncFII), GZ3 possessed one plasmid (IncFII). The genetic contexts of blaNDM-5 in GZ2 and GZ3 were identical but inserted into different backbones, IncX3 (102,512 bp) and IncFII (91,451 bp), respectively. The growth was not statistically different between the transconjugants with mcr-1 or blaNDM-5 plasmid and recipient (P = 0.6238). Whole genome sequence analysis revealed that 28 virulence genes were specific to GZ3, potentially contributing to increased virulence of GZ3. Decreased fitness and virulence in a mcr-1 and blaNDM-5 co-harboring ST10 E. coli was found alongside a ST4981 strain with only blaNDM-5. Acquisition of mcr-1 or blaNDM-5 plasmid did not lead to considerable fitness costs, indicating the potential for dissemination of mcr-1 and blaNDM-5 in Enterobacteriaceae.

September 21, 2019

A distinct and genetically diverse lineage of the hybrid fungal pathogen Verticillium longisporum population causes stem striping in British oilseed rape.

Population genetic structures illustrate evolutionary trajectories of organisms adapting to differential environmental conditions. Verticillium stem striping disease on oilseed rape was mainly observed in continental Europe, but has recently emerged in the United Kingdom. The disease is caused by the hybrid fungal species Verticillium longisporum that originates from at least three separate hybridization events, yet hybrids between Verticillium progenitor species A1 and D1 are mainly responsible for Verticillium stem striping. We reveal a hitherto un-described dichotomy within V. longisporum lineage A1/D1 that correlates with the geographic distribution of the isolates with an ‘A1/D1 West’ and an ‘A1/D1 East’ cluster. Genome comparison between representatives of the A1/D1 West and East clusters excluded population distinctiveness through separate hybridization events. Remarkably, the A1/D1 West population that is genetically more diverse than the entire A1/D1 East cluster caused the sudden emergence of Verticillium stem striping in the UK, whereas in continental Europe Verticillium stem striping is predominantly caused by the more genetically uniform A1/D1 East population. The observed genetic diversity of the A1/D1 West population argues against a recent introduction of the pathogen into the UK, but rather suggests that the pathogen previously established in the UK and remained latent or unnoticed as oilseed rape pathogen until recently.© 2017 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.

September 21, 2019

PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SICAvar gene family.

Plasmodium knowlesi has risen in importance as a zoonotic parasite that has been causing regular episodes of malaria throughout South East Asia. The P. knowlesi genome sequence generated in 2008 highlighted and confirmed many similarities and differences in Plasmodium species, including a global view of several multigene families, such as the large SICAvar multigene family encoding the variant antigens known as the schizont-infected cell agglutination proteins. However, repetitive DNA sequences are the bane of any genome project, and this and other Plasmodium genome projects have not been immune to the gaps, rearrangements and other pitfalls created by these genomic features. Today, long-read PacBio and chromatin conformation technologies are overcoming such obstacles. Here, based on the use of these technologies, we present a highly refined de novo P. knowlesi genome sequence of the Pk1(A+) clone. This sequence and annotation, referred to as the ‘MaHPIC Pk genome sequence’, includes manual annotation of the SICAvar gene family with 136 full-length members categorized as type I or II. This sequence provides a framework that will permit a better understanding of the SICAvar repertoire, selective pressures acting on this gene family and mechanisms of antigenic variation in this species and other pathogens.

September 21, 2019

The kinetoplastid-infecting Bodo saltans virus (BsV), a window into the most abundant giant viruses in the sea.

Giant viruses are ecologically important players in aquatic ecosystems that have challenged concepts of what constitutes a virus. Herein, we present the giant Bodo saltans virus (BsV), the first characterized representative of the most abundant group of giant viruses in ocean metagenomes, and the first isolate of a klosneuvirus, a subgroup of the Mimiviridae proposed from metagenomic data. BsV infects an ecologically important microzooplankton, the kinetoplastid Bodo saltans. Its 1.39 Mb genome encodes 1227 predicted ORFs, including a complex replication machinery. Yet, much of its translational apparatus has been lost, including all tRNAs. Essential genes are invaded by homing endonuclease-encoding self-splicing introns that may defend against competing viruses. Putative anti-host factors show extensive gene duplication via a genomic accordion indicating an ongoing evolutionary arms race and highlighting the rapid evolution and genomic plasticity that has led to genome gigantism and the enigma that is giant viruses.© 2018, Deeg et al.

September 21, 2019

Chromulinavorax destructans, a pathogenic TM6 bacterium with an unusual replication strategy targeting protist mitochondrion

Most of the diversity of microbial life is not available in culture, and as such we lack even a fundamental understanding of the biological diversity of several branches on the tree of life. One branch that is highly underrepresented is the candidate phylum TM6, also known as the Dependentiae. Their biology is known only from reduced genomes recovered from metagenomes around the world and two isolates infecting amoebae, all suggest that they live highly host-associated lifestyles as parasites or symbionts. Chromulinavorax destructans is an isolate from the TM6/Dependentiae that infects and lyses the abundant heterotrophic flagellate, Spumella elongata. Chromulinavorax destructans is characterized by a high degree of reduction and specialization for infection, so much so it was discovered in a screen for giant viruses. Its 1.2 Mb genome shows no metabolic potential and C. destructans instead relies on extensive transporter system to import nutrients, and even energy in the form of ATP from the host. Accordingly, it replicates in a viral-like fashion, while extensively reorganizing and expanding the host mitochondrion. 44% of proteins contain signal sequences for secretion, which includes many proteins of unknown function as well as 98 copies of ankyrin-repeat domain proteins, known effectors of host modulation, suggesting the presence of an extensive host-manipulation apparatus.

September 21, 2019

From the inside out: An epibiotic Bdellovibrio predator with an expanded genomic complement

Bdellovibrio and like organisms are abundant environmental predators of prokaryotes that show a diversity of predation strategies, ranging from intra-periplasmic to epibiotic predation. The novel epibiotic predator Bdellovibrio qaytius was isolated from a eutrophic freshwater pond in British Columbia, where it was a continual part of the microbial community. Bdellovibrio qaytius was found to preferentially prey on the beta-proteobacterium Paraburkholderia fungorum. Despite its epibiotic replication strategy, B. qaytius encodes a complex genomic complement more similar to periplasmic predators as well as several biosynthesis pathways not previously found in epibiotic predators. Bdellovibrio qaytius is representative of a widely distributed basal cluster within the genus Bdellovibrio, suggesting that epibiotic predation might be a common predation type in nature and ancestral to the genus.

September 21, 2019

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

September 21, 2019

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.

Long-read, single-molecule real-time (SMRT) sequencing is routinely used to finish microbial genomes, but available assembly methods have not scaled well to larger genomes. We introduce the MinHash Alignment Process (MHAP) for overlapping noisy, long reads using probabilistic, locality-sensitive hashing. Integrating MHAP with the Celera Assembler enabled reference-grade de novo assemblies of Saccharomyces cerevisiae, Arabidopsis thaliana, Drosophila melanogaster and a human hydatidiform mole cell line (CHM1) from SMRT sequencing. The resulting assemblies are highly continuous, include fully resolved chromosome arms and close persistent gaps in these reference genomes. Our assembly of D. melanogaster revealed previously unknown heterochromatic and telomeric transition sequences, and we assembled low-complexity sequences from CHM1 that fill gaps in the human GRCh38 reference. Using MHAP and the Celera Assembler, single-molecule sequencing can produce de novo near-complete eukaryotic assemblies that are 99.99% accurate when compared with available reference genomes.

September 21, 2019

Phased diploid genome assembly with single-molecule real-time sequencing.

While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.

July 19, 2019

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published.A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii.Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.

July 19, 2019

Preparation of next-generation DNA sequencing libraries from ultra-low amounts of input DNA: Application to single-molecule, real-time (SMRT) sequencing on the Pacific Biosciences RS II.

We have developed and validated an amplification-free method for generating DNA sequencing libraries from very low amounts of input DNA (500 picograms – 20 nanograms) for single- molecule sequencing on the Pacific Biosciences (PacBio) RS II sequencer. The common challenge of high input requirements for single-molecule sequencing is overcome by using a carrier DNA in conjunction with optimized sequencing preparation conditions and re-use of the MagBead-bound complex. Here we describe how this method can be used to produce sequencing yields comparable to those generated from standard input amounts, but by using 1000-fold less starting material.

Auto Tag: HGAP

Functional analysis of the first complete genome sequence of a multidrug resistant sequence type 2 Staphylococcus epidermidis.

Multiple genome sequences of important beer-spoiling lactic acid bacteria.

Recent advances in bioinformatics for fish genomics

Characterization of multi-drug resistant Enterococcus faecalis isolated from cephalic recording chambers in research macaques (Macaca spp.).

Decreased fitness and virulence in ST10 Escherichia coli harboring blaNDM-5 and mcr-1 against a ST4981 strain with blaNDM-5.

A distinct and genetically diverse lineage of the hybrid fungal pathogen Verticillium longisporum population causes stem striping in British oilseed rape.

PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SICAvar gene family.

The kinetoplastid-infecting Bodo saltans virus (BsV), a window into the most abundant giant viruses in the sea.

Chromulinavorax destructans, a pathogenic TM6 bacterium with an unusual replication strategy targeting protist mitochondrion

From the inside out: An epibiotic Bdellovibrio predator with an expanded genomic complement

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.

Phased diploid genome assembly with single-molecule real-time sequencing.

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Preparation of next-generation DNA sequencing libraries from ultra-low amounts of input DNA: Application to single-molecule, real-time (SMRT) sequencing on the Pacific Biosciences RS II.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert