Menu
July 7, 2019

Combining de novo and reference-guided assembly with scaffold_builder.

Genome sequencing has become routine, however genome assembly still remains a challenge despite the computational advances in the last decade. In particular, the abundance of repeat elements in genomes makes it difficult to assemble them into a single complete sequence. Identical repeats shorter than the average read length can generally be assembled without issue. However, longer repeats such as ribosomal RNA operons cannot be accurately assembled using existing tools. The application Scaffold_builder was designed to generate scaffolds – super contigs of sequences joined by N-bases – based on the similarity to a closely related reference sequence. This is independent of mate-pair information and can be used complementarily for genome assembly, e.g. when mate-pairs are not available or have already been exploited. Scaffold_builder was evaluated using simulated pyrosequencing reads of the bacterial genomes Escherichia coli 042, Lactobacillus salivarius UCC118 and Salmonella enterica subsp. enterica serovar Typhi str. P-stx-12. Moreover, we sequenced two genomes from Salmonella enterica serovar Typhimurium LT2 G455 and Salmonella enterica serovar Typhimurium SDT1291 and show that Scaffold_builder decreases the number of contig sequences by 53% while more than doubling their average length. Scaffold_builder is written in Python and is available at http://edwards.sdsu.edu/scaffold_builder. A web-based implementation is additionally provided to allow users to submit a reference genome and a set of contigs to be scaffolded.


July 7, 2019

Development of new methods for the quantitative detection and typing of Lactobacillus parabuchneri in dairy products

Thirty-one isolates of Lactobacillus parabuchneri were obtained from cheese containing histamine; of these, 26 were found to possess the hdcA gene encoding histidine decarboxylase. By analysing the genome data of 13 isolates, specific targets for the development of PCR-based detection and typing systems for L. parabuchneri were identified. The real-time PCR for detection showed a linear quantification over a range of 7 logs and a detection limit of 10 gene equivalents per reaction. The strain typing method utilised the amplification of repeat sequences and showed discrimination comparable with a phylogenetic tree, based on genome comparisons. The method was suitable for detecting and monitoring the development of L. parabuchneri in raw milk and cheese.


July 7, 2019

Detection, isolation and characterization of Fusobacterium gastrosuis sp. nov. colonizing the stomach of pigs.

Nine strains of a novel Fusobacterium sp. were isolated from the stomach of 6-8 months old and adult pigs. The isolates were obligately anaerobic, although they endured 2h exposure to air. Phylogenetic analysis based on 16S rRNA and gyrase B genes demonstrated that the isolates showed high sequence similarity with Fusobacterium mortiferum, Fusobacterium ulcerans, Fusobacterium varium, Fusobacterium russii and Fusobacterium necrogenes, but formed a distinct lineage in the genus Fusobacterium. Comparative analysis of the genome of the type strain of this novel Fusobacterium sp. confirmed that it is different from other recognized Fusobacterium spp. DNA-DNA hybridization, fingerprinting and genomic %GC determination further supported the conclusion that the isolates belong to a new, distinct species. The isolates were also distinguishable from these and other Fusobacterium spp. by phenotypical characterization. The strains produced indole and exhibited proline arylamidase and glutamic acid decarboxylase activity. They did not hydrolyse esculin, did not exhibit pyroglutamic acid arylamidase, valine arylamidase, a-galactosidase, ß-galactosidase, ß-galactosidase-6-phosphate or a-glucosidase activity nor produced acid from cellobiose, glucose, lactose, mannitol, mannose, maltose, raffinose, saccharose, salicin or trehalose. The major fatty acids were C16:0 and C18:1?9c. The name Fusobacterium gastrosuis sp. nov. is proposed for the novel isolates with the type strain CDW1(T) (=DSM 101753(T)=LMG 29236(T)). We also demonstrated that Clostridium rectum and mortiferum Fusobacterium represent the same species, with nomenclatural priority for the latter. Copyright © 2016 Elsevier GmbH. All rights reserved.


July 7, 2019

First complete genome sequence of Marinilactibacillus piezotolerans strain 15R, a marine lactobacillus isolated from coal-bearing sediment 2.0 kilometers below the seafloor, determined by PacBio single-molecule real-time technology.

Marinilactibacillus piezotolerans strain 15R is a facultatively anaerobic heterotrophic lactobacillus isolated from deep marine subsurface sediment nearly 2 km below the seafloor in the northwestern Pacific. We report here the first whole-genome sequence of strain 15R. The identified genome sequence has 2,767,908 bp, 35.4% G+C content, and predicted 2,552 candidate protein-coding sequences, with no identified plasmids. Copyright © 2017 Wei et al.


July 7, 2019

Potential probiotic-associated traits revealed from completed high quality genome sequence of Lactobacillus fermentum 3872.

The article provides an overview of the genomic features of Lactobacillus fermentum strain 3872. The genomic sequence reported here is one of three L. fermentum genome sequences completed to date. Comparative genomic analysis allowed the identification of genes that may be contributing to enhanced probiotic properties of this strain. In particular, the genes encoding putative mucus binding proteins, collagen-binding proteins, class III bacteriocin, as well as exopolysaccharide and prophage-related genes were identified. Genes related to bacterial aggregation and survival under harsh conditions in the gastrointestinal tract, along with the genes required for vitamin production were also found.


July 7, 2019

The complete genome sequence of the yogurt isolate Streptococcus thermophilus ACA-DC 2.

Streptococcus thermophilus ACA-DC 2 is a newly sequenced strain isolated from traditional Greek yogurt. Among the 14 fully sequenced strains of S. thermophilus currently deposited in the NCBI database, the ACA-DC 2 strain has the smallest chromosome, containing 1,731,838 bp. The annotation of its genome revealed the presence of 1,850 genes, including 1,556 protein-coding genes, 70 RNA genes and 224 potential pseudogenes. A large number of pseudogenes were identified. This was also accompanied by the absence of pathogenic features suggesting evolution of strain ACA-DC 2 through genome decay processes, most probably due to adaptation to the milk ecosystem. Analysis revealed the existence of one complete lactose-galactose operon, several proteolytic enzymes, one exopolysaccharide cluster, stress response genes and four putative antimicrobial peptides. Interestingly, one CRISPR-cas system and one orphan CRISPR, both carrying only one spacer, were predicted indicating low activity or inactivation of the cas proteins. Nevertheless, four putative restriction-modification systems were determined that may compensate any deficiencies of the CRISPR-cas system. Furthermore, whole genome phylogeny indicated three distinct clades within S. thermophilus. Comparative analysis among selected strains representative for each clade, including strain ACA-DC 2, revealed a high degree of conservation at the genomic scale, but also strain specific regions. Unique genes and genomic islands of strain ACA-DC 2 contained a number of genes potentially acquired through horizontal gene transfer events, that could be related to important technological properties for dairy starters. Our study suggests genomic traits in strain ACA-DC 2 compatible to the production of dairy fermented foods.


July 7, 2019

An amoebal grazer of cyanobacteria requires cobalamin produced by heterotrophic bacteria.

Amoebae are unicellular eukaryotes that consume microbial prey through phagocytosis, playing a role in shaping microbial foodwebs. Many amoebal species can be cultivated axenically in rich media or monoxenically with single bacterial prey species. Here we characterize heterolobosean amoeba LPG3, a recent natural isolate, which is unable to grow on unicellular cyanobacteria, its primary food source, in the absence of a heterotrophic bacterium, a Pseudomonas species coisolate. To investigate the molecular basis of this requirement for heterotrophic bacteria, we performed a screen using a defined non-redundant transposon library of Vibrio cholerae which implicated genes in corrinoid uptake and biosynthesis. Furthermore, cobalamin synthase deletion mutants in V. cholerae and the Pseudomonas species coisolate do not support growth of amoeba LPG3 on cyanobacteria. While cyanobacteria are robust producers of a corrinoid variant called pseudocobalamin, this variant does not support growth of amoeba LPG3. Instead, we show that it requires cobalamin which is produced by the Pseudomonas species coisolate. The diversity of eukaryotes utilizing corrinoids is poorly understood, and this amoebal corrinoid auxotroph serves as a model for examining predator-prey interactions and micronutrient transfer in bacterivores underpinning microbial foodwebs.Importance. Cyanobacteria are important primary producers in aquatic environments where they are grazed upon by a variety of phagotrophic protists, and hence have an impact on nutrient flux at the base of microbial foodwebs. Here we characterize amoebal isolate LPG3 which consumes cyanobacteria as its primary food source but that also requires heterotrophic bacteria as a source of corrinoid vitamins. Amoeba LPG3 specifically requires the corrinoid variant produced by the heterotrophic bacteria, and cannot grow on cyanobacteria alone, as they produce a different corrinoid variant. This same corrinoid specificity is also exhibited by other eukaryotes, including humans and algae. This amoebal model system allows us to dissect predator-prey interactions to uncover factors which may shape microbial foodwebs while also providing insight into corrinoid specificity in eukaryotes. Copyright © 2017 American Society for Microbiology.


July 7, 2019

Complete genome sequence of Lactobacillus jensenii strain SNUV360, a probiotic for treatment of bacterial vaginosis isolated from the vagina of a healthy Korean woman.

Lactobacillus jensenii SNUV360 is a potential probiotic strain that shows antimicrobial activity for the treatment of bacterial vaginosis. Here, we present the complete genomic sequence of L. jensenii SNUV360, isolated from a vaginal sample from a healthy Korean woman. Analysis of the sequence may provide insight into its functional activity. Copyright © 2017 Lee et al.


July 7, 2019

The histidine decarboxylase gene cluster of Lactobacillus parabuchneri was gained by horizontal gene transfer and is mobile within the species.

Histamine in food can cause intolerance reactions in consumers. Lactobacillus parabuchneri (L. parabuchneri) is one of the major causes of elevated histamine levels in cheese. Despite its significant economic impact and negative influence on human health, no genomic study has been published so far. We sequenced and analyzed 18 L. parabuchneri strains of which 12 were histamine positive and 6 were histamine negative. We determined the complete genome of the histamine positive strain FAM21731 with PacBio as well as Illumina and the genomes of the remaining 17 strains using the Illumina technology. We developed the synteny aware ortholog finding algorithm SynOrf to compare the genomes and we show that the histidine decarboxylase (HDC) gene cluster is located in a genomic island. It is very likely that the HDC gene cluster was transferred from other lactobacilli, as it is highly conserved within several lactobacilli species. Furthermore, we have evidence that the HDC gene cluster was transferred within the L. parabuchneri species.


July 7, 2019

Genomic changes associated with the evolutionary transition of an insect gut symbiont into a blood-borne pathogen.

The genus Bartonella comprises facultative intracellular bacteria with a unique lifestyle. After transmission by blood-sucking arthropods they colonize the erythrocytes of mammalian hosts causing acute and chronic infectious diseases. Although the pathogen-host interaction is well understood, little is known about the evolutionary origin of the infection strategy manifested by Bartonella species. Here we analyzed six genomes of Bartonella apis, a honey bee gut symbiont that to date represents the closest relative of pathogenic Bartonella species. Comparative genomics revealed that B. apis encodes a large set of vertically inherited genes for amino acid and cofactor biosynthesis and nitrogen metabolism. Most pathogenic bartonellae have lost these ancestral functions, but acquired specific virulence factors and expanded a vertically inherited gene family for harvesting cofactors from the blood. However, the deeply rooted pathogen Bartonella tamiae has retained many of the ancestral genome characteristics reflecting an evolutionary intermediate state toward a host-restricted intraerythrocytic lifestyle. Our findings suggest that the ancestor of the pathogen Bartonella was a gut symbiont of insects and that the adaptation to blood-feeding insects facilitated colonization of the mammalian bloodstream. This study highlights the importance of comparative genomics among pathogens and non-pathogenic relatives to understand disease emergence within an evolutionary-ecological framework.


July 7, 2019

The hidden perils of read mapping as a quality assessment tool in genome sequencing.

This article provides a comparative analysis of the various methods of genome sequencing focusing on verification of the assembly quality. The results of a comparative assessment of various de novo assembly tools, as well as sequencing technologies, are presented using a recently completed sequence of the genome of Lactobacillus fermentum 3872. In particular, quality of assemblies is assessed by using CLC Genomics Workbench read mapping and Optical mapping developed by OpGen. Over-extension of contigs without prior knowledge of contig location can lead to misassembled contigs, even when commonly used quality indicators such as read mapping suggest that a contig is well assembled. Precautions must also be undertaken when using long read sequencing technology, which may also lead to misassembled contigs.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.