Menu
July 7, 2019

Isolation and complete genome sequence of the thermophilic Geobacillus sp. 12AMOR1 from an Arctic deep-sea hydrothermal vent site.

Members of the genus Geobacillus have been isolated from a wide variety of habitats worldwide and are the subject for targeted enzyme utilization in various industrial applications. Here we report the isolation and complete genome sequence of the thermophilic starch-degrading Geobacillus sp. 12AMOR1. The strain 12AMOR1 was isolated from deep-sea hot sediment at the Jan Mayen hydrothermal Vent Site. Geobacillus sp. 12AMOR1 consists of a 3,410,035 bp circular chromosome and a 32,689 bp plasmid with a G?+?C content of 52 % and 47 %, respectively. The genome comprises 3323 protein-coding genes, 88 tRNA species and 10 rRNA operons. The isolate grows on a suite of sugars, complex polysaccharides and proteinous carbon sources. Accordingly, a versatility of genes encoding carbohydrate-active enzymes (CAZy) and peptidases were identified in the genome. Expression, purification and characterization of an enzyme of the glycoside hydrolase family 13 revealed a starch-degrading capacity and high thermal stability with a melting temperature of 76.4 °C. Altogether, the data obtained point to a new isolate from a marine hydrothermal vent with a large bioprospecting potential.


July 7, 2019

PEPR: pipelines for evaluating prokaryotic references.

The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes, improving confidence in microbial whole genome sequencing results. We have developed a reproducible and transparent bioinformatics tool, PEPR, Pipelines for Evaluating Prokaryotic References, for characterizing the reference genome of prokaryotic genomic materials. PEPR evaluates the quality, purity, and homogeneity of the reference material genome, and purity of the genomic material. The quality of the genome is evaluated using high coverage paired-end sequence data; coverage, paired-end read size and direction, as well as soft-clipping rates, are used to identify mis-assemblies. The homogeneity and purity of the material relative to the reference genome are characterized by comparing base calls from replicate datasets generated using multiple sequencing technologies. Genomic purity of the material is assessed by checking for DNA contaminants. We demonstrate the tool and its output using sequencing data while developing a Staphylococcus aureus candidate genomic reference material. PEPR is open source and available at https://github.com/usnistgov/pepr .


July 7, 2019

Structural and functional analysis of the finished genome of the recently isolated toxic Anabaena sp. WA102.

Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture.The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90.Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.


July 7, 2019

Reply to Bemm et al. and Arakawa: Identifying foreign genes in independent Hypsibius dujardini genome assemblies.

Our report (1) describing the discovery of extensive horizontal gene transfer in a tardigrade genome has raised questions from other groups who were sequencing the Hypsibius dujardini genome in parallel or who have done new experiments and analyses since our report (2??–5). Bemm et al. (2) now report filtering our data for likely contaminants, resulting in a new, prefiltered genome assembly. Arakawa (3) has sequenced genomes of starved, washed, individual animals that had been treated with antibiotics for 48 h, and used this genomic sequence and RNA-Seq data to identify likely bona fide tardigrade contigs. Two other reports have contributed data and analysis: Delmont and Eren (4) used a newly published analysis and visualization platform, Anvi’o (6), to identify likely contaminants in our genome assembly, and Koutsovoulos et al. (5) applied useful taxon-annotated GC coverage plots (Blobplots) (7) to our data and reported an independent genome assembly.


July 7, 2019

Complete genome sequence of Vibrio vulnificus FORC_017 isolated from a patient with a hemorrhagic rash after consuming raw dotted gizzard shad.

Vibrio vulnificus, a resident in the human gut, is frequently found in seafood, causing food-borne illnesses including gastroenteritis and severe septicemia. While V. vulnificus has been known to be one of the major food-borne pathogens, pathogenicity and virulence factors are not fully understood yet. To extend our understanding of the pathogenesis of V. vulnificus at the genomic level, the genome of V. vulnificus FORC_017 isolated from a female patient experiencing a hemorrhagic rash was completely sequenced and analyzed.Three discontinuous contigs were generated from a hybrid assembly using Illumina MiSeq and PacBio platforms, revealing that the genome of the FORC_017 consists of two circular chromosomes and a plasmid. Chromosome I consists of 3,253,417-bp (GC content 46.49 %) containing 2943 predicted open reading frames (ORFs) and chromosome II of 1,905,745-bp (GC content 46.90 %) containing 1638 ORFs. The plasmid pFORC17 consists of 70,069-bp (GC content 43.77 %) containing 84 ORFs. The average nucleotide identity (ANI) value of the FORC_017 and CMCP6 strains was 98.53, suggesting that they are closely related.Pathogenesis-associated genes including vvhA, rtx gene cluster, and various hemolysin genes were present in FORC_017. In addition, three complete secretion systems (Type I, II and VI) as well as iron uptake-related genes for virulence of the FORC_017 were detected, suggesting that this strain is pathogenic. Further comparative genome analysis revealed that FORC_017 and CMCP6 share major toxin genes including vvhA and rtx for pathogenesis activities. The genome information of the FORC_017 provides novel insights into pathogenicity and virulence factors of V. vulnificus.


July 7, 2019

Genomics-informed isolation and characterization of a symbiotic Nanoarchaeota system from a terrestrial geothermal environment.

Biological features can be inferred, based on genomic data, for many microbial lineages that remain uncultured. However, cultivation is important for characterizing an organism’s physiology and testing its genome-encoded potential. Here we use single-cell genomics to infer cultivation conditions for the isolation of an ectosymbiotic Nanoarchaeota (‘Nanopusillus acidilobi’) and its host (Acidilobus, a crenarchaeote) from a terrestrial geothermal environment. The cells of ‘Nanopusillus’ are among the smallest known cellular organisms (100-300?nm). They appear to have a complete genetic information processing machinery, but lack almost all primary biosynthetic functions as well as respiration and ATP synthesis. Genomic and proteomic comparison with its distant relative, the marine Nanoarchaeum equitans illustrate an ancient, common evolutionary history of adaptation of the Nanoarchaeota to ectosymbiosis, so far unique among the Archaea.


July 7, 2019

Complete genome of Nitrosospira briensis C-128, an ammonia-oxidizing bacterium from agricultural soil.

Nitrosospira briensis C-128 is an ammonia-oxidizing bacterium isolated from an acid agricultural soil. N. briensis C-128 was sequenced with PacBio RS technologies at the DOE-Joint Genome Institute through their Community Science Program (2010). The high-quality finished genome contains one chromosome of 3.21 Mb and no plasmids. We identified 3073 gene models, 3018 of which are protein coding. The two-way average nucleotide identity between the chromosomes of Nitrosospira multiformis ATCC 25196 and Nitrosospira briensis C-128 was found to be 77.2 %. Multiple copies of modules encoding chemolithotrophic metabolism were identified in their genomic context. The gene inventory supports chemolithotrophic metabolism with implications for function in soil environments.


July 7, 2019

Challenges, solutions, and quality metrics of personal genome assembly in advancing precision medicine.

Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly.


July 7, 2019

SAR11 bacteria linked to ocean anoxia and nitrogen loss.

Bacteria of the SAR11 clade constitute up to one half of all microbial cells in the oxygen-rich surface ocean. SAR11 bacteria are also abundant in oxygen minimum zones (OMZs), where oxygen falls below detection and anaerobic microbes have vital roles in converting bioavailable nitrogen to N2 gas. Anaerobic metabolism has not yet been observed in SAR11, and it remains unknown how these bacteria contribute to OMZ biogeochemical cycling. Here, genomic analysis of single cells from the world’s largest OMZ revealed previously uncharacterized SAR11 lineages with adaptations for life without oxygen, including genes for respiratory nitrate reductases (Nar). SAR11 nar genes were experimentally verified to encode proteins catalysing the nitrite-producing first step of denitrification and constituted ~40% of OMZ nar transcripts, with transcription peaking in the anoxic zone of maximum nitrate reduction activity. These results link SAR11 to pathways of ocean nitrogen loss, redefining the ecological niche of Earth’s most abundant organismal group.


July 7, 2019

Genomic characterization of the Atlantic cod sex-locus.

A variety of sex determination mechanisms can be observed in evolutionary divergent teleosts. Sex determination is genetic in Atlantic cod (Gadus morhua), however the genomic location or size of its sex-locus is unknown. Here, we characterize the sex-locus of Atlantic cod using whole genome sequence (WGS) data of 227 wild-caught specimens. Analyzing more than 55 million polymorphic loci, we identify 166 loci that are associated with sex. These loci are located in six distinct regions on five different linkage groups (LG) in the genome. The largest of these regions, an approximately 55?Kb region on LG11, contains the majority of genotypes that segregate closely according to a XX-XY system. Genotypes in this region can be used genetically determine sex, whereas those in the other regions are inconsistently sex-linked. The identified region on LG11 and its surrounding genes have no clear sequence homology with genes or regulatory elements associated with sex-determination or differentiation in other species. The functionality of this sex-locus therefore remains unknown. The WGS strategy used here proved adequate for detecting the small regions associated with sex in this species. Our results highlight the evolutionary flexibility in genomic architecture underlying teleost sex-determination and allow practical applications to genetically sex Atlantic cod.


July 7, 2019

Evidence for an opportunistic and endophytic lifestyle of the Bursaphelenchus xylophilus-associated bacteria Serratia marcescens PWN146 isolated from wilting Pinus pinaster.

Pine wilt disease (PWD) results from the interaction of three elements: the pathogenic nematode, Bursaphelenchus xylophilus; the insect-vector, Monochamus sp.; and the host tree, mostly Pinus species. Bacteria isolated from B. xylophilus may be a fourth element in this complex disease. However, the precise role of bacteria in this interaction is unclear as both plant-beneficial and as plant-pathogenic bacteria may be associated with PWD. Using whole genome sequencing and phenotypic characterization, we were able to investigate in more detail the genetic repertoire of Serratia marcescens PWN146, a bacterium associated with B. xylophilus. We show clear evidence that S. marcescens PWN146 is able to withstand and colonize the plant environment, without having any deleterious effects towards a susceptible host (Pinus thunbergii), B. xylophilus nor to the nematode model C. elegans. This bacterium is able to tolerate growth in presence of xenobiotic/organic compounds, and use phenylacetic acid as carbon source. Furthermore, we present a detailed list of S. marcescens PWN146 potentials to interfere with plant metabolism via hormonal pathways and/or nutritional acquisition, and to be competitive against other bacteria and/or fungi in terms of resource acquisition or production of antimicrobial compounds. Further investigation is required to understand the role of bacteria in PWD. We have now reinforced the theory that B. xylophilus-associated bacteria may have a plant origin.


July 7, 2019

Comparative genomics and transcriptomics of Pichia pastoris.

Pichia pastoris has emerged as an important alternative host for producing recombinant biopharmaceuticals, owing to its high cultivation density, low host cell protein burden, and the development of strains with humanized glycosylation. Despite its demonstrated utility, relatively little strain engineering has been performed to improve Pichia, due in part to the limited number and inconsistent frameworks of reported genomes and transcriptomes. Furthermore, the co-mingling of genomic, transcriptomic and fermentation data collected about Komagataella pastoris and Komagataella phaffii, the two strains co-branded as Pichia, has generated confusion about host performance for these genetically distinct species. Generation of comparative high-quality genomes and transcriptomes will enable meaningful comparisons between the organisms, and potentially inform distinct biotechnological utilies for each species.Here, we present a comprehensive and standardized comparative analysis of the genomic features of the three most commonly used strains comprising the tradename Pichia: K. pastoris wild-type, K. phaffii wild-type, and K. phaffii GS115. We used a combination of long-read (PacBio) and short-read (Illumina) sequencing technologies to achieve over 1000X coverage of each genome. Construction of individual genomes was then performed using as few as seven individual contigs to create gap-free assemblies. We found substantial syntenic rearrangements between the species and characterized a linear plasmid present in K. phaffii. Comparative analyses between K. phaffii genomes enabled the characterization of the mutational landscape of the GS115 strain. We identified and examined 35 non-synonomous coding mutations present in GS115, many of which are likely to impact strain performance. Additionally, we investigated transcriptomic profiles of gene expression for both species during cultivation on various carbon sources. We observed that the most highly transcribed genes in both organisms were consistently highly expressed in all three carbon sources examined. We also observed selective expression of certain genes in each carbon source, including many sequences not previously reported as promoters for expression of heterologous proteins in yeasts.Our studies establish a foundation for understanding critical relationships between genome structure, cultivation conditions and gene expression. The resources we report here will inform and facilitate rational, organism-wide strain engineering for improved utility as a host for protein production.


July 7, 2019

Isolation and genomic characterization of ‘Desulfuromonas soudanensis WTL’, a metal- and electrode-respiring bacterium from anoxic deep subsurface brine.

Reaching a depth of 713 m below the surface, the Soudan Underground Iron Mine (Soudan, MN, USA) transects a massive Archaean (2.7 Ga) banded iron formation, providing a remarkably accessible window into the terrestrial deep biosphere. Despite organic carbon limitation, metal-reducing microbial communities are present in potentially ancient anoxic brines continuously emanating from exploratory boreholes on Level 27. Using graphite electrodes deposited in situ as bait, we electrochemically enriched and isolated a novel halophilic iron-reducing Deltaproteobacterium, ‘Desulfuromonas soudanensis’ strain WTL, from an acetate-fed three-electrode bioreactor poised at +0.24 V (vs. standard hydrogen electrode). Cyclic voltammetry revealed that ‘D. soudanensis’ releases electrons at redox potentials approximately 100 mV more positive than the model freshwater surface isolate Geobacter sulfurreducens, suggesting that its extracellular respiration is tuned for higher potential electron acceptors. ‘D. soudanensis’ contains a 3,958,620-bp circular genome, assembled to completion using single-molecule real-time (SMRT) sequencing reads, which encodes a complete TCA cycle, 38 putative multiheme c-type cytochromes, one of which contains 69 heme-binding motifs, and a LuxI/LuxR quorum sensing cassette that produces an unidentified N-acyl homoserine lactone. Another cytochrome is predicted to lie within a putative prophage, suggesting that horizontal gene transfer plays a role in respiratory flexibility among metal reducers. Isolation of ‘D. soudanensis’ underscores the utility of electrode-based approaches for enriching rare metal reducers from a wide range of habitats.


July 7, 2019

Whole-genome sequencing recommendations

Recent technological developments have revolutionized the way we perform genetic analyses. In particular whole-genome sequencing provides access to the entire genetic makeup of an individual, and it is now an affordable approach for many research groups. As a consequence genome sequencing is pervading many fields of biological research. Sequencing technologies are evolving rapidly and so do their applications. Here we provide a first primer on whole-genome sequencing, focusing on two of the most popular applications: (1) de novo genome sequencing, in which the objective is obtaining a high-quality genome assembly that can serve as a reference for a species or variety, and (2) genome resequencing, when there is an available reference genome and the objective is to map sequence variation of an individual or a set of individuals. It is not our intention to provide a comprehensive overview of current methodologies that will likely soon become obsolete, but rather focus on general principles that will have a more general applicability.


July 7, 2019

Use of genomic approaches in understanding the role of Actinomycetes as PGP in grain legumes

The advancement in molecular technologies has given a breakthrough to explore the untapped and novel microbial isolates for characterization in every aspect as we can consider microbes as an important primary natural store house for key secondary metabolites and enzymes. Actinomycetes are the most fruitful source of microorganisms for all types of bioactive secondary metabolites, including agroactive-antibiotic molecules that are best recognized and most valuable for their role in agriculture and industries. In agriculture, actinomycetes are used as biocontrol agents against some pests and pathogenic organisms as well as plant growth-promoting (PGP) agents for crops. Use of different molecular methods, e.g., metagenomics, metatranscriptomics, genetic fingerprinting, proteogenomics, and metaproteomics, are more significant for classifying and discovering the immense diversity in microbial population and for understanding their interactions with other abiotic and biotic environmental elements. The opportunity of accessing inexpensive sequencing techniques has led to the assemblies of copious genomic data for actinomycetes, such as Streptomyces and related species, with the goal of discovering novel bioactive metabolic and their utility as PGP; however, the use of actinomycetes in agriculture using genomic approaches is in its initial stages.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.