Menu
July 7, 2019

Representing genetic variation with synthetic DNA standards.

The identification of genetic variation with next-generation sequencing is confounded by the complexity of the human genome sequence and by biases that arise during library preparation, sequencing and analysis. We have developed a set of synthetic DNA standards, termed ‘sequins’, that emulate human genetic features and constitute qualitative and quantitative spike-in controls for genome sequencing. Sequencing reads derived from sequins align exclusively to an artificial in silico reference chromosome, rather than the human reference genome, which allows them them to be partitioned for parallel analysis. Here we use this approach to represent common and clinically relevant genetic variation, ranging from single nucleotide variants to large structural rearrangements and copy-number variation. We validate the design and performance of sequin standards by comparison to examples in the NA12878 reference genome, and we demonstrate their utility during the detection and quantification of variants. We provide sequins as a standardized, quantitative resource against which human genetic variation can be measured and diagnostic performance assessed.


July 7, 2019

Comparative genomics and transcriptomics of Pichia pastoris.

Pichia pastoris has emerged as an important alternative host for producing recombinant biopharmaceuticals, owing to its high cultivation density, low host cell protein burden, and the development of strains with humanized glycosylation. Despite its demonstrated utility, relatively little strain engineering has been performed to improve Pichia, due in part to the limited number and inconsistent frameworks of reported genomes and transcriptomes. Furthermore, the co-mingling of genomic, transcriptomic and fermentation data collected about Komagataella pastoris and Komagataella phaffii, the two strains co-branded as Pichia, has generated confusion about host performance for these genetically distinct species. Generation of comparative high-quality genomes and transcriptomes will enable meaningful comparisons between the organisms, and potentially inform distinct biotechnological utilies for each species.Here, we present a comprehensive and standardized comparative analysis of the genomic features of the three most commonly used strains comprising the tradename Pichia: K. pastoris wild-type, K. phaffii wild-type, and K. phaffii GS115. We used a combination of long-read (PacBio) and short-read (Illumina) sequencing technologies to achieve over 1000X coverage of each genome. Construction of individual genomes was then performed using as few as seven individual contigs to create gap-free assemblies. We found substantial syntenic rearrangements between the species and characterized a linear plasmid present in K. phaffii. Comparative analyses between K. phaffii genomes enabled the characterization of the mutational landscape of the GS115 strain. We identified and examined 35 non-synonomous coding mutations present in GS115, many of which are likely to impact strain performance. Additionally, we investigated transcriptomic profiles of gene expression for both species during cultivation on various carbon sources. We observed that the most highly transcribed genes in both organisms were consistently highly expressed in all three carbon sources examined. We also observed selective expression of certain genes in each carbon source, including many sequences not previously reported as promoters for expression of heterologous proteins in yeasts.Our studies establish a foundation for understanding critical relationships between genome structure, cultivation conditions and gene expression. The resources we report here will inform and facilitate rational, organism-wide strain engineering for improved utility as a host for protein production.


July 7, 2019

Isolation and genomic characterization of ‘Desulfuromonas soudanensis WTL’, a metal- and electrode-respiring bacterium from anoxic deep subsurface brine.

Reaching a depth of 713 m below the surface, the Soudan Underground Iron Mine (Soudan, MN, USA) transects a massive Archaean (2.7 Ga) banded iron formation, providing a remarkably accessible window into the terrestrial deep biosphere. Despite organic carbon limitation, metal-reducing microbial communities are present in potentially ancient anoxic brines continuously emanating from exploratory boreholes on Level 27. Using graphite electrodes deposited in situ as bait, we electrochemically enriched and isolated a novel halophilic iron-reducing Deltaproteobacterium, ‘Desulfuromonas soudanensis’ strain WTL, from an acetate-fed three-electrode bioreactor poised at +0.24 V (vs. standard hydrogen electrode). Cyclic voltammetry revealed that ‘D. soudanensis’ releases electrons at redox potentials approximately 100 mV more positive than the model freshwater surface isolate Geobacter sulfurreducens, suggesting that its extracellular respiration is tuned for higher potential electron acceptors. ‘D. soudanensis’ contains a 3,958,620-bp circular genome, assembled to completion using single-molecule real-time (SMRT) sequencing reads, which encodes a complete TCA cycle, 38 putative multiheme c-type cytochromes, one of which contains 69 heme-binding motifs, and a LuxI/LuxR quorum sensing cassette that produces an unidentified N-acyl homoserine lactone. Another cytochrome is predicted to lie within a putative prophage, suggesting that horizontal gene transfer plays a role in respiratory flexibility among metal reducers. Isolation of ‘D. soudanensis’ underscores the utility of electrode-based approaches for enriching rare metal reducers from a wide range of habitats.


July 7, 2019

Whole genomic sequence analysis of Bacillus infantis: defining the genetic blueprint of strain NRRL B-14911, an emerging cardiopathogenic microbe.

We recently reported the identification of Bacillus sp. NRRL B-14911 that induces heart autoimmunity by generating cardiac-reactive T cells through molecular mimicry. This marine bacterium was originally isolated from the Gulf of Mexico, but no associations with human diseases were reported. Therefore, to characterize its biological and medical significance, we sought to determine and analyze the complete genome sequence of Bacillus sp. NRRL B-14911.Based on the phylogenetic analysis of 16S ribosomal RNA (rRNA) genes, sequence analysis of the 16S-23S rDNA intergenic transcribed spacers, phenotypic microarray, and matrix-assisted laser desorption ionization time-of-flight mass spectrometry, we propose that this organism belongs to the species Bacillus infantis, previously shown to be associated with sepsis in a newborn child. Analysis of the complete genome of Bacillus sp. NRRL B-14911 revealed several virulence factors including adhesins, invasins, colonization factors, siderophores and transporters. Likewise, the bacterial genome encodes a wide range of methyl transferases, transporters, enzymatic and biochemical pathways, and insertion sequence elements that are distinct from other closely related bacilli.The complete genome sequence of Bacillus sp. NRRL B-14911 provided in this study may facilitate genetic manipulations to assess gene functions associated with bacterial survival and virulence. Additionally, this bacterium may serve as a useful tool to establish a disease model that permits systematic analysis of autoimmune events in various susceptible rodent strains.


July 7, 2019

Large-scale maps of variable infection efficiencies in aquatic Bacteroidetes phage-host model systems.

Microbes drive ecosystem functioning and their viruses modulate these impacts through mortality, gene transfer and metabolic reprogramming. Despite the importance of virus-host interactions and likely variable infection efficiencies of individual phages across hosts, such variability is seldom quantified. Here, we quantify infection efficiencies of 38 phages against 19 host strains in aquatic Cellulophaga (Bacteroidetes) phage-host model systems. Binary data revealed that some phages infected only one strain while others infected 17, whereas quantitative data revealed that efficiency of infection could vary 10 orders of magnitude, even among phages within one population. This provides a baseline for understanding and modeling intrapopulation host range variation. Genera specific host ranges were also informative. For example, the Cellulophaga Microviridae, showed a markedly broader intra-species host range than previously observed in Escherichia coli systems. Further, one phage genus, Cba41, was examined to investigate nonheritable changes in plating efficiency and burst size that depended on which host strain it most recently infected. While consistent with host modification of phage DNA, no differences in nucleotide sequence or DNA modifications were detected, leaving the observation repeatable, but the mechanism unresolved. Overall, this study highlights the importance of quantitatively considering replication variations in studies of phage-host interactions. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.


July 7, 2019

Salmonella degrades the host glycocalyx leading to altered infection and glycan remodeling.

Complex glycans cover the gut epithelial surface to protect the cell from the environment. Invasive pathogens must breach the glycan layer before initiating infection. While glycan degradation is crucial for infection, this process is inadequately understood. Salmonella contains 47 glycosyl hydrolases (GHs) that may degrade the glycan. We hypothesized that keystone genes from the entire GH complement of Salmonella are required to degrade glycans to change infection. This study determined that GHs recognize the terminal monosaccharides (N-acetylneuraminic acid (Neu5Ac), galactose, mannose, and fucose) and significantly (p?


July 7, 2019

Next-generation sequencing: a diagnostic one-stop shop for Hepatitis C?

Before starting chronic hepatitis C treatment, the viral genotype/subtype has to be accurately determined and potentially coupled with drug resistance testing. Due to the high genetic variability of the hepatitis C virus, this can be a demanding task that can potentially be streamlined by viral whole-genome sequencing using next-generation sequencing as demonstrated by an article in this issue of the Journal of Clinical Microbiology by E. Thomson, C. L. C. Ip, A. Badhan, M. T. Christiansen, W. Adamson, et al. (J Clin Microbiol. 54:2455-2469, 2016, http://dx.doi.org/10.1128/JCM.00330-16). Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Complete genome sequence of Spiroplasma turonicum Tab4cT, a bacterium isolated from horse flies (Haematopota sp.).

Spiroplasma turonicum Tab4c(T) was isolated from a horse fly (Haematopota sp.; probably Haematopota pluvialis) collected at Champchevrier, Indre-et-Loire, Touraine, France, in 1991. Here, we report the complete genome sequence of this bacterium to facilitate the investigation of its biology and the comparative genomics among Spiroplasma spp. Copyright © 2016 Lo et al.


July 7, 2019

Comparative methylome analysis of the occasional ruminant respiratory pathogen Bibersteinia trehalosi.

We examined and compared both the methylomes and the modification-related gene content of four sequenced strains of Bibersteinia trehalosi isolated from the nasopharyngeal tracts of Nebraska cattle with symptoms of bovine respiratory disease complex. The methylation patterns and the encoded DNA methyltransferase (MTase) gene sets were different between each strain, with the only common pattern being that of Dam (GATC). Among the observed patterns were three novel motifs attributable to Type I restriction-modification systems. In some cases the differences in methylation patterns corresponded to the gain or loss of MTase genes, or to recombination at target recognition domains that resulted in changes of enzyme specificity. However, in other cases the differences could be attributed to differential expression of the same MTase gene across strains. The most obvious regulatory mechanism responsible for these differences was slipped strand mispairing within short sequence repeat regions. The combined action of these evolutionary forces allows for alteration of different parts of the methylome at different time scales. We hypothesize that pleiotropic transcriptional modulation resulting from the observed methylomic changes may be involved with the switch between the commensal and pathogenic states of this common member of ruminant microflora.


July 7, 2019

Sequence assembly of Yarrowia lipolytica strain W29/CLIB89 shows transposable element diversity.

Yarrowia lipolytica, an oleaginous yeast, is capable of accumulating significant cellular mass in lipid making it an important source of biosustainable hydrocarbon-based chemicals. In spite of a similar number of protein-coding genes to that in other Hemiascomycetes, the Y. lipolytica genome is almost double that of model yeasts. Despite its economic importance and several distinct strains in common use, an independent genome assembly exists for only one strain. We report here a de novo annotated assembly of the chromosomal genome of an industrially-relevant strain, W29/CLIB89, determined by hybrid next-generation sequencing. For the first time, each Y. lipolytica chromosome is represented by a single contig. The telomeric rDNA repeats were localized by Irys long-range genome mapping and one complete copy of the rDNA sequence is reported. Two large structural variants and retroelement differences with reference strain CLIB122 including a full-length, novel Ty3/Gypsy long terminal repeat (LTR) retrotransposon and multiple LTR-like sequences are described. Strikingly, several of these are adjacent to RNA polymerase III-transcribed genes, which are almost double in number in Y. lipolytica compared to other Hemiascomycetes. In addition to previously-reported dimeric RNA polymerase III-transcribed genes, tRNA pseudogenes were identified. Multiple full-length and truncated LINE elements are also present. Therefore, although identified transposons do not constitute a significant fraction of the Y. lipolytica genome, they could have played an active role in its evolution. Differences between the sequence of this strain and of the existing reference strain underscore the utility of an additional independent genome assembly for this economically important organism.


July 7, 2019

A viral immunity chromosome in the marine picoeukaryote, Ostreococcus tauri.

Micro-algae of the genus Ostreococcus and related species of the order Mamiellales are globally distributed in the photic zone of world’s oceans where they contribute to fixation of atmospheric carbon and production of oxygen, besides providing a primary source of nutrition in the food web. Their tiny size, simple cells, ease of culture, compact genomes and susceptibility to the most abundant large DNA viruses in the sea render them attractive as models for integrative marine biology. In culture, spontaneous resistance to viruses occurs frequently. Here, we show that virus-producing resistant cell lines arise in many independent cell lines during lytic infections, but over two years, more and more of these lines stop producing viruses. We observed sweeping over-expression of all genes in more than half of chromosome 19 in resistant lines, and karyotypic analyses showed physical rearrangements of this chromosome. Chromosome 19 has an unusual genetic structure whose equivalent is found in all of the sequenced genomes in this ecologically important group of green algae.


July 7, 2019

Genomic analysis reveals novel diversity among the 1976 Philadelphia Legionnaires’ disease outbreak isolates and additional ST36 strains.

Legionella pneumophila was first recognized as a cause of severe and potentially fatal pneumonia during a large-scale outbreak of Legionnaires’ disease (LD) at a Pennsylvania veterans’ convention in Philadelphia, 1976. The ensuing investigation and recovery of four clinical isolates launched the fields of Legionella epidemiology and scientific research. Only one of the original isolates, “Philadelphia-1”, has been widely distributed or extensively studied. Here we describe the whole-genome sequencing (WGS), complete assembly, and comparative analysis of all Philadelphia LD strains recovered from that investigation, along with L. pneumophila isolates sharing the Philadelphia sequence type (ST36). Analyses revealed that the 1976 outbreak was due to multiple serogroup 1 strains within the same genetic lineage, differentiated by an actively mobilized, self-replicating episome that is shared with L. pneumophila str. Paris, and two large, horizontally-transferred genomic loci, among other polymorphisms. We also found a completely unassociated ST36 strain that displayed remarkable genetic similarity to the historical Philadelphia isolates. This similar strain implies the presence of a potential clonal population, and suggests important implications may exist for considering epidemiological context when interpreting phylogenetic relationships among outbreak-associated isolates. Additional extensive archival research identified the Philadelphia isolate associated with a non-Legionnaire case of “Broad Street pneumonia”, and provided new historical and genetic insights into the 1976 epidemic. This retrospective analysis has underscored the utility of fully-assembled WGS data for Legionella outbreak investigations, highlighting the increased resolution that comes from long-read sequencing and a sequence type-matched genomic data set.


July 7, 2019

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome.

The Asian seabass is an important marine food fish that has been cultured for several decades in Asia Pacific. However, the lack of a high quality reference genome has hampered efforts to improve its selective breeding. A 3D BAC pool set generated in this study was screened using 22 SSR markers located on linkage group 2 which contains a growth-related QTL region. Seventy-two clones corresponding to 22 FPC contigs were sequenced by Illumina MiSeq technology. We co-assembled the MiSeq-derived scaffolds from each FPC contig with error-corrected PacBio reads, resulting in 187 sequences covering 9.7?Mb. Eleven genes annotated within this region were found to be potentially associated with growth and their tissue-specific expression was investigated. Correlation analysis demonstrated that SNPs in ctsb, skp1 and ppp2ca can be potentially used as markers for selecting fast-growing fingerlings. Conserved syntenies between seabass LG2 and five other teleosts were identified. This study i) provided a 10?Mb targeted genome assembly; ii) demonstrated NGS of BAC pools as a potential approach for mining candidates underlying QTLs of this species; iii) detected eleven genes potentially responsible for growth in the QTL region; and iv) identified useful SNP markers for selective breeding programs of Asian seabass.


July 7, 2019

Complete genome sequence of Pseudomonas citronellolis P3B5, a candidate for microbial phyllo-remediation of hydrocarbon-contaminated sites

Pseudomonas citronellolis is a Gram negative, motile gammaproteobacterium belonging to the order Pseudomonadales and the family Pseudomonadaceae. We isolated strain P3B5 from the phyllosphere of basil plants (Ocimum basilicum L.). Here we describe the physiology of this microorganism, its full genome sequence, and detailed annotation. The 6.95 Mbp genome contains 6071 predicted protein coding sequences and 96 RNA coding sequences. P. citronellolis has been the subject of many studies including the investigation of long-chain aliphatic compounds and terpene degradation. Plant leaves are covered by long-chain aliphates making up a waxy layer that is associated with the leaf cuticle. In addition, basil leaves are known to contain high amounts of terpenoid substances, hinting to a potential nutrient niche that might be exploited by P. citronellolis. Furthermore, the isolated strain exhibited resistance to several antibiotics. To evaluate the potential of this strain as source of transferable antibiotic resistance genes on raw consumed herbs we therefore investigated if those resistances are encoded on mobile genetic elements. The availability of the genome will be helpful for comparative genomics of the phylogenetically broad pseudomonads, in particular with the sequence of the P. citronellolis type strain PRJDB205 not yet publicly available. The genome is discussed with respect to a phyllosphere related lifestyle, aliphate and terpenoid degradation, and antibiotic resistance.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.