Menu
July 7, 2019

Comparative genomics and transcriptomics of Pichia pastoris.

Pichia pastoris has emerged as an important alternative host for producing recombinant biopharmaceuticals, owing to its high cultivation density, low host cell protein burden, and the development of strains with humanized glycosylation. Despite its demonstrated utility, relatively little strain engineering has been performed to improve Pichia, due in part to the limited number and inconsistent frameworks of reported genomes and transcriptomes. Furthermore, the co-mingling of genomic, transcriptomic and fermentation data collected about Komagataella pastoris and Komagataella phaffii, the two strains co-branded as Pichia, has generated confusion about host performance for these genetically distinct species. Generation of comparative high-quality genomes and transcriptomes will enable meaningful comparisons between the organisms, and potentially inform distinct biotechnological utilies for each species.Here, we present a comprehensive and standardized comparative analysis of the genomic features of the three most commonly used strains comprising the tradename Pichia: K. pastoris wild-type, K. phaffii wild-type, and K. phaffii GS115. We used a combination of long-read (PacBio) and short-read (Illumina) sequencing technologies to achieve over 1000X coverage of each genome. Construction of individual genomes was then performed using as few as seven individual contigs to create gap-free assemblies. We found substantial syntenic rearrangements between the species and characterized a linear plasmid present in K. phaffii. Comparative analyses between K. phaffii genomes enabled the characterization of the mutational landscape of the GS115 strain. We identified and examined 35 non-synonomous coding mutations present in GS115, many of which are likely to impact strain performance. Additionally, we investigated transcriptomic profiles of gene expression for both species during cultivation on various carbon sources. We observed that the most highly transcribed genes in both organisms were consistently highly expressed in all three carbon sources examined. We also observed selective expression of certain genes in each carbon source, including many sequences not previously reported as promoters for expression of heterologous proteins in yeasts.Our studies establish a foundation for understanding critical relationships between genome structure, cultivation conditions and gene expression. The resources we report here will inform and facilitate rational, organism-wide strain engineering for improved utility as a host for protein production.


July 7, 2019

Complete Genome Sequence of Mycobacterium avium, Isolated from Commercial Domestic Pekin Ducks (Anas platyrhynchos domestica), Determined Using PacBio Single-Molecule Real-Time Technology

Mycobacterium avium is an important pathogenic bacterium in birds and has never, to our knowledge, reported to be isolated from domestic ducks. We present here the complete genome sequence of a virulent strain of Mycobacterium avium, isolated from domestic Pekin ducks for the first time, which was determined by PacBio single-molecule real-time technology. Copyright © 2016 Song et al.


July 7, 2019

Complete Genome Sequences of Four Enterohemolysin-Positive (ehxA) Enterocyte Effacement-Negative Shiga Toxin-Producing Escherichia coli Strains

Shiga toxin-producing Escherichia coli (STEC) strains are important foodborne pathogens associated with human disease. Most disease-associated STEC strains carry the locus of enterocyte effacement (LEE); however, regularly LEE-negative STEC strains are recovered from ill patients. Few reference sequences are available for these isolate types. Here, we report here the complete genome sequences for four LEE-negative STEC strains. Copyright © 2016 Lorenz et al.


July 7, 2019

First complete genome sequence of the skin-improving Lactobacillus curvatus strain FBA2, isolated from fermented vegetables, determined by PacBio single-molecule real-time technology.

The first complete genome sequence of Lactobacillus curvatus was determined by PacBio RS II. The single circular chromosome (1,848,756 bp, G+C content of 42.1%) of L. curvatus FBA2, isolated from fermented vegetables, contained low G+C regions (26.9% minimum) and 43 sets of >1,000-bp identical sequence pairs. No plasmids were detected. Copyright © 2016 Nakano et al.


July 7, 2019

Draft genome sequence of Mycobacterium rufum JS14(T), a polycyclic-aromatic-hydrocarbon-degrading bacterium from petroleum-contaminated soil in Hawaii.

Mycobacterium rufum JS14(T) (=ATCC BAA-1377(T), CIP 109273(T), JCM 16372(T), DSM 45406(T)), a type strain of the species Mycobacterium rufum sp. . belonging to the family Mycobacteriaceae, was isolated from polycyclic aromatic hydrocarbon (PAH)-contaminated soil in Hilo (HI, USA) because it harbors the capability of degrading PAH. Here, we describe the first genome sequence of strain JS14(T), with brief phenotypic characteristics. The genome is composed of 6,176,413 bp with 69.25 % G?+?C content and contains 5810 protein-coding genes with 54 RNA genes. The genome information on M. rufum JS14(T) will provide a better understanding of the complexity of bacterial catabolic pathways for degradation of specific chemicals.


July 7, 2019

Genome sequence and analysis of Peptoclostridium difficile strain ZJCDC-S82.

Peptoclostridium difficile (Clostridium difficile) is the major pathogen associated with infectious diarrhea in humans. Concomitant with the increased incidence of C. difficile infection worldwide, there is an increasing concern regarding this infection type. This study reports a draft assembly and detailed sequence analysis of C. difficile strain ZJCDC-S82. The de novo assembled genome was 4.19 Mb in size, which includes 4,013 protein-coding genes, 41 rRNA genes, and 84 tRNA genes. Along with the nuclear genome, we also assembled sequencing information for a single plasmid consisting of 11,930 nucleotides. Comparative genomic analysis of C. difficile ZJCDC-S82 and two other previously published strains, such as M120 and CD630, showed extensive similarity. Phylogenetic analysis revealed that genetic diversity among C. difficile strains was not influenced by geographic location. Evolutionary analysis suggested that four genes encoding surface proteins exhibited positive selection in C. difficile ZJCDC-S82. Codon usage analysis indicated that C. difficile ZJCDC-S82 had high codon usage bias toward A/U-ended codons. Furthermore, codon usage patterns in C. difficile ZJCDC-S82 were predominantly affected by mutation pressure. Our results provide detailed information pertaining to the C. difficile genome associated with a strain from mainland China. This analysis will facilitate the understanding of genomic diversity and evolution of C. difficile strains in this region.


July 7, 2019

An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly.

High density genetic linkage maps that are extensively anchored to assembled genome sequences of the organism in question are extremely useful in gene discovery. To facilitate this process in perennial ryegrass (Lolium perenne L.), a high density single nucleotide polymorphism (SNP)- and presence/absence variant (PAV)-based genetic linkage map has been developed in an F2 mapping population that has been used as a reference population in numerous studies. To provide a reference sequence to which to align genotyping by sequencing (GBS) reads, a shotgun assembly of one of the grandparents of the population, a tenth-generation inbred line, was created using Illumina-based sequencing.The assembly was based on paired-end Illumina reads, scaffolded by mate pair and long jumping distance reads in the range of 3-40?kb, with >200-fold initial genome coverage. A total of 169 individuals from an F2 mapping population were used to construct PstI-based GBS libraries tagged with unique 4-9 nucleotide barcodes, resulting in 284 million reads, with approx. 1·6 million reads per individual. A bioinformatics pipeline was employed to identify both SNPs and PAVs. A core genetic map was generated using high confidence SNPs, to which lower confidence SNPs and PAVs were subsequently fitted in a straightforward binning approach.The assembly comprises 424?750 scaffolds, covering 1·11 Gbp of the 2·5 Gbp perennial ryegrass genome, with a scaffold N50 of 25 212?bp and a contig N50 of 3790?bp. It is available for download, and access to a genome browser has been provided. Comparison of the assembly with available transcript and gene model data sets for perennial ryegrass indicates that approx. 570 Mbp of the gene-rich portion of the genome has been captured. An ultra-high density genetic linkage map with 3092 SNPs and 7260 PAVs was developed, anchoring just over 200?Mb of the reference assembly.The combined genetic map and assembly, combined with another recently released genome assembly, represent a significant resource for the perennial ryegrass genetics community.© The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Assemblytics: a web analytics tool for the detection of variants from an assembly.

Assemblytics is a web app for detecting and analyzing variants from a de novo genome assembly aligned to a reference genome. It incorporates a unique anchor filtering approach to increase robustness to repetitive elements, and identifies six classes of variants based on their distinct alignment signatures. Assemblytics can be applied both to comparing aberrant genomes, such as human cancers, to a reference, or to identify differences between related species. Multiple interactive visualizations enable in-depth explorations of the genomic distributions of variants.http://assemblytics.com, https://github.com/marianattestad/assemblytics CONTACT: mnattest@cshl.eduSupplementary information: Supplementary data are available at Bioinformatics online.© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

High-quality genome assembly and annotation for Plasmodium coatneyi, generated using single-molecule real-time PacBio technology.

Plasmodium coatneyi is a protozoan parasite species that causes simian malaria and is an excellent model for studying disease caused by the human malaria parasite, P. falciparum Here we report the complete (nontelomeric) genome sequence of P. coatneyi Hackeri generated by the application of only Pacific Biosciences RS II (PacBio RS II) single-molecule real-time (SMRT) high-resolution sequence technology and assembly using the Hierarchical Genome Assembly Process (HGAP). This is the first Plasmodium genome sequence reported to use only PacBio technology. This approach has proven to be superior to short-read only approaches for this species. Copyright © 2016 Chien et al.


July 7, 2019

The Lysobacter capsici AZ78 genome has a gene pool enabling it to interact successfully with phytopathogenic microorganisms and environmental factors.

Lysobacter capsici AZ78 has considerable potential for biocontrol of phytopathogenic microorganisms. However, lack of information about genetic cues regarding its biological characteristics may slow down its exploitation as a biofungicide. In order to obtain a comprehensive overview of genetic features, the L. capsici AZ78 genome was sequenced, annotated and compared with the phylogenetically related pathogens Stenotrophomonas malthophilia K729a and Xanthomonas campestris pv. campestris ATCC 33913. Whole genome comparison, supported by functional analysis, indicated that L. capsici AZ78 has a larger number of genes responsible for interaction with phytopathogens and environmental stress than S. malthophilia K729a and X. c. pv. campestris ATCC 33913. Genes involved in the production of antibiotics, lytic enzymes and siderophores were specific for L. capsici AZ78, as well as genes involved in resistance to antibiotics, environmental stressors, fungicides and heavy metals. The L. capsici AZ78 genome did not encompass genes involved in infection of humans and plants included in the S. malthophilia K729a and X. c. pv. campestris ATCC 33913 genomes, respectively. The L. capsici AZ78 genome provides a genetic framework for detailed analysis of other L. capsici members and the development of novel biofungicides based on this bacterial strain.


July 7, 2019

Whole-genome sequencing recommendations

Recent technological developments have revolutionized the way we perform genetic analyses. In particular whole-genome sequencing provides access to the entire genetic makeup of an individual, and it is now an affordable approach for many research groups. As a consequence genome sequencing is pervading many fields of biological research. Sequencing technologies are evolving rapidly and so do their applications. Here we provide a first primer on whole-genome sequencing, focusing on two of the most popular applications: (1) de novo genome sequencing, in which the objective is obtaining a high-quality genome assembly that can serve as a reference for a species or variety, and (2) genome resequencing, when there is an available reference genome and the objective is to map sequence variation of an individual or a set of individuals. It is not our intention to provide a comprehensive overview of current methodologies that will likely soon become obsolete, but rather focus on general principles that will have a more general applicability.


July 7, 2019

Whole genomic sequence analysis of Bacillus infantis: defining the genetic blueprint of strain NRRL B-14911, an emerging cardiopathogenic microbe.

We recently reported the identification of Bacillus sp. NRRL B-14911 that induces heart autoimmunity by generating cardiac-reactive T cells through molecular mimicry. This marine bacterium was originally isolated from the Gulf of Mexico, but no associations with human diseases were reported. Therefore, to characterize its biological and medical significance, we sought to determine and analyze the complete genome sequence of Bacillus sp. NRRL B-14911.Based on the phylogenetic analysis of 16S ribosomal RNA (rRNA) genes, sequence analysis of the 16S-23S rDNA intergenic transcribed spacers, phenotypic microarray, and matrix-assisted laser desorption ionization time-of-flight mass spectrometry, we propose that this organism belongs to the species Bacillus infantis, previously shown to be associated with sepsis in a newborn child. Analysis of the complete genome of Bacillus sp. NRRL B-14911 revealed several virulence factors including adhesins, invasins, colonization factors, siderophores and transporters. Likewise, the bacterial genome encodes a wide range of methyl transferases, transporters, enzymatic and biochemical pathways, and insertion sequence elements that are distinct from other closely related bacilli.The complete genome sequence of Bacillus sp. NRRL B-14911 provided in this study may facilitate genetic manipulations to assess gene functions associated with bacterial survival and virulence. Additionally, this bacterium may serve as a useful tool to establish a disease model that permits systematic analysis of autoimmune events in various susceptible rodent strains.


July 7, 2019

Molecular evolution of a Klebsiella pneumoniae ST278 isolate harboring blaNDM-7 and involved in nosocomial transmission.

During 2013, ST278 Klebsiella pneumoniae with blaNDM-7 was isolated from the urine (KpN01) and rectum (KpN02) of a patient in Calgary, Canada. The same strain (KpN04) was subsequently isolated from another patient in the same unit. Interestingly, a carbapenem-susceptible K. pneumoniae ST278 (KpN06) was obtained 1 month later from the blood of the second patient. Next-generation sequencing (NGS) revealed that the loss of carbapenem-resistance in KpN06 was due to a 5-kb deletion on the blaNDM-7-harboring IncX3 plasmid. In addition, an IncFIB plasmid in KpN06 had a 27-kb deletion that removed genes encoding for heavy metal resistance. Phylogenetic analysis showed that the K. pneumoniae ST278 from patient 2 was likely a descendant of KpN02 and that KpN06 was a close progenitor of an environmental ST278. It is unclear whether KpN06 lost the blaNDM-7 gene in vivo. This study detailed the remarkable plasticity and speed of evolutionary changes in multidrug-resistant K. pneumoniae, demonstrating the highly recombinant nature of this species. It also highlights the ability of NGS to clarify molecular microevolutionary events within antibiotic-resistant organisms.© The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.


July 7, 2019

Draft genome sequences of Armillaria fuscipes, Ceratocystiopsis minuta, Ceratocystis adiposa, Endoconidiophora laricicola, E. polonica and Penicillium freii DAOMC 242723.

The genomes of Armillaria fuscipes, Ceratocystiopsis minuta, Ceratocystis adiposa, Endoconidiophora laricicola, E. polonica, and Penicillium freii DAOMC 242723 are presented in this genome announcement. These six genomes are from plant pathogens and otherwise economically important fungal species. The genome sizes range from 21 Mb in the case of Ceratocystiopsis minuta to 58 Mb for the basidiomycete Armillaria fuscipes. These genomes include the first reports of genomes for the genus Endoconidiophora. The availability of these genome data will provide opportunities to resolve longstanding questions regarding the taxonomy of species in these genera. In addition these genome sequences through comparative studies with closely related organisms will increase our understanding of how these pathogens cause disease.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.