Menu
July 7, 2019

Chloroplast genomes: diversity, evolution, and applications in genetic engineering.

Chloroplasts play a crucial role in sustaining life on earth. The availability of over 800 sequenced chloroplast genomes from a variety of land plants has enhanced our understanding of chloroplast biology, intracellular gene transfer, conservation, diversity, and the genetic basis by which chloroplast transgenes can be engineered to enhance plant agronomic traits or to produce high-value agricultural or biomedical products. In this review, we discuss the impact of chloroplast genome sequences on understanding the origins of economically important cultivated species and changes that have taken place during domestication. We also discuss the potential biotechnological applications of chloroplast genomes.


July 7, 2019

Lactobacillus rhamnosus GG outcompetes Enterococcus faecium by mucus-binding pili – Evidence for a novel probiotic mechanism on a distance.

Vancomycin-resistant enterococci (VRE) have become a major nosocomial threat. Enterococcus faecium is of special concern, as it can easily acquire new antibiotic resistances and is an excellent colonizer of the human intestinal tract. Several clinical studies have explored the potential use of beneficial bacteria to weed out opportunistic pathogens. Specifically, the widely studied Lactobacillus rhamnosus strain GG has been applied successfully in the context of VRE infections. Here, we provide new insight into the molecular mechanism underlying the effects of this model probiotic on VRE decolonization. Both clinical VRE isolates and L. rhamnosus GG express pili on their cell walls, which are the key modulators of their highly efficient colonization of the intestinal mucosa. We found that one of the VRE pilus clusters shares considerable sequence similarity with the SpaCBA-SrtC1 pilus cluster of L. rhamnosus GG. Remarkable immunological and functional similarities were discovered between the mucus-binding pili of L. rhamnosus GG and those of the clinical E. faecium strain E1165, which was characterized at the genome level. Moreover, E. faecium strain E1165 bound efficiently to mucus, which may be prevented by the presence of the mucus-binding SpaC protein or antibodies against L. rhamnosus GG or SpaC. These results present experimental support for a novel probiotic mechanism, in which the mucus-binding pili of L. rhamnosus GG prevent the binding of a potential pathogen to the host. Hence, we provide a molecular basis for the further exploitation of L. rhamnosus GG and its pilins for prophylaxis and treatment of VRE infections. IMPORTANCE Concern about vancomycin-resistant Enterococcus faecium causing nosocomial infections is rising globally. The arsenal of antibiotic strategies to treat these infections is nearly exhausted, and hence, new treatment strategies are urgently needed. Here, we provide molecular evidence to underpin reports of the successful clinical application of Lactobacillus rhamnosus GG in VRE decolonization strategies. Our results provide support for a new molecular mechanism, in which probiotics can perform competitive exclusion and possibly immune interaction. Moreover, we spur further exploration of the potential of intact L. rhamnosus GG and purified SpaC pilin as prophylactic and curative agents of the VRE carrier state.


July 7, 2019

Complete genome sequencing and comparative genomic analysis of functionally diverse Lysinibacillus sphaericus III(3)7.

Lysinibacillus sphaericus III(3)7 is a native Colombian strain, the first one isolated from soil samples. This strain has shown high levels of pathogenic activity against Culex quinquefaciatus larvae in laboratory assays compared to other members of the same species. Using Pacific Biosciences sequencing technology we sequenced, annotated (de novo) and described the genome of strain III(3)7, achieving a complete genome sequence status. We then performed a comparative analysis between the newly sequenced genome and the ones previously reported for Colombian isolates L. sphaericus OT4b.31, CBAM5 and OT4b.25, with the inclusion of L. sphaericus C3-41 that has been used as a reference genome for most of previous genome sequencing projects. We concluded that L. sphaericus III(3)7 is highly similar with strain OT4b.25 and shares high levels of synteny with isolates CBAM5 and C3-41.


July 7, 2019

Homologous recombination within large chromosomal regions facilitates acquisition of beta-lactam and vancomycin resistance in Enterococcus faecium.

The transfer of DNA between Enterococcus faecium strains has been characterized by both the movement of well-defined genetic elements and by the large-scale transfer of genomic DNA fragments. In this work we report on the whole genome analysis of transconjugants resulting from mating events between the vancomycin-resistant E. faecium C68 strain and vancomycin susceptible D344RRF to discern the mechanism by which the transferred regions enter the recipient chromosome. Vancomycin-resistant transconjugants from five independent matings were analysed by whole genome sequencing. In all cases but one, the penicillin binding protein 5 gene (pbp5) and the Tn5382-vancomycin resistance transposon were transferred together and replaced the corresponding pbp5 region of D344RRF. In one instance, Tn5382 inserted independently downstream of the D344RRF pbp5 Single nucleotide variants (SNV) analysis suggests that entry of donor DNA into the recipient chromosome occurred by recombination across regions of homology between donor and recipient chromosomes, rather than through insertion sequence-mediated transposition. Transfer of genomic DNA was also associated with transfer of C68 plasmid pLRM23 and another putative plasmid. Our data are consistent with transfer initiated by a cointegration of a transferable plasmid with the donor chromosome, with subsequent circularization of the plasmid/chromosome cointegrate in the donor prior to transfer. Entry into the recipient chromosome occurs most commonly across regions of homology between donor and recipient chromosomes. Copyright © 2016 García-Solache et al.


July 7, 2019

Cloche is a bHLH-PAS transcription factor that drives haemato-vascular specification.

Vascular and haematopoietic cells organize into specialized tissues during early embryogenesis to supply essential nutrients to all organs and thus play critical roles in development and disease. At the top of the haemato-vascular specification cascade lies cloche, a gene that when mutated in zebrafish leads to the striking phenotype of loss of most endothelial and haematopoietic cells and a significant increase in cardiomyocyte numbers. Although this mutant has been analysed extensively to investigate mesoderm diversification and differentiation and continues to be broadly used as a unique avascular model, the isolation of the cloche gene has been challenging due to its telomeric location. Here we used a deletion allele of cloche to identify several new cloche candidate genes within this genomic region, and systematically genome-edited each candidate. Through this comprehensive interrogation, we succeeded in isolating the cloche gene and discovered that it encodes a PAS-domain-containing bHLH transcription factor, and that it is expressed in a highly specific spatiotemporal pattern starting during late gastrulation. Gain-of-function experiments show that it can potently induce endothelial gene expression. Epistasis experiments reveal that it functions upstream of etv2 and tal1, the earliest expressed endothelial and haematopoietic transcription factor genes identified to date. A mammalian cloche orthologue can also rescue blood vessel formation in zebrafish cloche mutants, indicating a highly conserved role in vertebrate vasculogenesis and haematopoiesis. The identification of this master regulator of endothelial and haematopoietic fate enhances our understanding of early mesoderm diversification and may lead to improved protocols for the generation of endothelial and haematopoietic cells in vivo and in vitro.


July 7, 2019

Microsatellite length scoring by Single Molecule Real Time Sequencing – Effects of sequence structure and PCR regime.

Microsatellites are DNA sequences consisting of repeated, short (1-6 bp) sequence motifs that are highly mutable by enzymatic slippage during replication. Due to their high intrinsic variability, microsatellites have important applications in population genetics, forensics, genome mapping, as well as cancer diagnostics and prognosis. The current analytical standard for microsatellites is based on length scoring by high precision electrophoresis, but due to increasing efficiency next-generation sequencing techniques may provide a viable alternative. Here, we evaluated single molecule real time (SMRT) sequencing, implemented in the PacBio series of sequencing apparatuses, as a means of microsatellite length scoring. To this end we carried out multiplexed SMRT sequencing of plasmid-carried artificial microsatellites of varying structure under different pre-sequencing PCR regimes. For each repeat structure, reads corresponding to the target length dominated. We found that pre-sequencing amplification had large effects on scoring accuracy and error distribution relative to controls, but that the effects of the number of amplification cycles were generally weak. In line with expectations enzymatic slippage decreased proportionally with microsatellite repeat unit length and increased with repetition number. Finally, we determined directional mutation trends, showing that PCR and SMRT sequencing introduced consistent but opposing error patterns in contraction and expansion of the microsatellites on the repeat motif and single nucleotide level.


July 7, 2019

Structural basis for recombinatorial permissiveness in the generation of Anaplasma marginale Msp2 antigenic variants.

Sequential expression of outer membrane protein antigenic variants is an evolutionarily convergent mechanism used by bacterial pathogens to escape host immune clearance and establish persistent infection. Variants must be sufficiently structurally distinct to escape existing immune effectors yet retain core structural elements required for localization and function within the outer membrane. We examined this balance using Anaplasma marginale, which generates antigenic variants in the outer membrane protein Msp2 using gene conversion. The overwhelming majority of Msp2 variants expressed during long-term persistent infection are mosaics, derived by recombination of oligonucleotide segments from multiple alleles to form unique hypervariable regions (HVR). As a result, the mosaics are not under long-term selective pressure to encode a functional protein; consequently, we hypothesized that the Msp2 HVR is structurally permissive for mosaic expression. Using an integrated approach of predictive modeling with determination of native Msp2 protein structure and function, we demonstrate that structured elements, most notably ß-sheets, are significantly concentrated in the highly conserved N- and C-terminal domains. In contrast the HVR is overwhelmingly random coil with the structured a-helices and ß-sheets confined to the genomically defined “structural tethers” that separate the antigenically variable microdomains. This structure is supported by the surface exposure of the HVR microdomains and the slow diffusion type porin function in native Msp2. Importantly, the predominance of random coil provides plasticity for formation of functional HVR mosaics and realization of the full potential of segmental gene conversion to dramatically expand the variant repertoire. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

The novel 2016 WHO Neisseria gonorrhoeae reference strains for global quality assurance of laboratory investigations: phenotypic, genetic and reference genome characterization.

Gonorrhoea and MDR Neisseria gonorrhoeae remain public health concerns globally. Enhanced, quality-assured, gonococcal antimicrobial resistance (AMR) surveillance is essential worldwide. The WHO global Gonococcal Antimicrobial Surveillance Programme (GASP) was relaunched in 2009. We describe the phenotypic, genetic and reference genome characteristics of the 2016 WHO gonococcal reference strains intended for quality assurance in the WHO global GASP, other GASPs, diagnostics and research worldwide.The 2016 WHO reference strains (n?=?14) constitute the eight 2008 WHO reference strains and six novel strains. The novel strains represent low-level to high-level cephalosporin resistance, high-level azithromycin resistance and a porA mutant. All strains were comprehensively characterized for antibiogram (n?=?23), serovar, prolyliminopeptidase, plasmid types, molecular AMR determinants, N. gonorrhoeae multiantigen sequence typing STs and MLST STs. Complete reference genomes were produced using single-molecule PacBio sequencing.The reference strains represented all available phenotypes, susceptible and resistant, to antimicrobials previously and currently used or considered for future use in gonorrhoea treatment. All corresponding resistance genotypes and molecular epidemiological types were described. Fully characterized, annotated and finished references genomes (n?=?14) were presented.The 2016 WHO gonococcal reference strains are intended for internal and external quality assurance and quality control in laboratory investigations, particularly in the WHO global GASP and other GASPs, but also in phenotypic (e.g. culture, species determination) and molecular diagnostics, molecular AMR detection, molecular epidemiology and as fully characterized, annotated and finished reference genomes in WGS analysis, transcriptomics, proteomics and other molecular technologies and data analysis.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Challenges, solutions, and quality metrics of personal genome assembly in advancing precision medicine.

Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly.


July 7, 2019

SAR11 bacteria linked to ocean anoxia and nitrogen loss.

Bacteria of the SAR11 clade constitute up to one half of all microbial cells in the oxygen-rich surface ocean. SAR11 bacteria are also abundant in oxygen minimum zones (OMZs), where oxygen falls below detection and anaerobic microbes have vital roles in converting bioavailable nitrogen to N2 gas. Anaerobic metabolism has not yet been observed in SAR11, and it remains unknown how these bacteria contribute to OMZ biogeochemical cycling. Here, genomic analysis of single cells from the world’s largest OMZ revealed previously uncharacterized SAR11 lineages with adaptations for life without oxygen, including genes for respiratory nitrate reductases (Nar). SAR11 nar genes were experimentally verified to encode proteins catalysing the nitrite-producing first step of denitrification and constituted ~40% of OMZ nar transcripts, with transcription peaking in the anoxic zone of maximum nitrate reduction activity. These results link SAR11 to pathways of ocean nitrogen loss, redefining the ecological niche of Earth’s most abundant organismal group.


July 7, 2019

Representing genetic variation with synthetic DNA standards.

The identification of genetic variation with next-generation sequencing is confounded by the complexity of the human genome sequence and by biases that arise during library preparation, sequencing and analysis. We have developed a set of synthetic DNA standards, termed ‘sequins’, that emulate human genetic features and constitute qualitative and quantitative spike-in controls for genome sequencing. Sequencing reads derived from sequins align exclusively to an artificial in silico reference chromosome, rather than the human reference genome, which allows them them to be partitioned for parallel analysis. Here we use this approach to represent common and clinically relevant genetic variation, ranging from single nucleotide variants to large structural rearrangements and copy-number variation. We validate the design and performance of sequin standards by comparison to examples in the NA12878 reference genome, and we demonstrate their utility during the detection and quantification of variants. We provide sequins as a standardized, quantitative resource against which human genetic variation can be measured and diagnostic performance assessed.


July 7, 2019

Distinct Salmonella enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings.

An epidemiological paradox surrounds Salmonella enterica serovar Enteritidis. In high-income settings, it has been responsible for an epidemic of poultry-associated, self-limiting enterocolitis, whereas in sub-Saharan Africa it is a major cause of invasive nontyphoidal Salmonella disease, associated with high case fatality. By whole-genome sequence analysis of 675 isolates of S. Enteritidis from 45 countries, we show the existence of a global epidemic clade and two new clades of S. Enteritidis that are geographically restricted to distinct regions of Africa. The African isolates display genomic degradation, a novel prophage repertoire, and an expanded multidrug resistance plasmid. S. Enteritidis is a further example of a Salmonella serotype that displays niche plasticity, with distinct clades that enable it to become a prominent cause of gastroenteritis in association with the industrial production of eggs and of multidrug-resistant, bloodstream-invasive infection in Africa.


July 7, 2019

Vibrio natriegens as a fast-growing host for molecular biology.

A rapidly growing bacterial host would be desirable for a range of routine applications in molecular biology and biotechnology. The bacterium Vibrio natriegens has the fastest growth rate of any known organism, with a reported doubling time of <10 min. We report the development of genetic tools and methods to engineer V. natriegens and demonstrate the advantages of using these engineered strains in common biotech processes.


July 7, 2019

Variation of 45S rDNA intergenic spacers in Arabidopsis thaliana.

Approximately seven hundred 45S rRNA genes (rDNA) in the Arabidopsis thaliana genome are organised in two 4 Mbp-long arrays of tandem repeats arranged in head-to-tail fashion separated by an intergenic spacer (IGS). These arrays make up 5?% of the A. thaliana genome. IGS are rapidly evolving sequences and frequent rearrangements inside the rDNA loci have generated considerable interspecific and even intra-individual variability which allows to distinguish among otherwise highly conserved rRNA genes. The IGS has not been comprehensively described despite its potential importance in regulation of rDNA transcription and replication. Here we describe the detailed sequence variation in the complete IGS of A. thaliana WT plants and provide the reference/consensus IGS sequence, as well as genomic DNA analysis. We further investigate mutants dysfunctional in chromatin assembly factor-1 (CAF-1) (fas1 and fas2 mutants), which are known to have a reduced number of rDNA copies, and plant lines with restored CAF-1 function (segregated from a fas1xfas2 genetic background) showing major rDNA rearrangements. The systematic rDNA loss in CAF-1 mutants leads to the decreased variability of the IGS and to the occurrence of distinct IGS variants. We present for the first time a comprehensive and representative set of complete IGS sequences, obtained by conventional cloning and by Pacific Biosciences sequencing. Our data expands the knowledge of the A. thaliana IGS sequence arrangement and variability, which has not been available in full and in detail until now. This is also the first study combining IGS sequencing data with RFLP analysis of genomic DNA.


July 7, 2019

Comparative genomics and transcriptomics of Pichia pastoris.

Pichia pastoris has emerged as an important alternative host for producing recombinant biopharmaceuticals, owing to its high cultivation density, low host cell protein burden, and the development of strains with humanized glycosylation. Despite its demonstrated utility, relatively little strain engineering has been performed to improve Pichia, due in part to the limited number and inconsistent frameworks of reported genomes and transcriptomes. Furthermore, the co-mingling of genomic, transcriptomic and fermentation data collected about Komagataella pastoris and Komagataella phaffii, the two strains co-branded as Pichia, has generated confusion about host performance for these genetically distinct species. Generation of comparative high-quality genomes and transcriptomes will enable meaningful comparisons between the organisms, and potentially inform distinct biotechnological utilies for each species.Here, we present a comprehensive and standardized comparative analysis of the genomic features of the three most commonly used strains comprising the tradename Pichia: K. pastoris wild-type, K. phaffii wild-type, and K. phaffii GS115. We used a combination of long-read (PacBio) and short-read (Illumina) sequencing technologies to achieve over 1000X coverage of each genome. Construction of individual genomes was then performed using as few as seven individual contigs to create gap-free assemblies. We found substantial syntenic rearrangements between the species and characterized a linear plasmid present in K. phaffii. Comparative analyses between K. phaffii genomes enabled the characterization of the mutational landscape of the GS115 strain. We identified and examined 35 non-synonomous coding mutations present in GS115, many of which are likely to impact strain performance. Additionally, we investigated transcriptomic profiles of gene expression for both species during cultivation on various carbon sources. We observed that the most highly transcribed genes in both organisms were consistently highly expressed in all three carbon sources examined. We also observed selective expression of certain genes in each carbon source, including many sequences not previously reported as promoters for expression of heterologous proteins in yeasts.Our studies establish a foundation for understanding critical relationships between genome structure, cultivation conditions and gene expression. The resources we report here will inform and facilitate rational, organism-wide strain engineering for improved utility as a host for protein production.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.