Menu
July 7, 2019

E. coli O157:H7 strain EDL933 harbors multiple functional prophage-associated genes necessary for the utilization of 5-N-acetyl-9-O-acetyl neuraminic acid as a growth substrate.

Enterohemorrhagic E. coli (EHEC) O157:H7 strain EDL933 harbors multiple prophage-associated open reading frames (orfs) in its genome, which are highly homologous to the chromosomal nanS gene. The latter is part of the nanCMS-operon, which is present in most E. coli strains and encodes an esterase, which is responsible for the mono-deacetylation of 5-N-acetyl-9-O-acetyl neuraminic acid (Neu5,9Ac2). Whereas one prophage-borne orf (z1466) has been characterized in previous studies, the functions of the other nanS-homologous orfs are unknown.In the current study, the nanS-homologous orfs of EDL933 were initially studied in silico Due to their homology to the chromosomal nanS gene and their location in prophage genomes, we designated them nanS-p, and numbered the different nanS-p alleles consecutively from 1-10. The two alleles nanS-p2 and nanS-p4 were selected for production of recombinant proteins, their enzymatic activities were investigated and differences in their temperature optima were found. Furthermore, a function of these enzymes in substrate utilization could be demonstrated using an E. coli C600?nanS mutant in a growth medium with Neu5,9Ac2 as carbon source and supplementation with the different recombinant NanS-p proteins. Moreover, generation of sequential deletion of all nanS-p alleles in strain EDL933, and subsequent growth experiments demonstrated a gene-dose-effect on the utilization of Neu5,9Ac2Since Neu5,9Ac2 is an important component of human and animal gut mucus, and the nutrient availability in the large intestine is limited, we hypothesize that the presence of multiple Neu5,9Ac2-esterases provides them a nutrient supply under certain conditions in the large intestine, even if particular prophages get lost.In this study, a group of homologous prophage-borne nanS-p alleles and two of the corresponding enzymes of enterohemorrhagic E. coli (EHEC) O157:H7 strain EDL933 are characterized that may be important to provide alternative genes for substrate utilization. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Homologous recombination within large chromosomal regions facilitates acquisition of beta-lactam and vancomycin resistance in Enterococcus faecium.

The transfer of DNA between Enterococcus faecium strains has been characterized by both the movement of well-defined genetic elements and by the large-scale transfer of genomic DNA fragments. In this work we report on the whole genome analysis of transconjugants resulting from mating events between the vancomycin-resistant E. faecium C68 strain and vancomycin susceptible D344RRF to discern the mechanism by which the transferred regions enter the recipient chromosome. Vancomycin-resistant transconjugants from five independent matings were analysed by whole genome sequencing. In all cases but one, the penicillin binding protein 5 gene (pbp5) and the Tn5382-vancomycin resistance transposon were transferred together and replaced the corresponding pbp5 region of D344RRF. In one instance, Tn5382 inserted independently downstream of the D344RRF pbp5 Single nucleotide variants (SNV) analysis suggests that entry of donor DNA into the recipient chromosome occurred by recombination across regions of homology between donor and recipient chromosomes, rather than through insertion sequence-mediated transposition. Transfer of genomic DNA was also associated with transfer of C68 plasmid pLRM23 and another putative plasmid. Our data are consistent with transfer initiated by a cointegration of a transferable plasmid with the donor chromosome, with subsequent circularization of the plasmid/chromosome cointegrate in the donor prior to transfer. Entry into the recipient chromosome occurs most commonly across regions of homology between donor and recipient chromosomes. Copyright © 2016 García-Solache et al.


July 7, 2019

Cloche is a bHLH-PAS transcription factor that drives haemato-vascular specification.

Vascular and haematopoietic cells organize into specialized tissues during early embryogenesis to supply essential nutrients to all organs and thus play critical roles in development and disease. At the top of the haemato-vascular specification cascade lies cloche, a gene that when mutated in zebrafish leads to the striking phenotype of loss of most endothelial and haematopoietic cells and a significant increase in cardiomyocyte numbers. Although this mutant has been analysed extensively to investigate mesoderm diversification and differentiation and continues to be broadly used as a unique avascular model, the isolation of the cloche gene has been challenging due to its telomeric location. Here we used a deletion allele of cloche to identify several new cloche candidate genes within this genomic region, and systematically genome-edited each candidate. Through this comprehensive interrogation, we succeeded in isolating the cloche gene and discovered that it encodes a PAS-domain-containing bHLH transcription factor, and that it is expressed in a highly specific spatiotemporal pattern starting during late gastrulation. Gain-of-function experiments show that it can potently induce endothelial gene expression. Epistasis experiments reveal that it functions upstream of etv2 and tal1, the earliest expressed endothelial and haematopoietic transcription factor genes identified to date. A mammalian cloche orthologue can also rescue blood vessel formation in zebrafish cloche mutants, indicating a highly conserved role in vertebrate vasculogenesis and haematopoiesis. The identification of this master regulator of endothelial and haematopoietic fate enhances our understanding of early mesoderm diversification and may lead to improved protocols for the generation of endothelial and haematopoietic cells in vivo and in vitro.


July 7, 2019

Comparative genomic analysis of Klebsiella pneumoniae subsp. pneumoniae KP617 and PittNDM01, NUHL24835, and ATCC BAA-2146 reveals unique evolutionary history of this strain.

Klebsiella pneumoniae subsp. pneumoniae KP617 is a pathogenic strain that coproduces OXA-232 and NDM-1 carbapenemases. We sequenced the genome of KP617, which was isolated from the wound of a Korean burn patient, and performed a comparative genomic analysis with three additional strains: PittNDM01, NUHL24835 and ATCC BAA-2146.The complete genome of KP617 was obtained via multi-platform whole-genome sequencing. Phylogenetic analysis along with whole genome and multi-locus sequence typing of genes of the Klebsiella pneumoniae species showed that KP617 belongs to the WGLW2 group, which includes PittNDM01 and NUHL24835. Comparison of annotated genes showed that KP617 shares 98.3 % of its genes with PittNDM01. Nineteen antibiotic resistance genes were identified in the KP617 genome: bla OXA-1 and bla SHV-28 in the chromosome, bla NDM-1 in plasmid 1, and bla OXA-232 in plasmid 2 conferred resistance to beta-lactams; however, colistin- and tetracycline-resistance genes were not found. We identified 117 virulence factors in the KP617 genome, and discovered that the genes encoding these factors were also harbored by the reference strains; eight genes were lipopolysaccharide-related and four were capsular polysaccharide-related. A comparative analysis of phage-associated regions indicated that two phage regions are specific to the KP617 genome and that prophages did not act as a vehicle for transfer of antimicrobial resistance genes in this strain.Whole-genome sequencing and bioinformatics analysis revealed similarity in the genome sequences and content, and differences in phage-related genes, plasmids and antimicrobial resistance genes between KP617 and the references. In order to elucidate the precise role of these factors in the pathogenicity of KP617, further studies are required.


July 7, 2019

Microsatellite length scoring by Single Molecule Real Time Sequencing – Effects of sequence structure and PCR regime.

Microsatellites are DNA sequences consisting of repeated, short (1-6 bp) sequence motifs that are highly mutable by enzymatic slippage during replication. Due to their high intrinsic variability, microsatellites have important applications in population genetics, forensics, genome mapping, as well as cancer diagnostics and prognosis. The current analytical standard for microsatellites is based on length scoring by high precision electrophoresis, but due to increasing efficiency next-generation sequencing techniques may provide a viable alternative. Here, we evaluated single molecule real time (SMRT) sequencing, implemented in the PacBio series of sequencing apparatuses, as a means of microsatellite length scoring. To this end we carried out multiplexed SMRT sequencing of plasmid-carried artificial microsatellites of varying structure under different pre-sequencing PCR regimes. For each repeat structure, reads corresponding to the target length dominated. We found that pre-sequencing amplification had large effects on scoring accuracy and error distribution relative to controls, but that the effects of the number of amplification cycles were generally weak. In line with expectations enzymatic slippage decreased proportionally with microsatellite repeat unit length and increased with repetition number. Finally, we determined directional mutation trends, showing that PCR and SMRT sequencing introduced consistent but opposing error patterns in contraction and expansion of the microsatellites on the repeat motif and single nucleotide level.


July 7, 2019

Challenges, solutions, and quality metrics of personal genome assembly in advancing precision medicine.

Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly.


July 7, 2019

Genomic characterization of the Atlantic cod sex-locus.

A variety of sex determination mechanisms can be observed in evolutionary divergent teleosts. Sex determination is genetic in Atlantic cod (Gadus morhua), however the genomic location or size of its sex-locus is unknown. Here, we characterize the sex-locus of Atlantic cod using whole genome sequence (WGS) data of 227 wild-caught specimens. Analyzing more than 55 million polymorphic loci, we identify 166 loci that are associated with sex. These loci are located in six distinct regions on five different linkage groups (LG) in the genome. The largest of these regions, an approximately 55?Kb region on LG11, contains the majority of genotypes that segregate closely according to a XX-XY system. Genotypes in this region can be used genetically determine sex, whereas those in the other regions are inconsistently sex-linked. The identified region on LG11 and its surrounding genes have no clear sequence homology with genes or regulatory elements associated with sex-determination or differentiation in other species. The functionality of this sex-locus therefore remains unknown. The WGS strategy used here proved adequate for detecting the small regions associated with sex in this species. Our results highlight the evolutionary flexibility in genomic architecture underlying teleost sex-determination and allow practical applications to genetically sex Atlantic cod.


July 7, 2019

Representing genetic variation with synthetic DNA standards.

The identification of genetic variation with next-generation sequencing is confounded by the complexity of the human genome sequence and by biases that arise during library preparation, sequencing and analysis. We have developed a set of synthetic DNA standards, termed ‘sequins’, that emulate human genetic features and constitute qualitative and quantitative spike-in controls for genome sequencing. Sequencing reads derived from sequins align exclusively to an artificial in silico reference chromosome, rather than the human reference genome, which allows them them to be partitioned for parallel analysis. Here we use this approach to represent common and clinically relevant genetic variation, ranging from single nucleotide variants to large structural rearrangements and copy-number variation. We validate the design and performance of sequin standards by comparison to examples in the NA12878 reference genome, and we demonstrate their utility during the detection and quantification of variants. We provide sequins as a standardized, quantitative resource against which human genetic variation can be measured and diagnostic performance assessed.


July 7, 2019

Distinct Salmonella enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings.

An epidemiological paradox surrounds Salmonella enterica serovar Enteritidis. In high-income settings, it has been responsible for an epidemic of poultry-associated, self-limiting enterocolitis, whereas in sub-Saharan Africa it is a major cause of invasive nontyphoidal Salmonella disease, associated with high case fatality. By whole-genome sequence analysis of 675 isolates of S. Enteritidis from 45 countries, we show the existence of a global epidemic clade and two new clades of S. Enteritidis that are geographically restricted to distinct regions of Africa. The African isolates display genomic degradation, a novel prophage repertoire, and an expanded multidrug resistance plasmid. S. Enteritidis is a further example of a Salmonella serotype that displays niche plasticity, with distinct clades that enable it to become a prominent cause of gastroenteritis in association with the industrial production of eggs and of multidrug-resistant, bloodstream-invasive infection in Africa.


July 7, 2019

DBG2OLC: Efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies.

The highly anticipated transition from next generation sequencing (NGS) to third generation sequencing (3GS) has been difficult primarily due to high error rates and excessive sequencing cost. The high error rates make the assembly of long erroneous reads of large genomes challenging because existing software solutions are often overwhelmed by error correction tasks. Here we report a hybrid assembly approach that simultaneously utilizes NGS and 3GS data to address both issues. We gain advantages from three general and basic design principles: (i) Compact representation of the long reads leads to efficient alignments. (ii) Base-level errors can be skipped; structural errors need to be detected and corrected. (iii) Structurally correct 3GS reads are assembled and polished. In our implementation, preassembled NGS contigs are used to derive the compact representation of the long reads, motivating an algorithmic conversion from a de Bruijn graph to an overlap graph, the two major assembly paradigms. Moreover, since NGS and 3GS data can compensate for each other, our hybrid assembly approach reduces both of their sequencing requirements. Experiments show that our software is able to assemble mammalian-sized genomes orders of magnitude more quickly than existing methods without consuming a lot of memory, while saving about half of the sequencing cost.


July 7, 2019

New Delhi metallo-ß-lactamase-1-producing Klebsiella pneumoniae, Florida, USA(1).

New Delhi metallo-ß-lactamase (NDM)–producing Enterobacteriaceae have swiftly spread worldwide since an initial report in 2008 from a patient who had been transferred from India back home to Sweden (1). Epidemiologically, the global diffusion of NDM-1 producers has been associated with the Indian subcontinent and the Balkan region, which are considered the primary and secondary reservoirs of these pathogens, respectively (1). However, recent reports suggest that countries in the Middle East may constitute another potential reservoir for NDM-1 producers (1). More than 100 NDM-producing isolates have been reported in the United States, most of which were associated with recent travel from the Indian subcontinent (2,3). We report an NDM-1–producing Klebsiella pneumoniae strain that was recovered from a patient who had been transferred from Iran to a hospital in Florida, United States.


July 7, 2019

The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century.

Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.


July 7, 2019

Assemblytics: a web analytics tool for the detection of variants from an assembly.

Assemblytics is a web app for detecting and analyzing variants from a de novo genome assembly aligned to a reference genome. It incorporates a unique anchor filtering approach to increase robustness to repetitive elements, and identifies six classes of variants based on their distinct alignment signatures. Assemblytics can be applied both to comparing aberrant genomes, such as human cancers, to a reference, or to identify differences between related species. Multiple interactive visualizations enable in-depth explorations of the genomic distributions of variants.http://assemblytics.com, https://github.com/marianattestad/assemblytics CONTACT: mnattest@cshl.eduSupplementary information: Supplementary data are available at Bioinformatics online.© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.