Menu
July 7, 2019

Institutional profile: translational pharmacogenomics at the Icahn School of Medicine at Mount Sinai.

For almost 50 years, the Icahn School of Medicine at Mount Sinai has continually invested in genetics and genomics, facilitating a healthy ecosystem that provides widespread support for the ongoing programs in translational pharmacogenomics. These programs can be broadly cataloged into discovery, education, clinical implementation and testing, which are collaboratively accomplished by multiple departments, institutes, laboratories, companies and colleagues. Focus areas have included drug response association studies and allele discovery, multiethnic pharmacogenomics, personalized genotyping and survey-based education programs, pre-emptive clinical testing implementation and novel assay development. This overview summarizes the current state of translational pharmacogenomics at Mount Sinai, including a future outlook on the forthcoming expansions in overall support, research and clinical programs, genomic technology infrastructure and the participating faculty.


July 7, 2019

Glaucophyta

The Glaucophyta is by far the least species-rich phylum of the Archaeplastida comprising only four described genera, Glaucocystis, Cyanophora, Gloeochaete, and Cyanoptyche, and 15 species. However, recent molecular and morphological analyses reveal that glaucophytes are not as species poor as hitherto assumed with many novel lineages existing in natural environments. Glaucophytes are freshwater phototrophs of moderate to low abundance and retain many ancestral plastid traits derived from the cyanobacterial donor of this organelle, including the remnant peptidoglycan wall in their envelope. These plastids were originally named “cyanelles,” which was later changed to “muroplasts” when their shared ancestry with other Archaeplastida was recognized. The model glaucophyte, Cyanophora paradoxa, is well studied with respect to biochemistry, proteomics, and the gene content of the nuclear and organelle genomes. Investigation of the biosynthesis of cytosolic starch led to a model for the transition from glycogen to starch storage during plastid endosymbiosis. The photosynthetic apparatus, including phycobilisome antennae, resembles that of cyanobacteria. However, the carbon-concentrating mechanism is algal in nature and based on pyrenoids. Studies on protein import into muroplasts revealed a primordial Toc/Tic translocon. The peptidoglycan wall was elucidated with respect to composition, biosynthesis, and involvement of nuclear genes. The muroplast genome is distinct, not due to the number of encoded genes but, rather, because of the presence of unique genes not present on other plastid genomes. The mosaic nature of the gene-rich (27,000) nuclear genome came as a surprise, considering the relatively small genomes of unicellular red algae.


July 7, 2019

Microbial bioinformatics for food safety and production.

In the production of fermented foods, microbes play an important role. Optimization of fermentation processes or starter culture production traditionally was a trial-and-error approach inspired by expert knowledge of the fermentation process. Current developments in high-throughput ‘omics’ technologies allow developing more rational approaches to improve fermentation processes both from the food functionality as well as from the food safety perspective. Here, the authors thematically review typical bioinformatics techniques and approaches to improve various aspects of the microbial production of fermented food products and food safety. © The Author 2015. Published by Oxford University Press.


July 7, 2019

Wide geographical dissemination of the multiresistant Staphylococcus capitis NRCS-A clone in neonatal intensive-care units.

Nosocomial late-onset sepsis represents a frequent cause of morbidity and mortality in preterm neonates. The Staphylococcus capitis clone NRCS-A has been previously described as an emerging cause of nosocomial bacteraemia in French neonatal intensive-care units (NICUs). In this study, we aimed to explore the possible unrecognized dissemination of this clone on a larger geographical scale. One hundred methicillin-resistant S. capitis strains isolated from neonates (n = 86) and adult patients (n = 14) between 2000 and 2013 in four different countries (France, Belgium, the UK, and Australia) were analysed with SmaI pulsed-field gel electrophoresis (PFGE) and dru typing. The vast majority of NICU strains showed the NRCS-A pulsotype and the dt11c type (96%). We then randomly selected 14 isolates (from neonates, n = 12, three per country; from adult patients, n = 2), considered to be a subset of representative isolates, and performed further molecular typing (SacII PFGE, SCCmec typing, and multilocus sequence typing-like analysis), confirming the clonality of the S. capitis strains isolated from neonates, despite their distant geographical origin. Whole genome single-nucleotide polymorphism-based phylogenetic analysis of five NICU isolates (from the different countries) attested to high genetic relatedness within the NRCS-A clone. Finally, all of the NRCS-A strains showed multidrug resistance (e.g. methicillin and aminoglycoside resistance, and decreased vancomycin susceptibility), with potential therapeutic implications for infected neonates. In conclusion, this study represents the first report of clonal dissemination of methicillin-resistant coagulase-negative Staphylococcus clone on a large geographical scale. Questions remain regarding the origin and means of international spread, and the reasons for this clone’s apparent predilection for neonates. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.


July 7, 2019

First Azospirillum genome from aquatic environments: Whole-genome sequence of Azospirillum thiophilum BV-S(T), a novel diazotroph harboring a capacity of sulfur-chemolithotrophy from a sulfide spring.

Azospirillum thiophilum BV-S(T), isolated from a sulfide spring, is a novel nitrogen-fixing bacterium harboring sulfur-lithotrophy. In order to identify genetic characteristics with habitat- and metabolic features contrasting to those from terrestrial Azospirillum species, we present here the genome sequence of a novel species A. thiophilum BV-S(T), with a significance of first genome report in the aquatic Azospirillum species. The genome of strain BV-S(T) is comprised of 7.6Mb chromosome with a GC content of 68.2%. This information will contribute to expand understandings of sulfur-oxidizer microbes that preserve inherencies as a diazotroph, and further it will provide insights into genome plasticity of the genus Azospirillum for niche specific adaptations. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

ALP & FALP: C++ libraries for pairwise local alignment E-values.

Pairwise local alignment is an indispensable tool for molecular biologists. In real time (i.e. in about 1 s), ALP (Ascending Ladder Program) calculates the E-values for protein-protein or DNA-DNA local alignments of random sequences, for arbitrary substitution score matrix, gap costs and letter abundances; and FALP (Frameshift Ascending Ladder Program) performs a similar task, although more slowly, for frameshifting DNA-protein alignments.To permit other C++ programmers to implement the computational efficiencies in ALP and FALP directly within their own programs, C++ source codes are available in the public domain at http://go.usa.gov/3GTSW under ‘ALP’ and ‘FALP’, along with the standalone programs ALP and FALP.spouge@nih.govSupplementary information: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2015. This work is written by US Government employees and is in the public domain in the US.


July 7, 2019

Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended. © 2015 John Wiley & Sons Ltd.


July 7, 2019

Genome mining of astaxanthin biosynthetic genes from Sphingomonas sp. ATCC 55669 for heterologous overproduction in Escherichia coli.

As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the a-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform was constructed in E. coli for identifying the functional roles of candidate genes. All genes involved in astaxanthin biosynthesis showed discrete distributions on the chromosome. Moreover, the overexpression of exogenous E. coli idi in Sphingomonas sp. ATCC 55669 increased astaxanthin production by 5.4-fold. This study described a new astaxanthin producer and provided more biosynthesis components for bioengineering of astaxanthin in the future. © 2015 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


July 7, 2019

rHAT: fast alignment of noisy long reads with regional hashing.

Single Molecule Real-Time (SMRT) sequencing has been widely applied in cutting-edge genomic studies. However, it is still an expensive task to align the noisy long SMRT reads to reference genome by state-of-the-art aligners, which is becoming a bot-tleneck in applications with SMRT sequencing. Novel approach is on demand for improving the efficiency and effectiveness of SMRT read alignment.We propose Regional Hashing-based Alignment Tool (rHAT), a seed-and-extension-based read alignment approach specifically designed for noisy long reads. rHAT indexes reference genome by regional hash table (RHT), a hash table-based index which describes the short tokens within local windows of reference genome. In the seeding phase, rHAT utilizes RHT for efficiently calculating the occurrences of short token matches between partial read and local genomic windows to find highly possible candidate sites. In the extension phase, a sparse dynamic programming-based heuristic approach is used for reducing the cost of aligning read to the candidate sites. By benchmarking on the real and simulated datasets from various prokaryote and eukaryote genomes, we demonstrated that rHAT can effectively align SMRT reads with outstanding throughput. rHAT is implemented in C++; the source code is available at https://github.com/derekguan/rHAT CONTACT: ydwang@hit.edu.cn. © The Author (2015). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

hybridSPAdes: an algorithm for hybrid assembly of short and long reads.

Recent advances in single molecule real-time (SMRT) and nanopore sequencing technologies have enabled high-quality assemblies from long and inaccurate reads. However, these approaches require high coverage by long reads and remain expensive. On the other hand, the inexpensive short reads technologies produce accurate but fragmented assemblies. Thus, a hybrid approach that assembles long reads (with low coverage) and short reads has a potential to generate high-quality assemblies at reduced cost.We describe hybridSPAdes algorithm for assembling short and long reads and benchmark it on a variety of bacterial assembly projects. Our results demonstrate that hybridSPAdes generates accurate assemblies (even in projects with relatively low coverage by long reads) thus reducing the overall cost of genome sequencing. We further present the first complete assembly of a genome from single cells using SMRT reads.hybridSPAdes is implemented in C++?as a part of SPAdes genome assembler and is publicly available at http://bioinf.spbau.ru/en/spades CONTACT: d.antipov@spbu.ruSupplementary information: supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Timing, rates and spectra of human germline mutation.

Germline mutations are a driving force behind genome evolution and genetic disease. We investigated genome-wide mutation rates and spectra in multi-sibling families. The mutation rate increased with paternal age in all families, but the number of additional mutations per year differed by more than twofold between families. Meta-analysis of 6,570 mutations showed that germline methylation influences mutation rates. In contrast to somatic mutations, we found remarkable consistency in germline mutation spectra between the sexes and at different paternal ages. In parental germ line, 3.8% of mutations were mosaic, resulting in 1.3% of mutations being shared by siblings. The number of these shared mutations varied significantly between families. Our data suggest that the mutation rate per cell division is higher during both early embryogenesis and differentiation of primordial germ cells but is reduced substantially during post-pubertal spermatogenesis. These findings have important consequences for the recurrence risks of disorders caused by de novo mutations.


July 7, 2019

Complete genome sequence of Acinetobacter baumannii XH386 (ST208), a multi-drug resistant bacteria isolated from pediatric hospital in China.

Acinetobacter baumannii is an important bacterium that emerged as a significant nosocomial pathogen worldwide. The rise of A. baumannii was due to its multi-drug resistance (MDR), while it was difficult to treat multi-drug resistant A. baumannii with antibiotics, especially in pediatric patients for the therapeutic options with antibiotics were quite limited in pediatric patients. A. baumannii ST208 was identified as predominant sequence type of carbapenem resistant A. baumannii in the United States and China. As we knew, there was no complete genome sequence reproted for A. baumannii ST208, although several whole genome shotgun sequences had been reported. Here, we sequenced the 4087-kilobase (kb) chromosome and 112-kb plasmid of A. baumannii XH386 (ST208), which was isolated from a pediatric hospital in China. The genome of A. baumannii XH386 contained 3968 protein-coding genes and 94 RNA-only encoding genes. Genomic analysis and Minimum inhibitory concentration assay showed that A. baumannii XH386 was multi-drug resistant strain, which showed resistance to most of antibiotics, except for tigecycline. The data may be accessed via the GenBank accession number CP010779 and CP010780.


July 7, 2019

Analysis of hepatitis C NS5A resistance associated polymorphisms using ultra deep single molecule real time (SMRT) sequencing.

Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

MuffinEc: Error correction for de novo assembly via greedy partitioning and sequence alignment

Error correction is typically the first step of de novo genome assembly from NGS data. This step has an important impact on the quality and speed of the assembly process. However, the majority of available stand-alone error correction solutions can only detect and correct mismatches. Therefore, these solutions only support correcting reads generated by Illumina sequencers. Several solutions support insertions and deletions (indels) and are capable of working with multiple technologies. However, these solutions are limited by correction performance and resource consumption. In this paper, we introduce MuffinEc, an indel-aware multi-technology correction method for NGS data. This method uses a greedy approach to create groups of reads and subsequently corrects them using their consensus. MuffinEc surpasses existing solutions by offering better correction ratios for multiple technologies. This method also exploits parallel processing via OpenMP and uses less computational resources than similar programs, thereby being capable of handling large datasets. MuffinEc is open source and freely available at http://muffinec.sourceforge.net.


July 7, 2019

Long read and single molecule DNA sequencing simplifies genome assembly and TAL effector gene analysis of Xanthomonas translucens.

The species Xanthomonas translucens encompasses a complex of bacterial strains that cause diseases and yield loss on grass species including important cereal crops. Three pathovars, X. translucens pv. undulosa, X. translucens pv. translucens and X. translucens pv.cerealis, have been described as pathogens of wheat, barley, and oats. However, no complete genome sequence for a strain of this complex is currently available.A complete genome sequence of X. translucens pv. undulosa strain XT4699 was obtained by using PacBio long read, single molecule, real time (SMRT) DNA sequences and Illumina sequences. Draft genome sequences of nineteen additional X. translucens strains, which were collected from wheat or barley in different regions and at different times, were generated by Illumina sequencing. Phylogenetic relationships among different Xanthomonas strains indicates that X. translucens are members of a distinct clade from so-called group 2 xanthomonads and three pathovars of this species, undulosa, translucens and cerealis, represent distinct subclades in the group 1 clade. Knockout mutation of type III secretion system of XT4699 eliminated the ability to cause water-soaking symptoms on wheat and barley and resulted in a reduction in populations on wheat in comparison to the wild type strain. Sequence comparison of X. translucens strains revealed the genetic variation on type III effector repertories among different pathovars or within one pathovar. The full genome sequence of XT4699 reveals the presence of eight members of the Transcription-Activator Like (TAL) effector genes, which are phylogenetically distant from previous known TAL effector genes of group 2 xanthomonads. Microarray and qRT-PCR analyses revealed TAL effector-specific wheat gene expression modulation.PacBio long read sequencing facilitates the assembly of Xanthomonas genomes and the multiple TAL effector genes, which are difficult to assemble from short read platforms. The complete genome sequence of X. translucens pv. undulosa strain XT4699 and draft genome sequences of nineteen additional X. translucens strains provides a resource for further genetic analyses of pathogenic diversity and host range of the X. translucens species complex. TAL effectors of XT4699 strain play roles in modulating wheat host gene expressions.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.