Menu
July 7, 2019

Ultraaccurate genome sequencing and haplotyping of single human cells.

Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10-8and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

X-linked dystonia-parkinsonism (XDP) is a neurodegenerative disease associated with an antisense insertion of a SINE-VNTR-Alu (SVA)-type retrotransposon within an intron ofTAF1This unique insertion coincides with six additional noncoding sequence changes inTAF1, the gene that encodes TATA-binding protein-associated factor-1, which appear to be inherited together as an identical haplotype in all reported cases. Here we examined the sequence of this SVA in XDP patients (n= 140) and detected polymorphic variation in the length of a hexanucleotide repeat domain, (CCCTCT)nThe number of repeats in these cases ranged from 35 to 52 and showed a highly significant inverse correlation with age at disease onset. Because other SVAs exhibit intrinsic promoter activity that depends in part on the hexameric domain, we assayed the transcriptional regulatory effects of varying hexameric lengths found in the unique XDP SVA retrotransposon using luciferase reporter constructs. When inserted sense or antisense to the luciferase reading frame, the XDP variants repressed or enhanced transcription, respectively, to an extent that appeared to vary with length of the hexamer. Further in silico analysis of this SVA sequence revealed multiple motifs predicted to form G-quadruplexes, with the greatest potential detected for the hexameric repeat domain. These data directly link sequence variation within the XDP-specific SVA sequence to phenotypic variability in clinical disease manifestation and provide insight into potential mechanisms by which this intronic retroelement may induce transcriptional interference inTAF1expression. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

Comparative whole-genomic analysis of an ancient L2 lineage Mycobacterium novel phylogenetic clade and common genetic determinants of hypervirulent strains.

Background: Development of improved therapeutics against tuberculosis (TB) is hindered by an inadequate understanding of the relationship between disease severity and genetic diversity of its causative agent, Mycobacterium tuberculosis. We previously isolated a hypervirulent M. tuberculosis strain H112 from an HIV-negative patient with an aggressive disease progression from pulmonary TB to tuberculous meningitis—the most severe manifestation of tuberculosis. Human macrophage challenge experiment demonstrated that the strain H112 exhibited significantly better intracellular survivability and induced lower level of TNF-a than the reference virulent strain H37Rv and other 123 clinical isolates. Aim: The present study aimed to identify the potential genetic determinants of mycobacterial virulence that were common to strain H112 and hypervirulent M. tuberculosis strains of the same phylogenetic clade isolated in other global regions. Methods: A low-virulent M. tuberculosis strain H54 which belonged to the same phylogenetic lineage (L2) as strain H112 was selected from a collection of 115 clinical isolates. Both H112 and H54 were whole-genome-sequenced using PacBio sequencing technology. A comparative genomics approach was adopted to identify mutations present in strain H112 but absent in strain H54. Subsequently, an extensive phylogenetic analysis was conducted by including all publically available M. tuberculosis genomes. Single-nucleotide-polymorphisms (SNPs) and structural variations (SVs) common to hypervirulent strains in the global collection of genomes were considered as potential genetic determinants of hypervirulence. Results: Sequencing data revealed that both H112 and H54 were identified as members of the same sub-lineage L2.2.1. After excluding the lineage-related mutations shared between H112 and H54, we analyzed the phylogenetic relatedness of H112 with global collection of M. tuberculosis genomes (n = 4,338), and identified a novel phylogenetic clade in which four hypervirulent strains isolated from geographically diverse regions were clustered together. All hypervirulent strains in the clade shared 12 SNPs and 5 SVs with H112, including those affecting key virulence-associated loci, notably, a deleterious SNP (rv0178 p. D150E) within mce1 operon and an intergenic deletion (854259_ 854261delCC) in close-proximity to phoP. Conclusion: The present study identified common genetic factors in a novel phylogenetic clade of hypervirulent M. tuberculosis. The causative role of these mutations in mycobacterial virulence should be validated in future study.


July 7, 2019

Genomic clues to the parental origin of the wild flowering cherry Prunus yedoensis var. nudiflora (Rosaceae)

Prunus yedoensis Matsumura is one of the popular ornamental flowering cherry trees native to northeastern Asia, and its wild populations have only been found on Jeju Island, Korea. Previous studies suggested that wild P. yedoensis (P. yedoensis var. nudiflora) is a hybrid species; however, there is no solid evidence on its exact parental origin and genomic organization. In this study, we developed a total of 38 nuclear gene-based DNA markers that can be universally amplifiable in the Prunus species using 586 Prunus Conserved Orthologous Gene Set (Prunus COS). Using the Prunus COS markers, we investigated the genetic structure of wild P. yedoensis populations and evaluated the putative parental species of wild P. yedoensis. Population structure and phylogenetic analysis of 73 wild P. yedoensis accessions and 54 accessions of other Prunus species revealed that the wild P. yedoensis on Jeju Island is a natural homoploid hybrid. Sequence-level comparison of Prunus COS markers between species suggested that wild P. yedoensis might originate from a cross between maternal P. pendula f. ascendens and paternal P. jamasakura. Moreover, approximately 81% of the wild P. yedoensis accessions examined were likely F1 hybrids, whereas the remaining 19% were backcross hybrids resulting from additional asymmetric introgression of parental genotypes. These findings suggest that complex hybridization of the Prunus species on Jeju Island can produce a range of variable hybrid offspring. Overall, this study makes a significant contribution to address issues of the origin, nomenclature, and genetic relationship of ornamental P. yedoensis.


July 7, 2019

Mechanisms of adaptive divergence and speciation in Littorina saxatilis: Integrating knowledge from ecology and genetics with new data emerging from genomic studies

New opportunities to understand marine speciation and evolution of local adaptation come with genomic approaches and with the development of comprehensive model systems. The marine snail Littorina saxatilis is one example of a developing marine model for investigating genetic mechanisms of rapid divergence and evolution in natural systems. This species is strongly polymorphic and shows formation of local ecotypes throughout its distribution. Support is strong for primary (in situ) and parallel formation of reproductively semi-isolated ecotypes with contact zones between heterogeneous intertidal microhabitats. This makes this species an ideal organism for gaining new insights into the interplay of divergent selection, gene flow and genetic drift during local adaptation and speciation. A relatively well-resolved draft genome and a genetic map describing 17 linkage groups (“chromosomes”) are key tools for investigating the role of structural genomic variation, such as inversions, gene duplications and translocations. Whole genome re-sequencing of pools of individuals and the first comprehensive study of a contact zone contribute direct information on selection and barriers to gene flow present in specific regions of the genome. Linking selection at the phenotypic level to patterns obser ved in the genome is under way by quantitative trait loci mapping and annotation of candidate genes, while the role of single mutations on individual fitness will have to await development of gene manipulation tools. The features of the snail system facilitate the study of local adaptation and speciation and its genomic basis, but the underlying evolutionary processes are expected to be similar in other organisms, and hence this species is a useful model.


July 7, 2019

Genome sequence-based marker development and genotyping in potato

Potato (Solanum tuberosum L.) is one of the world’s most economically important food crops and holds major significance for future food security. Despite its importance, the study of potato genetics and breeding has lagged behind mainly due to its polyploid genome and high levels of heterozygosity. Conventional marker and genotyping approaches have been helpful in progressing potato genetic research but have also had limitations in exploiting the outcome from these studies for gene discovery and applied research applications. The sequencing of the potato genome, followed by advancements in marker and genotyping technologies, has brought a step change in the way potato genetic studies are conducted. Potato is now amenable to modern sequence-based marker and genotyping methods with their increased ability to put thousands of markers on any population of interest without a priori knowledge. This has increased the precision and resolution of genetic studies previously not feasible in potato. A diverse range of fixed and flexible genotyping platforms, for a wide variety of research and breeding applications, are now available. Concerted research efforts are now needed to screen the available genetic diversity for this important crop to identify novel and beneficial trait alleles in order to enable efficient and precise introgression breeding permitting breeding of climate smart, and resilient, potato cultivars. This chapter provides an overview of sequence-based marker development and genotyping methods along with their implications for potato research and breeding in the post-genomics era.


July 7, 2019

The state of whole-genome sequencing

Over the last decade, a technological paradigm shift has slashed the cost of DNA sequencing by over five orders of magnitude. Today, the cost of sequencing a human genome is a few thousand dollars, and it continues to fall. Here, we review the most cost-effective platforms for whole-genome sequencing (WGS) as well as emerging technologies that may displace or complement these. We also discuss the practical challenges of generating and analyzing WGS data, and how WGS has unlocked new strategies for discovering genes and variants underlying both rare and common human diseases.


July 7, 2019

Two orangutan species have evolved different KIR alleles and haplotypes.

The immune and reproductive functions of human NK cells are regulated by interactions of the C1 and C2 epitopes of HLA-C with C1-specific and C2-specific lineage III killer cell Ig-like receptors (KIR). This rapidly evolving and diverse system of ligands and receptors is restricted to humans and great apes. In this context, the orangutan has particular relevance because it represents an evolutionary intermediate, one having the C1 epitope and corresponding KIR but lacking the C2 epitope. Through a combination of direct sequencing, KIR genotyping, and data mining from the Great Ape Genome Project, we characterized the KIR alleles and haplotypes for panels of 10 Bornean orangutans and 19 Sumatran orangutans. The orangutan KIR haplotypes have between 5 and 10 KIR genes. The seven orangutan lineage III KIR genes all locate to the centromeric region of the KIR locus, whereas their human counterparts also populate the telomeric region. One lineage III KIR gene is Bornean specific, one is Sumatran specific, and five are shared. Of 12 KIR gene-content haplotypes, 5 are Bornean specific, 5 are Sumatran specific, and 2 are shared. The haplotypes have different combinations of genes encoding activating and inhibitory C1 receptors that can be of higher or lower affinity. All haplotypes encode an inhibitory C1 receptor, but only some haplotypes encode an activating C1 receptor. Of 130 KIR alleles, 55 are Bornean specific, 65 are Sumatran specific, and 10 are shared. Copyright © 2017 by The American Association of Immunologists, Inc.


July 7, 2019

Letting go: bacterial genome reduction solves the dilemma of adapting to predation mortality in a substrate-restricted environment.

Resource limitation and predation mortality are major determinants of microbial population dynamics, and optimization for either aspect is considered to imply a trade-off with respect to the other. Adaptation to these selective factors may, moreover, lead to disadvantages at rich growth conditions. We present an example of a concomitant evolutionary optimization to both, substrate limitation and predation in an aggregate-forming freshwater bacterial isolate, and we elucidate an underlying genomic mechanism. Bacteria were propagated in serial batch culture in a nutrient-restricted environment either with or without a bacterivorous flagellate. Strains isolated after 26 growth cycles of the predator-prey co-cultures formed as much total biomass as the ancestor at ancestral growth conditions, albeit largely reallocated to cell aggregates. A ~273?kbp genome fragment was lost in three strains that had independently evolved with predators. These strains had significantly higher growth yield on substrate-restricted media than others that were isolated from the same treatment before the excision event. Under predation pressure, the isolates with the deletion outcompeted both, the ancestor and the strains evolved without predators even at rich growth conditions. At the same time, genome reduction led to a growth disadvantage in the presence of benzoate due to the loss of the respective degradation pathway, suggesting that niche constriction might be the price for the bidirectional optimization.


July 7, 2019

Institutional profile: translational pharmacogenomics at the Icahn School of Medicine at Mount Sinai.

For almost 50 years, the Icahn School of Medicine at Mount Sinai has continually invested in genetics and genomics, facilitating a healthy ecosystem that provides widespread support for the ongoing programs in translational pharmacogenomics. These programs can be broadly cataloged into discovery, education, clinical implementation and testing, which are collaboratively accomplished by multiple departments, institutes, laboratories, companies and colleagues. Focus areas have included drug response association studies and allele discovery, multiethnic pharmacogenomics, personalized genotyping and survey-based education programs, pre-emptive clinical testing implementation and novel assay development. This overview summarizes the current state of translational pharmacogenomics at Mount Sinai, including a future outlook on the forthcoming expansions in overall support, research and clinical programs, genomic technology infrastructure and the participating faculty.


July 7, 2019

Characterization of ESBL disseminating plasmids.

Bacteria producing extended-spectrum ß-lactamases (ESBLs) constitute a globally increasing problem that contributes to treatment complications and elevated death rates. The extremely successful dissemination by ESBL-producing Enterobacteriaceae during the latest decades is a result of the combination of mobilization, evolution and horizontal spread of ß-lactamase genes on plasmids. In parallel, spread of these plasmids to particularly well-adapted bacterial clones (outbreak clones) has expanded. In this review we describe ESBL-producing bacteria and the genetic mechanisms for dissemination of ESBL resistance. We describe available methodology for studying plasmids and the importance of including plasmids in epidemiological typing as natural parts of the organisms. Plasmids play a fundamental role in how resistance arises and disseminates.


July 7, 2019

Effects of genome structure variation, homeologous genes and repetitive DNA on polyploid crop research in the age of genomics.

Compared to diploid species, allopolyploid crop species possess more complex genomes, higher productivity, and greater adaptability to changing environments. Next generation sequencing techniques have produced high-density genetic maps, whole genome sequences, transcriptomes and epigenomes for important polyploid crops. However, several problems interfere with the full application of next generation sequencing techniques to these crops. Firstly, different types of genomic variation affect sequence assembly and QTL mapping. Secondly, duplicated or homoeologous genes can diverge in function and then lead to emergence of many minor QTL, which increases difficulties in fine mapping, cloning and marker assisted selection. Thirdly, repetitive DNA sequences arising in polyploid crop genomes also impact sequence assembly, and are increasingly being shown to produce small RNAs to regulate gene expression and hence phenotypic traits. We propose that these three key features should be considered together when analyzing polyploid crop genomes. It is apparent that dissection of genomic structural variation, elucidation of the function and mechanism of interaction of homoeologous genes, and investigation of the de novo roles of repeat sequences in agronomic traits are necessary for genomics-based crop breeding in polyploids. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.


July 7, 2019

Wide geographical dissemination of the multiresistant Staphylococcus capitis NRCS-A clone in neonatal intensive-care units.

Nosocomial late-onset sepsis represents a frequent cause of morbidity and mortality in preterm neonates. The Staphylococcus capitis clone NRCS-A has been previously described as an emerging cause of nosocomial bacteraemia in French neonatal intensive-care units (NICUs). In this study, we aimed to explore the possible unrecognized dissemination of this clone on a larger geographical scale. One hundred methicillin-resistant S. capitis strains isolated from neonates (n = 86) and adult patients (n = 14) between 2000 and 2013 in four different countries (France, Belgium, the UK, and Australia) were analysed with SmaI pulsed-field gel electrophoresis (PFGE) and dru typing. The vast majority of NICU strains showed the NRCS-A pulsotype and the dt11c type (96%). We then randomly selected 14 isolates (from neonates, n = 12, three per country; from adult patients, n = 2), considered to be a subset of representative isolates, and performed further molecular typing (SacII PFGE, SCCmec typing, and multilocus sequence typing-like analysis), confirming the clonality of the S. capitis strains isolated from neonates, despite their distant geographical origin. Whole genome single-nucleotide polymorphism-based phylogenetic analysis of five NICU isolates (from the different countries) attested to high genetic relatedness within the NRCS-A clone. Finally, all of the NRCS-A strains showed multidrug resistance (e.g. methicillin and aminoglycoside resistance, and decreased vancomycin susceptibility), with potential therapeutic implications for infected neonates. In conclusion, this study represents the first report of clonal dissemination of methicillin-resistant coagulase-negative Staphylococcus clone on a large geographical scale. Questions remain regarding the origin and means of international spread, and the reasons for this clone’s apparent predilection for neonates. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.


July 7, 2019

Estimating fitness of viral quasispecies from next-generation sequencing data.

The quasispecies model is ubiquitous in the study of viruses. While having lead to a number of insights that have stood the test of time, the quasispecies model has mostly been discussed in a theoretical fashion with little support of data. With next-generation sequencing (NGS), this situation is changing and a wealth of data can now be produced in a time- and cost-efficient manner. NGS can, after removal of technical errors, yield an exceedingly detailed picture of the viral population structure. The widespread availability of cross-sectional data can be used to study fitness landscapes of viral populations in the quasispecies model. This chapter highlights methods that estimate the strength of selection in selective sweeps, assesses marginal fitness effects of quasispecies, and finally infers the fitness landscape of a viral quasispecies, all on the basis of NGS data.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.