Menu
July 19, 2019

The dentin phosphoprotein repeat region and inherited defects of dentin.

Nonsyndromic dentin defects classified as type II dentin dysplasia and types II and III dentinogenesis imperfecta are caused by mutations in DSPP (dentin sialophosphoprotein). Most reported disease-causing DSPP mutations occur within the repetitive DPP (dentin phosphoprotein) coding sequence. We characterized the DPP sequences of five probands with inherited dentin defects using single molecule real-time (SMRT) DNA sequencing. Eight of the 10 sequences matched previously reported DPP length haplotypes and two were novel. Alignment with known DPP sequences showed 32 indels arranged in 36 different patterns. Sixteen of the 32 indels were not represented in more than one haplotype. The 25 haplotypes with confirmed indels were aligned to generate a tree that describes how the length variations might have evolved. Some indels were independently generated in multiple lines. A previously reported disease-causing DSPP mutation in Family 1 was confirmed and its position clarified (c.3135delC; p.Ser1045Argfs*269). A novel frameshift mutation (c.3504_3508dup; p.Asp1170Alafs*146) caused the dentin defects in Family 2. A COL1A2 (c.2027G>A or p.Gly676Asp) missense mutation, discovered by whole-exome sequencing, caused the dentin defects in Family 3. We conclude that SMRT sequencing characterizes the DPP repeat region without cloning and can improve our understanding of normal and pathological length variations in DSPP alleles.


July 19, 2019

Mind the gap; seven reasons to close fragmented genome assemblies.

Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.


July 19, 2019

An improved genome reference for the African cichlid, Metriaclima zebra.

Problems associated with using draft genome assemblies are well documented and have become more pronounced with the use of short read data for de novo genome assembly. We set out to improve the draft genome assembly of the African cichlid fish, Metriaclima zebra, using a set of Pacific Biosciences SMRT sequencing reads corresponding to 16.5× coverage of the genome. Here we characterize the improvements that these long reads allowed us to make to the state-of-the-art draft genome previously assembled from short read data.Our new assembly closed 68 % of the existing gaps and added 90.6Mbp of new non-gap sequence to the existing draft assembly of M. zebra. Comparison of the new assembly to the sequence of several bacterial artificial chromosome clones confirmed the accuracy of the new assembly. The closure of sequence gaps revealed thousands of new exons, allowing significant improvement in gene models. We corrected one known misassembly, and identified and fixed other likely misassemblies. 63.5 Mbp (70 %) of the new sequence was classified as repetitive and the new sequence allowed for the assembly of many more transposable elements.Our improvements to the M. zebra draft genome suggest that a reasonable investment in long reads could greatly improve many comparable vertebrate draft genome assemblies.


July 19, 2019

Heterogeneous composition of key metabolic gene clusters in a vent mussel symbiont population.

Chemosynthetic symbiosis is one of the successful systems for adapting to a wide range of habitats including extreme environments, and the metabolic capabilities of symbionts enable host organisms to expand their habitat ranges. However, our understanding of the adaptive strategies that enable symbiotic organisms to expand their habitats is still fragmentary. Here, we report that a single-ribotype endosymbiont population in an individual of the host vent mussel, Bathymodiolus septemdierum has heterogeneous genomes with regard to the composition of key metabolic gene clusters for hydrogen oxidation and nitrate reduction. The host individual harbours heterogeneous symbiont subpopulations that either possess or lack the gene clusters encoding hydrogenase or nitrate reductase. The proportions of the different symbiont subpopulations in a host appeared to vary with the environment or with the host’s development. Furthermore, the symbiont subpopulations were distributed in patches to form a mosaic pattern in the gill. Genomic heterogeneity in an endosymbiont population may enable differential utilization of diverse substrates and confer metabolic flexibility. Our findings open a new chapter in our understanding of how symbiotic organisms alter their metabolic capabilities and expand their range of habitats.


July 19, 2019

Genome-directed lead discovery: biosynthesis, structure elucidation, and biological evaluation of two families of polyene macrolactams against Trypanosoma brucei.

Marine natural products are an important source of lead compounds against many pathogenic targets. Herein, we report the discovery of lobosamides A-C from a marine actinobacterium, Micromonospora sp., representing three new members of a small but growing family of bacterially produced polyene macrolactams. The lobosamides display growth inhibitory activity against the protozoan parasite Trypanosoma brucei (lobosamide A IC50 = 0.8 µM), the causative agent of human African trypanosomiasis (HAT). The biosynthetic gene cluster of the lobosamides was sequenced and suggests a conserved cluster organization among the 26-membered macrolactams. While determination of the relative and absolute configurations of many members of this family is lacking, the absolute configurations of the lobosamides were deduced using a combination of chemical modification, detailed spectroscopic analysis, and bioinformatics. We implemented a “molecules-to-genes-to-molecules” approach to determine the prevalence of similar clusters in other bacteria, which led to the discovery of two additional macrolactams, mirilactams A and B from Actinosynnema mirum. These additional analogs have allowed us to identify specific structure-activity relationships that contribute to the antitrypanosomal activity of this class. This approach illustrates the power of combining chemical analysis and genomics in the discovery and characterization of natural products as new lead compounds for neglected disease targets.


July 19, 2019

Recurrent methicillin-resistant Staphylococcus aureus cutaneous abscesses and selection of reduced chlorhexidine susceptibility during chlorhexidine use.

We describe the selection of reduced chlorhexidine susceptibility during chlorhexidine use in a patient with two episodes of cutaneous USA300 methicillin-resistant Staphylococcus aureus abscess. The second clinical isolate harbors a novel plasmid that encodes the QacA efflux pump. Greater use of chlorhexidine for disease prevention warrants surveillance for resistance. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 19, 2019

Genetic variation and the de novo assembly of human genomes.

The discovery of genetic variation and the assembly of genome sequences are both inextricably linked to advances in DNA-sequencing technology. Short-read massively parallel sequencing has revolutionized our ability to discover genetic variation but is insufficient to generate high-quality genome assemblies or resolve most structural variation. Full resolution of variation is only guaranteed by complete de novo assembly of a genome. Here, we review approaches to genome assembly, the nature of gaps or missing sequences, and biases in the assembly process. We describe the challenges of generating a complete de novo genome assembly using current technologies and the impact that being able to perfectly sequence the genome would have on understanding human disease and evolution. Finally, we summarize recent technological advances that improve both contiguity and accuracy and emphasize the importance of complete de novo assembly as opposed to read mapping as the primary means to understanding the full range of human genetic variation.


July 19, 2019

Genomic epidemiology of hypervirulent serogroup W, ST-11 Neisseria meningitidis

Neisseria meningitidis is a leading bacterial cause of sepsis and meningitis globally with dynamic strain distribution over time. Beginning with an epidemic among Hajj pilgrims in 2000, serogroup W (W) sequence type (ST) 11 emerged as a leading cause of epidemic meningitis in the African ‘meningitis belt’ and endemic cases in South America, Europe, Middle East and China. Previous genotyping studies were unable to reliably discriminate sporadic W ST-11 strains in circulation since 1970 from the Hajj outbreak strain (Hajj clone). It is also unclear what proportion of more recent W ST-11 disease clusters are caused by direct descendants of the Hajj clone. Whole genome sequences of 270 meningococcal strains isolated from patients with invasive meningococcal disease globally from 1970 to 2013 were compared using whole genome phylogenetic and major antigen-encoding gene sequence analyses. We found that all W ST-11 strains were descendants of an ancestral strain that had undergone unique capsular switching events. The Hajj clone and its descendants were distinct from other W ST-11 strains in that they shared a common antigen gene profile and had undergone recombination involving virulence genes encoding factor H binding protein, nitric oxide reductase, and nitrite reductase. These data demonstrate that recent acquisition of a distinct antigen-encoding gene profile and variations in meningococcal virulence genes was associated with the emergence of the Hajj clone. Importantly, W ST-11 strains unrelated to the Hajj outbreak contribute a significant proportion of W ST-11 cases globally. This study helps illuminate genomic factors associated with meningococcal strain emergence and evolution.


July 19, 2019

Stepwise evolution of pandrug-resistance in Klebsiella pneumoniae.

Carbapenem resistant Enterobacteriaceae (CRE) pose an urgent risk to global human health. CRE that are non-susceptible to all commercially available antibiotics threaten to return us to the pre-antibiotic era. Using Single Molecule Real Time (SMRT) sequencing we determined the complete genome of a pandrug-resistant Klebsiella pneumoniae isolate, representing the first complete genome sequence of CRE resistant to all commercially available antibiotics. The precise location of acquired antibiotic resistance elements, including mobile elements carrying genes for the OXA-181 carbapenemase, were defined. Intriguingly, we identified three chromosomal copies of an ISEcp1-blaOXA-181 mobile element, one of which has disrupted the mgrB regulatory gene, accounting for resistance to colistin. Our findings provide the first description of pandrug-resistant CRE at the genomic level, and reveal the critical role of mobile resistance elements in accelerating the emergence of resistance to other last resort antibiotics.


July 19, 2019

Variable genetic architectures produce virtually identical molecules in bacterial symbionts of fungus-growing ants.

Small molecules produced by Actinobacteria have played a prominent role in both drug discovery and organic chemistry. As part of a larger study of the actinobacterial symbionts of fungus-growing ants, we discovered a small family of three previously unreported piperazic acid-containing cyclic depsipeptides, gerumycins A-C. The gerumycins are slightly smaller versions of dentigerumycin, a cyclic depsipeptide that selectively inhibits a common fungal pathogen, Escovopsis. We had previously identified this molecule from a Pseudonocardia associated with Apterostigma dentigerum, and now we report the molecule from an associate of the more highly derived ant Trachymyrmex cornetzi. The three previously unidentified compounds, gerumycins A-C, have essentially identical structures and were produced by two different symbiotic Pseudonocardia spp. from ants in the genus Apterostigma found in both Panama and Costa Rica. To understand the similarities and differences in the biosynthetic pathways that produced these closely related molecules, the genomes of the three producing Pseudonocardia were sequenced and the biosynthetic gene clusters identified. This analysis revealed that dramatically different biosynthetic architectures, including genomic islands, a plasmid, and the use of spatially separated genetic loci, can lead to molecules with virtually identical core structures. A plausible evolutionary model that unifies these disparate architectures is presented.


July 19, 2019

Pangenome analysis of Bifidobacterium longum and site-directed mutagenesis through by-pass of restriction-modification systems.

Bifidobacterial genome analysis has provided insights as to how these gut commensals adapt to and persist in the human GIT, while also revealing genetic diversity among members of a given bifidobacterial (sub)species. Bifidobacteria are notoriously recalcitrant to genetic modification, which prevents exploration of their genomic functions, including those that convey (human) health benefits.PacBio SMRT sequencing was used to determine the whole genome seqeunces of two B. longum subsp. longum strains. The B. longum pan-genome was computed using PGAP v1.2 and the core B. longum phylogenetic tree was constructed using a maximum-likelihood based approach in PhyML v3.0. M.blmNCII was cloned in E. coli and an internal fragment if arfBarfB was cloned into pORI19 for insertion mutagenesis.In this study we present the complete genome sequences of two Bifidobacterium longum subsp. longum strains. Comparative analysis with thirty one publicly available B. longum genomes allowed the definition of the B. longum core and dispensable genomes. This analysis also highlighted differences in particular metabolic abilities between members of the B. longum subspecies infantis, longum and suis. Furthermore, phylogenetic analysis of the B. longum core genome indicated the existence of a novel subspecies. Methylome data, coupled to the analysis of restriction-modification systems, allowed us to substantially increase the genetic accessibility of B. longum subsp. longum NCIMB 8809 to a level that was shown to permit site-directed mutagenesis.Comparative genomic analysis of thirty three B. longum representatives revealed a closed pan-genome for this bifidobacterial species. Phylogenetic analysis of the B. longum core genome also provides evidence for a novel fifth B. longum subspecies. Finally, we improved genetic accessibility for the strain B. longum subsp. longum NCIMB 8809, which allowed the generation of a mutant of this strain.


July 19, 2019

The pineapple genome and the evolution of CAM photosynthesis.

Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ? duplication event. The pineapple lineage has transitioned from C3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues. CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.


July 19, 2019

Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships.

Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33-35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution.


July 19, 2019

Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16?kilobases) reads with random errors, we assembled 99% (244?megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4?megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.


July 19, 2019

A supergene determines highly divergent male reproductive morphs in the ruff.

Three strikingly different alternative male mating morphs (aggressive ‘independents’, semicooperative ‘satellites’ and female-mimic ‘faeders’) coexist as a balanced polymorphism in the ruff, Philomachus pugnax, a lek-breeding wading bird. Major differences in body size, ornamentation, and aggressive and mating behaviors are inherited as an autosomal polymorphism. We show that development into satellites and faeders is determined by a supergene consisting of divergent alternative, dominant and non-recombining haplotypes of an inversion on chromosome 11, which contains 125 predicted genes. Independents are homozygous for the ancestral sequence. One breakpoint of the inversion disrupts the essential CENP-N gene (encoding centromere protein N), and pedigree analysis confirms the lethality of homozygosity for the inversion. We describe new differences in behavior, testis size and steroid metabolism among morphs and identify polymorphic genes within the inversion that are likely to contribute to the differences among morphs in reproductive traits.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.