Menu
July 7, 2019  |  

Antibiotic failure mediated by a resistant subpopulation in Enterobacter cloacae.

Antibiotic resistance is a major public health threat, further complicated by unexplained treatment failures caused by bacteria that appear antibiotic susceptible. We describe an Enterobacter cloacae isolate harbouring a minor subpopulation that is highly resistant to the last-line antibiotic colistin. This subpopulation was distinct from persisters, became predominant in colistin, returned to baseline after colistin removal and was dependent on the histidine kinase PhoQ. During murine infection, but in the absence of colistin, innate immune defences led to an increased frequency of the resistant subpopulation, leading to inefficacy of subsequent colistin therapy. An isolate with a lower-frequency colistin-resistant subpopulation similarly caused treatment failure but was misclassified as susceptible by current diagnostics once cultured outside the host. These data demonstrate the ability of low-frequency bacterial subpopulations to contribute to clinically relevant antibiotic resistance, elucidating an enigmatic cause of antibiotic treatment failure and highlighting the critical need for more sensitive diagnostics.


July 7, 2019  |  

deBWT: parallel construction of Burrows-Wheeler Transform for large collection of genomes with de Bruijn-branch encoding.

With the development of high-throughput sequencing, the number of assembled genomes continues to rise. It is critical to well organize and index many assembled genomes to promote future genomics studies. Burrows-Wheeler Transform (BWT) is an important data structure of genome indexing, which has many fundamental applications; however, it is still non-trivial to construct BWT for large collection of genomes, especially for highly similar or repetitive genomes. Moreover, the state-of-the-art approaches cannot well support scalable parallel computing owing to their incremental nature, which is a bottleneck to use modern computers to accelerate BWT construction.We propose de Bruijn branch-based BWT constructor (deBWT), a novel parallel BWT construction approach. DeBWT innovatively represents and organizes the suffixes of input sequence with a novel data structure, de Bruijn branch encoding. This data structure takes the advantage of de Bruijn graph to facilitate the comparison between the suffixes with long common prefix, which breaks the bottleneck of the BWT construction of repetitive genomic sequences. Meanwhile, deBWT also uses the structure of de Bruijn graph for reducing unnecessary comparisons between suffixes. The benchmarking suggests that, deBWT is efficient and scalable to construct BWT for large dataset by parallel computing. It is well-suited to index many genomes, such as a collection of individual human genomes, with multiple-core servers or clusters.deBWT is implemented in C language, the source code is available at https://github.com/hitbc/deBWT or https://github.com/DixianZhu/deBWTContact: ydwang@hit.edu.cnSupplementary data are available at Bioinformatics online.© The Author 2016. Published by Oxford University Press.


July 7, 2019  |  

Normocyte-binding protein required for human erythrocyte invasion by the zoonotic malaria parasite Plasmodium knowlesi.

The dominant cause of malaria in Malaysia is now Plasmodium knowlesi, a zoonotic parasite of cynomolgus macaque monkeys found throughout South East Asia. Comparative genomic analysis of parasites adapted to in vitro growth in either cynomolgus or human RBCs identified a genomic deletion that includes the gene encoding normocyte-binding protein Xa (NBPXa) in parasites growing in cynomolgus RBCs but not in human RBCs. Experimental deletion of the NBPXa gene in parasites adapted to growth in human RBCs (which retain the ability to grow in cynomolgus RBCs) restricted them to cynomolgus RBCs, demonstrating that this gene is selectively required for parasite multiplication and growth in human RBCs. NBPXa-null parasites could bind to human RBCs, but invasion of these cells was severely impaired. Therefore, NBPXa is identified as a key mediator of P. knowlesi human infection and may be a target for vaccine development against this emerging pathogen.


July 7, 2019  |  

Challenges, solutions, and quality metrics of personal genome assembly in advancing precision medicine.

Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly.


July 7, 2019  |  

High-quality genome assembly and annotation for Plasmodium coatneyi, generated using single-molecule real-time PacBio technology.

Plasmodium coatneyi is a protozoan parasite species that causes simian malaria and is an excellent model for studying disease caused by the human malaria parasite, P. falciparum Here we report the complete (nontelomeric) genome sequence of P. coatneyi Hackeri generated by the application of only Pacific Biosciences RS II (PacBio RS II) single-molecule real-time (SMRT) high-resolution sequence technology and assembly using the Hierarchical Genome Assembly Process (HGAP). This is the first Plasmodium genome sequence reported to use only PacBio technology. This approach has proven to be superior to short-read only approaches for this species. Copyright © 2016 Chien et al.


July 7, 2019  |  

Divergent isoprenoid biosynthesis pathways in Staphylococcus species constitute a drug target for treating infections in companion animals.

Staphylococcus species are a leading cause of skin and soft tissue infections in humans and animals, and the antibiotics used to treat these infections are often the same. Methicillin- and multidrug-resistant staphylococcal infections are becoming more common in human and veterinary medicine. From a “One Health” perspective, this overlap in antibiotic use and resistance raises concerns over the potential spread of antibiotic resistance genes. Whole-genome sequencing and comparative genomics analysis revealed that Staphylococcus species use divergent pathways to synthesize isoprenoids. Species frequently associated with skin and soft tissue infections in companion animals, including S. schleiferi and S. pseudintermedius, use the nonmevalonate pathway. In contrast, S. aureus, S. epidermidis, and S. lugdunensis use the mevalonate pathway. The antibiotic fosmidomycin, an inhibitor of the nonmevalonate pathway, was effective in killing canine clinical staphylococcal isolates but had no effect on the growth or survival of S. aureus and S. epidermidis. These data identify an essential metabolic pathway in Staphylococcus that differs among members of this genus and suggest that drugs such as fosmidomycin, which targets enzymes in the nonmevalonate pathway, may be an effective treatment for certain staphylococcal infections. IMPORTANCE Drug-resistant Staphylococcus species are a major concern in human and veterinary medicine. There is a need for new antibiotics that exhibit a selective effect in treating infections in companion and livestock animals and that would not be used to treat human bacterial infections. We have identified fosmidomycin as an antibiotic that selectively targets certain Staphylococcus species that are often encountered in skin infections in cats and dogs. These findings expand our understanding of Staphylococcus evolution and may have direct implications for treating staphylococcal infections in veterinary medicine.


July 7, 2019  |  

Interchromosomal core duplicons drive both evolutionary instability and disease susceptibility of the Chromosome 8p23.1 region.

Recurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation. Using comparative sequence-based approaches, we generated a high-quality 6.3-Mbp alternate reference assembly of an inverted Chromosome 8p23.1 haplotype. Comparison with nonhuman primates reveals a 746-kbp duplicative transposition and two separate inversion events that arose in the last million years of human evolution. The breakpoints associated with these rearrangements map to an ape-specific interchromosomal core duplicon that clusters at sites of evolutionary inversion (P = 7.8 × 10(-5)). Refinement of microdeletion breakpoints identifies a subgroup of patients that map to the same interchromosomal core involved in the evolutionary formation of the duplication blocks. Our results define a higher-order genomic instability element that has shaped the structure of specific chromosomes during primate evolution contributing to rearrangements associated with inversion and disease.© 2016 Mohajeri et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

Highlights of the 11th International Bordetella Symposium: from basic biology to vaccine development.

Pertussis is a severe respiratory disease caused by infection with the bacterial pathogen Bordetella pertussis The disease affects individuals of all ages but is particularly severe and sometimes fatal in unvaccinated young infants. Other Bordetella species cause diseases in humans, animals, and birds. Scientific, clinical, public health, vaccine company, and regulatory agency experts on these pathogens and diseases gathered in Buenos Aires, Argentina from 5 to 8 April 2016 for the 11th International Bordetella Symposium to discuss recent advances in our understanding of the biology of these organisms, the diseases they cause, and the development of new vaccines and other strategies to prevent these diseases. Highlights of the meeting included pertussis epidemiology in developing nations, genomic analysis of Bordetella biology and evolution, regulation of virulence factor expression, new model systems to study Bordetella biology and disease, effects of different vaccines on immune responses, maternal immunization as a strategy to prevent newborn disease, and novel vaccine development for pertussis. In addition, the group approved the formation of an International Bordetella Society to promote research and information exchange on bordetellae and to organize future meetings. A new Bordetella.org website will also be developed to facilitate these goals. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019  |  

Expansion of lysine-rich repeats in Plasmodium proteins generates novel localisation sequences that target the periphery of the host erythrocyte.

Repetitive low-complexity sequences, mostly assumed to have no function, are common in proteins that are exported by the malaria parasite into its host erythrocyte. We identify a group of exported proteins containing short lysine-rich tandemly repeated sequences that are sufficient to localise to the erythrocyte periphery where key virulence-related modifications to the plasma membrane and the underlying cytoskeleton are known to occur. Efficiency of targeting is dependent on repeat number, indicating that novel targeting modules could evolve by expansion of short lysine-rich sequences. Indeed, expression GARP fragments from different species shows that two novel targeting sequences have arisen via the process of repeat expansion in this protein. In the protein Hyp12, the targeting function of a lysine-rich sequence is masked by a neighbouring repetitive acidic sequence, further highlighting the importance of repetitive low complexity sequences. We show that sequences capable of targeting the erythrocyte periphery are present in at least nine proteins from Plasmodium falciparum, and one from Plasmodium knowlesi. We find these sequences in proteins known to be involved in erythrocyte rigidification and cytoadhesion, as well as in previously uncharacterised exported proteins. Together, these data suggest that expansion and contraction of lysine-rich repeats could generate targeting sequences de novo as well as modulate protein targeting efficiency and function in response to selective pressure. Copyright © 2016, The American Society for Biochemistry and Molecular Biology.


July 7, 2019  |  

Deep sequencing of 10,000 human genomes.

We report on the sequencing of 10,545 human genomes at 30×-40× coverage with an emphasis on quality metrics and novel variant and sequence discovery. We find that 84% of an individual human genome can be sequenced confidently. This high-confidence region includes 91.5% of exon sequence and 95.2% of known pathogenic variant positions. We present the distribution of over 150 million single-nucleotide variants in the coding and noncoding genome. Each newly sequenced genome contributes an average of 8,579 novel variants. In addition, each genome carries on average 0.7 Mb of sequence that is not found in the main build of the hg38 reference genome. The density of this catalog of variation allowed us to construct high-resolution profiles that define genomic sites that are highly intolerant of genetic variation. These results indicate that the data generated by deep genome sequencing is of the quality necessary for clinical use.


July 7, 2019  |  

Variant exported blood-stage proteins encoded by Plasmodium multigene families are expressed in liver stages where they are exported into the parasitophorous vacuole.

Many variant proteins encoded by Plasmodium-specific multigene families are exported into red blood cells (RBC). P. falciparum-specific variant proteins encoded by the var, stevor and rifin multigene families are exported onto the surface of infected red blood cells (iRBC) and mediate interactions between iRBC and host cells resulting in tissue sequestration and rosetting. However, the precise function of most other Plasmodium multigene families encoding exported proteins is unknown. To understand the role of RBC-exported proteins of rodent malaria parasites (RMP) we analysed the expression and cellular location by fluorescent-tagging of members of the pir, fam-a and fam-b multigene families. Furthermore, we performed phylogenetic analyses of the fam-a and fam-b multigene families, which indicate that both families have a history of functional differentiation unique to RMP. We demonstrate for all three families that expression of family members in iRBC is not mutually exclusive. Most tagged proteins were transported into the iRBC cytoplasm but not onto the iRBC plasma membrane, indicating that they are unlikely to play a direct role in iRBC-host cell interactions. Unexpectedly, most family members are also expressed during the liver stage, where they are transported into the parasitophorous vacuole. This suggests that these protein families promote parasite development in both the liver and blood, either by supporting parasite development within hepatocytes and erythrocytes and/or by manipulating the host immune response. Indeed, in the case of Fam-A, which have a steroidogenic acute regulatory-related lipid transfer (START) domain, we found that several family members can transfer phosphatidylcholine in vitro. These observations indicate that these proteins may transport (host) phosphatidylcholine for membrane synthesis. This is the first demonstration of a biological function of any exported variant protein family of rodent malaria parasites.


July 7, 2019  |  

An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes.

Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity.


July 7, 2019  |  

Improved assembly of noisy long reads by k-mer validation.

Genome assembly depends critically on read length. Two recent technologies, from Pacific Biosciences (PacBio) and Oxford Nanopore, produce read lengths >20 kb, which yield de novo genome assemblies with vastly greater contiguity than those based on Sanger, Illumina, or other technologies. However, the very high error rates of these two new technologies (~15% per base) makes assembly imprecise at repeats longer than the read length and computationally expensive. Here we show that the contiguity and quality of the assembly of these noisy long reads can be significantly improved at a minimal cost, by leveraging on the low error rate and low cost of Illumina short reads. Namely, k-mers from the PacBio raw reads that are not present in Illumina reads (which account for ~95% of the distinct k-mers) are deemed sequencing errors and ignored at the seed alignment step. By focusing on the ~5% of k-mers that are error free, read overlap sensitivity is dramatically increased. Of equal importance, the validation procedure can be extended to exclude repetitive k-mers, which prevents read miscorrection at repeats and further improves the resulting assemblies. We tested the k-mer validation procedure using one long-read technology (PacBio) and one assembler (MHAP/Celera Assembler), but it is very likely to yield analogous improvements with alternative long-read technologies and assemblers, such as Oxford Nanopore and BLASR/DALIGNER/Falcon, respectively.© 2016 Carvalho et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

A high throughput screen for active human transposable elements.

Transposable elements (TEs) are mobile genetic sequences that randomly propagate within their host’s genome. This mobility has the potential to affect gene transcription and cause disease. However, TEs are technically challenging to identify, which complicates efforts to assess the impact of TE insertions on disease. Here we present a targeted sequencing protocol and computational pipeline to identify polymorphic and novel TE insertions using next-generation sequencing: TE-NGS. The method simultaneously targets the three subfamilies that are responsible for the majority of recent TE activity (L1HS, AluYa5/8, and AluYb8/9) thereby obviating the need for multiple experiments and reducing the amount of input material required.Here we describe the laboratory protocol and detection algorithm, and a benchmark experiment for the reference genome NA12878. We demonstrate a substantial enrichment for on-target fragments, and high sensitivity and precision to both reference and NA12878-specific insertions. We report 17 previously unreported loci for this individual which are supported by orthogonal long-read evidence, and we identify 1470 polymorphic and novel TEs in 12 additional samples that were previously undocumented in databases of insertion polymorphisms.We anticipate that future applications of TE-NGS alongside exome sequencing of patients with sporadic disease will reduce the number of unresolved cases, and improve estimates of the contribution of TEs to human genetic disease.


July 7, 2019  |  

De novo mutations resolve disease transmission pathways in clonal malaria

Detecting de novo mutations in viral and bacterial pathogens enables researchers to reconstruct detailed networks of disease transmission and is a key technique in genomic epidemiology. However, these techniques have not yet been applied to the malaria parasite, Plasmodium falciparum, in which a larger genome, slower generation times, and a complex life cycle make them difficult to implement. Here, we demonstrate the viability of de novo mutation studies in P. falciparum for the first time. Using a combination of sequencing, library preparation, and genotyping methods that have been optimized for accuracy in low-complexity genomic regions, we have detected de novo mutations that distinguish nominally identical parasites from clonal lineages. Despite its slower evolutionary rate compared with bacterial or viral species, de novo mutation can be detected in P. falciparum across timescales of just 1-2?years and evolutionary rates in low-complexity regions of the genome can be up to twice that detected in the rest of the genome. The increased mutation rate allows the identification of separate clade expansions that cannot be found using previous genomic epidemiology approaches and could be a crucial tool for mapping residual transmission patterns in disease elimination campaigns and reintroduction scenarios.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.