Menu
July 7, 2019

Integrated genomic and proteomic analyses of high-level chloramphenicol resistance in Campylobacter jejuni.

Campylobacter jejuni is a major zoonotic pathogen, and its resistance to antibiotics is of great concern for public health. However, few studies have investigated the global changes of the entire organism with respect to antibiotic resistance. Here, we provide mechanistic insights into high-level resistance to chloramphenicol in C. jejuni, using integrated genomic and proteomic analyses. We identified 27 single nucleotide polymorphisms (SNPs) as well as an efflux pump cmeB mutation that conferred modest resistance. We determined two radical S-adenosylmethionine (SAM) enzymes, one each from an SNP gene and a differentially expressed protein. Validation of major metabolic pathways demonstrated alterations in oxidative phosphorylation and ABC transporters, suggesting energy accumulation and increase in methionine import. Collectively, our data revealed a novel rRNA methylation mechanism by a radical SAM superfamily enzyme, indicating that two resistance mechanisms existed in Campylobacter. This work provided a systems biology perspective on understanding the antibiotic resistance mechanisms in bacteria.


July 7, 2019

Complete genome sequences of Clostridium perfringens Del1 strain isolated from chickens affected by necrotic enteritis.

Clostridium perfringens is ubiquitous in nature. It is a normal inhabitant in the intestinal tract of animals and humans. As the primary etiological agent of gas gangrene, necrosis and bacteremia, C. perfringens causes food poisoning, necrotic enteritis (NE), and even death. Epidemiology research has indicated that the increasing incidence of NE in poultry is associated with the withdrawal of in-feed antibiotic growth promoters in poultry production in response to government regulations. The recent omics studies have indicated that bacterial virulence is typically linked to highly efficient conjugative transfer of toxins, or plasmids carrying antibiotic-resistance traits. Currently, there is limited information on understanding of host-pathogen interaction in NE caused by virulent strains of C. perfringens. Elucidating such pathogenesis has practical impacts on fighting infectious diseases through adopting strategies of prophylactic or therapeutic interventions. In this report, we sequenced and analyzed the genome of C. perfringens Del1 strain using the hybrid of PacBio and Illumina sequencing technologies.Sequence analysis indicated that Del1 strain comprised a single circular chromosome with a complete 3,559,163 bp and 4 plasmids: pDel1_1 (82,596 bp), pDel1_2 (69,827 bp), pDel1_3 (49,582 bp), and pDel1_4 (49,728 bp). The genome had 3361 predicted coding DNA sequences, harbored numerous genes for pathogenesis and virulence factors, including 6 for antibiotic and antimicrobial resistance, and 3 phage-encoded genes. Phylogenetic analysis revealed that Del1 strain had similar genome and plasmid sequences to the CP4 strain.Complete chromosomal and plasmid sequences of Del1 strain are presented in this report. Since Del1 was isolated from a field disease outbreak, this strain is a good source to identify virulent genes that cause many damaging effects of Clostridial infections in chicken gut. Genome sequencing of the chicken pathogenic isolates from commercial farms provides valuable insights into the molecular pathogenesis of C. perfringens as a gastrointestinal pathogen in food animals. The detailed information on gene sequencing of this important field strain will benefit the development of novel vaccines specific for C. perfringens-induced NE in chickens.


July 7, 2019

Streptococcal toxic shock syndrome caused by the dissemination of an invasive emm3/ST15 strain of Streptococcus pyogenes.

Streptococcus pyogenes (group A Streptococcus [GAS]) is a major human pathogen that causes a wide spectrum of clinical manifestations. Although invasive GAS (iGAS) infections are relatively uncommon, emm3/ST15 GAS is a highly virulent, invasive, and pathogenic strain. Global molecular epidemiology analysis has suggested that the frequency of emm3 GAS has been recently increasing.A 14-year-old patient was diagnosed with streptococcal toxic shock syndrome and severe pneumonia, impaired renal function, and rhabdomyolysis. GAS was isolated from a culture of endotracheal aspirates and designated as KS030. Comparative genome analysis suggested that KS030 is classified as emm3 (emm-type) and ST15 (multilocus sequencing typing [MLST]), which is similar to iGAS isolates identified in the UK (2013) and Switzerland (2015).We conclude that the global dissemination of emm3/ST15 GAS strain has the potential to cause invasive disease.


July 7, 2019

A novel Tn1696-like composite transposon (Tn6404) harboring bla IMP-4 in a Klebsiella pneumoniae isolate carrying a rare ESBL gene bla SFO-1.

Genetic determinants of a clinical Klebsiella pneumoniae isolate (KP1814) coproducing IMP-4 and a rare ESBL gene SFO-1 was investigated. KP1814 belongs to a novel sequence type (ST) assigned to ST2270. WGS identified four circular DNA sequences in KP1814, including two multidrug-resistance (MDR) plasmids, one virulence plasmid, and one circular form. The MDR plasmid pKP1814-1 (299.9 Kb) is untypeable, and carries two large mosaic multiresistance regions (MRRs). bla SFO-1 and bla IMP-4 co-exists on MRR1, and bla SFO-1 is associated with an IS/Tn-independent genetic context. bla IMP-4 is carried by a novel In804-like integron (intlI-bla IMP-4-Kl.pn.I3-qacG2-aacA4-catB3?) associated with a novel Tn1696-like transposon (designed Tn6404) flanked by IS5075. The other MDR plasmid pKP1814-3 is a 95,701-bp IncFII plasmid, and is a hybrid of a Shigella flexneri plasmid pSF07201 and an E. coli plasmid pCA08. All resistance genes of pKP1814-3 were detected in a ~16-kb IS26-flanked composite transposon carried by a Tn5396 transposon. The circular form (18.3 Kb) was composed of two parts belonging to pKP1814-1 and pKP1814-3, respectively. The plasmid pKP1814-2, carrying multiple virulence factors, encodes IncFIBK and IncFIIK replicons with a size of 187,349?bp. The coexistence of MDR and virulence plasmids largely enhances the bacterial fitness in the host and environment.


July 7, 2019

Complete circularized genome sequences of four strains of Elizabethkingia anophelis, including two novel strains isolated from wild-caught Anopheles sinensis.

We provide complete circularized genome sequences of two mosquito-derived Elizabethkingia anophelis strains with draft sequences currently in the public domain (R26 and Ag1), and two novel E. anophelis strains derived from a different mosquito species, Anopheles sinensis (AR4-6 and AR6-8). The genetic similarity of all four mosquito-derived strains is remarkable.


July 7, 2019

SV2: Accurate structural variation genotyping and de novo mutation detection from whole genomes.

Structural Variation (SV) detection from short-read whole genome sequencing is error prone, presenting significant challenges for population or family-based studies of disease.Here we describe SV2, a machine-learning algorithm for genotyping deletions and duplications from paired-end sequencing data. SV2 can rapidly integrate variant calls from multiple structural variant discovery algorithms into a unified call set with high genotyping accuracy and capability to detect de novo mutations. SV2 is freely available on GitHub (https://github.com/dantaki/SV2).Supplementary data are available at Bioinformatics online.© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com


July 7, 2019

Ultraaccurate genome sequencing and haplotyping of single human cells.

Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10-8and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

X-linked dystonia-parkinsonism (XDP) is a neurodegenerative disease associated with an antisense insertion of a SINE-VNTR-Alu (SVA)-type retrotransposon within an intron ofTAF1This unique insertion coincides with six additional noncoding sequence changes inTAF1, the gene that encodes TATA-binding protein-associated factor-1, which appear to be inherited together as an identical haplotype in all reported cases. Here we examined the sequence of this SVA in XDP patients (n= 140) and detected polymorphic variation in the length of a hexanucleotide repeat domain, (CCCTCT)nThe number of repeats in these cases ranged from 35 to 52 and showed a highly significant inverse correlation with age at disease onset. Because other SVAs exhibit intrinsic promoter activity that depends in part on the hexameric domain, we assayed the transcriptional regulatory effects of varying hexameric lengths found in the unique XDP SVA retrotransposon using luciferase reporter constructs. When inserted sense or antisense to the luciferase reading frame, the XDP variants repressed or enhanced transcription, respectively, to an extent that appeared to vary with length of the hexamer. Further in silico analysis of this SVA sequence revealed multiple motifs predicted to form G-quadruplexes, with the greatest potential detected for the hexameric repeat domain. These data directly link sequence variation within the XDP-specific SVA sequence to phenotypic variability in clinical disease manifestation and provide insight into potential mechanisms by which this intronic retroelement may induce transcriptional interference inTAF1expression. Copyright © 2017 the Author(s). Published by PNAS.


July 7, 2019

A recurrence-based approach for validating structural variation using long-read sequencing technology.

Although numerous algorithms have been developed to identify structural variations (SVs) in genomic sequences, there is a dearth of approaches that can be used to evaluate their results. This is significant as the accurate identification of structural variation is still an outstanding but important problem in genomics. The emergence of new sequencing technologies that generate longer sequence reads can, in theory, provide direct evidence for all types of SVs regardless of the length of the region through which it spans. However, current efforts to use these data in this manner require the use of large computational resources to assemble these sequences as well as visual inspection of each region. Here we present VaPoR, a highly efficient algorithm that autonomously validates large SV sets using long-read sequencing data. We assessed the performance of VaPoR on SVs in both simulated and real genomes and report a high-fidelity rate for overall accuracy across different levels of sequence depths. We show that VaPoR can interrogate a much larger range of SVs while still matching existing methods in terms of false positive validations and providing additional features considering breakpoint precision and predicted genotype. We further show that VaPoR can run quickly and efficiency without requiring a large processing or assembly pipeline. VaPoR provides a long read-based validation approach for genomic SVs that requires relatively low read depth and computing resources and thus will provide utility with targeted or low-pass sequencing coverage for accurate SV assessment. The VaPoR Software is available at: https://github.com/mills-lab/vapor.© The Authors 2017. Published by Oxford University Press.


July 7, 2019

The state of whole-genome sequencing

Over the last decade, a technological paradigm shift has slashed the cost of DNA sequencing by over five orders of magnitude. Today, the cost of sequencing a human genome is a few thousand dollars, and it continues to fall. Here, we review the most cost-effective platforms for whole-genome sequencing (WGS) as well as emerging technologies that may displace or complement these. We also discuss the practical challenges of generating and analyzing WGS data, and how WGS has unlocked new strategies for discovering genes and variants underlying both rare and common human diseases.


July 7, 2019

Whole-genome sequencing: opportunities and challenges for public health, food-borne outbreak investigations, and the global food supply.

Food-borne disease is burdensome, af- fecting 1 in 6 persons or an estimated 48 million ill, 128 000 hospitalized, and 3000 deaths in the United States annually. In addition, societal costs from lost lives, lost labor, lost wages, and even lost revenue in the food industry are substan- tial. Globally the burden is even higher, and multinational outbreaks due to the global movement of contaminated foods are being described increasingly. The glo- bal food supply links nations and econo- mies, emphasizing the need to view food safety with an integrated farm-to-fork lens. As predicted, advances in molecular techniques and information management have been transformative for food-borne disease investigation.


July 7, 2019

HapCol: accurate and memory-efficient haplotype assembly from long reads.

Haplotype assembly is the computational problem of reconstructing haplotypes in diploid organisms and is of fundamental importance for characterizing the effects of single-nucleotide polymorphisms on the expression of phenotypic traits. Haplotype assembly highly benefits from the advent of ‘future-generation’ sequencing technologies and their capability to produce long reads at increasing coverage. Existing methods are not able to deal with such data in a fully satisfactory way, either because accuracy or performances degrade as read length and sequencing coverage increase or because they are based on restrictive assumptions.By exploiting a feature of future-generation technologies-the uniform distribution of sequencing errors-we designed an exact algorithm, called HapCol, that is exponential in the maximum number of corrections for each single-nucleotide polymorphism position and that minimizes the overall error-correction score. We performed an experimental analysis, comparing HapCol with the current state-of-the-art combinatorial methods both on real and simulated data. On a standard benchmark of real data, we show that HapCol is competitive with state-of-the-art methods, improving the accuracy and the number of phased positions. Furthermore, experiments on realistically simulated datasets revealed that HapCol requires significantly less computing resources, especially memory. Thanks to its computational efficiency, HapCol can overcome the limits of previous approaches, allowing to phase datasets with higher coverage and without the traditional all-heterozygous assumption. Our source code is available under the terms of the GNU General Public License at http://hapcol.algolab.eu/.bonizzoni@disco.unimib.itSupplementary information: Supplementary data are available at Bioinformatics online.© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended. © 2015 John Wiley & Sons Ltd.


July 7, 2019

Novel FANCI mutations in Fanconi anemia with VACTERL association.

Fanconi anemia (FA) is an inherited bone marrow failure syndrome caused by mutations in DNA repair genes; some of these patients may have features of the VACTERL association. Autosomal recessive mutations in FANCI are a rare cause of FA. We identified FANCI mutations by next generation sequencing in three patients in our FA cohort among several whose mutated gene was unknown. Four of the six mutations are novel and all mutations are likely deleterious to protein function. There are now 16 reported cases of FA due to FANCI of whom 7 have at least 3 features of the VACTERL association (44%). This suggests that the VACTERL association in patients with FA may be seen in patients with FANCI mutations more often than previously recognized. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.