Menu
July 7, 2019

SVachra: a tool to identify genomic structural variation in mate pair sequencing data containing inward and outward facing reads.

Characterization of genomic structural variation (SV) is essential to expanding the research and clinical applications of genome sequencing. Reliance upon short DNA fragment paired end sequencing has yielded a wealth of single nucleotide variants and internal sequencing read insertions-deletions, at the cost of limited SV detection. Multi-kilobase DNA fragment mate pair sequencing has supplemented the void in SV detection, but introduced new analytic challenges requiring SV detection tools specifically designed for mate pair sequencing data. Here, we introduce SVachra – Structural Variation Assessment of CHRomosomal Aberrations, a breakpoint calling program that identifies large insertions-deletions, inversions, inter- and intra-chromosomal translocations utilizing both inward and outward facing read types generated by mate pair sequencing.We demonstrate SVachra’s utility by executing the program on large-insert (Illumina Nextera) mate pair sequencing data from the personal genome of a single subject (HS1011). An additional data set of long-read (Pacific BioSciences RSII) was also generated to validate SV calls from SVachra and other comparison SV calling programs. SVachra exhibited the highest validation rate and reported the widest distribution of SV types and size ranges when compared to other SV callers.SVachra is a highly specific breakpoint calling program that exhibits a more unbiased SV detection methodology than other callers.


July 7, 2019

Genomic insights into the pathogenicity and environmental adaptability of Enterococcus hirae R17 isolated from pork offered for retail sale.

Genetic information about Enterococcus hirae is limited, a feature that has compromised our understanding of these clinically challenging bacteria. In this study, comparative analysis was performed of E. hirae R17, a daptomycin-resistant strain isolated from pork purchased from a retail market in Beijing, China, and three other enterococcal genomes (Enterococcus faecium DO, Enterococcus faecalis V583, and E. hirae ATCC™ 9790). Some 1,412 genes were identified that represented the core genome together with an additional 139 genes that were specific to E. hirae R17. The functions of these R17 strain-specific coding sequences relate to the COGs categories of carbohydrate transport and metabolism and transcription, a finding that suggests the carbohydrate utilization capacity of E. hirae R17 may be more extensive when compared with the other three bacterial species (spp.). Analysis of genomic islands and virulence genes highlighted the potential that horizontal gene transfer played as a contributor of variations in pathogenicity in this isolate. Drug-resistance gene prediction and antibiotic susceptibility testing indicated E. hirae R17 was resistant to several antimicrobial compounds, including bacitracin, ciprofloxacin, daptomycin, erythromycin, and tetracycline, thereby limiting chemotherapeutic treatment options. Further, tolerance to biocides and metals may confer a phenotype that facilitates the survival and adaptation of this isolate against food preservatives, disinfectants, and antibacterial coatings. The genomic plasticity, mediated by IS elements, transposases, and tandem repeats, identified in the E. hirae R17 genome may support adaptation to new environmental niches, such as those that are found in hospitalized patients. A predicted transmissible plasmid, pRZ1, was found to carry several antimicrobial determinants, along with some predicted pathogenic genes. These data supported the previously determined phenotype confirming that the foodborne E. hirae R17 is a multidrug-resistant pathogenic bacterium with evident genome plasticity and environmental adaptability.© 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


July 7, 2019

Complete genome sequence of Ralstonia solanacearum FJAT-91, a high-virulence pathogen of tomato wilt.

Ralstonia solanacearum FJAT-91, which displays higher virulence toward plants belonging to the family Solanaceae, was isolated from a wilted tomato plant vessel in Fujian province, southeast China. Here, we report the complete genome sequence of R. solanacearum FJAT-91 using long-read single-molecule PacBio sequencing technology. The genome comprises a 3,873,214-bp circular chromosome and a 2,000,873-bp circular megaplasmid with an overall G+C content of 66.85%. Copyright © 2017 Chen et al.


July 7, 2019

Recent expansion and adaptive evolution of the carcinoembryonic antigen family in bats of the Yangochiroptera subgroup.

Expansions of gene families are predictive for ongoing genetic adaptation to environmental cues. We describe such an expansion of the carcinoembryonic antigen (CEA) gene family in certain bat families. Members of the CEA family in humans and mice are exploited as cellular receptors by a number of pathogens, possibly due to their function in immunity and reproduction. The CEA family is composed of CEA-related cell adhesion molecules (CEACAMs) and secreted pregnancy-specific glycoproteins (PSGs). PSGs are almost exclusively expressed by trophoblast cells at the maternal-fetal interface. The reason why PSGs exist only in a minority of mammals is still unknown.Analysis of the CEA gene family in bats revealed that in certain bat families, belonging to the subgroup Yangochiroptera but not the Yinpterochiroptera subgroup an expansion of the CEA gene family took place, resulting in approximately one hundred CEA family genes in some species of the Vespertilionidae. The majority of these genes encode secreted PSG-like proteins (further referred to as PSG). Remarkably, we found strong evidence that the ligand-binding domain (IgV-like domain) of PSG is under diversifying positive selection indicating that bat PSGs may interact with structurally highly variable ligands. Such ligands might represent bacterial or viral pathogen adhesins. We have identified two distinct clusters of PSGs in three Myotis species. The two PSG cluster differ in the amino acids under positive selection. One cluster was only expanded in members of the Vespertilionidae while the other was found to be expanded in addition in members of the Miniopteridae and Mormoopidae. Thus one round of PSG expansion may have occurred in an ancestry of all three families and a second only in Vespertilionidae. Although maternal ligands of PSGs may exist selective challenges by two distinct pathogens seem to be likely responsible for the expansion of PSGs in Vespertilionidae.The rapid expansion of PSGs in certain bat species together with selection for diversification suggest that bat PSGs could be part of a pathogen defense system by serving as decoy receptors and/or regulators of feto-maternal interactions.


July 7, 2019

Draft genome sequence of the plant pathogen Streptomyces sp. strain 11-1-2.

Streptomyces sp. strain 11-1-2 is a Gram-positive filamentous bacterium that was isolated from a common scab lesion on a potato tuber. The strain is highly pathogenic to plants but does not produce the virulence-associated Streptomyces phytotoxin thaxtomin A. Here, we report the draft genome sequence of Streptomyces sp. 11-1-2. Copyright © 2017 Bown and Bignell.


July 7, 2019

Complete genome sequence of the fruiting myxobacterium Myxococcus macrosporus strain DSM 14697, generated by PacBio sequencing.

Members of the Myxococcales order initiate a developmental program in response to starvation that culminates in formation of spore-filled fruiting bodies. To investigate the genetic basis for fruiting body formation, we present the complete 8.9-Mb genome sequence of Myxococcus macrosporus strain DSM 14697, generated using the PacBio sequencing platform. Copyright © 2017 Treuner-Lange et al.


July 7, 2019

Key features of mcr-1-bearing plasmids from Escherichia coli isolated from humans and food.

Mcr-1-harboring Enterobacteriaceae are reported worldwide since their first discovery in 2015. However, a limited number of studies are available that compared full-length plasmid sequences of human and animal origins.In this study, mcr-1-bearing plasmids from seven Escherichia coli isolates recovered from patients (n = 3), poultry meat (n = 2) and turkey meat (n = 2) in Switzerland were further analyzed and compared. Isolates were characterized by multilocus sequence typing (MLST). The mcr-1-bearing plasmids were transferred by transformation into reference strain E. coli DH5a and MCR-1-producing transformants were selected on LB-agar supplemented with 2 mg/L colistin. Purified plasmids were then sequenced and compared.MLST revealed six distinct STs, illustrating the high clonal diversity among mcr-1-positive E. coli isolates of different origins. Two different mcr-1-positive plasmids were identified from a single E. coli ST48 human isolate. All other isolates possessed a single mcr-1 harboring plasmid. Transferable IncI2 (size ca. 60-61 kb) and IncX4 (size ca. 33-35 kb) type plasmids each bearing mcr-1 were found associated with human and food isolates. None of the mcr-1-positive IncI2 and IncX4 plasmids possessed any additional resistance determinants. Surprisingly, all but one of the sequenced mcr-1-positive plasmids lacked the ISApl1 element, which is a key element mediating acquisition of mcr-1 into various plasmid backbones.There is strong evidence that the food chain may be an important transmission route for mcr-1-bearing plasmids. Our data suggest that some “epidemic” plasmids rather than specific E. coli clones might be responsible for the spread of the mcr-1 gene along the food chain.


July 7, 2019

Convergent evolution of Y chromosome gene content in flies.

Sex-chromosomes have formed repeatedly across Diptera from ordinary autosomes, and X-chromosomes mostly conserve their ancestral genes. Y-chromosomes are characterized by abundant gene-loss and an accumulation of repetitive DNA, yet the nature of the gene repertoire of fly Y-chromosomes is largely unknown. Here we trace gene-content evolution of Y-chromosomes across 22 Diptera species, using a subtraction pipeline that infers Y genes from male and female genome, and transcriptome data. Few genes remain on old Y-chromosomes, but the number of inferred Y-genes varies substantially between species. Young Y-chromosomes still show clear evidence of their autosomal origins, but most genes on old Y-chromosomes are not simply remnants of genes originally present on the proto-sex-chromosome that escaped degeneration, but instead were recruited secondarily from autosomes. Despite almost no overlap in Y-linked gene content in different species with independently formed sex-chromosomes, we find that Y-linked genes have evolved convergent gene functions associated with testis expression. Thus, male-specific selection appears as a dominant force shaping gene-content evolution of Y-chromosomes across fly species.While X-chromosome gene content tends to be conserved, Y-chromosome evolution is dynamic and difficult to reconstruct. Here, Mahajan and Bachtrog use a subtraction pipeline to identify Y-linked genes in 22 Diptera species, revealing patterns of Y-chromosome gene-content evolution.


July 7, 2019

LOGAN: A framework for LOssless Graph-based ANalysis of high throughput sequence data

Recent massive growth in the production of sequencing data necessitates matching improvements in bioinformatics tools to effectively utilize it. Existing tools suffer from limitations in both scalability and applicability which are inherent to their underlying algorithms and data structures. We identify the key requirements for the ideal data structure for sequence analyses: it should be informationally lossless, locally updatable, and memory efficient; requirements which are not met by data structures underlying the major assembly strategies Overlap Layout Consensus and De Bruijn Graphs. We therefore propose a new data structure, the LOGAN graph, which is based on a memory efficient Sparse De Bruijn Graph with routing information. Innovations in storing routing information and careful implementation allow sequence datasets for Escherichia coli (4.6Mbp, 117x coverage), Arabidopsis thaliana (135Mbp, 17.5x coverage) and Solanum pennellii (1.2Gbp, 47x coverage) to be loaded into memory on a desktop computer in seconds, minutes, and hours respectively. Memory consumption is competitive with state of the art alternatives, while losslessly representing the reads in an indexed and updatable form. Both Second and Third Generation Sequencing reads are supported. Thus, the LOGAN graph is positioned to be the backbone for major breakthroughs in sequence analysis such as integrated hybrid assembly, assembly of exceptionally large and repetitive genomes, as well as assembly and representation of pan-genomes.


July 7, 2019

The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology.

Mobile element insertions (MEIs) represent ~25% of all structural variants in human genomes. Moreover, when they disrupt genes, MEIs can influence human traits and diseases. Therefore, MEIs should be fully discovered along with other forms of genetic variation in whole genome sequencing (WGS) projects involving population genetics, human diseases, and clinical genomics. Here, we describe the Mobile Element Locator Tool (MELT), which was developed as part of the 1000 Genomes Project to perform MEI discovery on a population scale. Using both Illumina WGS data and simulations, we demonstrate that MELT outperforms existing MEI discovery tools in terms of speed, scalability, specificity, and sensitivity, while also detecting a broader spectrum of MEI-associated features. Several run modes were developed to perform MEI discovery on local and cloud systems. In addition to using MELT to discover MEIs in modern humans as part of the 1000 Genomes Project, we also used it to discover MEIs in chimpanzees and ancient (Neanderthal and Denisovan) hominids. We detected diverse patterns of MEI stratification across these populations that likely were caused by (1) diverse rates of MEI production from source elements, (2) diverse patterns of MEI inheritance, and (3) the introgression of ancient MEIs into modern human genomes. Overall, our study provides the most comprehensive map of MEIs to date spanning chimpanzees, ancient hominids, and modern humans and reveals new aspects of MEI biology in these lineages. We also demonstrate that MELT is a robust platform for MEI discovery and analysis in a variety of experimental settings.© 2017 Gardner et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

The rapid in vivo evolution of Pseudomonas aeruginosa in ventilator-associated pneumonia patients leads to attenuated virulence.

Pseudomonas aeruginosa is an opportunistic pathogen that causes severe airway infections in humans. These infections are usually difficult to treat and associated with high mortality rates. While colonizing the human airways, P. aeruginosa could accumulate genetic mutations that often lead to its better adaptability to the host environment. Understanding these evolutionary traits may provide important clues for the development of effective therapies to treat P. aeruginosa infections. In this study, 25 P. aeruginosa isolates were longitudinally sampled from the airways of four ventilator-associated pneumonia (VAP) patients. Pacbio and Illumina sequencing were used to analyse the in vivo evolutionary trajectories of these isolates. Our analysis showed that positive selection dominantly shaped P. aeruginosa genomes during VAP infections and led to three convergent evolution events, including loss-of-function mutations of lasR and mpl, and a pyoverdine-deficient phenotype. Specifically, lasR encodes one of the major transcriptional regulators in quorum sensing, whereas mpl encodes an enzyme responsible for recycling cell wall peptidoglycan. We also found that P. aeruginosa isolated at late stages of VAP infections produce less elastase and are less virulent in vivo than their earlier isolated counterparts, suggesting the short-term in vivo evolution of P. aeruginosa leads to attenuated virulence.© 2017 The Authors.


July 7, 2019

Draft genome sequences of Trichophyton rubrum CMCC(F)T1i and Trichophyton violaceum CMCC(F)T3l by Illumina 2000 and Pacific Biosciences.

One strain of Trichophyton rubrum CMCC(F)T1i (=CBS 139224) isolated from onychomycosis and one strain of Trichophyton violaceum CMCC(F)T3l (=CBS 141829) isolated from tinea capitis in China were whole-genome sequenced by Illumina/Solexa, while the former was also sequenced by Pacific Biosciences sequencing in parallel. Copyright © 2017 Zhan et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.