Menu
September 22, 2019

IMSindel: An accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis.

Insertions and deletions (indels) have been implicated in dozens of human diseases through the radical alteration of gene function by short frameshift indels as well as long indels. However, the accurate detection of these indels from next-generation sequencing data is still challenging. This is particularly true for intermediate-size indels (=50?bp), due to the short DNA sequencing reads. Here, we developed a new method that predicts intermediate-size indels using BWA soft-clipped fragments (unmatched fragments in partially mapped reads) and unmapped reads. We report the performance comparison of our method, GATK, PINDEL and ScanIndel, using whole exome sequencing data from the same samples. False positive and false negative counts were determined through Sanger sequencing of all predicted indels across these four methods. The harmonic mean of the recall and precision, F-measure, was used to measure the performance of each method. Our method achieved the highest F-measure of 0.84 in one sample, compared to 0.56 for GATK, 0.52 for PINDEL and 0.46 for ScanIndel. Similar results were obtained in additional samples, demonstrating that our method was superior to the other methods for detecting intermediate-size indels. We believe that this methodology will contribute to the discovery of intermediate-size indels associated with human disease.


September 22, 2019

Complete genome analysis of Gluconacetobacter xylinus CGMCC 2955 for elucidating bacterial cellulose biosynthesis and metabolic regulation.

Complete genome sequence of Gluconacetobacter xylinus CGMCC 2955 for fine control of bacterial cellulose (BC) synthesis is presented here. The genome, at 3,563,314?bp, was found to contain 3,193 predicted genes without gaps. There are four BC synthase operons (bcs), among which only bcsI is structurally complete, comprising bcsA, bcsB, bcsC, and bcsD. Genes encoding key enzymes in glycolytic, pentose phosphate, and BC biosynthetic pathways and in the tricarboxylic acid cycle were identified. G. xylinus CGMCC 2955 has a complete glycolytic pathway because sequence data analysis revealed that this strain possesses a phosphofructokinase (pfk)-encoding gene, which is absent in most BC-producing strains. Furthermore, combined with our previous results, the data on metabolism of various carbon sources (monosaccharide, ethanol, and acetate) and their regulatory mechanism of action on BC production were explained. Regulation of BC synthase (Bcs) is another effective method for precise control of BC biosynthesis, and cyclic diguanylate (c-di-GMP) is the key activator of BcsA-BcsB subunit of Bcs. The quorum sensing (QS) system was found to positively regulate phosphodiesterase, which decomposed c-di-GMP. Thus, in this study, we demonstrated the presence of QS in G. xylinus CGMCC 2955 and proposed a possible regulatory mechanism of QS action on BC production.


September 22, 2019

Genomic analysis of oral Campylobacter concisus strains identified a potential bacterial molecular marker associated with active Crohn’s disease.

Campylobacter concisus is an oral bacterium that is associated with inflammatory bowel disease (IBD) including Crohn’s disease (CD) and ulcerative colitis (UC). C. concisus consists of two genomospecies (GS) and diverse strains. This study aimed to identify molecular markers to differentiate commensal and IBD-associated C. concisus strains. The genomes of 63 oral C. concisus strains isolated from patients with IBD and healthy controls were examined, of which 38 genomes were sequenced in this study. We identified a novel secreted enterotoxin B homologue, Csep1. The csep1 gene was found in 56% of GS2 C. concisus strains, presented in the plasmid pICON or the chromosome. A six-nucleotide insertion at the position 654-659?bp in csep1 (csep1-6bpi) was found. The presence of csep1-6bpi in oral C. concisus strains isolated from patients with active CD (47%, 7/15) was significantly higher than that in strains from healthy controls (0/29, P?=?0.0002), and the prevalence of csep1-6bpi positive C. concisus strains was significantly higher in patients with active CD (67%, 4/6) as compared to healthy controls (0/23, P?=?0.0006). Proteomics analysis detected the Csep1 protein. A csep1 gene hot spot in the chromosome of different C. concisus strains was found. The pICON plasmid was only found in GS2 strains isolated from the two relapsed CD patients with small bowel complications. This study reports a C. concisus molecular marker (csep1-6bpi) that is associated with active CD.


September 22, 2019

The N6-adenine methylation in yeast genome profiled by single-molecule technology.

The most common and abundant DNA modification is 5-meth- ylcytosine (5mC), which has been well-established as an epigenetic mark regulating gene expression in eukaryotes (Jones, 2012). Another DNA modification N6-methyldeoxyadenosine (6mA), pre- viously reported as a widespread DNA methylation in prokaryotes, plays an important role in gene expression, DNA replication, DNA repair, cell cycle progression and host-pathogen interaction (Messer and Noyer-Weidner, 1988; Lu et al., 1994; Collier et al., 2007). The knowledge of 6mA in eukaryotes has been very limited until the recent development of high-throughput sequencing and high-sensitive mass spectrometry technologies, which have greatly contributed to the investigation of 6mA in fungi, animals and plants (Fu et al., 2015; Greer et al., 2015; Zhang et al., 2015; Koziol et al., 2016; Liu et al., 2016; Wu et al., 2016; Liang et al., 2017; Mondo et al., 2017). Recent studies revealed that 6mA abundance is vari- able, and it is relative higher in Chlamydomonas and early- diverging fungi species than other eukaryotes. The distribution pat- terns of 6mA and their functions are not quite conserved among or- ganisms. 6mA was found enriched near the transcription start sites (TSS) in Chlamydomonas (Fu et al., 2015) and at the repeats in Drosophila, Mus musculus and Danio rerio (Zhang et al., 2015; Liu et al., 2016; Wu et al., 2016), and commonly depleted from gene exons in Xenopus laevis and M. musculus (Koziol et al., 2016). In several species, 6mA was associated with transcriptionally active genes (Fu et al., 2015; Mondo et al., 2017), and it was also found correlated with gene silencing in mammalian embryonic stem cells (Wu et al., 2016).


September 22, 2019

SvABA: genome-wide detection of structural variants and indels by local assembly.

Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA’s performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ~4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs.© 2018 Wala et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Conserved genomic and amino acid traits of cold adaptation in subzero-growing Arctic permafrost bacteria.

Permafrost accounts for 27% of all soil ecosystems and harbors diverse microbial communities. Our understanding of microorganisms in permafrost, their activities and adaptations, remains limited. Using five subzero-growing (cryophilic) permafrost bacteria, we examined features of cold adaptation through comparative genomic analyses with mesophilic relatives. The cryophiles possess genes associated with cold adaptation, including cold shock proteins, RNA helicases, and oxidative stress and carotenoid synthesis enzymes. Higher abundances of genes associated with compatible solutes were observed, important for osmoregulation in permafrost brine veins. Most cryophiles in our study have higher transposase copy numbers than mesophiles. We investigated amino acid (AA) modifications in the cryophiles favoring increased protein flexibility at cold temperatures. Although overall there were few differences with the mesophiles, we found evidence of cold adaptation, with significant differences in proline, serine, glycine and aromaticity, in several cryophiles. The use of cold/hot AA ratios of >1, used in previous studies to indicate cold adaptation, was found to be inadequate on its own. Comparing the average of all cryophiles to all mesophiles, we found that overall cryophiles had a higher ratio of cold adapted proteins for serine (more serine), and to a lesser extent, proline and acidic residues (fewer prolines/acidic residues).


September 22, 2019

Evolution of sequence type 4821 clonal complex meningococcal strains in China from prequinolone to quinolone era, 1972-2013.

The expansion of hypervirulent sequence type 4821 clonal complex (CC4821) lineage Neisseria meningitidis bacteria has led to a shift in meningococcal disease epidemiology in China, from serogroup A (MenA) to MenC. Knowledge of the evolution and genetic origin of the emergent MenC strains is limited. In this study, we subjected 76 CC4821 isolates collected across China during 1972-1977 and 2005-2013 to phylogenetic analysis, traditional genotyping, or both. We show that successive recombination events within genes encoding surface antigens and acquisition of quinolone resistance mutations possibly played a role in the emergence of CC4821 as an epidemic clone in China. MenC and MenB CC4821 strains have spread across China and have been detected in several countries in different continents. Capsular switches involving serogroups B and C occurred among epidemic strains, raising concerns regarding possible increases in MenB disease, given that vaccines in use in China do not protect against MenB.


September 22, 2019

Complete genome sequences of seven Vibrio anguillarum strains as derived from PacBio sequencing.

We report here the complete genome sequences of seven Vibrio anguillarum strains isolated from multiple geographic locations, thus increasing the total number of genomes of finished quality to 11. The genomes were de novo assembled from long-sequence PacBio reads. Including draft genomes, a total of 44?V. anguillarum genomes are currently available in the genome databases. They represent an important resource in the study of, for example, genetic variations and for identifying virulence determinants. In this article, we present the genomes and basic genome comparisons of the 11 complete genomes, including a BRIG analysis, and pan genome calculation. We also describe some structural features of superintegrons on chromosome 2?s, and associated insertion sequence (IS) elements, including 18 new ISs (ISVa3?-?ISVa20), both of importance in the complement of V. anguillarum genomes.


September 22, 2019

The genome sequence of “Candidatus Fokinia solitaria”: Insights on reductive evolution in Rickettsiales.

Candidatus Fokinia solitaria is an obligate intracellular endosymbiont of a unicellular eukaryote, a ciliate of the genus Paramecium. Here, we present the genome sequence of this bacterium and subsequent analysis. Phylogenomic analysis confirmed the previously reported positioning of the symbiont within the “Candidatus Midichloriaceae” family (order Rickettsiales), as well as its high sequence divergence from other members of the family, indicative of fast sequence evolution. Consistently with this high evolutionary rate, a comparative genomic analysis revealed that the genome of this symbiont is the smallest of the Rickettsiales to date. The reduced genome does not present flagellar genes, nor the pathway for the biosynthesis of lipopolysaccharides (present in all the other so far sequenced members of the family “Candidatus Midichloriaceae”) or genes for the Krebs cycle (present, although not always complete, in Rickettsiales). These results indicate an evolutionary trend toward a stronger dependence on the host, in comparison with other members of the family. Two alternative scenarios are compatible with our results; “Candidatus Fokinia solitaria” could be either a recently evolved, vertically transmitted mutualist, or a parasite with a high host-specificity.


September 22, 2019

Developing collaborative works for faster progress on fungal respiratory infections in cystic fibrosis.

Cystic fibrosis (CF) is the major genetic inherited disease in Caucasian populations. The respiratory tract of CF patients displays a sticky viscous mucus, which allows for the entrapment of airborne bacteria and fungal spores and provides a suitable environment for growth of microorganisms, including numerous yeast and filamentous fungal species. As a consequence, respiratory infections are the major cause of morbidity and mortality in this clinical context. Although bacteria remain the most common agents of these infections, fungal respiratory infections have emerged as an important cause of disease. Therefore, the International Society for Human and Animal Mycology (ISHAM) has launched a working group on Fungal respiratory infections in Cystic Fibrosis (Fri-CF) in October 2006, which was subsequently approved by the European Confederation of Medical Mycology (ECMM). Meetings of this working group, comprising both clinicians and mycologists involved in the follow-up of CF patients, as well as basic scientists interested in the fungal species involved, provided the opportunity to initiate collaborative works aimed to improve our knowledge on these infections to assist clinicians in patient management. The current review highlights the outcomes of some of these collaborative works in clinical surveillance, pathogenesis and treatment, giving special emphasis to standardization of culture procedures, improvement of species identification methods including the development of nonculture-based diagnostic methods, microbiome studies and identification of new biological markers, and the description of genotyping studies aiming to differentiate transient carriage and chronic colonization of the airways. The review also reports on the breakthrough in sequencing the genomes of the main Scedosporium species as basis for a better understanding of the pathogenic mechanisms of these fungi, and discusses treatment options of infections caused by multidrug resistant microorganisms, such as Scedosporium and Lomentospora species and members of the Rasamsonia argillacea species complex.


September 22, 2019

Haemophilus influenzae genome evolution during persistence in the human airways in chronic obstructive pulmonary disease.

Nontypeable Haemophilus influenzae (NTHi) exclusively colonize and infect humans and are critical to the pathogenesis of chronic obstructive pulmonary disease (COPD). In vitro and animal models do not accurately capture the complex environments encountered by NTHi during human infection. We conducted whole-genome sequencing of 269 longitudinally collected cleared and persistent NTHi from a 15-y prospective study of adults with COPD. Genome sequences were used to elucidate the phylogeny of NTHi isolates, identify genomic changes that occur with persistence in the human airways, and evaluate the effect of selective pressure on 12 candidate vaccine antigens. Strains persisted in individuals with COPD for as long as 1,422 d. Slipped-strand mispairing, mediated by changes in simple sequence repeats in multiple genes during persistence, regulates expression of critical virulence functions, including adherence, nutrient uptake, and modification of surface molecules, and is a major mechanism for survival in the hostile environment of the human airways. A subset of strains underwent a large 400-kb inversion during persistence. NTHi does not undergo significant gene gain or loss during persistence, in contrast to other persistent respiratory tract pathogens. Amino acid sequence changes occurred in 8 of 12 candidate vaccine antigens during persistence, an observation with important implications for vaccine development. These results indicate that NTHi alters its genome during persistence by regulation of critical virulence functions primarily by slipped-strand mispairing, advancing our understanding of how a bacterial pathogen that plays a critical role in COPD adapts to survival in the human respiratory tract.


September 22, 2019

Heterologous expression guides identification of the biosynthetic gene cluster of chuangxinmycin, an indole alkaloid antibiotic.

The indole alkaloid antibiotic chuangxinmycin, from Actinobacteria Actinoplanes tsinanensis, containing a unique thiopyrano[4,3,2- cd]indole scaffold, is a potent and selective inhibitor of bacterial tryptophanyl-tRNA synthetase. The chuangxinmycin biosynthetic gene cluster was identified by in silico analysis of the genome sequence, then verified by heterologous expression. Systemic gene inactivation and intermediate identification determined the minimum set of genes for unique thiopyrano[4,3,2- cd]indole formation and the concerted action of a radical S-adenosylmethionine protein plus an unknown protein for addition of the 3-methyl group. These findings set a solid foundation for comprehensively investigating the biosynthesis, optimizing yield, and generating new analogues of chuangxinmycin.


September 22, 2019

Distinct evolutionary patterns of Neisseria meningitidis serogroup B disease outbreaks at two universities in the USA.

Neisseria meningitidis serogroup B (MnB) was responsible for two independent meningococcal disease outbreaks at universities in the USA during 2013. The first at University A in New Jersey included nine confirmed cases reported between March 2013 and March 2014. The second outbreak occurred at University B in California, with four confirmed cases during November 2013. The public health response to these outbreaks included the approval and deployment of a serogroup B meningococcal vaccine that was not yet licensed in the USA. This study investigated the use of whole-genome sequencing(WGS) to examine the genetic profile of the disease-causing outbreak isolates at each university. Comparative WGS revealed differences in evolutionary patterns between the two disease outbreaks. The University A outbreak isolates were very closely related, with differences primarily attributed to single nucleotide polymorphisms/insertion-deletion (SNP/indel) events. In contrast, the University B outbreak isolates segregated into two phylogenetic clades, differing in large part due to recombination events covering extensive regions (>30?kb) of the genome including virulence factors. This high-resolution comparison of two meningococcal disease outbreaks further demonstrates the genetic complexity of meningococcal bacteria as related to evolution and disease virulence.


September 22, 2019

Case report of an extensively drug-resistant Klebsiella pneumoniae infection with genomic characterization of the strain and review of similar cases in the United States

Reports of extensively drug-resistant and pan-drug-resistant Klebsiella pneumoniae (XDR-KP and PDR-KP) cases are increasing worldwide. Here, we report a case of XDR-KP with an in-depth molecular characterization of resistance genes using whole-genome sequencing, and we review all cases of XDR-KP and PDR-KP reported in the United States to date.


September 22, 2019

Virgibacillus phasianinus sp. nov., a halophilic bacterium isolated from faeces of a Swinhoe’s pheasant, Lophura swinhoii.

A rod-shaped, Gram-stain-positive, motile and aerobic bacterium, designated LM2416T, was isolated from faeces of Lophuras winhoii living in Seoul Grand Park, Gyeonggi-do, Republic of Korea. Phylogenetic analyses based on 16S rRNA gene sequences indicated that strain LM2416T belonged to the genus Virgibacillus, sharing high 16S rRNA gene sequence similarities to Virgibacillus necropolis LMG 19488T (99.0?%), Virgibacillus carmonensis LMG 20964T (98.4?%), Virgibacillus arcticus Hal 1T (98.3?%) and Virgibacillus flavescens S1-20T (97.9?%). The isolate grew at 10-30?°C, pH 6-7 and 0-20?% (w/v) NaCl. Optimal growth was observed at 30?°C, pH 6-7 and 10?% (w/v) NaCl. The major fatty acid was anteiso-C15?:?0. Polar lipids were composed of phosphatidylglycerol, diphosphatidylglycerol, three unknown phospholipids and two unknown aminophospholipids. The main menaquinone was MK-7. Strain LM2416T had alanine, lysine, glutamic acid, glycine and aspartic acid as cell-wall amino acids and ribose as a cell-wall sugar. The whole genome sequences of strain LM2416T and V. necropolis KCTC 3820T were sequenced by PacBio RS II sequencing. The genome sequence-based G+C?content of strain LM2416T was 39.5?mol%. The orthologous average nucleotide identity value, showing genetic relatedness between strain LM2416T and V. necropolis KCTC 3820T, was 78.3?%. Based on the phylogenetic, biochemical, chemotaxonomic and genotypic data presented in this study, strain LM2416T is considered to represent a novel species of the genus Virgibacillus, for which the name Virgibacillus phasianinus is proposed. The type strain is LM2416T (=KCTC 33927T=JCM 32144T).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.