Menu
July 7, 2019

Wham: Identifying structural variants of biological consequence.

Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools-Lumpy, Delly and SoftSearch-and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.


July 7, 2019

Complete genome sequences of 11 Bordetella pertussis strains representing the pandemic ptxP3 lineage.

Pathogen adaptation has contributed to the resurgence of pertussis. To facilitate our understanding of this adaptation we report here 11 completely closed and annotated Bordetella pertussis genomes representing the pandemic ptxP3 lineage. Our analyses included six strains which do not produce the vaccine components pertactin and/or filamentous hemagglutinin. Copyright © 2015 Bart et al.


July 7, 2019

Insights into sex chromosome evolution and aging from the genome of a short-lived fish.

The killifish Nothobranchius furzeri is the shortest-lived vertebrate that can be bred in the laboratory. Its rapid growth, early sexual maturation, fast aging, and arrested embryonic development (diapause) make it an attractive model organism in biomedical research. Here, we report a draft sequence of its genome that allowed us to uncover an intra-species Y chromosome polymorphism representing-in real time-different stages of sex chromosome formation that display features of early mammalian XY evolution “in action.” Our data suggest that gdf6Y, encoding a TGF-ß family growth factor, is the master sex-determining gene in N. furzeri. Moreover, we observed genomic clustering of aging-related genes, identified genes under positive selection, and revealed significant similarities of gene expression profiles between diapause and aging, particularly for genes controlling cell cycle and translation. The annotated genome sequence is provided as an online resource (http://www.nothobranchius.info/NFINgb). Copyright © 2015 Elsevier Inc. All rights reserved.


July 7, 2019

Complete genome sequence of Salinicoccus halodurans H3B36, isolated from the Qaidam Basin in China.

Salinicoccus halodurans H3B36 is a moderately halophilic bacterium isolated from a sediment sample of Qaidam Basin at 3.2 m vertical depth. Strain H3B36 accumulate N (a)-acetyl-a-lysine as compatible solute against salinity and heat stresses and may have potential applications in industrial biotechnology. In this study, we sequenced the genome of strain H3B36 using single molecule, real-time sequencing technology on a PacBio RS II instrument. The complete genome of strain H3B36 was 2,778,379 bp and contained 2,853 protein-coding genes, 12 rRNA genes, and 61 tRNA genes with 58 tandem repeats, six minisatellite DNA sequences, 11 genome islands, and no CRISPR repeat region. Further analysis of epigenetic modifications revealed the presence of 11,000 m4C-type modified bases, 7,545 m6A-type modified bases, and 89,064 other modified bases. The data on the genome of this strain may provide an insight into the metabolism of N (a)-acetyl-a-lysine.


July 7, 2019

Hybrid de novo genome assembly of the Chinese herbal plant danshen (Salvia miltiorrhiza Bunge)

Danshen (Salvia miltiorrhiza Bunge), also known as Chinese red sage, is a member of Lamiaceae family. It is valued in traditional Chinese medicine, primarily for the treatment of cardiovascular and cerebrovascular diseases. Because of its pharmacological potential, ongoing research aims to identify novel bioactive compounds in danshen, and their biosynthetic pathways. To date, only expressed sequence tag (EST) and RNA-seq data for this herbal plant are available to the public. We therefore propose that the construction of a reference genome for danshen will help elucidate the biosynthetic pathways of important secondary metabolites, thereby advancing the investigation of novel drugs from this plant.


July 7, 2019

Molecular characterization using next generation sequencing of plasmids containing blaNDM-7 in Enterobacteriaceae from Calgary, Canada.

Enterobacteriaceae with blaNDM-7 is relatively uncommon and had previously been described in Europe, India, USA and Japan. This study describes the characteristics of Enterobacteriaceae [Klebsiella pneumoniae (n=2), Escherichia coli (n=2), Serratia marcescens (n=1), Enterobacter hormaechei (n=1)] with blaNDM-7 obtained in 4 patients from Calgary, Canada during 2013-4. The 46,161 bp IncX3 plasmids with blaNDM-7 are highly similar to other blaNDM-harboring IncX3 plasmids and interestingly, showed identical structures within the different isolates. This finding may indicate horizontal transmission within our health region or may indicate contact with individuals from endemic areas within the hospital setting. Patients infected or colonized with bacteria containing blaNDM-7 IncX3 plasmids will generate infection control challenges. Epidemiological and molecular studies are required to better understand the dynamics of transmission, risk factors and reservoirs for bacteria harboring blaNDM-7. To the best of our knowledge, this is the first report of S. marcescens, and E. hormaechei with blaNDM-7. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Organellar genomes of white spruce (Picea glauca): assembly and annotation.

The genome sequences of the plastid and mitochondrion of white spruce (Picea glauca) were assembled from whole-genome shotgun sequencing data using ABySS. The sequencing data contained reads from both the nuclear and organellar genomes, and reads of the organellar genomes were abundant in the data as each cell harbors hundreds of mitochondria and plastids. Hence, assembly of the 123-kb plastid and 5.9-Mb mitochondrial genomes were accomplished by analyzing data sets primarily representing low coverage of the nuclear genome. The assembled organellar genomes were annotated for their coding genes, ribosomal RNA, and transfer RNA. Transcript abundances of the mitochondrial genes were quantified in three developmental tissues and five mature tissues using data from RNA-seq experiments. C-to-U RNA editing was observed in the majority of mitochondrial genes, and in four genes, editing events were noted to modify ACG codons to create cryptic AUG start codons. The informatics methodology presented in this study should prove useful to assemble organellar genomes of other plant species using whole-genome shotgun sequencing data. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Next-generation sequencing and comparative analysis of sequential outbreaks caused by multidrug-resistant Acinetobacter baumannii at a large academic burn center.

Next-generation sequencing (NGS) analysis has emerged as a promising molecular epidemiological method for investigating health care-associated outbreaks. Here, we used NGS to investigate a 3-year outbreak of multidrug-resistant Acinetobacter baumannii (MDRAB) at a large academic burn center. A reference genome from the index case was generated using de novo assembly of PacBio reads. Forty-six MDRAB isolates were analyzed by pulsed-field gel electrophoresis (PFGE) and sequenced using an Illumina platform. After mapping to the index case reference genome, four samples were excluded due to low coverage, leaving 42 samples for further analysis. Multilocus sequence types (MLST) and the presence of acquired resistance genes were also determined from the sequencing data. A transmission network was inferred from genomic and epidemiological data using a Bayesian framework. Based on single-nucleotide variant (SNV) differences, this MDRAB outbreak represented three sequential outbreaks caused by distinct clones. The first and second outbreaks were caused by sequence type 2 (ST2), while the third outbreak was caused by ST79. For the second outbreak, the MLST and PFGE results were discordant. However, NGS-based SNV typing detected a recombination event and consequently enabled a more accurate phylogenetic analysis. The distribution of resistance genes varied among the three outbreaks. The first- and second-outbreak strains possessed a blaOXA-23-like group, while the third-outbreak strains harbored a blaOXA-40-like group. NGS-based analysis demonstrated the superior resolution of outbreak transmission networks for MDRAB and provided insight into the mechanisms of strain diversification between sequential outbreaks through recombination. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics.

Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.


July 7, 2019

Bovine NK-lysin: Copy number variation and functional diversification.

NK-lysin is an antimicrobial peptide and effector protein in the host innate immune system. It is coded by a single gene in humans and most other mammalian species. In this study, we provide evidence for the existence of four NK-lysin genes in a repetitive region on cattle chromosome 11. The NK2A, NK2B, and NK2C genes are tandemly arrayed as three copies in ~30-35-kb segments, located 41.8 kb upstream of NK1. All four genes are functional, albeit with differential tissue expression. NK1, NK2A, and NK2B exhibited the highest expression in intestine Peyer’s patch, whereas NK2C was expressed almost exclusively in lung. The four peptide products were synthesized ex vivo, and their antimicrobial effects against both Gram-positive and Gram-negative bacteria were confirmed with a bacteria-killing assay. Transmission electron microcopy indicated that bovine NK-lysins exhibited their antimicrobial activities by lytic action in the cell membranes. In summary, the single NK-lysin gene in other mammals has expanded to a four-member gene family by tandem duplications in cattle; all four genes are transcribed, and the synthetic peptides corresponding to the core regions are biologically active and likely contribute to innate immunity in ruminants.


July 7, 2019

Complete genome sequence of Bacillus methylotrophicus JJ-D34 isolated from deonjang, a Korean traditional fermented soybean paste.

Bacillus methylotrophicus JJ-D34 showing good proteolytic and antipathogenic activities was isolated from doenjang, a Korean traditional fermented soybean paste. Here, we report the complete genome sequence of strain JJ-D34 harboring a 4,105,955bp circular chromosome encoding 4044 genes with a 46.24% G+C content, which will provide insights into the genomic basis of its effects and facilitating its application to doenjang fermentation. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

The complete genome sequence of Bacillus thuringiensis serovar Hailuosis YWC2-8.

Bacillus thuringiensis, a typical aerobic, Gram-positive, spore-forming bacterium, is an important microbial insecticide widely used in the control of agricultural pests. B. thuringiensis serovar Hailuosis YWC2-8 with high insecticidal activity against Diptera and Lepidoptera insects has three insecticidal crystal protein genes, such as cry4Cb2, cry30Ea2, and cry56Aa1. In this study, the complete genome sequence of B. thuringiensis YWC2-8 was analyzed, which contains one circular gapless chromosome and six circular plasmids. Copyright © 2015. Published by Elsevier B.V.


July 7, 2019

Exploring the genomic traits of fungus-feeding bacterial genus Collimonas.

Collimonas is a genus belonging to the class of Betaproteobacteria and consists mostly of soil bacteria with the ability to exploit living fungi as food source (mycophagy). Collimonas strains differ in a range of activities, including swimming motility, quorum sensing, extracellular protease activity, siderophore production, and antimicrobial activities.In order to reveal ecological traits possibly related to Collimonas lifestyle and secondary metabolites production, we performed a comparative genomics analysis based on whole-genome sequencing of six strains representing 3 recognized species. The analysis revealed that the core genome represents 43.1 to 52.7 % of the genomes of the six individual strains. These include genes coding for extracellular enzymes (chitinase, peptidase, phospholipase), iron acquisition and type II secretion systems. In the variable genome, differences were found in genes coding for secondary metabolites (e.g. tripropeptin A and volatile terpenes), several unknown orphan polyketide synthase-nonribosomal peptide synthetase (PKS-NRPS), nonribosomal peptide synthetase (NRPS) gene clusters, a new lipopeptide and type III and type VI secretion systems. Potential roles of the latter genes in the interaction with other organisms were investigated. Mutation of a gene involved in tripropeptin A biosynthesis strongly reduced the antibacterial activity against Staphylococcus aureus, while disruption of a gene involved in the biosynthesis of the new lipopeptide had a large effect on the antifungal/oomycetal activities.Overall our results indicated that Collimonas genomes harbour many genes encoding for novel enzymes and secondary metabolites (including terpenes) important for interactions with other organisms and revealed genomic plasticity, which reflect the behaviour, antimicrobial activity and lifestylesof Collimonas spp.


July 7, 2019

SMRT sequencing of the Campylobacter coli BfR-CA-9557 genome sequence reveals unique methylation motifs.

Campylobacter species are the most prevalent bacterial pathogen causing acute enteritis worldwide. In contrast to Campylobacter jejuni, about 5 % of Campylobacter coli strains exhibit susceptibility to restriction endonuclease digestion by DpnI cutting specifically 5′-G(m)ATC-3′ motifs. This indicates significant differences in DNA methylation between both microbial species. The goal of the study was to analyze the methylome of a C. coli strain susceptible to DpnI digestion, to identify its methylation motifs and restriction modification systems (RM-systems), and compare them to related organisms like C. jejuni and Helicobacter pylori. Using one SMRT cell and the PacBio RS sequencing technology followed by PacBio Modification and Motif Analysis the complete genome of the DpnI susceptible strain C. coli BfR-CA-9557 was sequenced to 500-fold coverage and assembled into a single contig of 1.7 Mbp. The genome contains a CJIE1-like element prophage and is phylogenetically closer to C. coli clade 1 isolates than clade 3. 45,881 6-methylated adenines (ca. 2.7 % of genome positions) that are predominantly arranged in eight different methylation motifs and 1,788 4-methylated cytosines (ca. 0.1 %) have been detected. Only two of these motifs correspond to known restriction modification motifs. Characteristic for this methylome was the very high fraction of methylation of motifs with mostly above 99 %.Only five dominant methylation motifs have been identified in C. jejuni, which have been associated with known RM-systems. C. coli BFR-CA-9557 shares one (RAATTY) of these, but four ORFs could be assigned to putative Type I RM-systems, seven ORFs to Type II RM-systems and three ORFs to Type IV RM-systems. In accordance with DpnI prescreening RM-system IIP, methylation of GATC motifs was detected in C. coli BfR-CA-9557. A homologous IIP RM-system has been described for H. pylori. The remaining methylation motifs are specific for C. coli BfR-CA-9557 and have been neither detected in C. jejuni nor in H. pylori. The results of this study give us new insights into epigenetics of Campylobacteraceae and provide the groundwork to resolve the function of RM-systems in C. coli.


July 7, 2019

Whole-genome sequence of an evolved Clostridium pasteurianum strain reveals Spo0A deficiency responsible for increased butanol production and superior growth.

Biodiesel production results in crude glycerol waste from the transesterification of fatty acids (10 % w/w). The solventogenic Clostridium pasteurianum, an anaerobic Firmicute, can produce butanol from glycerol as the sole carbon source. Coupling butanol fermentation with biodiesel production can improve the overall economic viability of biofuels. However, crude glycerol contains growth-inhibiting byproducts which reduce feedstock consumption and solvent production.To obtain a strain with improved characteristics, a random mutagenesis and directed evolution selection technique was used. A wild-type C. pasteurianum (ATCC 6013) culture was chemically mutagenized, and the resulting population underwent 10 days of selection in increasing concentrations of crude glycerol (80-150 g/L). The best-performing mutant (M150B) showed a 91 % increase in butanol production in 100 g/L crude glycerol compared to the wild-type strain, as well as increased growth rate, a higher final optical density, and less production of the side product PDO (1,3-propanediol). Wild-type and M150B strains were sequenced via Single Molecule Real-Time (SMRT) sequencing. Mutations introduced to the M150B genome were identified by sequence comparison to the wild-type and published closed sequences. A major mutation (a deletion) in the gene of the master transcriptional regulator of sporulation, Spo0A, was identified. A spo0A single gene knockout strain was constructed using a double–crossover genome-editing method. The Spo0A-deficient strain showed similar tolerance to crude glycerol as the evolved mutant strain M150B. Methylation patterns on genomic DNA identified by SMRT sequencing were used to transform plasmid DNA to overcome the native C. pasteurianum restriction endonuclease.Solvent production in the absence of Spo0A shows C. pasteurianum differs in solvent-production regulation compared to other solventogenic Clostridium. Growth-associated butanol production shows C. pasteurianum to be an attractive option for further engineering as it may prove a better candidate for butanol production through continuous fermentation.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.