Menu
July 7, 2019

Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data.

Long-read sequencing can overcome the weaknesses of short reads in the assembly of eukaryotic genomes, however, at present additional scaffolding is needed to achieve chromosome-level assemblies. We generated PacBio long-read data of the genomes of three relatives of the model plant Arabidopsis thaliana and assembled all three genomes into only a few hundred contigs. To improve the contiguities of these assemblies, we generated BioNano Genomics optical mapping and Dovetail Genomics chromosome conformation capture data for genome scaffolding. Despite their technical differences, optical mapping and chromosome conformation capture performed similarly and doubled N50 values. After improving both integration methods, assembly contiguity reached chromosome-arm-levels. We rigorously assessed the quality of contigs and scaffolds using Illumina mate-pair libraries and genetic map information. This showed that PacBio assemblies have high sequence accuracy but can contain several misassemblies, which join unlinked regions of the genome. Most, but not all of these mis-joints were removed during the integration of the optical mapping and chromosome conformation capture data. Even though none of the centromeres was fully assembled, the scaffolds revealed large parts of some centromeric regions, even including some of the heterochromatic regions, which are not present in gold standard reference sequences. Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Antibiotic discovery throughout the Small World Initiative: A molecular strategy to identify biosynthetic gene clusters involved in antagonistic activity.

The emergence of bacterial pathogens resistant to all known antibiotics is a global health crisis. Adding to this problem is that major pharmaceutical companies have shifted away from antibiotic discovery due to low profitability. As a result, the pipeline of new antibiotics is essentially dry and many bacteria now resist the effects of most commonly used drugs. To address this global health concern, citizen science through the Small World Initiative (SWI) was formed in 2012. As part of SWI, students isolate bacteria from their local environments, characterize the strains, and assay for antibiotic production. During the 2015 fall semester at Bowling Green State University, students isolated 77 soil-derived bacteria and genetically characterized strains using the 16S rRNA gene, identified strains exhibiting antagonistic activity, and performed an expanded SWI workflow using transposon mutagenesis to identify a biosynthetic gene cluster involved in toxigenic compound production. We identified one mutant with loss of antagonistic activity and through subsequent whole-genome sequencing and linker-mediated PCR identified a 24.9 kb biosynthetic gene locus likely involved in inhibitory activity in that mutant. Further assessment against human pathogens demonstrated the inhibition of Bacillus cereus, Listeria monocytogenes, and methicillin-resistant Staphylococcus aureus in the presence of this compound, thus supporting our molecular strategy as an effective research pipeline for SWI antibiotic discovery and genetic characterization.© 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


July 7, 2019

Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing.

Global warming and other ecological changes have facilitated the expansion of Ixodes ricinus tick populations. Ixodes ricinus is the most important carrier of vector-borne pathogens in Europe, transmitting viruses, protozoa and bacteria, in particular Borrelia burgdorferi (sensu lato), the causative agent of Lyme borreliosis, the most prevalent vector-borne disease in humans in the Northern hemisphere. To faster control this disease vector, a better understanding of the I. ricinus tick is necessary. To facilitate such studies, we recently published the first reference genome of this highly prevalent pathogen vector. Here, we further extend these studies by scaffolding and annotating the first reference genome by using ultra-long sequencing reads from third generation single molecule sequencing. In addition, we present the first genome size estimation for I. ricinus ticks and the embryo-derived cell line IRE/CTVM19.235,953 contigs were integrated into 204,904 scaffolds, extending the currently known genome lengths by more than 30% from 393 to 516 Mb and the N50 contig value by 87% from 1643 bp to a N50 scaffold value of 3067 bp. In addition, 25,263 sequences were annotated by comparison to the tick’s North American relative Ixodes scapularis. After (conserved) hypothetical proteins, zinc finger proteins, secreted proteins and P450 coding proteins were the most prevalent protein categories annotated. Interestingly, more than 50% of the amino acid sequences matching the homology threshold had 95-100% identity to the corresponding I. scapularis gene models. The sequence information was complemented by the first genome size estimation for this species. Flow cytometry-based genome size analysis revealed a haploid genome size of 2.65Gb for I. ricinus ticks and 3.80 Gb for the cell line.We present a first draft sequence map of the I. ricinus genome based on a PacBio-Illumina assembly. The I. ricinus genome was shown to be 26% (500 Mb) larger than the genome of its American relative I. scapularis. Based on the genome size of 2.65 Gb we estimated that we covered about 67% of the non-repetitive sequences. Genome annotation will facilitate screening for specific molecular pathways in I. ricinus cells and provides an overview of characteristics and functions.


July 7, 2019

Assessment of insertion sequence mobilization as an adaptive response to oxidative stress in Acinetobacter baumannii using IS-Seq.

Insertion sequence (IS) elements are found throughout bacterial genomes and contribute to genome variation by interrupting genes or altering gene expression. Few of the more than thirty IS elements described in Acinetobacter baumannii have been characterized for transposition activity or expression effects. A targeted sequencing method, IS-seq, was developed to efficiently map the locations of new insertion events in A. baumannii genomes and was used to identify novel IS sites following growth in the presence of hydrogen peroxide, which causes oxidative stress. Serial subculture in the presence of sub-inhibitory concentrations of hydrogen peroxide led to rapid selection of cells carrying an ISAba1 element upstream of the catalase/peroxidase gene katG Several additional sites for the elements ISAba1, ISAba13, ISAba25, ISAba26, and ISAba125 were found at low abundance after serial subculture, indicating that each element is active and contributes to genetic variation that may be subject to selection. Following hydrogen peroxide exposure, rapid changes in gene expression were observed in genes related to iron homeostasis. The IS insertions adjacent to katG resulted in more than 20-fold overexpression of the gene and increased hydrogen peroxide tolerance.Importance Insertion sequences (IS) are contribute to genomic and phenotypic variation in many bacterial species, but little is known about how transposition rates vary among elements or how selective pressure influences this process. A new method, termed “IS-seq” for identifying new insertion locations that arise under experimental growth conditions in the genome was developed and tested with cells grown in the presence of hydrogen peroxide, which causes oxidative stress. Gene expression changes in response to hydrogen peroxide exposure are similar to those observed in other species and include genes that control free iron concentrations. New IS insertions adjacent to a gene encoding a catalase enzyme confirm that IS elements can rapidly contribute to adaptive variation in the presence of selection. Copyright © 2017 Wright et al.


July 7, 2019

Draft genome sequence of Halolamina pelagica CDK2 isolated from natural salterns from Rann of Kutch, Gujarat, India.

Halolamina pelagica strain CDK2, a halophilic archaeon (growth range 1.36 to 5.12 M NaCl), was isolated from rhizosphere of wild grasses of hypersaline soil of the Rann of Kutch, Gujarat, India. Its draft genome contains 2,972,542 bp and 3,485 coding sequences, depicting genes for halophilic serine proteases and trehalose synthesis. Copyright © 2017 Gaba et al.


July 7, 2019

Genomic sequencing of a strain of Acinetobacter baumannii and potential mechanisms to antibiotics resistance.

Acinetobacter baumannii has been becoming a great challenge to clinicians due to their resistance to almost all available antibiotics. In this study, we sequenced the genome from a multiple antibiotics resistant Acinetobacter baumannii stain which was named A. baumannii-1isolated from China by SMRT sequencing technology to explore its potential mechanisms to antibiotic resistance. We found that several mechanisms might contribute to the antibiotic resistance of Acinetobacter baumannii. Specifically, we found that SNP in genes associated with nucleotide excision repair and ABC transporter might contribute to its resistance to multiple antibiotics; we also found that specific genes associated with bacterial DNA integration and recombination, DNA-mediated transposition and response to antibiotics might contribute to its resistance to multiple antibiotics; Furthermore, specific genes associated with penicillin and cephalosporin biosynthetic pathway and specific genes associated with CHDL and MBL ß-lactamase genes might contribute to its resistance to multiple antibiotics. Thus, the detailed mechanisms by which Acinetobacter baumannii show extensive resistance to multiple antibiotics are very complicated. Such a study might be helpful to develop new strategies to control Acinetobacter baumannii infection. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Patterns of polymorphism at the self-incompatibility locus in 1,083 Arabidopsis thaliana genomes.

Although the transition to selfing in the model plant Arabidopsis thaliana involved the loss of the self-incompatibility (SI) system, it clearly did not occur due to the fixation of a single inactivating mutation at the locus determining the specificities of SI (the S-locus). At least three groups of divergent haplotypes (haplogroups), corresponding to ancient functional S-alleles, have been maintained at this locus, and extensive functional studies have shown that all three carry distinct inactivating mutations. However, the historical process of loss of SI is not well understood, in particular its relation with the last glaciation. Here, we took advantage of recently published genomic resequencing data in 1,083 Arabidopsis thaliana accessions that we combined with BAC sequencing to obtain polymorphism information for the whole S-locus region at a species-wide scale. The accessions differed by several major rearrangements including large deletions and interhaplogroup recombinations, forming a set of haplogroups that are widely distributed throughout the native range and largely overlap geographically. “Relict” A. thaliana accessions that directly derive from glacial refugia are polymorphic at the S-locus, suggesting that the three haplogroups were already present when glacial refugia from the last Ice Age became isolated. Interhaplogroup recombinant haplotypes were highly frequent, and detailed analysis of recombination breakpoints suggested multiple independent origins. These findings suggest that the complete loss of SI in A. thaliana involved independent self-compatible mutants that arose prior to the last Ice Age, and experienced further rearrangements during postglacial colonization.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Review of the algal biology program within the National Alliance for Advanced Biofuels and Bioproducts

In 2010, when the National Alliance for Advanced Biofuels and Bioproducts (NAABB) consortium began, little was known about the molecular basis of algal biomass or oil production. Very few algal genome sequences were available and efforts to identify the best-producing wild species through bioprospecting approaches had largely stalled after the U.S. Department of Energy’s Aquatic Species Program. This lack of knowledge included how reduced carbon was partitioned into storage products like triglycerides or starch and the role played by metabolite remodeling in the accumulation of energy-dense storage products. Furthermore, genetic transformation and metabolic engineering approaches to improve algal biomass and oil yields were in their infancy. Genome sequencing and transcriptional profiling were becoming less expensive, however; and the tools to annotate gene expression profiles under various growth and engineered conditions were just starting to be developed for algae. It was in this context that an integrated algal biology program was introduced in the NAABB to address the greatest constraints limiting algal biomass yield. This review describes the NAABB algal biology program, including hypotheses, research objectives, and strategies to move algal biology research into the twenty-first century and to realize the greatest potential of algae biomass systems to produce biofuels.


July 7, 2019

Complete genome sequence of Edwardsiella hoshinae ATCC 35051.

Edwardsiella hoshinae is a Gram-negative facultative anaerobe that has primarily been isolated from avians and reptiles. We report here the complete and annotated genome sequence of an isolate from a monitor lizard (Varanus sp.), which contains a chromosome of 3,811,650 bp and no plasmids. Copyright © 2017 Reichley et al.


July 7, 2019

Draft genome sequence of Karnal bunt pathogen (Tilletia indica) of wheat provides insights into the pathogenic mechanisms of quarantined fungus.

Karnal bunt disease in wheat is caused by hemibiotrophic fungus, Tilletia indica that has been placed as quarantine pest in more than 70 countries. Despite its economic importance, little knowledge about the molecular components of fungal pathogenesis is known. In this study, first time the genome sequence of T. indica has been deciphered for unraveling the effectors’ functions of molecular pathogenesis of Karnal bunt disease. The T. indica genome was sequenced employing hybrid approach of PacBio Single Molecule Real Time (SMRT) and Illumina HiSEQ 2000 sequencing platforms. The genome was assembled into 10,957 contigs (N50 contig length 3 kb) with total size of 26.7 Mb and GC content of 53.99%. The number of predicted putative genes were 11,535, which were annotated with Gene Ontology databases. Functional annotation of Karnal bunt pathogen genome and classification of identified effectors into protein families revealed interesting functions related to pathogenesis. Search for effectors’ genes using pathogen host interaction database identified 135 genes. The T. indica genome sequence and putative genes involved in molecular pathogenesis would further help in devising novel and effective disease management strategies including development of resistant wheat genotypes, novel biomarkers for pathogen detection and new targets for fungicide development.


July 7, 2019

Identification and bacterial characteristics of Xenorhabdus hominickii ANU101 from an entomopathogenic nematode, Steinernema monticolum.

An entomopathogenic nematode, Steinernema monticolum, was collected in Korea. Its identity was confirmed by morphological and molecular characters. Its symbiotic bacterium, Xenorhabdus hominickii ANU101, was isolated and assessed in terms of bacterial characteristics. Sixty-eight different carbon sources were utilized by X. hominickii ANU101 out of 95 different sources from a Biolog assay. Compared to other Xenorhabdus species, X. hominickii ANU101 was relatively susceptible to high temperatures and did not grow above 34°C. Furthermore, its growth rate was much slower than other Xenorhabdus species. X. hominickii exhibited insecticidal activities against coleopteran, dipteran, and lepidopteran insect pests. The bacterial virulence was not correlated with its host nematode virulence with respect to relative insecticidal activity against target insects. X. hominickii ANU101 exhibited antibiotics tolerance. The bacterium possesses four different plasmids (Xh-P1 (104,132bp), Xh-P2 (95,975bp), Xh-P3 (88,536bp), and Xh-P4 (11,403bp)) and encodes 332 open reading frames. Subsequent predicted genes include toxin/antitoxins comprising a multidrug export ATP-binding/permease. This study reports bacterial characters of X. hominickii and its entomopathogenicity. Copyright © 2017 Elsevier Inc. All rights reserved.


July 7, 2019

The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene.By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the “EPSPS cassette.” This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content.The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.


July 7, 2019

A genomic view of short tandem repeats.

Short tandem repeats (STRs) are some of the fastest mutating loci in the genome. Tools for accurately profiling STRs from high-throughput sequencing data have enabled genome-wide interrogation of more than a million STRs across hundreds of individuals. These catalogs have revealed that STRs are highly multiallelic and may contribute more de novo mutations than any other variant class. Recent studies have leveraged these catalogs to show that STRs play a widespread role in regulating gene expression and other molecular phenotypes. These analyses suggest that STRs are an underappreciated but rich reservoir of variation that likely make significant contributions to Mendelian diseases, complex traits, and cancer. Copyright © 2017 Elsevier Ltd. All rights reserved.


July 7, 2019

A Clostridioides difficile bacteriophage genome encodes functional binary toxin-associated genes.

Pathogenic clostridia typically produce toxins as virulence factors which cause severe diseases in both humans and animals. Whereas many clostridia like e.g., Clostridium perfringens, Clostridium botulinum or Clostridium tetani were shown to contain toxin-encoding plasmids, only toxin genes located on the chromosome were detected in Clostridioides difficile so far. In this study, we determined, annotated, and analyzed the complete genome of the bacteriophage phiSemix9P1 using single-molecule real-time sequencing technology (SMRT). To our knowledge, this represents the first C. difficile-associated bacteriophage genome that carries a complete functional binary toxin locus in its genome. Copyright © 2017 Elsevier B.V. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.