Menu
July 19, 2019

Next generation sequencing of Actinobacteria for the discovery of novel natural products.

Like many fields of the biosciences, actinomycete natural products research has been revolutionised by next-generation DNA sequencing (NGS). Hundreds of new genome sequences from actinobacteria are made public every year, many of them as a result of projects aimed at identifying new natural products and their biosynthetic pathways through genome mining. Advances in these technologies in the last five years have meant not only a reduction in the cost of whole genome sequencing, but also a substantial increase in the quality of the data, having moved from obtaining a draft genome sequence comprised of several hundred short contigs, sometimes of doubtful reliability, to the possibility of obtaining an almost complete and accurate chromosome sequence in a single contig, allowing a detailed study of gene clusters and the design of strategies for refactoring and full gene cluster synthesis. The impact that these technologies are having in the discovery and study of natural products from actinobacteria, including those from the marine environment, is only starting to be realised. In this review we provide a historical perspective of the field, analyse the strengths and limitations of the most relevant technologies, and share the insights acquired during our genome mining projects.


July 19, 2019

AgIn: Measuring the landscape of CpG methylation of individual repetitive elements.

Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications.We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ~30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ~93.7%. We also observed a high correlation coefficient (R?=?0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over [0, 1] were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ~99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682?bp in the medaka genome.AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn CONTACT: ysuzuki@cb.k.u-tokyo.ac.jp, moris@cb.k.u-tokyo.ac.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. © The Author(s) 2016. Published by Oxford University Press.


July 19, 2019

Aquaculture genomics, genetics and breeding in the United States: current status, challenges, and priorities for future research.

Advancing the production efficiency and profitability of aquaculture is dependent upon the ability to utilize a diverse array of genetic resources. The ultimate goals of aquaculture genomics, genetics and breeding research are to enhance aquaculture production efficiency, sustainability, product quality, and profitability in support of the commercial sector and for the benefit of consumers. In order to achieve these goals, it is important to understand the genomic structure and organization of aquaculture species, and their genomic and phenomic variations, as well as the genetic basis of traits and their interrelationships. In addition, it is also important to understand the mechanisms of regulation and evolutionary conservation at the levels of genome, transcriptome, proteome, epigenome, and systems biology. With genomic information and information between the genomes and phenomes, technologies for marker/causal mutation-assisted selection, genome selection, and genome editing can be developed for applications in aquaculture. A set of genomic tools and resources must be made available including reference genome sequences and their annotations (including coding and non-coding regulatory elements), genome-wide polymorphic markers, efficient genotyping platforms, high-density and high-resolution linkage maps, and transcriptome resources including non-coding transcripts. Genomic and genetic control of important performance and production traits, such as disease resistance, feed conversion efficiency, growth rate, processing yield, behaviour, reproductive characteristics, and tolerance to environmental stressors like low dissolved oxygen, high or low water temperature and salinity, must be understood. QTL need to be identified, validated across strains, lines and populations, and their mechanisms of control understood. Causal gene(s) need to be identified. Genetic and epigenetic regulation of important aquaculture traits need to be determined, and technologies for marker-assisted selection, causal gene/mutation-assisted selection, genome selection, and genome editing using CRISPR and other technologies must be developed, demonstrated with applicability, and application to aquaculture industries.Major progress has been made in aquaculture genomics for dozens of fish and shellfish species including the development of genetic linkage maps, physical maps, microarrays, single nucleotide polymorphism (SNP) arrays, transcriptome databases and various stages of genome reference sequences. This paper provides a general review of the current status, challenges and future research needs of aquaculture genomics, genetics, and breeding, with a focus on major aquaculture species in the United States: catfish, rainbow trout, Atlantic salmon, tilapia, striped bass, oysters, and shrimp. While the overall research priorities and the practical goals are similar across various aquaculture species, the current status in each species should dictate the next priority areas within the species. This paper is an output of the USDA Workshop for Aquaculture Genomics, Genetics, and Breeding held in late March 2016 in Auburn, Alabama, with participants from all parts of the United States.


July 19, 2019

Complete genome sequence of Vibrio campbellii strain 20130629003S01 isolated from shrimp with acute hepatopancreatic necrosis disease.

Vibrio campbellii is widely distributed in the marine environment and is an important pathogen of aquatic organisms such as shrimp, fish, and mollusks. An isolate of V. campbellii carrying the pirAB(vp) gene, causing acute hepatopancreatic necrosis disease (AHPND), has been reported. There are no previous reports about the complete genome of V. campbellii causing AHPND (VCAHPND). To extend our understanding of the pathogenesis of VCAHPND at the genomic level, the genome of V. campbellii 20130629003S01 isolated from a shrimp with AHPND was sequenced and analysed.The complete genome sequence of V. campbellii 20130629003S01 was generated using the PacBio RSII platform with single molecule, real-time sequencing. The 20130629003S01 strain consists of two circular chromosomes (3,621,712 bp in chromosome 1 and 2,245,751 bp in chromosome 2) and four plasmids of 70,066, 204,531, 143,140, and 86,121 bp. The genome contains a total of 5855 protein coding genes, 134 tRNA genes and 37 rRNA genes. The average nucleotide identity value of 20130629003S01 and other reference V. campbellii strains was 97.46%, suggesting that they are closely related.The genome sequence of V. campbellii 20130629003S01 and its comparative analysis with other V. campbellii strains that we present here are important for a better understanding of the genomic characteristics of VCAHPND.


July 19, 2019

The highly heterogeneous methylated genomes and diverse restriction-modification systems of bloom-forming Microcystis.

The occurrence of harmful Microcystis blooms is increasing in frequency in a myriad of freshwater ecosystems. Despite considerable research pertaining to the cause and nature of these blooms, the molecular mechanisms behind the cosmopolitan distribution and phenotypic diversity in Microcystis are still unclear. We compared the patterns and extent of DNA methylation in three strains of Microcystis, PCC 7806SL, NIES-2549 and FACHB-1757, using Single Molecule Real-Time (SMRT) sequencing technology. Intact restriction-modification (R-M) systems were identified from the genomes of these strains, and from two previously sequenced strains of Microcystis, NIES-843 and TAIHU98. A large number of methylation motifs and R-M genes were identified in these strains, which differ substantially among different strains. Of the 35 motifs identified, eighteen had not previously been reported. Strain NIES-843 contains a larger number of total putative methyltransferase genes than have been reported previously from any bacterial genome. Genomic comparisons reveal that methyltransferases (some partial) may have been acquired from the environment through horizontal gene transfer. Copyright © 2018 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Yersinia ruckeri strain CSF007-82, etiologic agent of red mouth disease in salmonid fish.

We present the complete, closed, and finished chromosomal and extrachromosomal genome sequences of Yersinia ruckeri strain CSF007-82, the etiologic agent of enteric red mouth disease in salmonid fish. The chromosome is 3,799,036 bp with a G+C content of 47.5% and encodes 3,530 predicted coding sequences (CDS), 7 ribosomal operons, and 80 tRNAs. Copyright © 2015 Nelson et al.


July 7, 2019

Complete genome sequence of the fish pathogen Flavobacterium psychrophilum ATCC 49418(T.).

Flavobacterium psychrophilum is the causative agent of bacterial cold water disease and rainbow trout fry mortality syndrome in salmonid fishes and is associated with significant losses in the aquaculture industry. The virulence factors and molecular mechanisms of pathogenesis of F. psychrophilum are poorly understood. Moreover, at the present time, there are no effective vaccines and control using antimicrobial agents is problematic due to growing antimicrobial resistance and the fact that sick fish don’t eat. In the hopes of identifying vaccine and therapeutic targets, we sequenced the genome of the type strain ATCC 49418 which was isolated from the kidney of a Coho salmon (Oncorhychus kisutch) in Washington State (U.S.A.) in 1989. The genome is 2,715,909 bp with a G+C content of 32.75%. It contains 6 rRNA operons, 49 tRNA genes, and is predicted to encode 2,329 proteins.


July 7, 2019

Do echinoderm genomes measure up?

Echinoderm genome sequences are a corpus of useful information about a clade of animals that serve as research models in fields ranging from marine ecology to cell and developmental biology. Genomic information from echinoids has contributed to insights into the gene interactions that drive the developmental process at the molecular level. Such insights often rely heavily on genomic information and the kinds of questions that can be asked thus depend on the quality of the sequence information. Here we describe the history of echinoderm genomic sequence assembly and present details about the quality of the data obtained. All of the sequence information discussed here is posted on the echinoderm information web system, Echinobase.org. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Draft genome sequence of Pseudoalteromonas luteoviolacea HI1, determined using Roche 454 and PacBio single-molecule real-time hybrid sequencing.

We report here the 6.0-Mb draft genome assembly of Pseudoalteromonas luteoviolacea strain HI1 using Roche 454 and PacBio single-molecule real-time hybrid-sequencing analysis. This strain is of biological importance since it has the capacity to induce the settlement and metamorphosis of the serpulid polychaete Hydroides elegans and the coral Pocillopora damicornis. Copyright © 2015 Asahina and Hadfield.


July 7, 2019

Genome sequence of Polycyclovorans algicola strain TG408, an obligate polycyclic aromatic hydrocarbon-degrading bacterium associated with marine eukaryotic phytoplankton.

Polycyclovorans algicola strain TG408 is a recently discovered bacterium associated with marine eukaryotic phytoplankton and exhibits the ability to utilize polycyclic aromatic hydrocarbons (PAHs) almost exclusively as sole sources of carbon and energy. Here, we present the genome sequence of this strain, which is 3,653,213 bp, with 3,477 genes and an average G+C content of 63.8%. Copyright © 2015 Gutierrez et al.


July 7, 2019

Saccharina genomes provide novel insight into kelp biology.

Seaweeds are essential for marine ecosystems and have immense economic value. Here we present a comprehensive analysis of the draft genome of Saccharina japonica, one of the most economically important seaweeds. The 537-Mb assembled genomic sequence covered 98.5% of the estimated genome, and 18,733 protein-coding genes are predicted and annotated. Gene families related to cell wall synthesis, halogen concentration, development and defence systems were expanded. Functional diversification of the mannuronan C-5-epimerase and haloperoxidase gene families provides insight into the evolutionary adaptation of polysaccharide biosynthesis and iodine antioxidation. Additional sequencing of seven cultivars and nine wild individuals reveal that the genetic diversity within wild populations is greater than among cultivars. All of the cultivars are descendants of a wild S. japonica accession showing limited admixture with S. longissima. This study represents an important advance toward improving yields and economic traits in Saccharina and provides an invaluable resource for plant genome studies.


July 7, 2019

Complete genome of Jeotgalibacillus malaysiensis D5(T) consisting of a chromosome and a circular megaplasmid.

Jeotgalibacillus spp. are halophilic bacteria within the family Planococcaceae. No genomes of Jeotgalibacillus spp. have been reported to date, and their metabolic pathways are unknown. How the bacteria survive in hypertonic conditions such as seawater is yet to be discovered. As only few studies have been conducted on Jeotgalibacillus spp., potential applications of these bacteria are unknown. Here, we present the complete genome of J. malaysiensis D5(T) (=DSM 28777(T) =KCTC 33350(T)), which is invaluable in identifying interesting applications for this genus. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Complete genome sequence of Haloarcula sp. CBA1115 isolated from non-purified solar salts.

Haloarcula sp. CBA1115, isolated from non-purified solar salts from South Korea, is a halophilic archaeon belonging to the family Halobacteriaceae. Here, we present the complete genome sequence of the strain Haloarcula sp. CBA1115 (4,225,046bp, with a G+C content of 61.98%), which is distributed over one chromosome and five plasmids. A comparison of the genome sequence of Haloarcula sp. CBA1115 with those of members of its closely related taxa showed that the closest neighbor is Haloarcula hispanica Y27, a popular model organism for archaeal studies. The strain was found to possess a number of genes predicted to be involved in osmo-regulatory strategies and metal regulation, suggesting that it might be useful for bioremediation in extreme environments. Copyright © 2015 Elsevier B.V. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.