Menu
July 7, 2019

DNA extraction protocols for whole-genome sequencing in marine organisms.

The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths’ different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.


July 7, 2019

Spontaneous chloroplast mutants mostly occur by replication slippage and show a biased pattern in the plastome of Oenothera.

Spontaneous plastome mutants have been used as a research tool since the beginning of genetics. However, technical restrictions have severely limited their contributions to research in physiology and molecular biology. Here, we used full plastome sequencing to systematically characterize a collection of 51 spontaneous chloroplast mutants in Oenothera (evening primrose). Most mutants carry only a single mutation. Unexpectedly, the vast majority of mutations do not represent single nucleotide polymorphisms but are insertions/deletions originating from DNA replication slippage events. Only very few mutations appear to be caused by imprecise double-strand break repair, nucleotide misincorporation during replication, or incorrect nucleotide excision repair following oxidative damage. U-turn inversions were not detected. Replication slippage is induced at repetitive sequences that can be very small and tend to have high A/T content. Interestingly, the mutations are not distributed randomly in the genome. The underrepresentation of mutations caused by faulty double-strand break repair might explain the high structural conservation of seed plant plastomes throughout evolution. In addition to providing a fully characterized mutant collection for future research on plastid genetics, gene expression, and photosynthesis, our work identified the spectrum of spontaneous mutations in plastids and reveals that this spectrum is very different from that in the nucleus.© 2016 American Society of Plant Biologists. All rights reserved.


July 7, 2019

Transfer of the potato plant isolates of Pectobacterium wasabiae to Pectobacterium parmentieri sp. nov.

Pectobacterium wasabiae was originally isolated from Japanese horseradish (Eutrema wasabi), but recently some Pectobacterium isolates collected from potato plants and tubers displaying blackleg and soft rot symptoms were also assigned to P. wasabiae. Here, combining genomic and phenotypical data, we re-evaluated their taxonomic position. PacBio and Illumina technologies were used to complete the genome sequences of P. wasabiae CFBP 3304T and RNS 08-42-1A. Multi-locus sequence analysis showed that the P. wasabiae strains RNS 08-42-1A, SCC3193, CFIA1002 and WPP163, which were collected from potato plant environment, constituted a separate clade from the original Japanese horseradish P. wasabiae. The taxonomic position of these strains was also supported by calculation of the in-silico DNA-DNA hybridization, genome average nucleotide indentity, alignment fraction and average nucleotide indentity values. In addition, they were phenotypically distinguished from P. wasabiae strains by producing acids from (+)-raffinose, a-d(+)-a-lactose, d(+)-galactose and (+)-melibiose but not from methyl a-d-glycopyranoside, (+)-maltose or malonic acid. The name Pectobacterium parmentieri sp. nov. is proposed for this taxon; the type strain is RNS 08-42-1AT (=CFBP 8475T=LMG 29774T).


July 7, 2019

Characterization of tet(Y)-carrying LowGC plasmids exogenously captured from cow manure at a conventional dairy farm.

Manure from dairy farms has been shown to contain diverse tetracycline resistance genes that are transferable to soil. Here, we focus on conjugative plasmids that may spread tetracycline resistance at a conventional dairy farm. We performed exogenous plasmid isolation from cattle feces using chlortetracycline for transconjugant selection. The transconjugants obtained harbored LowGC-type plasmids and tet(Y). A representative plasmid (pFK2-7) was fully sequenced and this was compared with previously described LowGC plasmids from piggery manure-treated soil and a GenBank record from Acinetobacter nosocomialis that we also identified as a LowGC plasmid. The pFK2-7 plasmid had the conservative backbone typical of LowGC plasmids, though this region was interrupted with an insert containing the tet(Y)-tet(R) tetracycline resistance genes and the strA-strB streptomycin resistance genes. Despite Acinetobacter populations being considered natural hosts of LowGC plasmids, these plasmids were not found in three Acinetobacter isolates from the study farm. The isolates harbored tet(Y)-tet(R) genes in identical genetic surroundings as pFK2-7, however, suggesting genetic exchange between Acinetobacter and LowGC plasmids. Abundance of LowGC plasmids and tet(Y) was correlated in manure and soil samples from the farm, indicating that LowGC plasmids may be involved in the spread of tet(Y) in the environment.© FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Genomic insights into a sustained national outbreak of Yersinia pseudotuberculosis.

In 2014, a sustained outbreak of yersiniosis due to Yersinia pseudotuberculosis occurred across all major cities in New Zealand (NZ), with a total of 220 laboratory-confirmed cases, representing one of the largest ever reported outbreaks of Y. pseudotuberculosis. Here, we performed whole genome sequencing of outbreak-associated isolates to produce the largest population analysis to date of Y. pseudotuberculosis, giving us unprecedented capacity to understand the emergence and evolution of the outbreak clone. Multivariate analysis incorporating our genomic and clinical epidemiological data strongly suggested a single point-source contamination of the food chain, with subsequent nationwide distribution of contaminated produce. We additionally uncovered significant diversity in key determinants of virulence, which we speculate may help explain the high morbidity linked to this outbreak.


July 7, 2019

Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes.

Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits.© The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Complete genome sequence of Brevibacterium linens BS258, a potential marine Actinobacterium for environmental remediation via microbially induced calcite precipitation

Brevibacterium linens BS258 is a urease positive actinobacterium isolated from marine sediment of China Yellow Sea, which demonstrated to have strong capability of calcite precipitation and bioremediation of heavy metal pollution. Here, we report the complete genome sequence of this strain, which might provide a lot of valuable information for environmental remediation, wastewater treatment and atmospheric CO2 sequestration.


July 7, 2019

Susan Celniker: Foundational resources to study a dynamic genome.

The Genetics Society of America’s George W. Beadle Award honors individuals who have made outstanding contributions to the community of genetics researchers and who exemplify the qualities of its namesake. The 2016 recipient, Susan E. Celniker, played a key role in the sequencing, annotation, and characterization of the Drosophila genome. She participated in early sequencing efforts at the Lawrence Berkeley National Laboratory and led the modENCODE Fly Transcriptome Consortium. Her efforts were critical to ensuring that the Drosophila genome was well-annotated, making it one of the best curated animal genomes available. As the Principal Investigator for the BDGP, Celniker has enabled the study of proteomes by creating a collection of over 13,000 clones that match annotated genes for protein expression in cells or transgenic flies, and she has established the most comprehensive spatial gene expression atlas in any organism, with in situ imaging of more than 80% of the Drosophila protein-coding transcriptome through embryogenesis. In addition to providing the research community with these invaluable resources and reagents, she continues to develop new tools and datasets for genetics researchers to explore the spatial and temporal control of gene expression.


July 7, 2019

Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1?kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.


July 7, 2019

Complete genome sequence of Marivivens sp. JLT3646, a potential aromatic compound degrader

Marivivens sp. JLT3646 (CGMCC 1.15778), belonging to the phylum Alphaproteobacteria, was isolated from seawater, Kueishan Islet, offshore northeast of Taiwan. Here, we present the complete genome sequence of Marivivens sp. JLT3646, which contains a circular 2,978,145 bp chromosome with 56.2% G + C content, and one circular plasmid which is 169,066 bp in length. The genome data suggested that Marivivens sp. JLT3646 has the potential to degrade aromatic monomers, which might provide insight into biotechnological applications and facilitate the investigation of environmental bioremediation.


July 7, 2019

Microbial sequence typing in the genomic era.

Next-generation sequencing (NGS), also known as high-throughput sequencing, is changing the field of microbial genomics research. NGS allows for a more comprehensive analysis of the diversity, structure and composition of microbial genes and genomes compared to the traditional automated Sanger capillary sequencing at a lower cost. NGS strategies have expanded the versatility of standard and widely used typing approaches based on nucleotide variation in several hundred DNA sequences and a few gene fragments (MLST, MLVA, rMLST and cgMLST). NGS can now accommodate variation in thousands or millions of sequences from selected amplicons to full genomes (WGS, NGMLST and HiMLST). To extract signals from high-dimensional NGS data and make valid statistical inferences, novel analytic and statistical techniques are needed. In this review, we describe standard and new approaches for microbial sequence typing at gene and genome levels and guidelines for subsequent analysis, including methods and computational frameworks. We also present several applications of these approaches to some disciplines, namely genotyping, phylogenetics and molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Genomic insights into Photobacterium damselae subsp. damselae strain KC-Na-1, isolated from the finless porpoise (Neophocaena asiaeorientalis)

Photobacterium damselae subsp. damselae (PDD) is a marine bacterium that can infect a variety of marine animals and humans. Although this bacterium has been isolated from several stranded dolphins and whales, its pathogenic role in cetaceans is still unclear. In this study, we report the complete genome of PDD strain KC-Na-1 isolated from a finless porpoise (Neophocaena asiaeorientalis) rescued from the South Sea (Republic of Korea). The sequenced genome comprised two chromosomes and four plasmids. Among the recently identified major virulence factors in PDD, only phospholipase (plpV) was found in strain KC-Na-1. Interestingly, two genes homologous to Vibrio thermostable direct hemolysin (tdh) and its transcriptional regulator toxR, which are known virulence factors associated with Vibrio parahaemolyticus, were encoded on the plasmid pPDD-Na-1-3. Based on these results, strain KC-Na-1 may have potential pathogenicity in humans and other marine animals and also could act as a potential virulent strain. To the best of our knowledge, this is the first report of the complete genome sequence of P. damselae.


July 7, 2019

New high copy tandem repeat in the content of the chicken W chromosome.

The content of repetitive DNA in avian genomes is considerably less than in other investigated vertebrates. The first descriptions of tandem repeats were based on the results of routine biochemical and molecular biological experiments. Both satellite DNA and interspersed repetitive elements were annotated using library-based approach and de novo repeat identification in assembled genome. The development of deep-sequencing methods provides datasets of high quality without preassembly allowing one to annotate repetitive elements from unassembled part of genomes. In this work, we search the chicken assembly and annotate high copy number tandem repeats from unassembled short raw reads. Tandem repeat (GGAAA)n has been identified and found to be the second after telomeric repeat (TTAGGG)n most abundant in the chicken genome. Furthermore, (GGAAA)n repeat forms expanded arrays on the both arms of the chicken W chromosome. Our results highlight the complexity of repetitive sequences and update data about organization of sex W chromosome in chicken.


July 7, 2019

Comparative genomic analysis of Lactobacillus plantarum GB-LP4 and identification of evolutionarily divergent genes in high-osmolarity environment.

Lactobacillus plantarum is one of the widely-used probiotics and there have been a large number of advanced researches on the effectiveness of this species. However, the difference between previously reported plantarum strains, and the source of genomic variation among the strains were not clearly specified. In order to understand further on the molecular basis of L. plantarum on Korean traditional fermentation, we isolated the L. plantarum GB-LP4 from Korean fermented vegetable and conducted whole genome assembly. With comparative genomics approach, we identified the candidate genes that are expected to have undergone evolutionary acceleration. These genes have been reported to associate with the maintaining homeostasis, which are generally known to overcome instability in external environment including low pH or high osmotic pressure. Here, our results provide an evolutionary relationship between L. plantarum species and elucidate the candidate genes that play a pivotal role in evolutionary acceleration of GB-LP4 in high osmolarity environment. This study may provide guidance for further studies on L. plantarum.


July 7, 2019

Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D.

Completion of eukaryal genomes can be difficult task with the highly repetitive sequences along the chromosomes and short read lengths of second-generation sequencing. Saccharomyces cerevisiae strain CEN.PK113-7D, widely used as a model organism and a cell factory, was selected for this study to demonstrate the superior capability of very long sequence reads for de novo genome assembly. We generated long reads using two common third-generation sequencing technologies (Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio)) and used short reads obtained using Illumina sequencing for error correction. Assembly of the reads derived from all three technologies resulted in complete sequences for all 16 yeast chromosomes, as well as the mitochondrial chromosome, in one step. Further, we identified three types of DNA methylation (5mC, 4mC and 6mA). Comparison between the reference strain S288C and strain CEN.PK113-7D identified chromosomal rearrangements against a background of similar gene content between the two strains. We identified full-length transcripts through ONT direct RNA sequencing technology. This allows for the identification of transcriptional landscapes, including untranslated regions (UTRs) (5′ UTR and 3′ UTR) as well as differential gene expression quantification. About 91% of the predicted transcripts could be consistently detected across biological replicates grown either on glucose or ethanol. Direct RNA sequencing identified many polyadenylated non-coding RNAs, rRNAs, telomere-RNA, long non-coding RNA and antisense RNA. This work demonstrates a strategy to obtain complete genome sequences and transcriptional landscapes that can be applied to other eukaryal organisms.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.