Menu
April 21, 2020

A comprehensive evaluation of long read error correction methods

Motivation: Third-generation sequencing technologies can sequence long reads, which is advancing the frontiers of genomics research. However, their high error rates prohibit accurate and efficient downstream analysis. This difficulty has motivated the development of many long read error correction tools, which tackle this problem through sampling redundancy and/or leveraging accurate short reads of the same biological samples. Existing studies to asses these tools use simulated data sets, and are not sufficiently comprehensive in the range of software covered or diversity of evaluation measures used. Results: In this paper, we present a categorization and review of long read error correction methods, and provide a comprehensive evaluation of the corresponding long read error correction tools. Leveraging recent real sequencing data, we establish benchmark data sets and set up evaluation criteria for a comparative assessment which includes quality of error correction as well as run-time and memory usage. We study how trimming and long read sequencing depth affect error correction in terms of length distribution and genome coverage post-correction, and the impact of error correction performance on an important application of long reads, genome assembly. We provide guidelines for practitioners for choosing among the available error correction tools and identify directions for future research.


April 21, 2020

Genome sequence analysis of 91 Salmonella Enteritidis isolates from mice caught on poultry farms in the mid 1990s.

A total of 91 draft genome sequences were used to analyze isolates of Salmonella enterica serovar Enteritidis obtained from feral mice caught on poultry farms in Pennsylvania. One objective was to find mutations disrupting open reading frames (ORFs) and another was to determine if ORF-disruptive mutations were present in isolates obtained from other sources. A total of 83 mice were obtained between 1995-1998. Isolates separated into two genomic clades and 12 subgroups due to 742 mutations. Nineteen ORF-disruptive mutations were found, and in addition, bigA had exceptional heterogeneity requiring additional evaluation. The TRAMS algorithm detected only 6 ORF disruptions. The sefD mutation was the most frequently encountered mutation and it was prevalent in human, poultry, environmental and mouse isolates. These results confirm previous assessments of the mouse as a rich source of Salmonella enterica serovar Enteritidis that varies in genotype and phenotype. Copyright © 2019. Published by Elsevier Inc.


April 21, 2020

Rapid transcriptional responses to serum exposure are associated with sensitivity and resistance to antibody-mediated complement killing in invasive Salmonella Typhimurium ST313

Background: Salmonella Typhimurium ST313 exhibits signatures of adaptation to invasive human infection, including higher resistance to humoral immune responses than gastrointestinal isolates. Full resistance to antibody-mediated complement killing (serum resistance) among nontyphoidal Salmonellae is uncommon, but selection of highly resistant strains could compromise vaccine-induced antibody immunity. Here, we address the hypothesis that serum resistance is due to a distinct genotype or transcriptome response in S. Typhimurium ST313.


April 21, 2020

Draft Genome Sequence of Streptomyces sp. Strain RKND-216, an Antibiotic Producer Isolated from Marine Sediment in Prince Edward Island, Canada.

Streptomyces sp. strain RKND-216 was isolated from marine sediment collected in Prince Edward Island, Canada, and produces a putatively novel bioactive natural product with antitubercular activity. The genome assembly consists of two contigs covering 5.61?Mb. Genome annotation identified 4,618 predicted protein-coding sequences and 19 predicted natural product biosynthetic gene clusters.Copyright © 2019 Liang et al.


April 21, 2020

Genome Sequences and Methylation Patterns of Natrinema versiforme BOL5-4 and Natrinema pallidum BOL6-1, Two Extremely Halophilic Archaea from a Bolivian Salt Mine.

Two extremely halophilic archaea, namely, Natrinema versiforme BOL5-4 and Natrinema pallidum BOL6-1, were isolated from a Bolivian salt mine and their genomes sequenced using single-molecule real-time sequencing. The GC-rich genomes of BOL5-4 and BOL6-1 were 4.6 and 3.8 Mbp, respectively, with large chromosomes and multiple megaplasmids. Genome annotation was incorporated into HaloWeb and methylation patterns incorporated into REBASE.Copyright © 2019 DasSarma et al.


April 21, 2020

Complete Genome Sequence of Leptospira kmetyi LS 001/16, Isolated from a Soil Sample Associated with a Leptospirosis Patient in Kelantan, Malaysia.

The Gram-negative pathogenic spirochetal bacteria Leptospira spp. cause leptospirosis in humans and livestock animals. Leptospira kmetyi strain LS 001/16 was isolated from a soil sample associated with a leptospirosis patient in Kelantan, which is among the states in Malaysia with a high reported number of disease cases. Here, we report the complete genome sequence of Leptospira kmetyi strain LS 001/16. Copyright © 2019 Yusof et al.


April 21, 2020

The Genome Sequence of the Halobacterium salinarum Type Strain Is Closely Related to That of Laboratory Strains NRC-1 and R1.

High-coverage long-read sequencing of the Halobacterium salinarum type strain (91-R6) revealed a 2.17-Mb chromosome and two large plasmids (148 and 102 kb). Population heterogeneity and long repeats were observed. Strain 91-R6 and laboratory strain R1 showed 99.63% sequence identity in common chromosomal regions and only 38 strain-specific segments. This information resolves the previously uncertain relationship between type and laboratory strains.Copyright © 2019 Pfeiffer et al.


April 21, 2020

Complete Whole-Genome Sequences of Two Raoultella terrigena Strains, NCTC 13097 and NCTC 13098, Isolated from Human Cases.

Raoultella terrigena is a bacterial species associated with soil and aquatic environments; however, sporadic cases of opportunistic disease in humans have been reported. Here, we report the first two complete genome sequences from clinical strains isolated from human sources that have been deposited in the National Collection of Type Cultures (NCTC). © Crown copyright 2019.


April 21, 2020

Complete Genome Sequence of Leuconostoc kimchii Strain NKJ218, Isolated from Homemade Kimchi.

Leuconostoc kimchii strain NKJ218 was isolated from homemade kimchi in South Korea. The whole genome was sequenced using the PacBio RS II and Illumina NovoSeq 6000 platforms. Here, we report a genome sequence of strain NKJ218, which consists of a 1.9-Mbp chromosome and three plasmid contigs. A total of 2,005 coding sequences (CDS) were predicted, including 1,881 protein-coding sequences.Copyright © 2019 Jung et al.


April 21, 2020

Complete Genome Sequences of Two USA300-Related Community-Associated Methicillin-Resistant Staphylococcus aureus Clinical Isolates.

USA300 is a predominant community-associated methicillin-resistant Staphylococcus aureus strain causing significant morbidity and mortality in North America. We present the full annotated genome sequences of two methicillin-resistant Staphylococcus aureus isolates related to the USA300 pulsotype with the goal of studying the evolutionary relationships of this highly successful strain type.Copyright © 2019 McClure and Zhang.


April 21, 2020

Genomic Islands in the Full-Genome Sequence of an NAD-Hemin-Independent Avibacterium paragallinarum Strain Isolated from Peru.

Here, we report the full-genome sequence of an NAD-hemin-independent Avibacterium paragallinarum serovar C-2 strain, FARPER-174, isolated from layer hens in Peru. This genome contained 12 potential genomic islands that include ribosomal protein-coding genes, a nadR gene, hemocin-coding genes, sequences of fagos, an rtx operon, and drug resistance genes. Copyright © 2019 Tataje-Lavanda et al.


April 21, 2020

Whole Genome Sequencing and Analysis of Chlorimuron-Ethyl Degrading Bacteria Klebsiella pneumoniae 2N3.

Klebsiella pneumoniae 2N3 is a strain of gram-negative bacteria that can degrade chlorimuron-ethyl and grow with chlorimuron-ethyl as the sole nitrogen source. The complete genome of Klebsiella pneumoniae 2N3 was sequenced using third generation high-throughput DNA sequencing technology. The genomic size of strain 2N3 was 5.32 Mb with a GC content of 57.33% and a total of 5156 coding genes and 112 non-coding RNAs predicted. Two hydrolases expressed by open reading frames (ORFs) 0934 and 0492 were predicted and experimentally confirmed by gene knockout to be involved in the degradation of chlorimuron-ethyl. Strains of ?ORF 0934, ?ORF 0492, and wild type (WT) reached their highest growth rates after 8-10 hours in incubation. The degradation rates of chlorimuron-ethyl by both ?ORF 0934 and ?ORF 0492 decreased in comparison to the WT during the first 8 hours in culture by 25.60% and 24.74%, respectively, while strains ?ORF 0934, ?ORF 0492, and the WT reached the highest degradation rates of chlorimuron-ethyl in 36 hours of 74.56%, 90.53%, and 95.06%, respectively. This study provides scientific evidence to support the application of Klebsiella pneumoniae 2N3 in bioremediation to control environmental pollution.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.