Menu
July 7, 2019

RIFRAF: a frame-resolving consensus algorithm.

Protein coding genes can be studied using long-read next generation sequencing. However, high rates of indel sequencing errors are problematic, corrupting the reading frame. Even the consensus of multiple independent sequence reads retains indel errors. To solve this problem, we introduce Reference-Informed Frame-Resolving multiple-Alignment Free template inference algorithm (RIFRAF), a sequence consensus algorithm that takes a set of error-prone reads and a reference sequence and infers an accurate in-frame consensus. RIFRAF uses a novel structure, analogous to a two-layer hidden Markov model: the consensus is optimized to maximize alignment scores with both the set of noisy reads and with a reference. The template-to-reads component of the model encodes the preponderance of indels, and is sensitive to the per-base quality scores, giving greater weight to more accurate bases. The reference-to-template component of the model penalizes frame-destroying indels. A local search algorithm proceeds in stages to find the best consensus sequence for both objectives.Using Pacific Biosciences SMRT sequences from an HIV-1 env clone, NL4-3, we compare our approach to other consensus and frame correction methods. RIFRAF consistently finds a consensus sequence that is more accurate and in-frame, especially with small numbers of reads. It was able to perfectly reconstruct over 80% of consensus sequences from as few as three reads, whereas the best alternative required twice as many. RIFRAF is able to achieve these results and keep the consensus in-frame even with a distantly related reference sequence. Moreover, unlike other frame correction methods, RIFRAF can detect and keep true indels while removing erroneous ones.RIFRAF is implemented in Julia, and source code is publicly available at https://github.com/MurrellGroup/Rifraf.jl.Supplementary data are available at Bioinformatics online.


July 7, 2019

Isolation and identification of an anthracimycin analogue from Nocardiopsis kunsanensis, a halophile from a saltern, by genomic mining strategy.

Modern medicine is unthinkable without antibiotics; yet, growing issues with microbial drug resistance require intensified search for new active compounds. Natural products generated by Actinobacteria have been a rich source of candidate antibiotics, for example anthracimycin that, so far, is only known to be produced by Streptomyces species. Based on sequence similarity with the respective biosynthetic cluster, we sifted through available microbial genome data with the goal to find alternative anthracimycin-producing organisms. In this work, we report about the prediction and experimental verification of the production of anthracimycin derivatives by Nocardiopsis kunsanensis, a non-Streptomyces actinobacterial microorganism. We discovered N. kunsanensis to predominantly produce a new anthracimycin derivative with methyl group at C-8 and none at C-2, labeled anthracimycin BII-2619, besides a minor amount of anthracimycin. It displays activity against Gram-positive bacteria with similar low level of mammalian cytotoxicity as that of anthracimycin.


July 7, 2019

The draft genome of the lichen-forming fungus Lasallia hispanica (Frey) Sancho & A. Crespo

Lasallia hispanica (Frey) Sancho & A. Crespo is one of three Lasallia species occurring in central-western Europe. It is an orophytic, photophilous Mediterranean endemic which is sympatric with the closely related, widely distributed, highly clonal sister taxon L. pustulata in the supra- and oro-Mediterranean belts. We sequenced the genome of L. hispanica from a multispore isolate. The total genome length is 41·2 Mb, including 8488 gene models. We present the annotation of a variety of genes that are involved in protein secretion, mating processes and secondary metabolism, and we report transposable elements. Additionally, we compared the genome of L. hispanica to the closely related, yet ecologically distant, L. pustulata and found high synteny in gene content and order. The newly assembled and annotated L. hispanica genome represents a useful resource for future investigations into niche differentiation, speciation and microevolution in L. hispanica and other members of the genus.


July 7, 2019

Auroramycin, a potent antibiotic from Streptomyces roseosporus by CRISPR-Cas9 activation.

Silent biosynthetic gene clusters represent a potentially rich source for new bioactive compounds. We report the discovery, characterization and biosynthesis of a novel doubly glycosylated 24-membered polyene macrolactam from a silent biosynthetic gene cluster in Streptomyces roseosporus using the CRISPR-Cas9 gene cluster activation strategy. Structural characterization of this polyketide, named auroramycin, revealed a rare isobutyrylmalonyl extender unit and a unique pair of aminosugars. Relative and absolute stereochemistry were determined using a combination of spectroscopic analyses, chemical derivatization, and computational analysis. The activated gene cluster for auroramycin production was also verified by transcriptional analyses and gene deletions. Finally, auroramycin exhibited potent anti-methicillin-resistant Staphylococcus aureus (anti-MRSA) activity towards clinical drug-resistant isolates.© 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.


July 7, 2019

Complete genome sequences of three Campylobacter jejuni phage-propagating strains.

Bacteriophage therapy can potentially reduce Campylobacter jejuni numbers in livestock, but it requires a detailed understanding of phage-host interactions. C. jejuni strains readily infected by certain phages are designated as phage-propagating strains. Here, we report the complete genome sequences of three such strains, NCTC 12660, NCTC 12661, and NCTC 12664. Copyright © 2018 Sacher et al.


July 7, 2019

Complete genome sequence of a novel mutant strain of Vibrio parahaemolyticus from Pacific White Shrimp (Penaeus vannamei).

The acute hepatopancreatic necrosis disease (AHPND) of Penaeus vannamei shrimp is caused by Vibrio parahaemolyticus carrying toxin genes, pirA and pirB We report the complete genome sequence of the novel V. parahaemolyticus strain R14, which did not display AHPND symptoms in P. vannamei despite containing the binary toxin genes. Copyright © 2018 Kanrar and Dhar.


July 7, 2019

Complete genome sequence of Bacillus subtilis strain DKU_NT_02, isolated from traditional Korean food using soybean (chung-gook-jang) for high-quality poly-?-glutamic acid activity.

The complete genome sequence of Bacillus subtilis strain DKU_NT_02, isolated from traditional Korean food using soybeans (chung-gook-jang), is presented here. This strain was chosen to help identify genetic factors with high-quality poly-?-glutamic acid (?PGA) activity. Copyright © 2018 Bang et al.


July 7, 2019

Complete genome sequences of three Bacillus amyloliquefaciens strains that inhibit the growth of Listeria monocytogenes in vitro.

Here, we report the complete genome sequences of three Bacillus amyloliquefaciens strains isolated from alfalfa, almond drupes, and grapes that inhibited the growth of Listeria monocytogenes strain 2011L-2857 in vitro We also report multiple gene clusters encoding secondary metabolites that may be responsible for the growth inhibition of L. monocytogenes. Copyright © 2018 Tran et al.


July 7, 2019

Genome sequences of Shewanella baltica and Shewanella morhuae strains isolated from the gastrointestinal tract of freshwater fish.

We present here the genome sequences of Shewanella baltica strain CW2 and Shewanella morhuae strain CW7, isolated from the gastrointestinal tract of Salvelinus namaycush (lean lake trout) and Coregonus clupeaformis (whitefish), respectively. These genome sequences provide insights into the niche adaptation of these specific species in freshwater systems. Copyright © 2018 Castillo et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.