Menu
July 7, 2019

Complete genome sequences of three Bacillus amyloliquefaciens strains that inhibit the growth of Listeria monocytogenes in vitro.

Here, we report the complete genome sequences of three Bacillus amyloliquefaciens strains isolated from alfalfa, almond drupes, and grapes that inhibited the growth of Listeria monocytogenes strain 2011L-2857 in vitro We also report multiple gene clusters encoding secondary metabolites that may be responsible for the growth inhibition of L. monocytogenes. Copyright © 2018 Tran et al.


July 7, 2019

Complete genome sequence of Mycobacterium shigaense.

Mycobacterium shigaense is a slowly growing scotochromogenic species and a member of the Mycobacterium simiae complex group. Here, we report the complete sequence of its genome, comprising a 5.2-Mb chromosome. The sequence will represent the essential data for future phylogenetic and comparative genome studies of the Mycobacterium simiae complex group. Copyright © 2018 Yoshida et al.


July 7, 2019

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.

The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time.Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50?=?4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50?=?14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~?10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n?=?13).ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.


July 7, 2019

The challenge of analyzing the sugarcane genome.

Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.


July 7, 2019

Assembly of a complete genome sequence for Gemmata obscuriglobus reveals a novel prokaryotic rRNA operon gene architecture.

Gemmata obscuriglobus is a Gram-negative bacterium with several intriguing biological features. Here, we present a complete, de novo whole genome assembly for G. obscuriglobus which consists of a single, circular 9 Mb chromosome, with no plasmids detected. The genome was annotated using the NCBI Prokaryotic Genome Annotation pipeline to generate common gene annotations. Analysis of the rRNA genes revealed three interesting features for a bacterium. First, linked G. obscuriglobus rrn operons have a unique gene order, 23S-5S-16S, compared to typical prokaryotic rrn operons (16S-23S-5S). Second, G. obscuriglobus rrn operons can either be linked or unlinked (a 16S gene is in a separate genomic location from a 23S and 5S gene pair). Third, all of the 23S genes (5 in total) have unique polymorphisms. Genome analysis of a different Gemmata species (SH-PL17), revealed a similar 23S-5S-16S gene order in all of its linked rrn operons and the presence of an unlinked operon. Together, our findings show that unique and rare features in Gemmata rrn operons among prokaryotes provide a means to better define the evolutionary relatedness of Gemmata species and the divergence time for different Gemmata species. Additionally, these rrn operon differences provide important insights into the rrn operon architecture of common ancestors of the planctomycetes.


July 7, 2019

Improved draft genome sequence of a monoteliosporic culture of the karnal bunt (Tilletia indica) pathogen of wheat.

Karnal bunt of wheat is an internationally quarantined fungal pathogen disease caused by Tilletia indica and affects the international commercial seed trade of wheat. We announce here the first improved draft genome assembly of a monoteliosporic culture of the Tilletia indica fungus, consisting of 787 scaffolds with an approximate total genome size of 31.83 Mbp, which is more accurate and near to complete than the previous version. Copyright © 2018 Kumar et al.


July 7, 2019

Complete genome sequence of Acinetobacter radioresistens strain LH6, a multidrug-resistant bacteriophage-propagating strain.

Antimicrobial resistance is a major problem worldwide. Understanding the interplay between drug-resistant pathogens, such as Acinetobacter baumannii and related species, potentially acting as environmental reservoirs is critical for preventing the spread of resistance determinants. Here we report the complete genome sequence of a multidrug-resistant bacteriophage-propagating strain of Acinetobacter radioresistens.


July 7, 2019

Complete genome sequence of Aeromonas rivipollensis KN-Mc-11N1, isolated from a wild nutria (Myocastor coypus) in South Korea.

We report here the complete genome sequence of Aeromonas rivipollensis KN-Mc-11N1, which was isolated from a wild nutria (Myocastor coypus) in South Korea. Genomic analysis indicated that A. rivipollensis may have zoonotic potential similar to that of other aeromonads, and nutria could be one of the sources of transmission of zoonotic pathogens to humans.


July 7, 2019

An improved approach for reconstructing consensus repeats from short sequence reads

Repeat elements are important components of most eukaryotic genomes. Most existing tools for repeat analysis rely either on high quality reference genomes or existing repeat libraries. Thus, it is still challenging to do repeat analysis for species with highly repetitive or complex genomes which often do not have good reference genomes or annotated repeat libraries. Recently we developed a computational method called REPdenovo that constructs consensus repeat sequences directly from short sequence reads, which outperforms an existing tool called RepARK. One major issue with REPdenovo is that it doesn’t perform well for repeats with relatively high divergence rates or low copy numbers. In this paper, we present an improved approach for constructing consensus repeats directly from short reads. Comparing with the original REPdenovo, the improved approach uses more repeat-related k-mers and improves repeat assembly quality using a consensus-based k-mer processing method.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.