Learn how Single Molecule, Real-Time (SMRT) Sequencing and the Sequel IIe System will accelerate your research by delivering highly accurate long reads to provide the most comprehensive view of genomes, transcriptomes and epigenomes.
Mario Caccamo, head of bioinformatics at The Genome Analysis Centre (TGAC) in the UK, integrates many different sequencing technologies to get the best of each for optimal genome assemblies, analysis, and annotation. He uses PacBio’s SMRT Sequencing due to its unique long reads for scaffolding and finishing genomes.
Mario Caccamo, head of bioinformatics at The Genome Analysis Centre (TGAC) in the UK, integrates many different sequencing technologies to get the best of each for optimal genome assemblies, analysis, and annotation. He uses PacBio’s SMRT Sequencing due to its unique long reads for scaffolding and finishing genomes.
Many genomes have been sequenced to high-quality draft status using Sanger capillary electrophoresis and/or newer short-read sequence data and whole genome assembly techniques. However, even the best draft genomes contain gaps and other imperfections due to limitations in the input data and the techniques used to build draft assemblies. Sequencing biases, repetitive genomic features, genomic polymorphism, and other complicating factors all come together to make some regions difficult or impossible to assemble. Traditionally, draft genomes were upgraded to “phase 3 finished” status using time-consuming and expensive Sanger-based manual finishing processes. For more facile assembly and automated finishing of draft genomes,…
Pseudomonas knackmussii B13 was the first strain to be isolated in 1974 that could degrade chlorinated aromatic hydrocarbons. This discovery was the prologue for subsequent characterization of numerous bacterial metabolic pathways, for genetic and biochemical studies, and which spurred ideas for pollutant bioremediation. In this study, we determined the complete genome sequence of B13 using next generation sequencing technologies and optical mapping. Genome annotation indicated that B13 has a variety of metabolic pathways for degrading monoaromatic hydrocarbons including chlorobenzoate, aminophenol, anthranilate and hydroxyquinol, but not polyaromatic compounds. Comparative genome analysis revealed that B13 is closest to Pseudomonas denitrificans and Pseudomonas…
Thermoanaerobacter thermohydrosulfuricus BSB-33 is a thermophilic gram positive obligate anaerobe isolated from a hot spring in West Bengal, India. Unlike other T. thermohydrosulfuricus strains, BSB-33 is able to anaerobically reduce Fe(III) and Cr(VI) optimally at 60 °C. BSB-33 is the first Cr(VI) reducing T. thermohydrosulfuricus genome sequenced and of particular interest for bioremediation of environmental chromium contaminations. Here we discuss features of T. thermohydrosulfuricus BSB-33 and the unique genetic elements that may account for the peculiar metal reducing properties of this organism. The T. thermohydrosulfuricus BSB-33 genome comprises 2597606 bp encoding 2581 protein genes, 12 rRNA, 193 pseudogenes and has a G?+?C…
Burkholderia sp. strain WSM4176 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected in Nieuwoudtville, Western Cape of South Africa, in October 2007. This plant persists in infertile, acidic and deep sandy soils, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. Here we describe the features of Burkholderia sp. strain WSM4176, which represents a potential inoculant quality strain for L. ambigua, together with sequence and annotation. The 9,065,247 bp high-quality-draft genome is arranged in 13 scaffolds of 65 contigs, contains 8369 protein-coding genes…
In recent years, the number of human infections caused by opportunistic pathogens has increased dramatically. Plant rhizospheres are one of the most typical natural reservoirs for these pathogens but they also represent a great source for beneficial microbes with potential for biotechnological applications. However, understanding the natural variation and possible differences between pathogens and beneficials is the main challenge in furthering these possibilities. The genus Stenotrophomonas contains representatives found to be associated with human and plant host.We used comparative genomics as well as transcriptomic and physiological approaches to detect significant borders between the Stenotrophomonas strains: the multi-drug resistant pathogenic S.…
The fast reduction of prices of DNA sequencing allowed rapid accumulation of genome data. However, the process of obtaining complete genome sequences is still very time consuming and labor demanding. In addition, data produced from various sequencing technologies or alternative assemblies remain underexplored to improve assembly of incomplete genome sequences.We have developed FGAP, a tool for closing gaps of draft genome sequences that takes advantage of different datasets. FGAP uses BLAST to align multiple contigs against a draft genome assembly aiming to find sequences that overlap gaps. The algorithm selects the best sequence to fill and eliminate the gap.FGAP reduced…
The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data.Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50×…
Ensifer (syn. Sinorhizobium) meliloti is an important symbiotic bacterial species that fixes nitrogen. Strains BO21CC and AK58 were previously investigated for their substrate utilization and their plant-growth promoting abilities showing interesting features. Here, we describe the complete genome sequence and annotation of these strains. BO21CC and AK58 genomes are 6,985,065 and 6,974,333 bp long with 6,746 and 6,992 genes predicted, respectively.
Advances in sequencing technologies have dramatically reduced costs in producing high-quality draft genomes. However, there are still many contigs and possible misassembled regions in those draft genomes. Improving the quality of these genomes requires an efficient and economical means to close gaps and resequence some regions. Sequencing pooled gap region PCR products with Pacific Biosciences (PacBio) provides a significantly less expensive means for this need. We have developed a genome improvement pipeline with this strategy after decreasing a loading bias against larger PCR products in the PacBio process. Compared with Sanger technology, this approach is not only cost-effective but also…