The main challenge in assembling plant genome is its ploidy level, repeats content, and polymorphism. The second-generation sequencing delivered the throughput and the accuracy that is crucial to whole-genome sequencing but insufficient and remained challenging for some plant species. It is known that genomes produced by next-gen- eration sequencing produced small contigs that would inflate the number of annotated genes (Varshney et al. 2011) and missed on the transposable elements that are abun- dant in plant genome due to their repetitive nature (Michael and Jackson 2013).
The full genome sequence of Bacillus safensis KCTC 12796BP which had been isolated from the marine sponge in the seawater of Jeju Island, was determined by Pac-Bio next- generation sequencing system. A circular chromosome in the length of 3,935,874 bp was obtained in addition to a circular form of plasmid having 36,690 bp. The G + C content of chromosome was 41.4%, and that of plasmid was 37.3%. The number of deduced CDSs in the chromosome was 3,980, whereas 36 CDS regions were determined in a plasmid. Among the deduced CDSs in chromosome, 81 tRNA genes and 24 rRNA genes…
In this technical application note we demonstrate how long reads can be used to study hard-to-sequence regions. We will give an overview of the methods and considerations when targeting genomic regions or transcript sequencing and present various results and examples of applications.
This tutorial provides an overview of the Circular Consensus Sequence (CCS) analysis application. The CCS algorithm is used in applications that require distinguishing closely related DNA molecules in the same sample. Applications of CCS include profiling microbial communities, resolving viral populations and accurately identifying somatic variations within heterogeneous tumor cells.
This tutorial provides an overview of the Long Amplicon Analysis (LAA) application. The LAA algorithm generates highly accurate, phased and full-length consensus sequences from long amplicons. Applications of LAA include HLA typing, alternative haplotyping, and localized de novo assemblies of targeted genes.
This tutorial provides an overview of the Isoform Sequencing (Iso-Seq) analysis application. The Iso-Seq application provides reads that span entire transcript isoforms, from the 5′ end to the 3′ polyA-tail. Generation of accurate, full-length transcript sequences greatly simplifies analysis by eliminating the need for transcript reconstruction to infer isoforms using error-prone assembly of short RNA-seq reads.
This tutorial provides an overview of the Hierarchical Genome Assembly Process (HGAP4) de novo assembly analysis application. HGAP4 generates accurate de novo assemblies using only PacBio data. HGAP4 is suitable for assembling a wide range of genome sizes and complexity. HGAP4 now includes some support for diploid-aware assembly.
This tutorial provides an overview of the Base Modification and Motif analysis application for identifying common bacterial epigenetic modifications and analyzing methyltransferase recognition motifs. SMRT Analysis software supports epigenetic research by measuring the rate of DNA base incorporation during Single Molecule, Real-Time Sequencing.
This tutorial provides a high-level overview of the features contained within the SMRT Link software. SMRT Link is the web-based end-to-end software workflow manager for run design and set-up on the Sequel System, Data Management, and SMRT Analysis.