The throughput and read length provided by Pacific Bioscience (PacBio) Single Molecule Real Time (SMRT) sequencing platform makes it feasible to construct contiguous, non-chimeric sequences. This is especially useful for genes with repetitive sequences in their gene bodies in gene families. We illustrate the use of PacBio to sequence and assemble hundreds of transcripts of gluten gene families from different cultivars of wheat using sequence from a single SMRT cell. To this end, we barcoded amplicons from different cultivars, then pooled these into one library for sequencing. Sequencing reads were later separated by the barcodes and further sorted into different gene groups by blast. The reads from each gene are then assembled by SeqmanNGen software. Given the length of 1 kb for each sequence derived from an initial molecule, the phase of the polymorphisms is not lost and can be used to infer also haplotype differences between different cultivars.
Journal: Methods in molecular biology