Menu
July 19, 2019  |  

Reconstructing complex regions of genomes using long-read sequencing technology.

Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.


July 19, 2019  |  

Revealing complete complex KIR haplotypes phased by long-read sequencing technology

The killer cell immunoglobulin-like receptor (KIR) region of human chromosome 19 contains up to 16 genes for natural killer (NK) cell receptors that recognize human leukocyte antigen (HLA)/peptide complexes and other ligands. The KIR proteins fulfill functional roles in infections, pregnancy, autoimmune diseases and transplantation. However, their characterization remains a constant challenge. Not only are the genes highly homologous due to their recent evolution by tandem duplications, but the region is structurally dynamic due to frequent transposon-mediated recombination. A sequencing approach that precisely captures the complexity of KIR haplotypes for functional annotation is desirable. We present a unique approach to haplotype the KIR loci using single-molecule, real-time (SMRT) sequencing. Using this method, we have—for the first time—comprehensively sequenced and phased sixteen KIR haplotypes from eight individuals without imputation. The information revealed four novel haplotype structures, a novel gene-fusion allele, novel and confirmed insertion/deletion events, a homozygous individual, and overall diversity for the structural haplotypes and their alleles. These KIR haplotypes augment our existing knowledge by providing high-quality references, evolutionary informers, and source material for imputation. The haplotype sequences and gene annotations provide alternative loci for the KIR region in the human genome reference GrCh38.p8.


July 7, 2019  |  

The assembly and characterisation of two structurally distinct cattle MHC class I haplotypes point to the mechanisms driving diversity.

In cattle, there are six classical MHC class I genes that are variably present between different haplotypes. Almost all known haplotypes contain between one and three genes, with an allele of Gene 2 present on the vast majority. However, very little is known about the sequence and therefore structure and evolutionary history of this genomic region. To address this, we have refined the MHC class I region in the Hereford cattle genome assembly and sequenced a complete A14 haplotype from a homozygous Holstein. Comparison of the two haplotypes revealed extensive variation within the MHC class Ia region, but not within the flanking regions, with each gene contained within a conserved 63- to 68-kb sequence block. This variable region appears to have undergone block gene duplication and likely deletion at regular breakpoints, suggestive of a site-specific mechanism. Phylogenetic analysis using complete gene sequences provided evidence of allelic diversification via gene conversion, with breakpoints between each of the extracellular domains that were associated with high guanine-cytosine (GC) content. Advancing our knowledge of cattle MHC class I evolution will help inform investigations of cattle genetic diversity and disease resistance.


July 7, 2019  |  

Assembly and characterization of the MHC class I region of the Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis).

The Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis; YFP) is the sole freshwater subspecies of N. asiaeorientalis and is now critically endangered. Major histocompatibility complex (MHC) is a family of highly polymorphic genes that play an important immunological role in antigen presentation in the vertebrates. Currently, however, little is known about MHC region in the genome of the YFP, which hampers conservation genetics and evolutionary ecology study using MHC genes. In this work, a nucleotide sequence of 774,811 bp covering the YFP MHC class I region was obtained by screening a YFP bacterial artificial chromosome (BAC) library, followed by sequencing and assembly of positive BAC clones. A total of 45 genes were successfully annotated, of which four were MHC class I genes. There are high similarities among the four YFP MHC class I genes (>94 %). Divergence in the coding region of the four YFP MHC class I genes is mainly localized to exons 2 and 3, which encode the antigen-binding sites of MHC class I genes. Additionally, comparison of the MHC structure in YFP to those of cattle, sheep, and pig showed that MHC class I genes are located in genome regions with regard to the conserved genes, and the YFP contains the fewest MHC class I genes among these species. This is the first report characterizing a cetacean MHC class I region and describing its organization, which would be valuable for further investigation of adaptation in natural populations of the YFP and other cetaceans.


January 23, 2017  |  

Tutorial: Long Amplicon Analysis application

This tutorial provides an overview of the Long Amplicon Analysis (LAA) application. The LAA algorithm generates highly accurate, phased and full-length consensus sequences from long amplicons. Applications of LAA include…


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.