New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution, and comprehensiveness. Translating these methods to routine research and clinical practice requires robust benchmark sets. We developed the first benchmark set for identification of both false negative and false positive germline SVs, which complements recent efforts emphasizing increasingly comprehensive characterization of SVs. To create this benchmark for a broadly consented son in a Personal Genome Project trio with broadly available cells and DNA, the Genome in a Bottle (GIAB) Consortium integrated 19 sequence-resolved variant calling methods, both alignment- and de novo assembly-based,…
Primary cutaneous follicle center lymphoma (PCFCL) is a rare mature B-cell lymphoma with an unknown etiology. PCFCL resembles follicular lymphoma (FL) by cytomorphologic and microarchitectural criteria. FL B cells are selected for N-linked glycosylation motifs in their B-cell receptors (BCRs) that are acquired during continuous somatic hypermutation. The stimulation of mannosylated BCR by lectins on the tumor microenvironment is therefore a candidate driver in FL pathogenesis. We investigated whether the same mechanism could play a role in PCFCL pathogenesis. Full-length functional variable, diversity, and joining gene sequences of 18 PCFCL and 8 primary cutaneous diffuse large B-cell lymphoma, leg-type were…
Unlike the normal anadromous lifestyle, Chinese native Dolly Varden char (Salvelinus malma) is locked in land and lives in fresh water lifetime. To explore the effect of freshwater adaption on its immune system, we constructed a pooled cDNA library of hepatopancreas and spleen of Chinese freshwater Dolly Varden char (S. malma). A total of 27,829 unigenes were generated from 31,233 high-quality transcripts and 17,670 complete open reading frames (ORF) were identified. Totally 25,809 unigenes were successfully annotated and it classified more native than adaptive immunity-associated genes, and more genes involved in toll-like receptor signal pathway than those in complement and…
The antibody repertoire of Bos taurus is characterized by a subset of variable heavy (VH) chain regions with ultralong third complementarity determining regions (CDR3) which, compared to other species, can provide a potent response to challenging antigens like HIV env. These unusual CDR3 can range to over seventy highly diverse amino acids in length and form unique ß-ribbon ‘stalk’ and disulfide bonded ‘knob’ structures, far from the typical antigen binding site. The genetic components and processes for forming these unusual cattle antibody VH CDR3 are not well understood. Here we analyze sequences of Bos taurus antibody VH domains and find…
The genomic structure of the Major Histocompatibility Complex (MHC) region and variation in selected MHC class I related genes in Old World camels, Camelus bactrianus and Camelus dromedaries were studied. The overall genomic organization of the camel MHC region follows a general pattern observed in other mammalian species and individual MHC loci appear to be well conserved. Selected MHC class I genes B-67 and BL3-7 exhibited unexpectedly low variability, even when compared to other camel MHC class I related genes MR1 and MICA. Interspecific SNP and allele sharing are relatively common, and frequencies of heterozygotes are usually low. Such a…
Long-read sequencing technologies have advantages in genome assembly, structural variant detection and haplotype phasing, but are less suited for single-nucleotide variant (SNV) and insertion/deletion (indel) calling due to the high error rate in comparison with short-read sequencing. Wenger et al., from Pacific Biosciences, optimized the circular consensus sequencing (CCS) protocol to achieve long, high-fidelity reads, in which they selected the SMRTbell library with fractions tightly distributed at 15 kb for high-coverage sequencing.