Targeted sequencing, Archives - Page 53 of 55

July 7, 2019 |

Third-generation sequencing and analysis of four complete pig liver esterase gene sequences in clones identified by screening BAC library.

Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing.After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis.Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression.This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for investigation of the genetic structure, function, and regulatory mechanisms of the PLE gene family.

July 7, 2019 |

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome.

The Asian seabass is an important marine food fish that has been cultured for several decades in Asia Pacific. However, the lack of a high quality reference genome has hampered efforts to improve its selective breeding. A 3D BAC pool set generated in this study was screened using 22 SSR markers located on linkage group 2 which contains a growth-related QTL region. Seventy-two clones corresponding to 22 FPC contigs were sequenced by Illumina MiSeq technology. We co-assembled the MiSeq-derived scaffolds from each FPC contig with error-corrected PacBio reads, resulting in 187 sequences covering 9.7?Mb. Eleven genes annotated within this region were found to be potentially associated with growth and their tissue-specific expression was investigated. Correlation analysis demonstrated that SNPs in ctsb, skp1 and ppp2ca can be potentially used as markers for selecting fast-growing fingerlings. Conserved syntenies between seabass LG2 and five other teleosts were identified. This study i) provided a 10?Mb targeted genome assembly; ii) demonstrated NGS of BAC pools as a potential approach for mining candidates underlying QTLs of this species; iii) detected eleven genes potentially responsible for growth in the QTL region; and iv) identified useful SNP markers for selective breeding programs of Asian seabass.

July 7, 2019 |

Conservation genetics of an endangered grassland butterfly (Oarisma poweshiek) reveals historically high gene flow despite recent and rapid range loss

1. In poorly dispersing species gene flow can be facilitated when suitable habitat is widespread, allowing for increased dispersal between neighbouring locations. The Poweshiek skipperling [Oarisma poweshiek (Parker)], a federally endangered butterfly, has undergone a rapid, recent demographic decline following the loss of tallgrass prairie and fen habitats range wide. The loss of habitat, now restricted geographic range, and poor dispersal ability have left O. poweshiek at increased risk of extinction. 2. We studied the population genetics of six remaining populations of O. poweshiek in order to test the hypothesis that gene flow was historically high despite limited long-distance dispersal capability. Utilising nine microsatellite loci developed by PacBio sequencing, we tested for patterns of isolation by distance, low population genetic structure and alternative gene flow models. 3. Populations from southern Manitoba, Canada to the Lower Peninsula of Michigan, USA are only weakly genetically differentiated despite having low diversity. We found no support for isolation by distance, and Bayesian estimates of historical gene flow support our hypothesis that high levels of gene flow previously connected populations from Michigan to Wisconsin. 4. Prairie grasslands have been reduced tremendously over the past century, but the low mobility of O. poweshiek suggests that rapid loss of populations over the past decade cannot be simply explained by fragmentation of habitat. 5. As a species at high risk of extinction, understanding historical processes of gene flow will allow for informed management decisions with respect to head-starting individuals for population reintroductions and for conserving networks of habitat that will allow for high levels of gene flow.

July 7, 2019 |

Interchromosomal core duplicons drive both evolutionary instability and disease susceptibility of the Chromosome 8p23.1 region.

Recurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation. Using comparative sequence-based approaches, we generated a high-quality 6.3-Mbp alternate reference assembly of an inverted Chromosome 8p23.1 haplotype. Comparison with nonhuman primates reveals a 746-kbp duplicative transposition and two separate inversion events that arose in the last million years of human evolution. The breakpoints associated with these rearrangements map to an ape-specific interchromosomal core duplicon that clusters at sites of evolutionary inversion (P = 7.8 × 10(-5)). Refinement of microdeletion breakpoints identifies a subgroup of patients that map to the same interchromosomal core involved in the evolutionary formation of the duplication blocks. Our results define a higher-order genomic instability element that has shaped the structure of specific chromosomes during primate evolution contributing to rearrangements associated with inversion and disease.© 2016 Mohajeri et al.; Published by Cold Spring Harbor Laboratory Press.

July 7, 2019 |

A photoreceptor contributes to the natural variation of diapause induction in Daphnia magna.

Diapause is an adaptation that allows organisms to survive harsh environmental conditions. In species occurring over broad habitat ranges, both the timing and the intensity of diapause induction can vary across populations, revealing patterns of local adaptation. Understanding the genetic architecture of this fitness-related trait would help clarify how populations adapt to their local environments. In the cyclical parthenogenetic crustacean Daphnia magna, diapause induction is a phenotypic plastic life history trait linked to sexual reproduction, as asexual females have the ability to switch to sexual reproduction and produce resting stages, their sole strategy for surviving habitat deterioration. We have previously shown that the induction of resting stage production correlates with changes in photoperiod that indicate the imminence of habitat deterioration and have identified a Quantitative Trait Locus (QTL) responsible for some of the variation in the induction of resting stages. Here, new data allows us to anchor the QTL to a large scaffold and then, using a combination of a new mapping panel, targeted association mapping and selection analysis in natural populations, to identify candidate genes within the QTL. Our results show that variation in a rhodopsin photoreceptor gene plays a significant role in the variation observed in resting stage induction. This finding provides a mechanistic explanation for the link between diapause and day-length perception that has been suggested in diverse arthropod taxa. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

July 7, 2019 |

HLA-A23:01:19, a novel variant of HLA-A23:01, discovered in a West African stem cell transplantation patient.

July 7, 2019 |

Unbiased identification of signal-activated transcription factors by barcoded synthetic tandem repeat promoter screening (BC-STAR-PROM).

The discovery of transcription factors (TFs) controlling pathways in health and disease is of paramount interest. We designed a widely applicable method, dubbed barcorded synthetic tandem repeat promoter screening (BC-STAR-PROM), to identify signal-activated TFs without any a priori knowledge about their properties. The BC-STAR-PROM library consists of ~3000 luciferase expression vectors, each harboring a promoter (composed of six tandem repeats of synthetic random DNA) and an associated barcode of 20 base pairs (bp) within the 3′ untranslated mRNA region. Together, the promoter sequences encompass >400,000 bp of random DNA, a sequence complexity sufficient to capture most TFs. Cells transfected with the library are exposed to a signal, and the mRNAs that it encodes are counted by next-generation sequencing of the barcodes. This allows the simultaneous activity tracking of each of the ~3000 synthetic promoters in a single experiment. Here we establish proof of concept for BC-STAR-PROM by applying it to the identification of TFs induced by drugs affecting actin and tubulin cytoskeleton dynamics. BC-STAR-PROM revealed that serum response factor (SRF) is the only immediate early TF induced by both actin polymerization and microtubule depolymerization. Such changes in cytoskeleton dynamics are known to occur during the cell division cycle, and real-time bioluminescence microscopy indeed revealed cell-autonomous SRF-myocardin-related TF (MRTF) activity bouts in proliferating cells.© 2016 Gosselin et al.; Published by Cold Spring Harbor Laboratory Press.

July 7, 2019 |

Chimeras link to tandem repeats and transposable elements in tetraploid hybrid fish

Abstract The formation of the allotetraploid hybrid lineage (4nAT) encompasses both distant hybridization and polyploidization processes. The allotetraploid offspring have two sets of sub-genomes inherited from both parental species and therefore it is important to explore its genetic structure. Herein, we construct a bacterial artificial chromosome library of allotetraploids, and then sequence and analyze the full-length sequences of 19 bacterial artificial chromosomes. Sixty-eight DNA chimeras are identified, which are divided into four models according to the distribution of the genomic DNA derived from the parents. Among the 68 genetic chimeras, 44 (64.71%) are linked to tandem repeats (TRs) and 23 (33.82%) are linked to transposable elements (TEs). The chimeras linked to TRs are related to slipped-strand mispairing and double-strand break repair while the chimeras linked to TEs are benefit from the intervention of recombinases. In addition, TRs and TEs are linked not only with the recombinations, but also with the insertions/deletions of DNA segments. We conclude that DNA chimeras accompanied by TRs and TEs coordinate a balance between the sub-genomes derived from the parents which reduces the genomic shock effects and favors the evolutionary and adaptive capacity of the allotetraploidization. It is the first report on the relationship between formation of the DNA chimeras and TRs and TEs in the polyploid animals.

July 7, 2019 |

Complete sequence of a F33:A-:B- conjugative plasmid carrying the oqxAB, fosA3, and blaCTX-M-55 elements from a foodborne Escherichia coli strain.

This study reports the complete sequence of pE80, a conjugative IncFII plasmid recovered from an Escherichia coli strain isolated from chicken meat. This plasmid harbors multiple resistance determinants including oqxAB, fosA3, blaCTX-M-55, and blaTEM-1, and is a close variant of the recently reported p42-2 element, which was recovered from E. coli of veterinary source. Recovery of pE80 constitutes evidence that evolution or genetic re-arrangement of IncFII type plasmids residing in animal-borne organisms is an active event, which involves acquisition and integration of foreign resistance elements into the plasmid backbone. Dissemination of these plasmids may further compromise the effectiveness of current antimicrobial strategies.

July 7, 2019 |

MICADo – Looking for mutations in targeted PacBio cancer data: an alignment-free method.

Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo.

July 7, 2019 |

SRinversion: a tool for detecting short inversions by splitting and re-aligning poorly mapped and unmapped sequencing reads.

Rapid development in sequencing technologies has dramatically improved our ability to detect genetic variants in human genome. However, current methods have variable sensitivities in detecting different types of genetic variants. One type of such genetic variants that is especially hard to detect is inversions. Analysis of public databases showed that few short inversions have been reported so far. Unlike reads that contain small insertions or deletions, which will be considered through gap alignment, reads carrying short inversions often have poor mapping quality or are unmapped, thus are often not further considered. As a result, the majority of short inversions might have been overlooked and require special algorithms for their detection.Here, we introduce SRinversion, a framework to analyze poorly mapped or unmapped reads by splitting and re-aligning them for the purpose of inversion detection. SRinversion is very sensitive to small inversions and can detect those less than 10?bp in size. We applied SRinversion to both simulated data and high-coverage sequencing data from the 1000 Genomes Project and compared the results with those from Pindel, BreakDancer, DELLY, Gustaf and MID. A better performance of SRinversion was achieved for both datasets for the detection of small inversions.SRinversion is implemented in Perl and is publicly available at http://paed.hku.hk/genome/software/SRinversion/index.html CONTACT: yangwl@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

July 7, 2019 |

TeloPCR-seq: a high-throughput sequencing approach for telomeres.

We have developed a high-throughput sequencing approach that enables us to determine terminal telomere sequences from tens of thousands of individual Schizosaccharomyces pombe telomeres. This method provides unprecedented coverage of telomeric sequence complexity in fission yeast. S. pombe telomeres are composed of modular degenerate repeats that can be explained by variation in usage of the TER1 RNA template during reverse transcription. Taking advantage of this deep sequencing approach, we find that ‘like’ repeat modules are highly correlated within individual telomeres. Moreover, repeat module preference varies with telomere length, suggesting that existing repeats promote the incorporation of like repeats and/or that specific conformations of the telomerase holoenzyme efficiently and/or processively add repeats of like nature. After the loss of telomerase activity, this sequencing and analysis pipeline defines a population of telomeres with altered sequence content. This approach will be adaptable to study telomeric repeats in other organisms and also to interrogate repetitive sequences throughout the genome that are inaccessible to other sequencing methods.© 2016 Federation of European Biochemical Societies.

July 7, 2019 |

Efficient, cost-effective, high-throughput, Multilocus Sequencing Typing (MLST) method, NGMLST, and the analytical software program MLSTEZ.

Multilocus sequence typing (MLST) has become the preferred method for genotyping many biological species. It can be used to identify major phylogenetic clades, molecular groups, or subpopulations of a species, as well as individual strains or clones. However, conventional MLST is costly and time consuming, which limits its power for genotyping large numbers of samples. Here, we describe a new MLST method that uses next-generation sequencing, a multiplexing protocol, and appropriate analytical software to provide accurate, rapid, and economical MLST genotyping of 96 or more isolates in a single assay.

July 7, 2019 |

Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1?kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.

July 7, 2019 |

Collection and storage of HLA NGS genotyping data for the 17th International HLA and Immunogenetics Workshop.

For over 50?years, the International HLA and Immunogenetics Workshops (IHIW) have advanced the fields of histocompatibility and immunogenetics (H&I) via community sharing of technology, experience and reagents, and the establishment of ongoing collaborative projects. Held in the fall of 2017, the 17th IHIW focused on the application of next generation sequencing (NGS) technologies for clinical and research goals in the H&I fields. NGS technologies have the potential to allow dramatic insights and advances in these fields, but the scope and sheer quantity of data associated with NGS raise challenges for their analysis, collection, exchange and storage. The 17th IHIW adopted a centralized approach to these issues, and we developed the tools, services and systems to create an effective system for capturing and managing these NGS data. We worked with NGS platform and software developers to define a set of distinct but equivalent NGS typing reports that record NGS data in a uniform fashion. The 17th IHIW database applied our standards, tools and services to collect, validate and store those structured, multi-platform data in an automated fashion. We have created community resources to enable exploration of the vast store of curated sequence and allele-name data in the IPD-IMGT/HLA Database, with the goal of creating a long-term community resource that integrates these curated data with new NGS sequence and polymorphism data, for advanced analyses and applications. Copyright © 2017 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

Asset Tag: Targeted sequencing,

Third-generation sequencing and analysis of four complete pig liver esterase gene sequences in clones identified by screening BAC library.

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome.

Conservation genetics of an endangered grassland butterfly (Oarisma poweshiek) reveals historically high gene flow despite recent and rapid range loss

Interchromosomal core duplicons drive both evolutionary instability and disease susceptibility of the Chromosome 8p23.1 region.

A photoreceptor contributes to the natural variation of diapause induction in Daphnia magna.

HLA-A23:01:19, a novel variant of HLA-A23:01, discovered in a West African stem cell transplantation patient.

Unbiased identification of signal-activated transcription factors by barcoded synthetic tandem repeat promoter screening (BC-STAR-PROM).

Chimeras link to tandem repeats and transposable elements in tetraploid hybrid fish

Complete sequence of a F33:A-:B- conjugative plasmid carrying the oqxAB, fosA3, and blaCTX-M-55 elements from a foodborne Escherichia coli strain.

MICADo – Looking for mutations in targeted PacBio cancer data: an alignment-free method.

SRinversion: a tool for detecting short inversions by splitting and re-aligning poorly mapped and unmapped sequencing reads.

TeloPCR-seq: a high-throughput sequencing approach for telomeres.

Efficient, cost-effective, high-throughput, Multilocus Sequencing Typing (MLST) method, NGMLST, and the analytical software program MLSTEZ.

Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.

Collection and storage of HLA NGS genotyping data for the 17th International HLA and Immunogenetics Workshop.

Subscribe for blog updates:

Filter by topic

Talk with an expert

ALS case study

Subscribe for blog updates:

Filter by topic

Talk with an expert