Menu
April 24, 2025  |  Genomic sequencing

Estimating gene conversion tract length and rate from PacBio HiFi data

Authors: Anders Poulsen Charmouh, Peter Porsborg Sørud, Lasse Thorup Hansen, Søren Besenbacher, Sofia Boeg Winge, Kristian Almstrup, Asger Hobolth, Thomas Bataillon, Mikkel Heide Schierup

Gene conversions are broadly defined as the transfer of genetic material from a ‘donor’ to an ‘acceptor’ sequence and can happen both in meiosis and mitosis. They are a subset of non-crossover events and, like crossover events, gene conversion can generate new combinations of alleles and counteract mutation load by reverting germline mutations through GC-biased gene conversion. Estimating gene conversion rate and the distribution of gene conversion tract lengths remains challenging. We present a new method for estimating tract length, rate and detection probability of non-crossover events directly in HiFi PacBio long read data. The method can be used to make inference from sequencing of gametes from a single individual. The method is unbiased even under low single nucleotide variant (SNV) densities and does not necessitate any demographic or evolutionary assumptions. We test the accuracy and robustness of our method using simulated datasets where we vary length of tracts, number of tracts, the genomic SNV density and levels of correlation between SNV density and NCO event position. Our simulations show that under low SNV densities, like those found in humans, only a minute fraction (∼2%) of NCO events are expected to become visible as gene conversions by moving at least one SNV. We finally illustrate our method by applying it to PacBio sequencing data from human sperm.

Journal: bioRXiv
DOI: 10.1101/2024.07.05.601865
Year: 2024

Read publication

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.