Accessible, realistic genome simulation with selection using stdpopsim
More On Article
- HEAS Publication Covered in PsyPost
- Mining and Dining: Prehistoric Diets in the Salt Mines of Hallstatt
- A new late Neanderthal from Crimea reveals long-distance connections across Eurasia
- Perceptions of female age, health and attractiveness vary with systematic hair manipulations
- Long shared haplotypes identify the southern Urals as a primary source for the 10th-century Hungarians.
Gower, G., Pope, N.S., Rodrigues, M.F., Tittes, S., Tran, L.N., Alam, O., Cavassim, M.I.A., Fields, P.D., Haller, B.C., Huang, X., Jeffrey, B., Korfmann, K., Kyriazis, C.C., Min, J., Rebollo, I., Rehmann, C.T., Small, S.T., Smith, C.C.R., Tsambos, G., Wong, Y., Zhang, Y., Huber, C.D., Gorjanc, G., Ragsdale, A.P., Gronau, I., Gutenkunst, R.N., Kelleher, J., Lohmueller, K.E., Schrider, D.R., Ralph, P.L., Kern, A.D., 2025. Accessible, realistic genome simulation with selection using stdpopsim. Molecular Biology and Evolution.
Selection is a fundamental evolutionary force that shapes patterns of genetic variation across species. However, simulations incorporating realistic selection along heterogeneous genomes in complex demographic histories are challenging, limiting our ability to benchmark statistical methods aimed at detecting selection and to explore theoretical predictions. stdpopsim is a community-maintained simulation library that already provides an extensive catalog of species-specific population genetic models. Here we present a major extension to the stdpopsim framework that enables simulation of various modes of selection, including background selection, selective sweeps, and arbitrary distributions of fitness effects (DFE) acting on annotated subsets of the genome (for instance, exons). This extension maintains stdpopsim’s core principles of reproducibility and accessibility while adding support for species-specific genomic annotations and published DFE estimates. We demonstrate the utility of this framework by comparing methods for demographic inference, DFE estimation, and selective sweep detection across several species and scenarios. Our results demonstrate the robustness of demographic inference methods to selection on linked sites, reveal the sensitivity of DFE-inference methods to model assumptions, and show how genomic features, like recombination rate and functional sequence density, influence power to detect selective sweeps. This extension to stdpopsim provides a powerful new resource for the population genetics community to explore the interplay between selection and other evolutionary forces in a reproducible, user-friendly framework.