Accessible, realistic genome simulation with selection using stdpopsim
More On Article
- It Takes Two to Tango: A Pluralist Account for Building Comprehensive Explanations in Human Evolution
- HEAS Member Has Article Recognized by Wiley as one of their Top 10% Viewed Articles of 2024
- Humans are not unique: difficult birth is common in placental mammals.
- Human childbirth is not uniquely difficult among mammals
- Osteoarchaeologist Ellen Green To Give A Talk At The ÖAI
Gower, G., Pope, N.S., Rodrigues, M.F., Tittes, S., Tran, L.N., Alam, O., Cavassim, M.I.A., Fields, P.D., Haller, B.C., Huang, X., Jeffrey, B., Korfmann, K., Kyriazis, C.C., Min, J., Rebollo, I., Rehmann, C.T., Small, S.T., Smith, C.C.R., Tsambos, G., Wong, Y., Zhang, Y., Huber, C.D., Gorjanc, G., Ragsdale, A.P., Gronau, I., Gutenkunst, R.N., Kelleher, J., Lohmueller, K.E., Schrider, D.R., Ralph, P.L., Kern, A.D., 2025. Accessible, realistic genome simulation with selection using stdpopsim. Molecular Biology and Evolution.
Selection is a fundamental evolutionary force that shapes patterns of genetic variation across species. However, simulations incorporating realistic selection along heterogeneous genomes in complex demographic histories are challenging, limiting our ability to benchmark statistical methods aimed at detecting selection and to explore theoretical predictions. stdpopsim is a community-maintained simulation library that already provides an extensive catalog of species-specific population genetic models. Here we present a major extension to the stdpopsim framework that enables simulation of various modes of selection, including background selection, selective sweeps, and arbitrary distributions of fitness effects (DFE) acting on annotated subsets of the genome (for instance, exons). This extension maintains stdpopsim’s core principles of reproducibility and accessibility while adding support for species-specific genomic annotations and published DFE estimates. We demonstrate the utility of this framework by comparing methods for demographic inference, DFE estimation, and selective sweep detection across several species and scenarios. Our results demonstrate the robustness of demographic inference methods to selection on linked sites, reveal the sensitivity of DFE-inference methods to model assumptions, and show how genomic features, like recombination rate and functional sequence density, influence power to detect selective sweeps. This extension to stdpopsim provides a powerful new resource for the population genetics community to explore the interplay between selection and other evolutionary forces in a reproducible, user-friendly framework.