Skip to main navigation Skip to search Skip to main content

BVSim: A benchmarking variation simulator mimicking human variation spectrum

  • Yongyi Luo
  • , Zhen Zhang
  • , Shu Wang
  • , Jiandong Shi
  • , Jingyu Hao
  • , Sheng Lian
  • , Taobo Hu
  • , Toyotaka Ishibashi
  • , Depeng Wang
  • , Weichuan Yu*
  • , Xiaodan Fan*
  • *Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Background Genomic variations, including single-nucleotide polymorphisms, small insertions and deletions, and structural variations, are crucial for understanding evolution and disease. However, comprehensive simulation tools for benchmarking genomic analysis methods are lacking. Existing simulators do not accurately represent the nonuniform distribution and length patterns of structural variations in human genomes, and simulating complex structural variations remains challenging. Results We present BVSim, a flexible tool that provides probabilistic simulations of genomic variations, primarily focusing on human patterns while accommodating diverse species. BVSim effectively simulates both simple and complex structural variations and small variants by mimicking real-life variation distributions, which often exhibit higher frequencies near telomeres and within tandem repeat regions. Notably, BVSim allows users to input single or multiple benchmark samples from any reference genome, enabling the tool to summarize and represent the unique distribution patterns of structural variation positions and lengths specific to those species. Its compatibility with standard file formats facilitates seamless integration into various genomic research workflows, making it a very useful resource for benchmarking downstream tools such as variant callers. With numerical experiments, we show that BVSim generated more realistic sequences significantly different from other simulators' outputs. Conclusions BVSim is written in Python and freely available to noncommercial users under the GPL3 license. Source code, application guide, and toy examples are provided on the GitHub page at https://github.com/YongyiLuo98/BVSim. The tool is registered in SciCrunch (RRID:SCR_026926), bio.tools (biotools:BVSim), and WorkflowHub (doi:10.48546/WORKFLOWHUB.WORKFLOW.1361.1).

Original languageEnglish
Article numbergiaf095
Number of pages8
JournalGigaScience
Volume14
DOIs
Publication statusPublished - 30 Aug 2025

Bibliographical note

Publisher Copyright:
© 2025 The Author(s). Published by Oxford University Press on behalf of GigaScience.

Keywords

  • genomic variations
  • sequence simulation
  • benchmarking

Fingerprint

Dive into the research topics of 'BVSim: A benchmarking variation simulator mimicking human variation spectrum'. Together they form a unique fingerprint.

Cite this