Safety and Robustness of Audio Watermarking
Date
2025
Authors
Advisors
Journal Title
Journal ISSN
Volume Title
Repository Usage Stats
views
downloads
Abstract
The rapid evolution of text-to-speech (TTS) technology has greatly enhanced the realism of synthetic speech, enabling numerous beneficial applications. However, these advancements also introduce significant ethical concerns, particularly regarding impersonation, disinformation, and copyright violations. To mitigate these risks, audio watermarking has emerged as a viable solution by embedding imperceptible yet verifiable watermarks into AI-generated speech. Despite its potential, the resilience of existing audio watermarking methods against both common and adversarial perturbations remains insufficiently studied.
This research presents AudioMarkBench, the first systematic benchmark aimed at assessing the robustness of audio watermarking against two major threats: watermark removal and watermark forgery. The benchmark is structured around three core components: (1) a newly developed dataset sourced from Common Voice, ensuring diversity across languages, biological sexes, and age groups; (2) an evaluation of three leading audio watermarking techniques; and (3) an analysis of watermark robustness against 15 distinct perturbation types under three adversarial settings—no-box, black-box, and white-box attacks.
Through extensive experimentation, this study evaluates the effectiveness and vulnerabilities of state-of-the-art audio watermarking methods when subjected to these perturbations. The results reveal that while current watermarking techniques perform reliably in ideal conditions, they demonstrate notable weaknesses, particularly under black-box and white-box attack scenarios. Additionally, this study identifies potential fairness concerns, as robustness inconsistencies are observed across different demographic groups, underscoring the need for more equitable and resilient audio watermarking solutions.
Type
Department
Description
Provenance
Subjects
Citation
Permalink
Citation
Guo, Moyang (2025). Safety and Robustness of Audio Watermarking. Master's thesis, Duke University. Retrieved from https://hdl.handle.net/10161/32940.
Collections
Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.