Safety and Robustness of Audio Watermarking

Loading...

Date

2025

Advisors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

0
views
2
downloads

Abstract

The rapid evolution of text-to-speech (TTS) technology has greatly enhanced the realism of synthetic speech, enabling numerous beneficial applications. However, these advancements also introduce significant ethical concerns, particularly regarding impersonation, disinformation, and copyright violations. To mitigate these risks, audio watermarking has emerged as a viable solution by embedding imperceptible yet verifiable watermarks into AI-generated speech. Despite its potential, the resilience of existing audio watermarking methods against both common and adversarial perturbations remains insufficiently studied.

This research presents AudioMarkBench, the first systematic benchmark aimed at assessing the robustness of audio watermarking against two major threats: watermark removal and watermark forgery. The benchmark is structured around three core components: (1) a newly developed dataset sourced from Common Voice, ensuring diversity across languages, biological sexes, and age groups; (2) an evaluation of three leading audio watermarking techniques; and (3) an analysis of watermark robustness against 15 distinct perturbation types under three adversarial settings—no-box, black-box, and white-box attacks.

Through extensive experimentation, this study evaluates the effectiveness and vulnerabilities of state-of-the-art audio watermarking methods when subjected to these perturbations. The results reveal that while current watermarking techniques perform reliably in ideal conditions, they demonstrate notable weaknesses, particularly under black-box and white-box attack scenarios. Additionally, this study identifies potential fairness concerns, as robustness inconsistencies are observed across different demographic groups, underscoring the need for more equitable and resilient audio watermarking solutions.

Description

Provenance

Subjects

Computer engineering, Audio, Trustworthy AI, Watermarking

Citation

Citation

Guo, Moyang (2025). Safety and Robustness of Audio Watermarking. Master's thesis, Duke University. Retrieved from https://hdl.handle.net/10161/32940.

Collections


Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.