Safety and Robustness of Audio Watermarking

dc.contributor.advisor

Gong, Neil

dc.contributor.author

Guo, Moyang

dc.date.accessioned

2025-07-02T19:08:06Z

dc.date.available

2025-07-02T19:08:06Z

dc.date.issued

2025

dc.department

Electrical and Computer Engineering

dc.description.abstract

The rapid evolution of text-to-speech (TTS) technology has greatly enhanced the realism of synthetic speech, enabling numerous beneficial applications. However, these advancements also introduce significant ethical concerns, particularly regarding impersonation, disinformation, and copyright violations. To mitigate these risks, audio watermarking has emerged as a viable solution by embedding imperceptible yet verifiable watermarks into AI-generated speech. Despite its potential, the resilience of existing audio watermarking methods against both common and adversarial perturbations remains insufficiently studied.

This research presents AudioMarkBench, the first systematic benchmark aimed at assessing the robustness of audio watermarking against two major threats: watermark removal and watermark forgery. The benchmark is structured around three core components: (1) a newly developed dataset sourced from Common Voice, ensuring diversity across languages, biological sexes, and age groups; (2) an evaluation of three leading audio watermarking techniques; and (3) an analysis of watermark robustness against 15 distinct perturbation types under three adversarial settings—no-box, black-box, and white-box attacks.

Through extensive experimentation, this study evaluates the effectiveness and vulnerabilities of state-of-the-art audio watermarking methods when subjected to these perturbations. The results reveal that while current watermarking techniques perform reliably in ideal conditions, they demonstrate notable weaknesses, particularly under black-box and white-box attack scenarios. Additionally, this study identifies potential fairness concerns, as robustness inconsistencies are observed across different demographic groups, underscoring the need for more equitable and resilient audio watermarking solutions.

dc.identifier.uri

https://hdl.handle.net/10161/32940

dc.rights.uri

https://creativecommons.org/licenses/by-nc-nd/4.0/

dc.subject

Computer engineering

dc.subject

Audio

dc.subject

Trustworthy AI

dc.subject

Watermarking

dc.title

Safety and Robustness of Audio Watermarking

dc.type

Master's thesis

duke.embargo.months

0.01

duke.embargo.release

2025-07-08

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Guo_duke_0066N_18611.pdf
Size:
2.15 MB
Format:
Adobe Portable Document Format

Collections