Loading [MathJax]/jax/output/CommonHTML/jax.js
World Scientific
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at customercare@wspc.com for any enquiries.

DNN-based speech watermarking resistant to desynchronization attacks

    https://doi.org/10.1142/S0219691323500091Cited by:1 (Source: Crossref)

    Desynchronization attacks proved to be the greatest challenge to audio watermarking systems as they introduce misalignment between the signal carrier and the watermark. This paper proposes a DNN-based speech watermarking system with two adversarial networks jointly trained on a set of desynchronization attacks to embed a randomly generated watermark. The detector neural network is expanded with spatial pyramid pooling layers to be able to handle signals affected by these attacks. A detailed training procedure of the aforementioned DNN system with gradual attack introduction is proposed in order to achieve robustness. Experiments performed on a speech dataset show that the system achieves satisfactory results according to all the benchmarks it was tested against. The system preserves signal quality after watermark embedding. Most importantly, the system achieved resistance to all considered desynchronization attacks. The majority of the attacks cause less than 1.70% of incorrectly detected watermarked bits on average, which outperforms comparative techniques in this regard.

    AMSC: 68T07