List of contributions
Images
- Watermarking Images in Self-Supervised Latent Spaces
An approach to embed invisible watermarks in images using self-supervised learning features. - Active Image Indexing
A method for efficient better image fingerprinting and retrieval by modifying images before their release. - The Stable Signature
A framework for watermarking diffusion models by embedding binary messages in their latent decoder. - Watermark Anything With Localized Messages
Models to embed localized watermarks in images, enabling the extraction of one or multiple messages from different regions of an image, and the segmentation of the watermarkβs presence.
Audio
- Proactive Detection of Voice Cloning with Localized Watermarking
Model for localized watermarking to detect AI-generated speech with precision. The detector directly predicts watermark presence for each time step, making it fast and suitable for voice cloning applications. - Latent Watermarking of Audio Generative Models
A method to watermark audio generative models by watermarking their training data in a way that is robust to the audio tokenizer.
Text
- Three Bricks to Consolidate Watermarks for Large Language Models
Three key improvements to watermarking methods for LLMs: theoretically grounded statistical tests that guarantee false positive rates, evaluation on classical NLP benchmarks, and extension to multi-bit watermarking. - Watermarking Makes Language Models Radioactive
Demonstrates that training on watermarked text can be easily detected, linking contamination level to watermark robustness and fine-tuning process. Shows that training on watermarked synthetic instructions can be detected with high confidence.
Model
- Functional Invariants to Watermark Large Transformers
A method to watermark transformer models using functional invariants while preserving model utility.
Miscellaneous
- DINOv2
Self-supervised vision model that learns robust visual features without any supervision. - A Cookbook of Self-Supervised Learning
A comprehensive guide to self-supervised learning methods, best practices, and implementation details. - Seamless: Multilingual Expressive and Streaming Speech Translation
A unified framework for multilingual speech translation supporting streaming and expressive speech.
Links
Slides