Posts

Is Text Watermarking a lost cause? 2024-10-01T16:20:51.113Z

Comments

Comment by egor.timatkov on Is Text Watermarking a lost cause? · 2024-10-01T19:01:30.151Z · LW · GW

I haven't, no. I really wish I could somehow investigate all 3 pillars of a good watermark (Decisiveness, Invisibility, Robustness), but I couldn't think of any way to quantify a general text watermark's invisibility. For any given watermark you can technically rate "how invisible it is" by using an LLM's loss function to see how different the watermarked text is from the original text, but I can't come up with a way to generalize this.
So unfortunately my analysis was only about the interplay between decisiveness and robustness.