A global watermarking standard could help safeguard elections in the ChatGPT era

pedroapero@lemmy.ml · 11 months ago

A global watermarking standard could help safeguard elections in the ChatGPT era

rodbiren@midwest.social · 11 months ago

Good luck watermarking plaintext and locally run models. There is no good option. If you want certainty that you are dealing with a human you lose privacy. If you want privacy you cannot know where the plain text came from unless you sign each file cryptographically. Then you only know it came from a certain source, but there is no guarantee how that source made the text. Welcome to the new world.

tpihkal@lemmy.world · 11 months ago

So what happens when we can’t trust everything we read on the Internet anymore?

kent_eh@lemmy.ca · 11 months ago

Spoiler alert: we’ve never been able to trust everything we read on the internet.

Serinus@lemmy.world · 11 months ago

In relative terms we could.

The amount of disinformation and propaganda is about to become obscene.

fishos@lemmy.world · 11 months ago

Except, no, you can’t. The whole “you eat seven spiders at night a year” was a rumor created specifically to show how easy is to start rumors. And how many times has that little gem been floating around the internet? Or how about how often you hear experts say that people talking about their given field on the Internet are flat out wrong, but they sound charismatic, so they get the upvote?

The Internet is full of DATA. It’s always been up to you to parse that info and decide what’s credible and what’s not. The difference now is that the critical thinking required to even access the Internet is basically nil and now everyone is on there.

Serinus@lemmy.world · 11 months ago

I guess you don’t know what’s coming. Is there a lot of misinformation now? Certainly. But I’d say less than half the data is false.

In the coming months you’re going to start seeing social media taken over by AI. You’re going to see pointed political “opinions” followed by several comments agreeing with the point being pushed. These are going to outnumber human comments.

Currently, shills absolutely exist, but they’re far outnumbered by genuine people. That’s about to change. Money is going to buy public opinion on a whole new scale unless we learn to ignore anonymous social media.

fishos@lemmy.world · 11 months ago

If you think that doesn’t already exist, you’ve been living under a rock. The Dead Internet Theory is pretty old at this point. I’m not saying you’re wrong, I’m saying that some of us have seen this trend coming long before AI was a buzzword and have been watching it already happen around us. I very much know what is coming because I’ve already watched it happen.

Serinus@lemmy.world · 11 months ago

Yeah, I mean 2015 was a big turning point, but this one should be bigger. It’s not black and white.

fishos@lemmy.world · 11 months ago

Exactly, it’s not black and white. It’s gray and grayer. And you’re telling me “it’s gonna be black!” and I’m telling you “it’s already gray, and it’s about to become even grayer”. This isn’t a turning point either. It’s just a predictable progression down a path that we started on decades ago. Some of us have been raising the alarm over this for a very long time. You’re coming to the trenches fresh faced trying to school me and I’m already war torn and fatigued.

rodbiren@midwest.social · 11 months ago

It’s not even about trust. It’s that I am confident I will have no clue who is a real life human being anymore soon. Autogenerated images, video, and text is practically in its infancy but already exists in the uncanny valley of being impossible to determine which is real and which is not. Imagine 5 years from now when perfectly lifelike high res video of practically anything you can imagine can be generated on the fly. Essentially the only thing I will have any certainty on is what I can witness in person. Or, if I have a circle of trust I can choose to believe content published by certain organizations or groups.

It may actually push us away from tech and back to the community, which could be good assuming we survive the transition.

Daniel@lemmy.ml · 11 months ago

For instance, on the planet Earth, man had always assumed that he was more intelligent than dolphins because he had achieved so much — the wheel, New York, wars and so on — whilst all the dolphins had ever done was muck about in the water having a good time. But conversely, the dolphins had always believed that they were far more intelligent than man — for precisely the same reasons.

Looks pretty good to be a dolphin right now.

snooggums@kbin.social · 11 months ago

That has been the internet since it was first created.

just another dev@lemmy.my-box.dev · 11 months ago

The same thing that has been happening for the past 2 decades.

BastingChemina@slrpnk.net · 11 months ago

I see that as a great opportunity for journalism.

kibiz0r@lemmy.world · 11 months ago

There are ways to watermark plaintext. But it’s relatively brittle, because it loses signal as the output is further modified, and you also need to know what specific LLM’s watermarks you’re looking for.

So it’s not a great solution on its own, but it could be part of something more comprehensive.

As for non-plaintext file formats…

A simple signature would indeed give us a source but not method, but I think that’s probably 90% of what we care about when it comes to mass disinformation. If an article or an image is signed by Reuters, you can probably trust it. If it’s signed by OpenAI or Stability, you probably can’t. And if it’s not signed at all or signed by some rando, you should remain skeptical.

But there are efforts like C2PA that include a log of how the asset was changed over time, providing a much more detailed explanation of what was done explicitly by humans vs. generative automated tools.

I understand the concern about privacy, but it’s not like you have to use a format that supports proving that an image is legit. But if you want to prove that it is legit, then you have to provide something that grounds it in reality. It doesn’t have to be personally-identifying. It could just be a key baked into your digital camera (assuming that the resulting signature is strong enough that it’s computationally expensive to try to reverse-engineer the key and find who bought the camera).

If you think about it, it’s kind of crazy that we’ve made it this far with a trust model that’s no more sophisticated than “I can tell from the pixels and from seeing quite a few shops in my time”.