This is legit.
- The actual conversation: https://archive.is/sjG2B
- The user created a Reddit thread about it: https://old.reddit.com/r/artificial/comments/1gq4acr/gemini_told_my_brother_to_die_threatening/
This bubble can’t pop soon enough.
This is legit.
This bubble can’t pop soon enough.
I’m not saying this to excuse google (I generally avoid the big corp AI models), but I’ve used LLMs for like… what is it, almost 2 years now? And the degradation seems barely even existent, like it just went from 0 to 100. I only skimmed, so maybe I missed something important. It’s very weird. Typically there’s going to be somewhat of a path to output like this, and corp models such as this are usually tuned heavily to stay on a sanitized, assistant-like track.
The theory that weird tokens in input caused it to go wonky does seem plausible. LLMs use tokenizers (things that break stuff up into words or segments of words) and so weirdness relative to how they tokenize and what they’re trained on could maybe cause it to go off the rails.
Anyway, I tend to be opposed to the sanitized assistant format that they are most known for because it presents AI as a fact machine (which it cannot do reliably), it tries to pave over creativity of responses with sanitized tuning (which gives a false sense of security for “safe” output - as we see in examples like this, it cannot block everything weird in all scenarios), and it gets people thinking that AI = chat assistant. When the basics of an LLM without all the bells and whistles is more like: You type “I went to” and the AI continues it as “the store to buy some bread, where I saw”. How an LLM is likely to continue given text will depend some on how it’s tuned, what is in the training data, etc., but that’s ultimately what it’s doing, is it’s predicting the token that should come next and sampling methods add an element of randomness (and sometimes other fancy math) so that it doesn’t write deterministically. It doesn’t know that there is an independent human user and itself a machine. It is tuned to predict tokens like the format is a chat between two names and some stuff is done behind the scenes to stop its output before it continues writing for the user; if you remove those mechanisms with a model like this, you could have it write a whole simulated back and forth.
But because chat format presents it like AI and user, no matter how many times the corps shove in phrases like “As an AI language model”, it’s going to feel like you are talking with an entity. Which I don’t think is so much a problem for chat format where you go in knowing it’s for fantasy, like roleplay setups. But this corp stuff badly wants to encroach on the space inhabited by internet search and customer service, and it just can’t reliably. It’s a square peg in a round hole, or round peg in a square hole, however that goes.
Raw GPT 4o could honestly be incredible – both in good and bad. I remember the early chatGPT would cheerfully give you recipes for bombs and such if you asked it. Then they manually blocked that.
Now it has to do the “it’s important to consider both sides” thing all the time and I feel like you get much better responses if you talk to it at length like you would a person. Saying “thanks, now let’s look at” etc. In a study they found that if you told it to take a deep breath before answering it would send a slightly more accurate answer, apparently. I use it for bug-solving and coding because it relies on an existing corpus of documentation so it’s generally reliable and pretty good at that, but I’m starting to hate having to write at length to describe exactly what I want it to do. It should be able to infer my intent, I think this is something an LLM could do innately.
I did get some interesting answers if I primed it by saying “you are a marxist who has read the entirety of the Marxists Internet Archive”. Then exercise some human discretion when reading the output but it has allowed me to consider topics differently at times. Of course there’s also always the hallucinations machine phenomena where you second guess everything it tells you anyway because there’s no way to check if it’s actually true.
I’ve also tried much smaller LLM models and you can tell the difference. Actually, you can’t so much anymore, precisely because GPT is purposely throttled so much. I want a GPT model that only needs one sentence to do its job and will not presume it knows better than me! If there has to be AI, it has to be open source AI!
Not to sound like an ad, but this is where I appreciate NovelAI as a service. Even though it’s not open source and is a paid service, they have a good track record for letting adults use a model like an adult. They don’t have investors breathing down their necks and they made it encrypted from the start, so you can do whatever you want with text gen and not worry about it being read by some programmer who’s using it to train a model or whatever.
As you can imagine, this makes them behind the big corps who are taking ungodly amounts of investor funding, but their latest is pretty good. Not as “smart” as the best models have ever been and mainly storytelling focused, but pretty good.
So in other words, within the capitalist model of things and AI being so expensive to host and train, they’re one of the closest things I’ve seen to being in the same spirit as what open source AI could do for people without going quite that far.
Reminds me of how with one model, it was like, saying “please” as part of a request would give slightly better results.
I won’t ramble on too much on this topic, but I’m sure I could go on at length on this point alone. It’s a fascinating thing to me finding that sweet spot where an AI is designed like an extra limb for a person (metaphorically speaking, not talking about actual cybernetics). I think that’s where it’s most powerful, as opposed to implementations where we’re trusting that what it’s saying and doing is solid on its own. The means of interfacing where you tell the model in natural language what you want and it tries to give it to you is only one approach and there could probably be better. With storytelling focused AI, for example, you might use outlining and other such stuff to indirectly help the AI know what you want.
That’s interesting. I experimented with a NovelAI model of trying to set up its role as a sort of marxist therapist, to avoid more individualist-feeling back and forth. I’m not sure how much difference it actually made, but it was similar, I think, to what you describe in the way that it has helped me consider things in ways I hadn’t thought of at times. And yeah, the hallucination thing is a very real part of it. Occasionally there are times an LLM tells me something that I look up and it turns out it is real and I hadn’t heard of it, but then there are also those times where I’m just taking what it says with a grain of salt as something to consider rather than as something grounded.
It eventually all ties into the contradiction between what the technology is vs. what big tech and venture capital want you think it is as you alluded. I think LLMs in an ideal scenario could be at worst a fun toy and at best a good stepping stone but big tech has decided to get incredibly weird with it. So now you get bombastic claims about what LLMs will be able to do five years from now alongside disclaimers that it currently makes shit up so please double check the responses.
The reason I posted this is that it’s good to try and hold demoncorps like Google accountable even though it won’t likely make a dent. At worst it’s just good fun expect for the Gemini user in question.
Agreed. I have no love for google or how they and others like them are going about this. Personally, it’s a subject I hang around a lot, so I tend to use what opportunities I have to drop some basics about it, in case there are people around who think it’s more… magical than it is, for lack of a better word.
Lol yeah, that stuff is… something. AGI (Artificial General Intelligence) seems to be the go-to buzzword to fuel the hype machine, but as far as I can tell, the logistics of actually achieving it are so beyond what an LLM is, at least in the current transformer infrastructure of things. One of the things I’ve picked up along the way is just how important data is that goes into training an LLM. And it’s this thing that kinda makes intuitive sense when you think about it, but can get lost in the black box “AI so clever” hype; that it can’t know something it hasn’t ever been presented with before. To put it one way, if you trained an LLM on a story with binary good and a story with binary evil, it’s not necessarily going to extrapolate from that how to write a mundane story about shades of gray. It might instead combine the two flavors, creating a blend of the extremes. I can’t claim with confidence it’s exactly this straightforward in practice, but trying to get at a general idea.