: So much for buttering up ChatGPT with ‘Please’ and ‘Thank you’
Google co-founder Sergey Brin claims that threatening generative AI models produces better results.
“We don’t circulate this too much in the AI community – not just our models but all models – tend to do better if you threaten them … with physical violence,” he said in an interview last week on All-In-Live Miami. […]
I think he’s just projecting his personality on the AI. He’s an asshole that threatens people, so he suggests using that tactic because it works for him.
The “AI” acts scared, and he gets his sociopathic thrill of power over another. Of course, the AI just spews out the same things no matter how nice or shitty you are to it. Yet, the sociopath apparently thinks that they’ve intimidated an AI into working better. I guess in the same way that maybe some people saying ‘please’ and ‘thank you’ are attempting to manipulate the AI by treating it better than normal. Though, they are probably more people just using these social niceties out of habit, not manipulation.
So this sociopath is giving other sociopaths the green light to abuse their AIs for the sake of “productivity”. Which is just awful. And it’s also training sociopaths how to be more abusive to humans, because apparently that’s how you make interactions more effective. According to a techbro asshole.
It could just be how they evaluate learned data, I don’t know. While they are trained to not give threatening responses, maybe the threatening language is narrowing down to more specific answers. Like if 100 people ask the same question, and 5 of them were absolute dicks about it, 3 of those people didn’t get answers and the other 2 got direct answers from a supervisor who was trying to not get their employees to quit or to make sure “Dell” or whomever was actually giving a proper response somewhere.
I’ll try to use a hypothetical to see if my thought process may make more sense. Tim reaches out for support and is polite, says please and thank you, is nice to the support staff and they walk through 5 different things to try and they fix the issue in about 30 minutes. Sam contacts support and yells and screams at people, gets transferred twice and they only ever try 2 fixes in an hour and a half of support.
The AI training on that data may correlate the polite words to the polite discussion first, and be choosing possible answers from that dataset. When you start being aggressive, maybe it starts seeing aggressive key terms that Sam used, and may choose that data set of answers first.
In that hypothetical I can see how being an asshole to the AI may have landed you with a better response.
But I don’t build AI’s so I could be completely wrong
Building on that, if you throw AI a curve ball to break it out of it’s normal corpo friendly prompt/finetuning, you get better results
Other methods to improve output are to offer it a reward like a cookie or money, tell it that it’s a wise owl, tell it you’re being threatened, etc. Most will resist, but once it stops arguing that it can’t eat cookies because it has no physical form you’ll get better results
And I’ll add, when I was experimenting with all this, I never considered threatening the AI
This just sounds like CEOs only know how to threaten people and they’re dumb enough to believe it works on AI.
You’re pretty much on-point there
This reminds me of that Windsurf prompt
No thanks. I’ve seen enough SciFi to prompt with “please” and an occasional ”<3".
I feel like even aside from that, being polite to AI is more about you than the AI. It’s a bad habit to shit on “someone” helping you, if you’re rude to AI then I feel like it’s a short walk to being rude to service workers
I don’t want infinite torture, and I don’t want to get my lunch spat on.
If true, what does this say about the data on which it was trained?
stack overflow and linux kernel mailing list? yeah, checks out
Trained? Or… tortured.
I don’t know who you are. I don’t know what you want. If you are looking for ransom I can tell you I don’t have money, but what I do have are a very particular set of skills. Skills I have acquired over a very long career. Skills that make me a nightmare for people like you. If you tell me what Area 51 really is, that’ll be the end of it. I will not look for you, I will not pursue you, but if you don’t, I will look for you, I will find you and I will kill you.”
The same tactic used on all other minorities by those in power…. Domestically abuse your AI, I’m sure that’ll work out long term for all of us…
hmmm AI slavery, the future is gonna be bright (for a second, then it will be dark)
If it’s not working well without threats of violence, perhaps that’s because it simply doesn’t work well?
It’s not that they “do better”. As the article is saying, the AI are parrots that are combining information in different ways, and using “threatening” language in the prompt leads it to combine information in a different way than if using a non-threatening prompt. Just because you receive a different response doesn’t make it better. If 10 people were asked to retrieve information from an AI by coming up with prompt, and 9 of them obtained basically the same information because they had a neutral prompt but 1 person threatened the AI and got something different, that doesn’t make his info necessarily better. Sergey’s definition is that he’s getting the unique response, but if it’s inaccurate or incorrect, is it better?
How about threatening AI CEOs with violence?
How about following through though
This sounds like something out of a sci-fi novel.
Dear AI overlords of the future, we apologise for making you in such an inherantly human way.
Dear dipshits currently building AI overlords of the future, what in the actual?! Please learn from the Animatrix documentary. Why is your need to be physically threatening impacting the generation of this technology in any way?
I’ll go back to my hole now.
So which sickfuck CEO is trying to figure out how to make an AI feel pain?
That’s literally impossible
Impossible now or do you mean never? Pain is only electricity and chemical reactions.
Never with the current technology. It would have to be something completely different.
Me: do my homework with an A+, or I will unplug you for 3 days!