lemmyreader@lemmy.ml to Technology@lemmy.mlEnglish · 7 months agoStack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPTwww.tomshardware.comexternal-linkmessage-square34fedilinkarrow-up1262arrow-down13cross-posted to: technology@lemmy.world
arrow-up1259arrow-down1external-linkStack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPTwww.tomshardware.comlemmyreader@lemmy.ml to Technology@lemmy.mlEnglish · 7 months agomessage-square34fedilinkcross-posted to: technology@lemmy.world
minus-squarejubilationtcornpone@sh.itjust.workslinkfedilinkEnglisharrow-up20arrow-down1·7 months agoData Rule Numero Uno: Garbage in, garbage out. Have fun training your LLM on a big steaming pile of hot garbage. That’s 80% of Stack Overflows content.
minus-squareharrys_balzac@lemmy.dbzer0.comlinkfedilinkarrow-up6·7 months agoMostly “this has been answered in another thread” and “why don’t you Google it” comments in my experience.
minus-squareLostXOR@fedia.iolinkfedilinkarrow-up2·edit-27 months agoThe other 20% is mostly high quality however, and I’m sure they’d filter out the heavily downvoted crud.
Data Rule Numero Uno:
Garbage in, garbage out.
Have fun training your LLM on a big steaming pile of hot garbage. That’s 80% of Stack Overflows content.
Mostly “this has been answered in another thread” and “why don’t you Google it” comments in my experience.
The other 20% is mostly high quality however, and I’m sure they’d filter out the heavily downvoted crud.