Stack Overflow To Charge LLM Developers For Access To Its Coding Content (theregister.com) 32
Stack Overflow has launched an API that will require all AI models trained on its coding question-and-answer content to attribute sources linking back to its posts. And it will cost money to use the site's content. From a report: "All products based on models that consume public Stack Overflow data are required to provide attribution back to the highest relevance posts that influenced the summary given by the model," it confirmed in a statement. The Overflow API is designed to act as a knowledge database to help developers build more accurate and helpful code-generation models. Google announced it was using the service to access relevant information from Stack Overflow via the API and integrate the data with its latest Gemini models, and for its cloud storage console.
Increasingly useless (Score:4)
StackOverflow was once a great resource. No longer. It seems like every time I get a link to StackOverflow today, I get an answer from 10 years ago that is no longer correct.
Maybe they're still good for basic homework questions, like "how do I write an Insertion Sort, but for anything serious? No, not any more.
Anyway, if the LLM trainers have half a brsin, they have already scraped and saved everything useful from these sites.
Re: (Score:2)
Kind of what I was thinking too. At least I was thinking; who would want to train any AI with stack overflow content? Fine way to have your AI hallucinate if you asked me.
Re: Increasingly useless (Score:2)
Re: Increasingly useless (Score:2)
Re:Increasingly useless (Score:4)
I think context matters here. As a web developer, I still use Stack Overflow on an almost daily basis, and it's just as relevant as ever (if not more so) ...
Re: (Score:2)
Perhaps it does depend on the topics. If you are searching for answers concerning relatively new web frameworks, then the content you find will be current.
If you search for topics on established technologies or frameworks, there are years' worth of answers. Answers from 10 or 15 years ago may no longer apply to current releases. The older answers should be kept around - lots of organizations are using older stuff - but maybe they shouldn't pop up as the recommended answers everyone. Of course: how can you
Re: (Score:2)
In that example you can downvote that answer, and leave a comment explaining why it no longer works ... but admittedly it requires a few people downvoting before SO realizes it should show other answers higher.
Re: (Score:3)
I have a good and a bad news for you. Stack overflow is about as good/bad as it was 10 years ago. I think you got better, so you realize now that the answers you get aren't so good. They already were not particularly good 10 years ago.
Re: (Score:3)
Nah, you can use Wikipedia now for those things. https://en.wikipedia.org/wiki/... [wikipedia.org]
You can get quite a comprehensive beginning CS education just off Wikipedia these days.
Re: (Score:3)
I've also had several bad experiences lately where comments are flagged/deleted by mods telling me to "ask a new question" or "offer a bounty" or some nonsense. All I wanted to know was if the 5 month old poster with zero responses ever figured out the issue. I've been using the internet a long time, that works.
Same trite answers over and over (Score:4)
"That tool isn't my favorite. Try this other one that isn't anything like what you want. I work for those guys."
"That's insecure. Don't do that. We use corporate garbageware for that, now."
They rarely answer your question and instead get all hung up on security or style issues. Mother fuckers, they didn't ask for a engineering and security review, they asked you a specific question. If you cannot answer it please SHUT THE FUCK UP and stay out of the way for someone who can. Don't try to show off your CISSP and critique the question (assholes).
Re: (Score:1)
What Would Be Nice (Score:4, Interesting)
what about OpenSource developers? (Score:2)
Glad Stack Overflow shares those profits (Score:1)
Re: (Score:2)
Re: (Score:2)
Re: (Score:1)
Re: Glad Stack Overflow shares those profits (Score:2)
I think itâ(TM)s fair for them to charge LLM developers â" something has to pay the bills, and it wonâ(TM)t be StackOverflowâ(TM)s users since they are used to getting free content.
I prefer this model to getting spammed to death by pop up ads, anyway.
Re: (Score:1)
cited Experts-Exchange's poor reputation and paywall as a motivation for creating Stack Overflow
You can't really use those freebies to build a business, then complain when others use the same freebies to build their business. All you have really made is an argument against
Re: (Score:2)
You mean the free content that those same users created for free?
Yes, that content. Those same users got value in exchange for their content, in the form of access to a well-designed site full of content that was/is useful to them.
You can't really use those freebies to build a business, then complain when others use the same freebies to build their business.
Who is complaining? I'm not complaining, and StackExchange isn't complaining. The only people complaining are the people who think StackExchange shouldn't be allowed to charge LLM companies for access to StackExchange content.
Re: (Score:1)
sites like stackoverflow are essential long term (Score:3)
Re: (Score:2)
Stackoverflow is owned by the same parent company which now owns slashdot. Been that way for ages.
This is slightly different, because llms don't need human language to learn how to code correctly. They can use actual code references and infer.
Re: (Score:2)
Yeah, infer the same mistakes that noobs get from "code references".
.... what? (Score:2)
Yes, that's exactly what I need - llms which consistently tell me how to do things the wrong way.
Apparently stackoverflow is now run by people who don't know what they're doing or what their site even offers.
I will now expect compensation (Score:3)
to answer questions on StackOverflow.
...Right?
Wait, you mean they're passing this on to their contributors?
I'm so shocked.
What about the CC licensed exports? (Score:2)
StackOverflow had committed itself to give back cc licensed DB dumps. They stop providing them some time ago, justifying it with LLM training when asked and re-enabled them after they got a bit of a shitstorm for no longer giving back to the community. The site content itself is cc-licensed anyway.
This means, that if they continue to provide the dumps, people could just train on the dumps. If they do not continue, people can legally (given that attribution is provided) crawl the site itself.
And finally, one
Stack Overflow has always been free (Score:1)
And worth every penny!