Developers News | Slashdot

Microsoft Favors Anthropic Over OpenAI For Visual Studio Code (theverge.com) 7

Posted by BeauHD on Wednesday September 17, 2025 @04:40PM from the would-you-look-at-that dept.

Gemini AI Solves Coding Problem That Stumped 139 Human Teams At ICPC World Finals (arstechnica.com) 75

Posted by BeauHD on Wednesday September 17, 2025 @04:02PM from the artificial-brain-power dept.

An anonymous reader quotes a report from Ars Technica: Like the rest of its Big Tech cadre, Google has spent lavishly on developing generative AI models. Google's AI can clean up your text messages and summarize the web, but the company is constantly looking to prove that its generative AI has true intelligence. The International Collegiate Programming Contest (ICPC) helps make the point. Google says Gemini 2.5 participated in the 2025 ICPC World Finals, turning in a gold medal performance. According to Google this marks "a significant step on our path toward artificial general intelligence."

Every year, thousands of college-level coders participate in the ICPC event, facing a dozen deviously complex coding and algorithmic puzzles over five grueling hours. This is the largest and longest-running competition of its type. To compete in the ICPC, Google connected Gemini 2.5 Deep Think to a remote online environment approved by the ICPC. The human competitors were given a head start of 10 minutes before Gemini began "thinking."

According to Google, it did not create a freshly trained model for the ICPC like it did for the similar International Mathematical Olympiad (IMO) earlier this year. The Gemini 2.5 AI that participated in the ICPC is the same general model that we see in other Gemini applications. However, it was "enhanced" to churn through thinking tokens for the five-hour duration of the competition in search of solutions. At the end of the time limit, Gemini managed to get correct answers for 10 of the 12 problems, which earned it a gold medal. Only four of 139 human teams managed the same feat. "The ICPC has always been about setting the highest standards in problem-solving," said ICPC director Bill Poucher. "Gemini successfully joining this arena, and achieving gold-level results, marks a key moment in defining the AI tools and academic standards needed for the next generation." Gemini's solutions are available on GitHub.

AI's Ability To Displace Jobs is Advancing Quickly, Anthropic CEO Says (axios.com) 50

Posted by msmash on Wednesday September 17, 2025 @02:46PM from the ring-the-alarm-bells dept.

OpenAI Says Models Programmed To Make Stuff Up Instead of Admitting Ignorance (theregister.com) 90

Posted by msmash on Wednesday September 17, 2025 @01:28PM from the fault-in-our-stars dept.

Business Insider Reportedly Tells Journalists They Can Use AI To Draft Stories (theverge.com) 17

Posted by msmash on Wednesday September 17, 2025 @11:27AM from the AI-slop dept.

Anthropic Denies Federal Agencies Use of Claude for Surveillance Tasks (semafor.com) 19

Posted by msmash on Wednesday September 17, 2025 @10:05AM from the drawing-a-line dept.

China Tells Its Tech Companies To Stop Buying All of Nvidia's AI Chips (ft.com) 52

Posted by msmash on Wednesday September 17, 2025 @04:43AM from the breaking-news dept.

ChatGPT Will Guess Your Age and Might Require ID For Age Verification 111

Posted by BeauHD on Tuesday September 16, 2025 @08:02PM from the safety-first dept.

OpenAI is rolling out stricter safety measures for ChatGPT after lawsuits linked the chatbot to multiple suicides. "ChatGPT will now attempt to guess a user's age, and in some cases might require users to share an ID in order to verify that they are at least 18 years old," reports 404 Media. "We know this is a privacy compromise for adults but believe it is a worthy tradeoff," the company said in its announcement. "I don't expect that everyone will agree with these tradeoffs, but given the conflict it is important to explain our decisionmaking," OpenAI CEO Sam Altman said on X. From the report: OpenAI introduced parental controls to ChatGPT earlier in September, but has now introduced new, more strict and invasive security measures. In addition to attempting to guess or verify a user's age, ChatGPT will now also apply different rules to teens who are using the chatbot. "For example, ChatGPT will be trained not to do the above-mentioned flirtatious talk if asked, or engage in discussions about suicide of self-harm even in a creative writing setting," the announcement said. "And, if an under-18 user is having suicidal ideation, we will attempt to contact the users' parents and if unable, will contact the authorities in case of imminent harm."

OpenAI's post explains that it is struggling to manage an inherent problem with large language models that 404 Media has tracked for several years. ChatGPT used to be a far more restricted chatbot that would refuse to engage users on a wide variety of issues the company deemed dangerous or inappropriate. Competition from other models, especially locally hosted and so-called "uncensored" models, and a political shift to the right which sees many forms of content moderation as censorship, has caused OpenAI to loosen those restrictions.

"We want users to be able to use our tools in the way that they want, within very broad bounds of safety," Open AI said in its announcement. The position it seemed to have landed on given these recent stories about teen suicide, is that it wants to "'Treat our adult users like adults' is how we talk about this internally, extending freedom as far as possible without causing harm or undermining anyone else's freedom."

Microsoft Announces $30 Billion Investment In AI Infrastructure, Operations In UK 22

Posted by BeauHD on Tuesday September 16, 2025 @07:20PM from the spending-commitments dept.

Another Lawsuit Blames an AI Company of Complicity In a Teenager's Suicide 63

Posted by BeauHD on Tuesday September 16, 2025 @04:40PM from the here-we-go-again dept.

Zoom CEO Latest Executive To Forecast Shortened Workweeks From AI Adoption (fortune.com) 52

Posted by msmash on Tuesday September 16, 2025 @12:01PM from the future-of-work dept.

The Mac App Flea Market 40

Posted by msmash on Tuesday September 16, 2025 @10:40AM from the flea-market dept.

Google Releases VaultGemma, Its First Privacy-Preserving LLM 23

Posted by BeauHD on Tuesday September 16, 2025 @09:00AM from the first-of-its-kind dept.

An anonymous reader quotes a report from Ars Technica: The companies seeking to build larger AI models have been increasingly stymied by a lack of high-quality training data. As tech firms scour the web for more data to feed their models, they could increasingly rely on potentially sensitive user data. A team at Google Research is exploring new techniques to make the resulting large language models (LLMs) less likely to 'memorize' any of that content. LLMs have non-deterministic outputs, meaning you can't exactly predict what they'll say. While the output varies even for identical inputs, models do sometimes regurgitate something from their training data -- if trained with personal data, the output could be a violation of user privacy. In the event copyrighted data makes it into training data (either accidentally or on purpose), its appearance in outputs can cause a different kind of headache for devs. Differential privacy can prevent such memorization by introducing calibrated noise during the training phase.

Adding differential privacy to a model comes with drawbacks in terms of accuracy and compute requirements. No one has bothered to figure out the degree to which that alters the scaling laws of AI models until now. The team worked from the assumption that model performance would be primarily affected by the noise-batch ratio, which compares the volume of randomized noise to the size of the original training data. By running experiments with varying model sizes and noise-batch ratios, the team established a basic understanding of differential privacy scaling laws, which is a balance between the compute budget, privacy budget, and data budget. In short, more noise leads to lower-quality outputs unless offset with a higher compute budget (FLOPs) or data budget (tokens). The paper details the scaling laws for private LLMs, which could help developers find an ideal noise-batch ratio to make a model more private. The work the team has done here has led to a new Google model called VaultGemma, its first open-weight model trained with differential privacy to minimize memorization risks. It's built on the older Gemma 2 foundation and sized at 1 billion parameters, which the company says performs comparably to non-private models of similar size.

It's available now from Hugging Face and Kaggle.

Online Marketplace Fiverr To Lay Off 30% of Workforce In AI Push 41

Posted by BeauHD on Tuesday September 16, 2025 @03:00AM from the would-you-look-at-that dept.

OpenAI's First Study On ChatGPT Usage (arstechnica.com) 20

Posted by BeauHD on Monday September 15, 2025 @11:30PM from the behind-the-scenes dept.

An anonymous reader quotes a report from Ars Technica: Today, OpenAI's Economic Research Team went a long way toward answering that question, on a population level, releasing a first-of-its-kind National Bureau of Economic Research working paper (in association with Harvard economist David Denning) detailing how people end up using ChatGPT across time and tasks. While other research has sought to estimate this kind of usage data using self-reported surveys, this is the first such paper with direct access to OpenAI's internal user data. As such, it gives us an unprecedented direct window into reliable usage stats for what is still the most popular application of LLMs by far. After digging through the dense 65-page paper, here are seven of the most interesting and/or surprising things we discovered about how people are using OpenAI today. Here are the seven most interesting and surprising findings from the study:

1. ChatGPT is now used by "nearly 10% of the world's adult population," up from 100 million users in early 2024 to over 700 million users in 2025. Daily traffic is about one-fifth of Google's at 2.6 billion GPT messages per day.

2. Long-term users' daily activity has plateaued since June 2025. Almost all recent growth comes from new sign-ups experimenting with ChatGPT, not from established users increasing their usage.

3. 46% of users are aged 18-25, making ChatGPT especially popular among the youngest adult cohort. Factoring in under-18 users (not counted in the study), the majority of ChatGPT users likely weren't alive in the 20th century.

4. At launch in 2022, ChatGPT was 80% male-dominated. By late 2025, the balance has shifted: 52.4% of users are now female.

5. In 2024, work vs. personal use was close to even. By mid-2025, 72% of usage is non-work related -- people are using ChatGPT more for personal, creative, and casual needs than for productivity.

6. 28% of all conversations involve writing assistance (emails, edits, translations). For work-related queries, that jumps to 42% overall, and 52% among business/management jobs. Furthermore, the report found that editing and critiquing text is more common than generating text from scratch.

7. 14.9% of work-related usage is dealt with "making decisions and solving problems." This shows people don't just use ChatGPT to do tasks -- they use it as an advisor or co-pilot to help weigh options and guide choices.

'Meta Ray-Ban Display' Glasses Design, HUD Clips Leak (uploadvr.com) 25

Posted by BeauHD on Monday September 15, 2025 @08:45PM from the sneak-peak dept.

Robinhood Plans To Launch a Startups Fund Open To All Retail Investors (techcrunch.com) 21

Posted by BeauHD on Monday September 15, 2025 @08:02PM from the no-more-pre-IPO-FOMO dept.

Vibe Coding Has Turned Senior Devs Into 'AI Babysitters' 86

Posted by BeauHD on Monday September 15, 2025 @07:20PM from the is-it-worth-it? dept.

An anonymous reader quotes a report from TechCrunch: Carla Rover once spent 30 minutes sobbing after having to restart a project she vibe coded. Rover has been in the industry for 15 years, mainly working as a web developer. She's now building a startup, alongside her son, that creates custom machine learning models for marketplaces. She called vibe coding a beautiful, endless cocktail napkin on which one can perpetually sketch ideas. But dealing with AI-generated code that one hopes to use in production can be "worse than babysitting," she said, as these AI models can mess up work in ways that are hard to predict.

She had turned to AI coding in a need for speed with her startup, as is the promise of AI tools. "Because I needed to be quick and impressive, I took a shortcut and did not scan those files after the automated review," she said. "When I did do it manually, I found so much wrong. When I used a third-party tool, I found more. And I learned my lesson." She and her son wound up restarting their whole project -- hence the tears. "I handed it off like the copilot was an employee," she said. "It isn't."

Rover is like many experienced programmers turning to AI for coding help. But such programmers are also finding themselves acting like AI babysitters -- rewriting and fact-checking the code the AI spits out. A recent report by content delivery platform company Fastly found that at least 95% of the nearly 800 developers it surveyed said they spend extra time fixing AI-generated code, with the load of such verification falling most heavily on the shoulders of senior developers. These experienced coders have discovered issues with AI-generated code ranging from hallucinating package names to deleting important information and security risks. Left unchecked, AI code can leave a product far more buggy than what humans would produce.

Working with AI-generated code has become such a problem that it's given rise to a new corporate coding job known as "vibe code cleanup specialist." TechCrunch spoke to experienced coders about their time using AI-generated code about what they see as the future of vibe coding. Thoughts varied, but one thing remained certain: The technology still has a long way to go. "Using a coding co-pilot is kind of like giving a coffee pot to a smart six-year-old and saying, 'Please take this into the dining room and pour coffee for the family,'" Rover said. Can they do it? Possibly. Could they fail? Definitely. And most likely, if they do fail, they aren't going to tell you. "It doesn't make the kid less clever," she continued. "It just means you can't delegate [a task] like that completely." Further reading: The Software Engineers Paid To Fix Vibe Coded Messes

Microsoft's Office Apps Now Have Free Copilot Chat Features (theverge.com) 26

Posted by msmash on Monday September 15, 2025 @04:44PM from the free-bloatware dept.

Hard Drive Shortage Intensifies as AI Training Data Pushes Lead Times Beyond 12 Months (tomshardware.com) 24

Posted by msmash on Monday September 15, 2025 @04:01PM from the AI-needs-a-bigger-boat dept.

2012	Torvalds Uses Profanity To Lambaste Romney Remarks	1223 comments
2008	How Close Were US Presidential Elections?	971 comments
2006	The Man Who Literally Saved the World	796 comments
2004	Europeans To Monitor American Voters	1867 comments
2002	Flirting With Mac OS X	1147 comments

Slashdot Top Deals