#Did xAI lie about Grok 3’s benchmarks?

#Did xAI lie about Grok 3’s benchmarks?

Debates over AI benchmarks — and how they’re reported by AI labs — are spilling out into public view. This week, an OpenAI employee accused Elon Musk’s AI company, xAI, of publishing misleading benchmark results for its latest AI model, Grok 3. One of the co-founders of xAI, Igor Babushkin, insisted that the company was…

Read More
#Court filings show Meta staffers discussed using copyrighted content for AI training

#Court filings show Meta staffers discussed using copyrighted content for AI training

For years, Meta employees have internally discussed using copyrighted works obtained through legally questionable means to train the company’s AI models, according to court documents unsealed on Thursday. The documents were submitted by plaintiffs in the case Kadrey v. Meta, one of many AI copyright disputes slowly winding through the U.S. court system. The defendant,…

Read More
#Guidde taps AI to help create software training videos

#Guidde taps AI to help create software training videos

Creating corporate training videos for software is a time-consuming ordeal, especially if you’re an organization with a lot of software licenses. Training videos can help get employees up to speed, but they’re a big lift. They often take entire teams to produce. Tel Aviv-based entrepreneur Yoav Einav thought there might be an alternative, cheaper way…

Read More
#These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models

#These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models

Every Sunday, NPR host Will Shortz, The New York Times’ crossword puzzle guru, gets to quiz thousands of listeners in a long-running segment called the Sunday Puzzle. While written to be solvable without too much foreknowledge, the brainteasers are usually challenging even for skilled contestants. That’s why some experts think they’re a promising way to…

Read More