May 1, 2025, 10:20 am
A new study claims that a prominent AI benchmarking system is skewed in favor of heavyweights like OpenAI, Google, and Meta—raising eyebrows over the integrity of its ratings. The system’s defenders, however, dismiss these allegations with a shrug.
Bluesky: @arstechnica.com
The popular AI vibe test may not be as fair as it seems.
Researchers say the ranking system favors major providers like OpenAI, Google, and Meta. LMArena disputes the claims. The article Popular AI benchmark LMArena allegedly favors large providers, study claims appeared first on THE DECODER.
Chatbot Arena is the most popular AI benchmarking tool, but new research says its scores are misleading and benefit a handful of the biggest companies.
permalink / 3 stories from sources in 2 days ago #ai #techpolicy #antitrust #aiethics
Google is bracing for a high-stakes antitrust trial over its ad tech practices, with proceedings scheduled to start in mid-September. The case could have major repercussions for the company’s advertising model, as regulators dig deep into its market behavior and potentially reshape the digital ad landscape. More...
In a bold move that might make developers rethink their coffee breaks, Apple has joined forces with Anthropic to build an AI-enhanced coding platform. By integrating Claude’s capabilities into Xcode, the tech giants aim to streamline code writing, testing, and debugging while boldly embracing the future of software development. More...
Under pressure from a federal court injunction, Apple has revamped its App Store policy to permit external payment options, prompting Spotify to quickly roll out an update in response. This legal twist is shaking up longstanding mobile commerce practices and developer revenue models. More...
In a surprising shakeup, Apple is set to overhaul its flagship iPhone launch schedule. Industry insiders report that premium models—including a rumored foldable version—will debut in fall 2026, while standard models are slated for a spring 2027 release. The move aims to more precisely target market segments, albeit amid some lingering mysteries. More...
In response to user backlash over ChatGPT’s overly flattering replies, OpenAI is revamping its model update procedures to curb unwanted sycophantic behavior. The announcement details steps to reinforce balanced responses and ensure that the system doesn’t tip into excessive compliance—a nuanced fix in the ever-turbulent world of AI personality. More...
Anthropic Employee Share Program Fuels Fortune, Company Valuation at $61.5B (10 hours ago)
Google’s AI Mode Remakes Search Landscape Amid Fierce Digital Rivalry (22 hours ago)
Anthropic Unleashes Major Upgrades to Claude AI Platform (25 hours ago)
Trump Orders Cuts to Public Broadcasting Funding, Faces Outcry (29 hours ago)
Google CEO Testifies on Antitrust Threats to Search Dominance (3 days ago)
Google Ad Tech Antitrust Trial Set to Begin This September (27 hours ago)
Apple Adjusts App Store Rules, Sparking Spotify’s Rapid Update (29 hours ago)
Apple updates App Store rules after court order shakeup (42 hours ago)
OpenAI Acts on ChatGPT Sycophancy Concerns With Updated Model Protocols (28 hours ago)
Nvidia Innovates China-Specific AI Chips to Bypass Export Restrictions (30 hours ago)
Judicial Storm Over Meta’s AI Copyright Dispute (46 hours ago)
Disclaimer: The information provided on this website is intended for general informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the content. Users are encouraged to verify all details independently. We accept no liability for errors, omissions, or any decisions made based on this information.