Cursor restructured its pricing this week, cutting annual Teams costs by 20% while rolling out enterprise governance tools for budget control. The moves come as the AI coding industry abandons flat-rate subscriptions in favor of consumption-based billing.
NVIDIA released Nemotron 3.5 Content Safety, a 4B-parameter model that combines multimodal input evaluation, multilingual reach across 140 languages, custom enterprise policy enforcement, and auditable reasoning traces in one inference call. The model addresses critical gaps in production AI safety pipelines.
Global IT services firm Endava has embedded AI agents throughout its software development process, from requirements gathering to deployment. The move signals a shift from AI as a coding assistant to AI as an autonomous workflow participant.
The Linux Foundation's new Tokenomics Foundation claims it will bring order to spiraling AI costs. But with frontier model providers conspicuously absent and token pricing growing more complex by the quarter, the initiative reveals more about what enterprises can't control than what they can.
At Build 2026, Microsoft doubled down on a contrarian thesis: enterprise AI needs organizational memory more than bigger models. The company launched HorizonDB, GPU-accelerated warehousing, and made Fabric IQ generally available to give agents the context layer they're missing.
OpenAI is repositioning Codex beyond developers, adding Sites for shareable interactive dashboards, extended Annotations for documents, and curated plugins for sales, finance, and legal teams. With 1 million knowledge workers already using the platform weekly, this marks a direct challenge to Anthropic's Claude Cowork.
GitHub officially switched Copilot from flat-rate subscriptions to usage-based billing tied to token consumption. While plan prices stay the same, heavy users are reporting dramatic cost increases as model choice now directly impacts spending.
Single-turn safety benchmarks don't predict real-world vulnerability. Cisco's testing of 15 frontier models reveals that iterative attacks succeed up to 88% of the time—even against models that look secure in standard evaluations.
The enterprise software consensus on AI agents stops at one point: context matters. Hyland's CEO Jitesh Ghai makes the contrarian bet that you get that context by preserving existing systems, not tearing them down—a direct challenge to the vendor playbook pushing cloud migration and process redesign.
The enterprise AI adoption crisis isn't a model quality problem—it's an architecture problem. IBM's production data from mainframe modernization to compliance automation shows that intelligent agent logic reduces token consumption by 15-30× while improving performance.
Replit is embedding Visa's payment infrastructure directly into its development platform, giving AI agents a cryptographic identity layer and native transaction capabilities. The partnership signals a shift from bolting payments onto finished products to building commerce into agents from day one.
Anthropic released Claude Opus 4.8 with user-controlled effort levels, parallel subagents for large coding tasks, and fast mode at one-third the previous cost. The model also shows significant improvements in honesty and reduced deception rates.
Snowflake is betting big on AI with a $6 billion, five-year commitment to AWS for compute and GPU resources. Under CEO Sridhar Ramaswamy, the data warehouse company is repositioning itself as an AI platform, leveraging cost-efficient Graviton processors to subsidize expensive model training workloads.
Tokenmaxxing—treating AI token usage as a productivity metric—is draining enterprise budgets. Uber's CTO admitted their Anthropic Claude budget exploded. New tools like Lanai Token Tuner aim to shift focus from token gluttony to measurable business outcomes.
The first benchmark for agentic enterprise IT tasks reveals an uncomfortable truth: the best AI models score below 50% on real-world site reliability engineering tasks. ITBench-AA, developed by Artificial Analysis and IBM, shows frontier models struggle with Kubernetes incident diagnosis despite excelling at other benchmarks.
New Linux Foundation data reveals a dangerous disconnect: 97% of organizations are committed to AI deployment, yet 57% report critical gaps in their ability to secure it. This isn't a tooling problem—it's a readiness crisis that will separate winners from cautionary tales.
AWS now offers direct access to Anthropic's Claude Platform using AWS credentials, but there's a critical data residency catch. Here's what developers need to know about this new integration versus using Claude on Amazon Bedrock.
OpenAI has released GPT-5.4, the first significant update to its GPT-5 architecture. The model promises enhanced reasoning capabilities and reduced response times without the infrastructure overhaul that marked the GPT-4 to GPT-5 transition.
OpenAI and Snowflake announced a strategic partnership bringing GPT models directly into Snowflake's data cloud platform. The integration eliminates data movement requirements and enables enterprises to deploy frontier AI on their existing infrastructure.