AI is ready to take over Python programming, but not much else

Tests of how well 19 large language models (LLMs) complete and perform complicated multi-step tasks has shown that they are both error-prone and, in many cases, unreliable. The findings are contained a preprint paper, LLMs Corrupt Your Documents When You Delegate, written by Microsoft researchers  Philippe Laban, Tobias Schnabel and Jennifer Neville based on a…

Read More

GitLab CEO sees developer tool bill increasing 100-fold

GitLab CEO Bill Staples says enterprises’ monthly bill for developer platform services has risen from tens of dollars per seat to hundreds over the last year, and is headed toward the thousands, signaling a structural change in how they will be billed for AI-enabled software development tools. The increase in cost reflects the volume of…

Read More

Red Hat adds support for agentic AI development

As use of agentic AI accelerates, Red Hat is hoping to position itself as the critical behind-the-scenes plumbing and connective fabric. To this end, the company has unveiled new desktop and developer suite functions, skills bundles, and a rolling Linux release to help enterprises move beyond the experimental phase. Announced at Red Hat Summit today,…

Read More

Oracle NetSuite announces AI coding skills for SuiteCloud developers

Oracle NetSuite is adding AI capabilities to SuiteCloud to help developers customize its ERP platform faster using natural language prompts. In a statement, the company said its NetSuite SuiteCloud Agent Skills “will make it easier for developers to create customized vertical and industry-specific applications by giving AI coding assistants a better understanding of the conventions,…

Read More

Xiaomi releases MIT‑licensed MiMo models for long‑running AI agents

Xiaomi has released and open-sourced MiMo-V2.5 and MiMo-V2.5-Pro under the MIT License, giving developers another potentially lower-cost option for building AI agents that can run longer tasks such as coding and workflow automation. Both models support a 1-million-token context window, the company said. MiMo-V2.5-Pro is designed for complex agent and coding tasks, while MiMo-V2.5 is…

Read More

SpaceX secures option to acquire AI coding startup Cursor for $60B

SpaceX has obtained the right to acquire AI coding startup Cursor for $60 billion later this year, the two companies announced Tuesday. The aerospace company disclosed the arrangement in a post on X. “SpaceXAI and cursor_ai are now working closely together to create the world’s best coding and knowledge work AI.” SpaceX added that the…

Read More

Salesforce launches Headless 360 to support agent‑first enterprise workflows

Salesforce is packaging its developer and AI tooling, including its vibe coding environment Agentforce Vibes, into a new platform named Headless 360, designed to help enterprise teams build agent-first workflows. The CRM software provider defines agent-first workflows as enterprise processes in which software agents, rather than human users, carry out tasks by directly invoking APIs,…

Read More

Anthropic cuts OpenClaw access from Claude subscriptions, offers credits to ease transition

Anthropic has blocked paid Claude subscribers from using the widely used open-source AI agent OpenClaw under their existing subscription plans, a move that took effect April 4 and has drawn pushback from subscribers who question both the cost implications and the company’s stated rationale. In an email to subscribers reviewed by InfoWorld, Anthropic said access…

Read More

Kilo targets shadow AI agents with a managed enterprise platform

Kilo has launched KiloClaw for Organizations, a managed version of its OpenClaw platform aimed at enterprises seeking more control over how employees deploy AI agents for tasks such as repository monitoring, email drafting, and calendar management. Co-founded by GitLab co-founder Sid Sijbrandij and Scott Breitenother, Kilo is building open-source coding and AI agent tools and…

Read More

Meta shows structured prompts can make LLMs more reliable for code review

Meta researchers have developed a structured prompting technique that enables LLMs to verify code patches without executing them, achieving up to 93% accuracy in tests. The method, dubbed semi-formal reasoning, could help reduce reliance on the resource-heavy sandbox environments currently required for automated code validation. The development comes as organizations look to deploy agentic AI…

Read More