Brief
AgentPerf posts put AI-chip efficiency claims on an agent-workload footing
Artificial Analysis said in an X post on June 12 that it was releasing first AA-AgentPerf results, describing the benchmark as a new agentic inference test initially covering DeepSeek V4 Pro across Nvidia Blackwell, Hopper and AMD. Nvidia and Nvidia AI Infrastructure posts framed AgentPerf as a way to measure concurrent AI agents under performance targets, with Nvidia AI Infrastructure claiming GB300 NVL72 delivered 20x more concurrent agents per megawatt than Hopper on DeepSeek V4 Pro. That makes “agents per megawatt” a notable public marketing frame for AI infrastructure, but the supplied evidence does not include a readable Artificial Analysis methodology page or result table, so exact rankings, scores, power accounting and comparison rules remain unverified.
Technology · June 12, 2026