A reproducible TypeScript benchmark comparing MCP-native agents vs mcp-cli, capturing token usage, tool calls, retries, and latency across shared MCP tasks