#llm — Blog | Kunal Ganglani

Abstract purple lines on a black background

I Tested 5 LLM APIs for Latency — Here's the Real Data (March 2026)

I benchmarked 5 LLM APIs across 3 prompt sizes — the fastest TTFT was 597ms, the throughput winner hit 173 tok/s, and the results upend common assumptions about which models are actually fast.