Benchmarking for large customers is back in the spotlight. The core question: should teams publish real performance benchmarks, or keep them private? A discussion about catering to a large customer's benchmark request highlights the problem: there’s no other market reference to anchor claims [1].
Real-world constraints — Benchmarks collide with logs and analytics realities. Real workloads, data locality, and distributed systems make apples-to-apples comparisons hard, so publishing a single benchmark may help some customers but confuse others if environments differ [1].
Cross-engine comparisons — In the wild, teams compare Snowflake and BigQuery against open-source options like DuckDB. A 1T-row aggregation on 63 Azure Standard E64pds v6 nodes finishes in about 5 seconds, underscoring scale and cost tradeoffs [2]. Open-source DataFusion Ballista is cited as a path for cross-engine planning [2].
The cost puzzle — That 63-node run costs about $235.872/hr on demand; a Snowflake 4XL cluster would run around $384/hr. Spot instances could drop to about $45.99/hr, but reliability and data locality come into play [2]. The numbers echo a broader pattern: performance claims live or die by context [2].
What it means for large customers — Benchmarks help, but real-world constraints—logs, analytics, distributed systems—mean no one-size-fits-all. The debate will keep fueling apples-to-apples when possible and transparent disclosures when not [1][2].
Closing thought: watch how this evolves as teams push for practical benchmarks that matter to enterprise buyers.
References
Ask HN: Should I cater to performance benchmark request for a large customer?
Low-code platform debates publishing performance benchmarks; lack of peers; benchmark usefulness questioned; references to KONG and Mulesoft cited
View sourceA sharded DuckDB on 63 nodes runs 1T row aggregation challenge in 5 sec
Discusses cross-DB sharding, DuckDB, DataFusion Ballista, cost/performance tradeoffs, and comparisons with Snowflake/BigQuery, including one-trillion-row challenge datasets and distributed query planning.
View source