Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

59 points | by khurdula 4 days ago ago

28 comments