fosai

What is wrong with LLM benchmarks, and why are we still using them? - sh.itjust.works

https://sh.itjust.works/post/2247924
7
3
Comments 3