micheal65536@lemmy.micheal65536.duckdns.org to Free Open-Source Artificial Intelligence@lemmy.worldEnglish · 1 year agoWhat is wrong with LLM benchmarks, and why are we still using them? - sh.itjust.workssh.itjust.worksexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkWhat is wrong with LLM benchmarks, and why are we still using them? - sh.itjust.workssh.itjust.worksmicheal65536@lemmy.micheal65536.duckdns.org to Free Open-Source Artificial Intelligence@lemmy.worldEnglish · 1 year agomessage-square0fedilink