Everyone wants fair benchmarks, but do you even lift?

No stone is left unturned on this episode. As the end of the year approaches, Tom and Nate check in on all the vibes of the machine learning world: torrents, faked demos, alchemy, weightlifting, actual science, and blogs are all not safe in this episode.
Some links for your weekend:
- AI Alliance: https://thealliance.ai/
- Evaluation gaming on Interconnects: https://www.interconnects.ai/p/evals-are-marketing
- Fupi: https://www.youtube.com/watch?v=WtVknbxzn7Q

Creators and Guests

Nathan Lambert
Host
Nathan Lambert
RLHF researcher and author of Interconnects.ai blog
Everyone wants fair benchmarks, but do you even lift?
Broadcast by