Can a Model Keep Its Story Straight?
•Matt Handzel
Notes from my first empirical AI safety research project on resampling and multi-sample monitoring.
Read moreDeep dives into AI safety, knowledge management, and personal reflections.
My Substack newsletter focusing on productivity and knowledge systems.
Subscribe on Substack