Sam Bowman

@sleepinyourhat.bsky.social

7.5Kfollowers

166following

11posts

AI safety at Anthropic, on leave from a faculty job at NYU. Views not employers'. I think you should join Giving What We Can. cims.nyu.edu/~sbowman

Top posts

Sam Bowman·Dec 18

New work from my team at Anthropic in collaboration with Redwood Research. I think this is plausibly the most important AGI safety result of the year. Cross-posting the thread below:

Title card: Alignment Faking in Large Language Models by Greenblatt et al.

126

Sam Bowman·Dec 2

If you're potentially interested in transitioning into AI safety research, come collaborate with my team at Anthropic! Funded fellows program for researchers new to the field here: alignment.anthropic.com/2024/anthrop...

Latest posts