Sam Bowman

@sleepinyourhat.bsky.social

7.5Kfollowers
166following
11posts

AI safety at Anthropic, on leave from a faculty job at NYU. Views not employers'. I think you should join Giving What We Can. cims.nyu.edu/~sbowman

Top posts

Sam Bowman avatar
Sam Bowman·Dec 18

New work from my team at Anthropic in collaboration with Redwood Research. I think this is plausibly the most important AGI safety result of the year. Cross-posting the thread below:

Title card: Alignment Faking in Large Language Models by Greenblatt et al.
5
29
126
Sam Bowman avatar
Sam Bowman·Dec 2

If you're potentially interested in transitioning into AI safety research, come collaborate with my team at Anthropic! Funded fellows program for researchers new to the field here: alignment.anthropic.com/2024/anthrop...

3
16
70

Latest posts

Sam Bowman avatar
Sam Bowman·Dec 18

New work from my team at Anthropic in collaboration with Redwood Research. I think this is plausibly the most important AGI safety result of the year. Cross-posting the thread below:

Title card: Alignment Faking in Large Language Models by Greenblatt et al.
5
29
126
Sam Bowman avatar
Sam Bowman·Dec 2

If you're potentially interested in transitioning into AI safety research, come collaborate with my team at Anthropic! Funded fellows program for researchers new to the field here: alignment.anthropic.com/2024/anthrop...

3
16
70