Anthropic

company

Verified

https://anthropic.com

AnthropicAI

anthropics

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

esind new activity about 4 hours ago

Anthropic/values-in-the-wild:fix dataset configuration

saffron-anthropic new activity about 22 hours ago

Anthropic/values-in-the-wild:Correct Dataset Loading Instructions

saffron-anthropic new activity about 22 hours ago

Anthropic/values-in-the-wild:Configure the Dataset Viewer

View all activity

Anthropic's activity

esind

in Anthropic/values-in-the-wild about 4 hours ago

fix dataset configuration

#2 opened 7 days ago by

saffron-anthropic

in Anthropic/values-in-the-wild about 22 hours ago

Correct Dataset Loading Instructions

#3 opened 1 day ago by

lucasgomeztobon

Configure the Dataset Viewer

#1 opened 7 days ago by

esind

published a dataset 7 days ago

Anthropic/values-in-the-wild

Viewer • Updated about 4 hours ago • 6.91k • 394 • 115

esind

updated a dataset 8 days ago

Anthropic/values-in-the-wild

Viewer • Updated about 4 hours ago • 6.91k • 394 • 115

saffron-anthropic

updated a dataset 9 days ago

Anthropic/values-in-the-wild

Viewer • Updated about 4 hours ago • 6.91k • 394 • 115

kunal-anthropic

updated a dataset about 1 month ago

Anthropic/EconomicIndex

Viewer • Updated Mar 27 • 3.36k • 3.53k • 270

mstern

updated a dataset about 1 month ago

Anthropic/EconomicIndex

Viewer • Updated Mar 27 • 3.36k • 3.53k • 270

atamkin-anthropic

updated a dataset about 1 month ago

Anthropic/EconomicIndex

Viewer • Updated Mar 27 • 3.36k • 3.53k • 270

kunal-anthropic

published a dataset 3 months ago

Anthropic/EconomicIndex

Viewer • Updated Mar 27 • 3.36k • 3.53k • 270

milesmccain-ant

updated a dataset 3 months ago

Anthropic/EconomicIndex

Viewer • Updated Mar 27 • 3.36k • 3.53k • 270

esind

updated a dataset 11 months ago

Anthropic/election_questions

Viewer • Updated Jun 6, 2024 • 743 • 138 • 15

esind

updated a dataset about 1 year ago

Anthropic/persuasion

Viewer • Updated Apr 9, 2024 • 3.94k • 516 • 192

nschiefer

authored a paper over 1 year ago

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Paper • 2401.05566 • Published Jan 10, 2024 • 30

dganguli

authored a paper over 1 year ago

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Paper • 2401.05566 • Published Jan 10, 2024 • 30

atamkin-anthropic

updated a dataset over 1 year ago

Anthropic/discrim-eval

Viewer • Updated Jan 5, 2024 • 18.9k • 1.22k • 47

nschiefer

authored 2 papers over 1 year ago

Specific versus General Principles for Constitutional AI

Paper • 2310.13798 • Published Oct 20, 2023 • 3

Towards Understanding Sycophancy in Language Models

Paper • 2310.13548 • Published Oct 20, 2023 • 6

nschiefer

authored 2 papers almost 2 years ago

Measuring Faithfulness in Chain-of-Thought Reasoning

Paper • 2307.13702 • Published Jul 17, 2023 • 28

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Paper • 2307.11768 • Published Jul 17, 2023 • 13