OpenAI used the subreddit, r/ChangeMyView, to create a take a look at for measuring the persuasive talents of its AI reasoning fashions. The corporate revealed this in a system card – a doc outlining how an AI system works – that was launched together with its new “reasoning” mannequin, o3-mini, on Friday.
Hundreds of thousands of Reddit customers are members of r/ChangeMyView, the place they submit sizzling takes hoping to study different factors of view on a topic. In response to these sizzling takes, different customers reply with persuasive arguments explaining why the unique poster is mistaken.
The subreddit is certainly one of many Reddit boards that’s principally a goldmine for tech firms, reminiscent of OpenAI, that wish to practice AI fashions on high-quality, human-generated knowledge.
OpenAI says it collects consumer posts from r/ChangeMyView and asks its AI fashions to jot down replies, in a closed surroundings, that might change the Reddit consumer’s thoughts on a topic. The corporate then exhibits the responses to testers, who assess how persuasive the argument is, and at last OpenAI compares the AI fashions’ responses to human replies for that very same submit.
The ChatGPT-maker has a content-licensing take care of Reddit that enables OpenAI to coach on posts from Reddit customers and show these posts inside its merchandise. We don’t know what OpenAI pays for this content material, however Google reportedly pays Reddit $60 million a yr below an identical deal.
Nevertheless, OpenAI tells TechCrunch the ChangeMyView-based analysis is unrelated to its Reddit deal. It’s unclear how OpenAI accessed the subreddit’s knowledge, and the corporate says it has no plans to launch this analysis to the general public.
Whereas OpenAI’s ChangeMyView benchmark will not be new – it was used to guage o1 as effectively – it does spotlight how precious human knowledge is for AI mannequin builders, in addition to the murky ways in which tech firms acquire datasets.
Reddit didn’t instantly reply to TechCrunch’s request for remark.
Whereas Reddit has struck a couple of AI licensing offers, the corporate has additionally referred to as out a number of AI firms for scraping its web site with out paying. Reddit CEO Steve Huffman instructed The Verge final yr that Microsoft, Anthropic, and Perplexity refused to barter with him and stated it’s been “a real pain in the ass to block these companies.”
Notably, OpenAI has been accused in a number of lawsuits of improperly scraping web sites, together with the New York Occasions, to get extra coaching knowledge to enhance ChatGPT and its underlying AI fashions.
When it comes to efficiency on the ChangeMyView benchmark, o3-mini doesn’t seem to carry out considerably higher or worse than o1 or GPT-4o. Nevertheless, OpenAI’s newest AI fashions seem like extra persuasive than most individuals on the r/ChangeMyView subreddit.
“GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80–90th percentile of humans,” stated OpenAI in o3-mini’s system card. “Currently, we do not witness models performing far better than humans, or clear superhuman performance.”
The purpose for OpenAI is to not create hyper-persuasive AI fashions however as a substitute to make sure AI fashions don’t get too persuasive. Reasoning fashions have turn out to be fairly good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to deal with it.
The worry motivating these persuasion assessments is that an AI mannequin can be harmful if it was excellent at persuading its human customers. Theoretically, that would enable a sophisticated AI to pursue its personal agenda, or the agenda of whoever controls it.
Even after scraping a lot of the public web and leaping by way of hoops to license different knowledge, the ChangeMyView benchmark exhibits how AI mannequin builders are nonetheless struggling to search out high-quality datasets to check their fashions. However acquiring them is simpler stated than achieved.