“AI companies are either in a position where they need to aggressively filter user prompts and model outputs to make sure that they don’t accidentally show Darth Vader, or strike deals with the rights holders to get permission to make videos and images of Darth Vader,” Sag informed Fortune. “The licensing strategy is much more of a win-win.”
The three-year settlement offers OpenAI the correct to ingest a whole lot of Disney-owned characters into Sora and ChatGPT Picture. Disney will even obtain fairness warrants and grow to be a significant OpenAI buyer, whereas deploying ChatGPT internally.
Sag mentioned the deal itself will likely be a sort of “revenue-sharing.”
“OpenAI hasn’t figured out the revenue model,” Sag mentioned. “So I think making this just an investment deal, in some ways, simplifies it. For Disney … [OpenAI] will figure out a way to make this profitable at some point, and [Disney will] get a cut of that.”
Why this deal issues: The ‘Snoopy problem’
That’s as a result of the case for AI corporations coaching their fashions on unlicensed content material is “very strong,” Sag mentioned. Two latest courtroom rulings involving Anthropic and Meta have strengthened these arguments.
The actual stumbling block, Sag mentioned, has at all times been outputs, not coaching. If a mannequin can unintentionally produce a body that appears an excessive amount of like Darth Vader, Homer Simpson, Snoopy, or Elsa, the honest use protection begins to fray.
“If you do get too much memorization, if that memorization finds its way into outputs, then your fair use case begins to just crumble,” Sag mentioned.
Whereas it’s unimaginable to license sufficient textual content to coach an LLM (“that would take a billion” offers, Sag mentioned), it’s attainable to construct picture or video fashions completely from licensed knowledge if in case you have the correct companions. That is why offers like Disney’s are essential: They flip beforehand unlawful outputs into authorized ones, no matter whether or not the coaching course of itself qualifies as honest use.
“The limiting principle is going to be essentially about whether—in their everyday operation—these models reproduce substantial portions of works from their training data,” Sag mentioned.
That is additionally a sign about the way forward for AI knowledge
In a weblog publish, Sag wrote:
“The low-hanging fruit of the public internet has been picked,” he wrote. “To get better, companies like OpenAI are going to need access to data that no one else has. Google has YouTube; OpenAI now has the Magic Kingdom.”
That is the core of what he calls the “data scarcity thesis.” OpenAI’s subsequent leap in mannequin high quality might require unique content material partnerships, versus extra scraping.
“By entangling itself with the world’s premier IP holder, OpenAI makes itself indispensable to the very industry that threatened to sue it out of existence,” Sag wrote.
AI and Hollywood have spent three years locked in a chilly warfare over coaching knowledge, likeness rights, and infringement. With Disney’s $1 billion funding, that period seems to be ending.
“This is the template for the future,” Sag wrote. “We are moving away from total war between AI and content, toward a negotiated partition of the world.”
