OpenAI’s Ilya Sutskever Has a Plan for Keeping Super-Intelligent AI in Check

Byadmin December 15, 2023

OpenAI was founded on a promise to build artificial intelligence that benefits all of humanity—even when that AI becomes considerably smarter than its creators. Since the debut of ChatGPT last year and during the company’s recent governance crisis, its commercial ambitions have been more prominent. Now, the company says a new research group working on wrangling the supersmart AIs of the future is starting to bear fruit.

“AGI is very fast approaching,” says Leopold Aschenbrenner, a researcher at OpenAI involved with the Superalignment research team established in July. “We’re gonna see superhuman models, they’re gonna have vast capabilities, and they could be very, very dangerous, and we don’t yet have the methods to control them.” OpenAI has said it will dedicate a fifth of its available computing power to the Superalignment project.

A research paper released by OpenAI today touts results from experiments designed to test a way to let an inferior AI model guide the behavior of a much smarter one without making it less smart. Although the technology involved is far from surpassing the flexibility of humans, the scenario was designed to stand in for a future time when humans must work with AI systems more intelligent than themselves.

OpenAI’s researchers examined the process, called supervision, which is used to tune systems like GPT-4, the large language model behind ChatGPT, to be more helpful and less harmful. Currently this involves humans giving the AI system feedback on which answers are good and which are bad. As AI advances, researchers are exploring how to automate this process to save time—but also because they think it may become impossible for humans to provide useful feedback as AI becomes more powerful.

In a control experiment using OpenAI’s GPT-2 text generator first released in 2019 to teach GPT-4, the more recent system became less capable and similar to the inferior system. The researchers tested two ideas for fixing this. One involved training progressively larger models to reduce the performance lost at each step. In the other, the team added an algorithmic tweak to GPT-4 that allowed the stronger model to follow the guidance of the weaker model without blunting its performance as much as would normally happen. This was more effective, although the researchers admit that these methods do not guarantee that the stronger model will behave perfectly, and they describe it as a starting point for further research.

“It’s great to see OpenAI proactively addressing the problem of controlling superhuman AIs,” says Dan Hendryks, director of the Center for AI Safety, a nonprofit in San Francisco dedicated to managing AI risks. “We’ll need many years of dedicated effort to meet this challenge.”

Artificial Intelligence

AI Is Heating the Olympic Pool

Byadmin August 6, 2024

In the suburbs of northeast Paris, there is a giant terra-cotta-colored warehouse with a labyrinth of windowless corridors inside. A deafening whir emanates from behind rows and rows of anonymous gray doors, and under white striplights, disposable earbuds are available to protect passersby from the noise. These are the uncanny innards of one of France’s…

Artificial Intelligence

Teachers Are Going All In on Generative AI

Byadmin September 15, 2023

Past research shows that large language models are capable of generating text harmful to some groups of people, including those who identify as Black, women, people with disabilities, and Muslims. Since 90 percent of students who attend schools that work with Charter School Growth Fund identify as people of color, Connell says, “having a human…

Artificial Intelligence

CES 2024 Preview: Get Ready for a ‘Tsunami’ of AI

Byadmin December 30, 2023

If you’re waiting for the hubbub over generative AI to die down, maybe pull up a chair. The buzz around artificial intelligence shows no signs of quieting—a fact that will become all too obvious at this year’s CES. CES, the consumer electronics industry’s largest annual gathering in the US, is returning to Las Vegas on…

Artificial Intelligence

Meta’s Movie Gen Makes Convincing AI Video Clips

Byadmin October 5, 2024

Meta just announced its own media-focused AI model, called Movie Gen, that can be used to generate realistic video and audioclips. The company shared multiple 10-second clips generated with Movie Gen, including a Moo Deng-esque baby hippo swimming around, to demonstrate its capabilities. While the tool is not yet available for use, this Movie Gen…

Artificial Intelligence

Apple, Nvidia, Anthropic Used Thousands of Swiped YouTube Videos to Train AI

Byadmin July 16, 2024

In response to the suits, defendants such as Meta, OpenAI, and Bloomberg have argued that their actions constitute fair use. A case against EleutherAI, which originally scraped the books and made them public, was voluntarily dismissed by the plaintiffs. Litigation in remaining cases remains in the early stages, leaving the questions surrounding permission and payment…

Artificial Intelligence

How to Use ChatGPT’s Memory Feature

Byadmin February 27, 2024

Everything reminds me of Her. While ChatGPT is not as powerful as the artificial intelligence from Spike Jonze’s sci-fi romance movie, OpenAI’s experimental memory tool for its chatbot seems to suggest a future where bots are highly personalized and capable of more fluid, lifelike conversations. OpenAI just soft-launched a new feature for ChatGPT called Memory,…

Similar Posts