OpenAI checked to see whether GPT-4 could take over the world

Enlarge (credit: Ars Technica)

As part of pre-release safety testing for its new GPT-4 AI model, launched Tuesday, OpenAI allowed an AI testing group to assess the potential risks of the model’s emergent capabilities—including “power-seeking behavior,” self-replication, and self-improvement.

While the testing group found that GPT-4 was “ineffective at the autonomous replication task,” the nature of the experiments raises eye-opening questions about the safety of future AI systems.

Raising alarms

“Novel capabilities often emerge in more powerful models,” writes OpenAI in a GPT-4 safety document published yesterday. “Some that are particularly concerning are the ability to create and act on long-term plans, to accrue power and resources (“power-seeking”), and to exhibit behavior that is increasingly ‘agentic.'” In this case, OpenAI clarifies that “agentic” isn’t necessarily meant to humanize the models or declare sentience but simply to denote the ability to accomplish independent goals.

Read 21 remaining paragraphs | Comments

What's your reaction?

Excited

Happy

In Love

Not Sure

Silly

OpenAI checked to see whether GPT-4 could take over the world

Raising alarms

What's your reaction?

Security firm Rubrik is latest to be felled by GoAnywhere vulnerability

The Fear of Mass Unemployment due to Artificial Intelligence and Robotics Is Unfounded

We made a cat drink a beer with Runway’s AI video generator, and it sprouted hands

CrowdStrike blames testing bugs for security update that took down 8.5M Windows PCs

Elon Musk claims he is training “the world’s most powerful AI by every metric”

More in:Editor's Pick

How Russia-linked malware cut heat to 600 Ukrainian buildings in deep winter

The first GPT-4-class AI model anyone can download has arrived: Llama 405B

Microsoft says 8.5M systems hit by CrowdStrike BSOD, releases USB recovery tool

Astronomers discover technique to spot AI fakes using galaxy-measurement tools

Posts List

Friday Feature: Homeschool CPA

Major outages at CrowdStrike, Microsoft leave the world with BSODs and confusion

Maryland Judge Dismisses Baltimore Climate-Change Case

Raising alarms

Share

What's your reaction?

You may also like

More in:Editor's Pick

Posts List

Latest Posts