Editor's Pick

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

May 13, 20241 view0

Enlarge (credit: Getty Images)

On Monday, OpenAI employee William Fedus confirmed on X that a mysterious chat-topping AI chatbot known as “gpt-chatbot” that had been undergoing testing on LMSYS’s Chatbot Arena and frustrating experts was, in fact, OpenAI’s newly announced GPT-4o AI model. He also revealed that GPT-4o had topped the Chatbot Arena leaderboard, achieving the highest documented score ever.

“GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot,” Fedus tweeted.

Chatbot Arena is a website where visitors converse with two random AI language models side by side without knowing which model is which, then choose which model gives the best response. It’s a perfect example of vibe-based AI benchmarking, as AI researcher Simon Willison calls it.

Read 8 remaining paragraphs | Comments

What's your reaction?

Excited

0

Happy

0

In Love

0

Not Sure

0

Silly

0

You may also like

Editor's Pick

97% of CrowdStrike systems are back online; Microsoft suggests Windows changes

By

July 26, 2024

Editor's Pick

At the Olympics, AI is watching you

By

July 26, 2024

Editor's Pick

Hang out with Ars in San Jose and DC this fall for two infrastructure events

By

July 26, 2024

More in:Editor's Pick

Editor's Pick

Google AI earns silver medal equivalent at International Mathematical Olympiad

Enlarge / An illustration provided by Google. (credit: Google) On Thursday, Google DeepMind announced that ...

Editor's Pick

OpenAI hits Google where it hurts with new SearchGPT prototype

Enlarge (credit: Benj Edwards / OpenAI) Arguably, few companies have unintentionally contributed more to the ...

Editor's Pick

Chrome will now prompt some users to send passwords for suspicious files

(credit: Chrome) Google is redesigning Chrome malware detections to include password-protected executable files that users ...

Editor's Pick

Secure Boot is completely broken on 200+ models from 5 big device makers

Enlarge (credit: sasha85ru | Getty Imates) In 2012, an industry-wide coalition of hardware and software ...

Now Reading

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

2min read

0 %