OpenAI gives ChatGPT an upgrade — reclaims top spot in LLM leaderboard

1 day ago 4
ChatGPT
(Image credit: Shutterstock)

OpenAI just made ChatGPT a much better writer. The latest update to the underlying GPT-4o model gave the AI a boost in creativity.

This new update also helped it leapfrog Google Gemini and reclaim the top spot in the LLM arena. This is a virtual venue where users rank the output of two models without knowing which is which until the end.

OpenAI didn't share much about the new update other than that its "creative writing ability has leveled up" and that it is now "more natural and engaging" with "more tailored writing to improve relevance and readability".

What's new from ChatGPT?

Exciting News from Chatbot Arena❤️‍🔥Over the past week, the latest @OpenAI ChatGPT-4o (20241120) competed anonymously as "anonymous-chatbot", gathering 8,000+ community votes.The result? OpenAI reclaims the #1 spot, surpassing Gemini-Exp-1114 with an impressive 1361 score!… https://t.co/Q7q3Uonp94 pic.twitter.com/ogmhhCW7zYNovember 20, 2024

OpenAI has been on its game recently, adding a range of new features including bringing Advanced Voice to the web and hints at better DALL-E image generations. There are even rumors circulating of a version of Sora coming soon to bring AI video to the rapidly evolving AI platform.

I've tried it out and it does seem to be more complete and engaging in its responses.

The latest update is more 'behind the scenes' than a new UI or flashy new features like the impressive Canvas. It is a change to how GPT-4o works, making it more of a creativity powerhouse than previous generations.

The new version of GPT-4o, still called GPT-4o, is also better at working with files you've uploaded to ChatGPT and providing deeper insights into its contents.

It was released in secret to the lmarena.ai (formerly LMSys arena) LLM chatbot arena. This is a platform where models compete anonymously to find out which ones score better with human users.

Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!

OpenAI dropped it in as "anonymous-chatbot" last week and it quickly surpassed Gemini-Ex-1114, the latest model from Google that held the crown for about a week. Writing on X, Lmarena described it as a "remarkable improvement" on previous versions including around creative writing, coding and math.

Have you noticed a change in ChatGPT over the past few days? I've tried it out and it does seem to be more complete and engaging in its responses. This upgrade in creativity might also explain why DALL-E images have improved.

More from Tom's Guide

  • Did Apple Intelligence just make Grammarly obsolete?
  • 3 Apple Intelligence features I can't live without
  • Midjourney vs Flux — 7 prompts to find the best AI image model

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?

Read Entire Article