- AI for Non-Techies
- Posts
- Grok 3 - does it deliver? š
Grok 3 - does it deliver? š
Will xAIās new launch make our non-techie lives easier?
Happy Tuesday Non-Techies,
The beta version of Grok 3 was released last week. I mentioned Grok a few times in last weekās chatbot crash course, but in case youāve already forgotten (no judgement here), Grok is xAIās (uncensored) horse in the chatbot race.
Is it the scruffy, stubborn mule that likes to play dirty? Or a polished stallion thatās already galloping ahead?*
In todayās newsletter, Iāll try to answer that question. First, a quick poll:
Do you use Grok? |
Okay, letās get started. For those about to Grok, I salute you (sorry, I couldnāt resist).
What makes Grok 3 stand out?
Until now, Grokās rebellious sass was probably its biggest differentiator. If we ranked chatbots based on how likely they are to get caught smoking in the school playground, Grok would win hands down. A solid 9/10 on the snark scale.
That āpersonalityā is still there with Grok 3, but according to xAI, it has other unique selling points too:
š§ Itās now really smart.
š„ Itās fuelled by a giant supercomputer called Colossus.
š¤ It uses a step-by-step reasoning process, kinda like a person might.
š It can act as an academic researcher too.
š It pulls real-time information from the web and from X.
Letās dig furtherā¦
___
š§ How smart is Grok?
If only there was a score to measure thisā¦oh wait, there is! The Chatbot Arena might sound like some sort of 2025 Robot Wars reboot, but disappointingly, itās just a platform that tells you which chatbot is the smartest.
It does that with a score called an ELO. Users blind-vote on their favourite answer produced by anonymous chatbots and points are awarded to the winner.
No chatbot has ever achieved an ELO score of over 1400 beforeā¦until now.
But it canāt take all the credit. You know what they say: behind every great chatbot is an even greater computer. Letās meet Colossus.
š„ Colossus isā¦colossal.
With a name like Colossus, youāve really got to deliver the goods. Well, Colossus does. Itās believed to be the worldās largest AI supercomputer, and boy is it chunky.
Hereās a glamour pic:
Not exactly the sort of computer you can whip out in your local coffee shop.
Researching this thing is a minefield for non-techies, but hereās the big takeaway: it means Grok 3 is 10x more powerful than Grok 2, capable of solving more complex and intricate problems.
Itās kind of weird to remember that this technology has this physical presence. Lest we forget the environmental impact this all has too.
___
š¤ Reasoning like a person.
OpenAIās o1 and o3 models and Deepseek pipped xAI to the post on this one, but itās undeniable that Grok 3 likes to show off. Where other chatbots and earlier versions of ChatGPT will spit out an answer, Grok 3ās Think mode shows you its workings along the way.
Does the world's biggest AI supercomputer really need 5s to answer this??
š Academic-level research.
The same is true for its DeepSearch mode, which is its research function (apparently as clever as a PhD level researcher). Here, rather than just a thought process, youāll see a sequence of search queries, sources and conclusions. Grok 3 wants you to know where it gets its information and the logic itās using to give you an answer. As it says itself, āThis isnāt about getting to an answer, itās about transparency.ā Fancy pants.
Be warned - it isnāt quick - these new research functions can take minutes to get a good answer for you, not seconds.
___
š X marks the spot.
Apparently (and when I say āapparentlyā, I mean āaccording to Googleās AI summaryā), around 500,000,000 posts are published to X every day. It wonāt come as a shock to you that Xās chatbot offering has been given a front-row seat to this trove of breaking news, opinion, sentiment andā¦letās face it, quite a lot of drivel.
The supposed benefits are access to real-time information, social insights, a more diversified pool of knowledge and the ability to cross reference its web sources with X posts. Quite how useful this is, weāre yet to see.
So, is Grok 3 my new squeeze?
Honestly? Maybe Iām uptight (or maybe Iām just not a beer-chugging frat boy) but Iām not really drawn to chatbot that can yell and call me a tw*t and engage in sexy voice messages. That stuff puts me off.
Thatās not to say it doesnāt have serious power - as it now clearly does. Itās easy to use and itās just as strong, if not stronger, than most of the stuff out there.
Itās almost like the more techie you are, the more excited you are about Grok 3. Developers, computer scientists, mathematiciansā¦these demographics seem to be the most impressed by Grok 3ās extra computational power (courtesy of our old pal Colossus).
But for me, as a non-techie, 26 more ELO points than ChatGPT and an extra serving of sass probably isnāt enough to turn me into a Grok 3 fan girl yet.
Iāll keep an eye on it though, for sure.
Okay, hopefully you know a bit more about Grok 3 now. Letās do another poll:
Do you think you'll take Grok 3 for a spin? |
Right then, itās Sunday as I write this newsletter, so Iām off to make an āepic breakfastā for my extended family - Iām talking pancakes, sausages, bacon, pastries, fruit, all the good stuff.
Have a great week!
Heather
*bonus pic of the scruffy mule vs the sleek stallion. Bit creepy? Reply and let me know.
PS Iāll see some of you in todayās 3 day Become an AI Trainer bootcamp, starts at 5pm GMT (tickets still left if you want to join last minute)
Reply