Grok 3 - does it deliver? šŸ‘€

Will xAIā€™s new launch make our non-techie lives easier?

Happy Tuesday Non-Techies,

The beta version of Grok 3 was released last week. I mentioned Grok a few times in last weekā€™s chatbot crash course, but in case youā€™ve already forgotten (no judgement here), Grok is xAIā€™s (uncensored) horse in the chatbot race.

Is it the scruffy, stubborn mule that likes to play dirty? Or a polished stallion thatā€™s already galloping ahead?*

In todayā€™s newsletter, Iā€™ll try to answer that question. First, a quick poll:

Do you use Grok?

Login or Subscribe to participate in polls.

Okay, letā€™s get started. For those about to Grok, I salute you (sorry, I couldnā€™t resist).

What makes Grok 3 stand out?

Until now, Grokā€™s rebellious sass was probably its biggest differentiator. If we ranked chatbots based on how likely they are to get caught smoking in the school playground, Grok would win hands down. A solid 9/10 on the snark scale.

That ā€˜personalityā€™ is still there with Grok 3, but according to xAI, it has other unique selling points too:

šŸ§  Itā€™s now really smart. 
šŸ–„ Itā€™s fuelled by a giant supercomputer called Colossus.
šŸ¤” It uses a step-by-step reasoning process, kinda like a person might.
šŸŽ“ It can act as an academic researcher too.
šŸ†‡ It pulls real-time information from the web and from X.

Letā€™s dig furtherā€¦

___

šŸ§  How smart is Grok?

If only there was a score to measure thisā€¦oh wait, there is! The Chatbot Arena might sound like some sort of 2025 Robot Wars reboot, but disappointingly, itā€™s just a platform that tells you which chatbot is the smartest.

It does that with a score called an ELO. Users blind-vote on their favourite answer produced by anonymous chatbots and points are awarded to the winner.

No chatbot has ever achieved an ELO score of over 1400 beforeā€¦until now.

But it canā€™t take all the credit. You know what they say: behind every great chatbot is an even greater computer. Letā€™s meet Colossus. 

šŸ–„ Colossus isā€¦colossal.

With a name like Colossus, youā€™ve really got to deliver the goods. Well, Colossus does. Itā€™s believed to be the worldā€™s largest AI supercomputer, and boy is it chunky. 

Hereā€™s a glamour pic:

Not exactly the sort of computer you can whip out in your local coffee shop.

Researching this thing is a minefield for non-techies, but hereā€™s the big takeaway: it means Grok 3 is 10x more powerful than Grok 2, capable of solving more complex and intricate problems.

Itā€™s kind of weird to remember that this technology has this physical presence. Lest we forget the environmental impact this all has too. 

___

šŸ¤” Reasoning like a person.

OpenAIā€™s o1 and o3 models and Deepseek pipped xAI to the post on this one, but itā€™s undeniable that Grok 3 likes to show off. Where other chatbots and earlier versions of ChatGPT will spit out an answer, Grok 3ā€™s Think mode shows you its workings along the way.

Does the world's biggest AI supercomputer really need 5s to answer this??

šŸŽ“ Academic-level research.

The same is true for its DeepSearch mode, which is its research function (apparently as clever as a PhD level researcher). Here, rather than just a thought process, youā€™ll see a sequence of search queries, sources and conclusions. Grok 3 wants you to know where it gets its information and the logic itā€™s using to give you an answer. As it says itself, ā€œThis isnā€™t about getting to an answer, itā€™s about transparency.ā€ Fancy pants.

Be warned - it isnā€™t quick - these new research functions can take minutes to get a good answer for you, not seconds. 

___

šŸ†‡ X marks the spot.

Apparently (and when I say ā€˜apparentlyā€™, I mean ā€˜according to Googleā€™s AI summaryā€™), around 500,000,000 posts are published to X every day. It wonā€™t come as a shock to you that Xā€™s chatbot offering has been given a front-row seat to this trove of breaking news, opinion, sentiment andā€¦letā€™s face it, quite a lot of drivel.

The supposed benefits are access to real-time information, social insights, a more diversified pool of knowledge and the ability to cross reference its web sources with X posts. Quite how useful this is, weā€™re yet to see. 

So, is Grok 3 my new squeeze?

Honestly? Maybe Iā€™m uptight (or maybe Iā€™m just not a beer-chugging frat boy) but Iā€™m not really drawn to chatbot that can yell and call me a tw*t and engage in sexy voice messages. That stuff puts me off. 

Thatā€™s not to say it doesnā€™t have serious power - as it now clearly does. Itā€™s easy to use and itā€™s just as strong, if not stronger, than most of the stuff out there. 

Itā€™s almost like the more techie you are, the more excited you are about Grok 3. Developers, computer scientists, mathematiciansā€¦these demographics seem to be the most impressed by Grok 3ā€™s extra computational power (courtesy of our old pal Colossus).

But for me, as a non-techie, 26 more ELO points than ChatGPT and an extra serving of sass probably isnā€™t enough to turn me into a Grok 3 fan girl yet. 

Iā€™ll keep an eye on it though, for sure.

Okay, hopefully you know a bit more about Grok 3 now. Letā€™s do another poll:

Do you think you'll take Grok 3 for a spin?

Login or Subscribe to participate in polls.

Right then, itā€™s Sunday as I write this newsletter, so Iā€™m off to make an ā€œepic breakfastā€ for my extended family - Iā€™m talking pancakes, sausages, bacon, pastries, fruit, all the good stuff. 

Have a great week!

Heather

*bonus pic of the scruffy mule vs the sleek stallion. Bit creepy? Reply and let me know.

PS Iā€™ll see some of you in todayā€™s 3 day Become an AI Trainer bootcamp, starts at 5pm GMT (tickets still left if you want to join last minute)

Reply

or to participate.