Model Drop

The Drops Keep Coming

We’re not saying the set of announcements in recent weeks were explosive. But then again, we felt a bit stunned by all the news. The landscape shifted. Innovation is running at hyper speed as we reported in our last post. But we have had a chance to test some of the capability and are still evaluating results, with an eye on the real business value of some of these announcements. We’ll get back to you on all of this once we sort through some of the clutter, and identify the gems

But the wave of announcements are still not over. Here is what Nathan Beaich from Air Street Capital reports:

November saw a run of model releases. In early November, Elon announced Grok, an LLM chatbot built by x.ai that is designed to answer questions with a bit (I’d say a lot!) and a rebellious streak. The system has access to real-time knowledge of the world via X and it will “answer spicy questions that are rejected by most other AI systems”. Built in 2 months, the system used an interesting evaluation (amongst several others) on the 2023 Hungarian national high school finals in mathematics, which was published after the training dataset collection date cutoff. The team showed that Grok passed the exam with a C (same as Claude-2) while GPT-4 got a B.

Google is also releasing updates to DeepMind. It is not clear to us yet what direction Google is taking for the commercialization of their products, but we’re interested in seeing how they ultimately compete with Open AI (and Now Elon)

If you’re trying to navigate this shifting landscape and want to discuss strategy – and avoid a few of the known landmines out there – give us a call. Our engineers are business focused, and we’re focused on delivering measuring value for every AI project.