xAI Unveils Grok 3, Fine-Tuned LLMs Dominate Text-to-SQL
Fine-Tuned by Genloop - #4
Dear Readers,
Welcome to Edition 4 of Fine-Tuned by Genloop – your go-to guide for the latest in LLM customization. Last week, we released a deep dive on Text-to-SQL, packed with insights from our enterprise experience. The response has been incredible! If you haven’t checked it out yet, we've got a summary waiting for you in our top blogs section.
In this edition, we cover xAI’s launch of Grok 3, Perplexity’s open-sourcing of DeepSeek-R1, OpenAI’s roadmap for GPT-4.5 and 5, and key takeaways from the Paris AI Summit.
GenAI is evolving at lightning speed—let’s dive into the biggest developments from the past two weeks!
🌟 AI Industry Highlights
1. xAI Unveils Grok 3 with Advanced Reasoning
xAI on Monday unveiled its updated Grok 3 artificial intelligence model, as the Elon Musk-led startup pushes to keep pace with competitors' advanced reasoning and search capabilities.
Key developments:
Musk referred to Grok 3 as "kind of a beta" and promised rapid improvements. He also teased an upcoming voice mode similar to conversational features in competing apps. The release comes amid Musk's growing AI ambitions, including his recent $97 billion offer to buy OpenAI and his promise to open-source Grok 2's code when Grok 3 is "mature and stable" in the coming months.
2. Perplexity Open-Sources Uncensored DeepSeek-R1 Model
Perplexity has open-sourced R1 1776, a version of the DeepSeek-R1 model that has been post-trained to provide unbiased, accurate, and factual information. While the original DeepSeek-R1 achieved performance close to state-of-the-art reasoning models like o1 and o3-mini, it was limited by its refusal to respond to sensitive topics, especially those censored by the Chinese Communist Party.
Key points:
This development helps unlock R1's powerful reasoning capabilities while mitigating bias and censorship, making advanced AI reasoning more widely accessible.
3. OpenAI’s GPT-4.5 and GPT-5 Roadmap
OpenAI has revealed plans for its next-generation models, confirming that GPT-4.5 (codename: Orion) will be its last non-chain-of-thought model, paving the way for the upcoming GPT-5, which promises to unify reasoning and language capabilities.
What’s Changing?
This move validates Ilya Sutskever’s earlier prediction that pre-training alone is no longer enough. Scaling compute has reached its limits, and the industry must explore new paradigms. However, what happens to controls and determinism requirements like SLAs in enterprise applications? Given a question, can I not be sure how soon the model will answer? We are yet to see. There is more work to be done.
4. Google Makes Gemini 2.0 Available to All
Google has made its latest AI model, dubbed Gemini 2.0, available to all. The Gemini 2.0 lineup includes three models:
Key Upgrades:
Notably, Google’s experimental “thinking” model saw significant gains, scoring 73.3% on AIME (an advanced math competition) and 74.2% on GPQA Diamond (complex science questions). It is currently the most used model of the week on OpenRouter.
5. World Powers Shift AI Regulation at Paris Summit
The AI Action Summit in Paris highlighted growing global divides over AI governance. Unlike past summits that focused on existential risks, this event saw a pivot toward investment and competition.
Key Takeaways:
Why It Matters:
We are optimistically following how these policies shape the AI landscape.
6. Humane's AI Pin Discontinued as HP Buys Assets for $116M
Humane announced on Tuesday that HP has acquired most of its assets for $116 million, bringing an abrupt end to its short-lived AI Pin. This serves as a stark reminder that just applying AI to anything doesn't automatically make it successful - product-market fit and real utility remain essential.
Key points:
Recommended by LinkedIn
This acquisition marks a dramatic shift from Humane's original aspirations. The company had previously sought between $750 million and $1 billion in acquisition offers last May. The AI Pin faced significant challenges since its April 2024 launch, including disappointing reviews, more returns than sales by last summer, battery fire concerns, and a $200 price drop in October.
📚 Featured Blog Posts
We've got two fascinating reads that showcase how the AI landscape is evolving:
1. Text to SQL: The Ultimate Guide for 2025
Text-to-SQL is a popular GenAI use case, where we see enterprises struggling to achieve high accuracy despite trying multiple approaches. We discovered a more effective solution through fine-tuning.
Key points:
We've compiled a comprehensive comparison of all approaches to help you choose the best solution for your needs. We're happy to discuss specifics in a 1-1 chat. Feel free to schedule a time here.
2. Highlights of NeurIPS 2024
The 38th NeurIPS Conference reaffirmed its position as the leading AI research event, drawing record attendance with over 4,000 accepted papers, 56 workshops, and 14 tutorials at the Vancouver Convention Center. We've documented our key learnings and highlights to share with you. Better late than never!
Key points:
🔬 Research Corner
Our team has been diving deep into groundbreaking research papers, and two particularly caught our attention:
1. SmolLM2 Training Report
Hugging Face's SmolLM2, a 1.7B parameter language model, achieves remarkable performance through a data-centric training strategy. The team placed significant emphasis on data quality, employing 18 customized SLMs for data processing.
Key highlights:
This research highlights how smaller models can remain competitive with strategic data selection and training methodologies. We believe enterprises will soon feasibly train their own SLMs from scratch for domain-adapted advantages.
2. AlphaGeometry2: AI Surpassing Olympiad Gold Medalists
Google DeepMind's AlphaGeometry2 represents a major leap in AI-driven mathematical reasoning. This new version significantly improves on the original, now solving 84% of International Math Olympiad (IMO) geometry problems—outperforming an average IMO gold medalist.
Key highlights that caught our attention:
This work showcases how AI is advancing beyond pattern recognition into structured mathematical reasoning, bringing us closer to AI systems capable of higher-level abstract thinking.
Looking Forward
The AI landscape is experiencing an unprecedented surge in development, and its trajectory promises to become even more captivating in the coming days. We are witnessing remarkable technical advancements. However, the true challenge lies in harnessing domain intelligence on top of general intelligence, in developing models that possess a deep understanding of business domains. Our text-to-SQL study underscores the pivotal role that this aspect will play in putting GenAI to production.
Thank you for reading! Share your thoughts with us, and don't forget to subscribe to stay updated on the latest in LLM customization.
About Genloop
Genloop delivers customized LLMs that provide unmatched cost, control, simplicity, and performance for production enterprise applications. Please visit genloop.ai, catch us on Linkedin, or email founder@genloop.ai for more details.
Stay Curious,
The Genloop Team