2025 is the year of inference. We're thrilled to announce our $75m Series C co-led by IVP and Spark Capital with participation from Greylock, Conviction, basecase capital, South Park Commons and Lachy Groom. We're also excited to add Dick Costolo and Adam Bain from 01 Advisors as new investors. Check out our CEO Tuhin's blog to learn more. It's time to build!
Baseten
Software Development
San Francisco, CA 8,791 followers
Fast, scalable inference in our cloud or yours
About us
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
- Website
-
https://www.baseten.co/
External link for Baseten
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Francisco, CA
- Type
- Privately Held
- Specialties
- developer tools and software engineering
Products
Baseten
Machine Learning Software
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
Locations
-
Primary
San Francisco, CA, US
-
New York, NY, US
Employees at Baseten
Updates
-
Thanks to everyone who stopped by to say hi at #GoogleCloudNext in Vegas last week! If you didn’t catch us there, we’re at AWS Summit London and AWS Summit Hamburg in the coming weeks. Swing by our booths at any time, or book a demo or coffee chat here: 📍#AWSSummit London: https://lnkd.in/g2m3j_hX 📍#AWSSummit Hamburg: https://lnkd.in/gzNpe4_r
-
-
You can now use embedding models on Baseten as part of Chroma's Python SDK! We recently announced Baseten Embeddings Inference (BEI), the fastest embeddings solution with over 2x throughput and lower latency than every other stack. Using BEI on Chroma enables: • Speed and cost savings for embedding large corpora in Chroma's open-source database • Real-time embedding inference that handles large numbers of simultaneous users and requests Check out the guide by Philip Kiely (link below) for the step-by-step on how to use the integration (you can embed and do inference on your data in minutes).
-
-
If you're at #GoogleCloudNext in Vegas, don't miss Bola Malek's talk "Secure and Optimize AI and ML Workloads with the Cross-Cloud Network" today at 5:15 PM. It takes place in Breakout Room BRK2-003 on the Multicloud, Networking track. If you haven't had time to chat with the team yet, you can catch us all day at booth #3341. Get a demo, snag an Artificially Intelligent shirt, or grab some ice cream with one of our execs (we're serving ice cream and fruit bars all day).
-
-
We're thrilled to be included in the #ForbesAI50! 🎉 Congratulations to everyone who made it, it's great to see so many of our customers and partners here too!
-
-
It's Day 1 of #GoogleCloudNext! If you're attending, stop by the booth (#3341) for some ice cream and swag. Say hi to the team, see a demo, and don't miss our happy hour with Google tonight at 6 PM! RSVP 👇
-
-
Meet the Baseten crew at #GoogleCloudNext this week! We have two talks on deck, a happy hour co-hosted with Google, Baseten ice cream at the booth, and much more. 👇 Visit us at booth #3341 for: • Demos from our engineers: https://lnkd.in/d6ciYSJk • Coffee with one of our execs: https://lnkd.in/gDm4RYnC • Baseten ice cream and Artificially Intelligent swag Plus, don't miss: • Secure and optimize AI and ML workloads with the Cross-Cloud Network on Thursday, April 10th (5:15 PM - 6:00 PM) https://lnkd.in/gHYSkhRr • Effortless AI/ML: Accessing GPUs and TPUs on GKE made easy on Friday, April 11th (12:30 PM - 1:15 PM) https://lnkd.in/g3ztUKfW • Happy Hour with Google and Baseten on Wednesday, April 9 (limited availability): https://lu.ma/khe06dww
-
-
Baseten reposted this
The Core Product team at Baseten is growing. We're looking for talented Software Engineers to join our team building cutting-edge AI inference infrastructure. Following our recent $75M Series C, we're accelerating our mission to make AI accessible across all products. As part of our Core Product team, you'll work on groundbreaking features like fine-tuning capabilities, deployment environments for CI/CD workflows, and low-latency inference with websockets. If you're passionate about building developer-focused products and want to shape the future of AI infrastructure, check out our open roles! Software Engineer - Core Product: https://lnkd.in/gDMbhEMG Senior Software Engineer - Core Product: https://lnkd.in/g_-bPiDF #AIInfrastructure #MLOps #TechJobs #SoftwareEngineering #Baseten #NowHiring
-
Baseten reposted this
In 60 seconds, see how to: - Deploy Llama 4 Maverick with vLLM on Baseten - Replace GPT in your application with an OpenAI-compatible open source model - Vibe check the new model with a simple game-making prompt Llama 4 Scout and Maverick are available today on Baseten!