2025 is the year of inference. We're thrilled to announce our $75m Series C co-led by IVP and Spark Capital with participation from Greylock, Conviction, basecase capital, South Park Commons and Lachy Groom. We're also excited to add Dick Costolo and Adam Bain from 01 Advisors as new investors. Check out our CEO Tuhin's blog to learn more. It's time to build!
Baseten
Software Development
San Francisco, CA 8,601 followers
Fast, scalable inference in our cloud or yours
About us
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
- Website
-
https://www.baseten.co/
External link for Baseten
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Francisco, CA
- Type
- Privately Held
- Specialties
- developer tools and software engineering
Products
Baseten
Machine Learning Software
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
Locations
-
Primary
San Francisco, CA, US
-
New York, NY, US
Employees at Baseten
Updates
-
We're thrilled to be included in the #ForbesAI50! 🎉 Congratulations to everyone who made it, it's great to see so many of our customers and partners here too!
-
-
It's Day 1 of #GoogleCloudNext! If you're attending, stop by the booth (#3341) for some ice cream and swag. Say hi to the team, see a demo, and don't miss our happy hour with Google tonight at 6 PM! RSVP 👇
-
-
Meet the Baseten crew at #GoogleCloudNext this week! We have two talks on deck, a happy hour co-hosted with Google, Baseten ice cream at the booth, and much more. 👇 Visit us at booth #3341 for: • Demos from our engineers: https://lnkd.in/d6ciYSJk • Coffee with one of our execs: https://lnkd.in/gDm4RYnC • Baseten ice cream and Artificially Intelligent swag Plus, don't miss: • Secure and optimize AI and ML workloads with the Cross-Cloud Network on Thursday, April 10th (5:15 PM - 6:00 PM) https://lnkd.in/gHYSkhRr • Effortless AI/ML: Accessing GPUs and TPUs on GKE made easy on Friday, April 11th (12:30 PM - 1:15 PM) https://lnkd.in/g3ztUKfW • Happy Hour with Google and Baseten on Wednesday, April 9 (limited availability): https://lu.ma/khe06dww
-
-
Baseten reposted this
The Core Product team at Baseten is growing. We're looking for talented Software Engineers to join our team building cutting-edge AI inference infrastructure. Following our recent $75M Series C, we're accelerating our mission to make AI accessible across all products. As part of our Core Product team, you'll work on groundbreaking features like fine-tuning capabilities, deployment environments for CI/CD workflows, and low-latency inference with websockets. If you're passionate about building developer-focused products and want to shape the future of AI infrastructure, check out our open roles! Software Engineer - Core Product: https://lnkd.in/gDMbhEMG Senior Software Engineer - Core Product: https://lnkd.in/g_-bPiDF #AIInfrastructure #MLOps #TechJobs #SoftwareEngineering #Baseten #NowHiring
-
Baseten reposted this
In 60 seconds, see how to: - Deploy Llama 4 Maverick with vLLM on Baseten - Replace GPT in your application with an OpenAI-compatible open source model - Vibe check the new model with a simple game-making prompt Llama 4 Scout and Maverick are available today on Baseten!
-
Llama 4 is here! 🦙🚀 Scout | 109B Parameters | 10M Context Maverick | 400B Parameters | 1M Context Llama 4 models are natively multimodal, use a MoE architecture, and set a new frontier for performance/cost. We're excited to offer dedicated deployments of Llama 4! Details on Llama 4 Scout: While the model is only 109B parameters, the 10M-token context window benefits from extra compute. You can serve a million tokens with 8xH100, but pushing to the full context window requires multinode, H200, or B200. Details on Llama 4 Maverick: This model is 400B parameters, replacing Llama 3.1 405B, and has a 1M-token context window. You can serve the model on 8xH100 in FP8 with about half the context, or bump up to H200 or B200 for full context and faster speeds. Links in the comments for accessing dedicated deployments!
-
-
We’re having a great time at #KubeCon London. If you haven’t had a chance to visit us, stop by Booth #N651 to grab a Baseten cupcake, get your “Artificially Intelligent" T-shirt and to see a demo from our engineers. It’s not too late to grab a coffee with a team member and learn what Baseten can do for you! Book some time now 👇
-