Baseten

Baseten · 2025-04-02T13:30:10.819Z

We’re having a great time at #KubeCon London. If you haven’t had a chance to visit us, stop by Booth #N651 to grab a Baseten cupcake, get your “Artificially Intelligent" T-shirt and to see a demo from our engineers. It’s not too late to grab a coffee with a team member and learn what Baseten can do for you! Book some time now 👇

Software Development

San Francisco, CA 8,601 followers

Fast, scalable inference in our cloud or yours

See jobs Follow

Discover all 74 employees

About us

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.

Website: https://www.baseten.co/
External link for Baseten
Industry: Software Development
Company size: 51-200 employees
Headquarters: San Francisco, CA
Type: Privately Held
Specialties: developer tools and software engineering

Products

Baseten

Machine Learning Software

Locations

Primary

San Francisco, CA, US

Get directions
New York, NY, US

Get directions

Employees at Baseten

See all employees

Updates

Baseten

8,601 followers
1mo Edited
Report this post
2025 is the year of inference. We're thrilled to announce our $75m Series C co-led by IVP and Spark Capital with participation from Greylock, Conviction, basecase capital, South Park Commons and Lachy Groom. We're also excited to add Dick Costolo and Adam Bain from 01 Advisors as new investors. Check out our CEO Tuhin's blog to learn more. It's time to build!

Announcing Baseten’s $75M Series C

55 Comments

Like Comment Share
Baseten

8,601 followers
4h
Report this post
We're thrilled to be included in the #ForbesAI50! 🎉 Congratulations to everyone who made it, it's great to see so many of our customers and partners here too!
3 Comments

Like Comment Share
Baseten

8,601 followers
22h
Report this post
It's Day 1 of #GoogleCloudNext! If you're attending, stop by the booth (#3341) for some ice cream and swag. Say hi to the team, see a demo, and don't miss our happy hour with Google tonight at 6 PM! RSVP 👇
1 Comment

Like Comment Share
Baseten

8,601 followers
1d Edited
Report this post
Meet the Baseten crew at #GoogleCloudNext this week! We have two talks on deck, a happy hour co-hosted with Google, Baseten ice cream at the booth, and much more. 👇 Visit us at booth #3341 for: • Demos from our engineers: https://lnkd.in/d6ciYSJk • Coffee with one of our execs: https://lnkd.in/gDm4RYnC • Baseten ice cream and Artificially Intelligent swag Plus, don't miss: • Secure and optimize AI and ML workloads with the Cross-Cloud Network on Thursday, April 10th (5:15 PM - 6:00 PM) https://lnkd.in/gHYSkhRr • Effortless AI/ML: Accessing GPUs and TPUs on GKE made easy on Friday, April 11th (12:30 PM - 1:15 PM) https://lnkd.in/g3ztUKfW • Happy Hour with Google and Baseten on Wednesday, April 9 (limited availability): https://lu.ma/khe06dww
Like Comment Share
Baseten

8,601 followers
3d
Report this post
New bots for Llama 4 Maverick and Scout are now live on Poe! Get started with an 8M token context window for Scout (yes, you read that right) and 1M for Maverick. We're thrilled to power the fastest open-source models for Quora—more to come!

6 Comments

Like Comment Share
Baseten reposted this
Dustin Cyrus Michaels

building @ baseten
6d Edited
Report this post
The Core Product team at Baseten is growing. We're looking for talented Software Engineers to join our team building cutting-edge AI inference infrastructure. Following our recent $75M Series C, we're accelerating our mission to make AI accessible across all products. As part of our Core Product team, you'll work on groundbreaking features like fine-tuning capabilities, deployment environments for CI/CD workflows, and low-latency inference with websockets. If you're passionate about building developer-focused products and want to shape the future of AI infrastructure, check out our open roles! Software Engineer - Core Product: https://lnkd.in/gDMbhEMG Senior Software Engineer - Core Product: https://lnkd.in/g_-bPiDF #AIInfrastructure #MLOps #TechJobs #SoftwareEngineering #Baseten #NowHiring

Software Engineer - Fullstack (Core Product)

jobs.ashbyhq.com

3 Comments

Like Comment Share
Baseten reposted this
Philip Kiely

DevRel @ Baseten | Not an LLM (yet)
4d
Report this post
In 60 seconds, see how to: - Deploy Llama 4 Maverick with vLLM on Baseten - Replace GPT in your application with an OpenAI-compatible open source model - Vibe check the new model with a simple game-making prompt Llama 4 Scout and Maverick are available today on Baseten!

2 Comments

Like Comment Share
Baseten

8,601 followers
4d
Report this post
Llama 4 is here! 🦙🚀 Scout | 109B Parameters | 10M Context Maverick | 400B Parameters | 1M Context Llama 4 models are natively multimodal, use a MoE architecture, and set a new frontier for performance/cost. We're excited to offer dedicated deployments of Llama 4! Details on Llama 4 Scout: While the model is only 109B parameters, the 10M-token context window benefits from extra compute. You can serve a million tokens with 8xH100, but pushing to the full context window requires multinode, H200, or B200. Details on Llama 4 Maverick: This model is 400B parameters, replacing Llama 3.1 405B, and has a 1M-token context window. You can serve the model on 8xH100 in FP8 with about half the context, or bump up to H200 or B200 for full context and faster speeds. Links in the comments for accessing dedicated deployments!
6 Comments

Like Comment Share
Baseten

8,601 followers
1w Edited
Report this post
Thanks to everyone at #Kubecon London who swung by yesterday to chat with us. If you haven't had time to talk to the team at Kubecon yet, you can still catch us today at booth #N651. Snag an “Artificially Intelligent” T-shirt or grab coffee and a Baseten cupcake with one of our engineers.
1 Comment

Like Comment Share
Baseten

8,601 followers
1w
Report this post
We’re having a great time at #KubeCon London. If you haven’t had a chance to visit us, stop by Booth #N651 to grab a Baseten cupcake, get your “Artificially Intelligent" T-shirt and to see a demo from our engineers. It’s not too late to grab a coffee with a team member and learn what Baseten can do for you! Book some time now 👇
3 Comments

Like Comment Share

Browse jobs

Funding

Baseten 5 total rounds

Last Round

Series C Mar 19, 2025

US$ 75.0M

Investors

IVP Spark Capital + 8 Other investors

See more info on crunchbase

Baseten

Software Development

San Francisco, CA 8,601 followers

Fast, scalable inference in our cloud or yours

About us

Products

Baseten

Machine Learning Software

Locations

Employees at Baseten

William Lau

Amir Haghighat

Co-founder at Baseten

Aaron Relph

Leading design at Baseten

Sarah Guo

startup investor and company-builder

Updates

Announcing Baseten’s $75M Series C

Join now to see what you are missing

Similar pages

Arize AI

Metronome

Sardine

SpecterOps

Phantom

Candid Health

Cyera

Fay

EoT Labs

Chess.com

Browse jobs

Engineer jobs

Machine Learning Engineer jobs

Scientist jobs

Software Engineer jobs

Developer jobs

Marketing Manager jobs

Manager jobs

Senior Software Engineer jobs

Intern jobs

Associate jobs

Analyst jobs

Human Resources Specialist jobs

Executive jobs

Full Stack Engineer jobs

Operational Specialist jobs

Junior Software Engineer jobs

Designer jobs

Human Resources Generalist jobs

Human Resources Manager jobs

Account Executive jobs

Funding