Baseten

Baseten · 2025-04-07T20:46:07.464Z

New bots for Llama 4 Maverick and Scout are now live on Poe! Get started with an 8M token context window for Scout (yes, you read that right) and 1M for Maverick. We're thrilled to power the fastest open-source models for Quora—more to come!

Software Development

San Francisco, CA 8,791 followers

Fast, scalable inference in our cloud or yours

See jobs Follow

View all 76 employees

About us

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.

Website: https://www.baseten.co/
External link for Baseten
Industry: Software Development
Company size: 51-200 employees
Headquarters: San Francisco, CA
Type: Privately Held
Specialties: developer tools and software engineering

Products

Baseten

Machine Learning Software

Locations

Primary

San Francisco, CA, US

Get directions
New York, NY, US

Get directions

Employees at Baseten

See all employees

Updates

Baseten

8,791 followers
1mo Edited
Report this post
2025 is the year of inference. We're thrilled to announce our $75m Series C co-led by IVP and Spark Capital with participation from Greylock, Conviction, basecase capital, South Park Commons and Lachy Groom. We're also excited to add Dick Costolo and Adam Bain from 01 Advisors as new investors. Check out our CEO Tuhin's blog to learn more. It's time to build!

Announcing Baseten’s $75M Series C

55 Comments

Like Comment Share
Baseten

8,791 followers
16h
Report this post
Thanks to everyone who stopped by to say hi at #GoogleCloudNext in Vegas last week! If you didn’t catch us there, we’re at AWS Summit London and AWS Summit Hamburg in the coming weeks. Swing by our booths at any time, or book a demo or coffee chat here: 📍#AWSSummit London: https://lnkd.in/g2m3j_hX 📍#AWSSummit Hamburg: https://lnkd.in/gzNpe4_r
4 Comments

Like Comment Share
Baseten

8,791 followers
3d
Report this post
You can now use embedding models on Baseten as part of Chroma's Python SDK! We recently announced Baseten Embeddings Inference (BEI), the fastest embeddings solution with over 2x throughput and lower latency than every other stack. Using BEI on Chroma enables: • Speed and cost savings for embedding large corpora in Chroma's open-source database • Real-time embedding inference that handles large numbers of simultaneous users and requests Check out the guide by Philip Kiely (link below) for the step-by-step on how to use the integration (you can embed and do inference on your data in minutes).
3 Comments

Like Comment Share
Baseten

8,791 followers
4d
Report this post
If you're at #GoogleCloudNext in Vegas, don't miss Bola Malek's talk "Secure and Optimize AI and ML Workloads with the Cross-Cloud Network" today at 5:15 PM. It takes place in Breakout Room BRK2-003 on the Multicloud, Networking track. If you haven't had time to chat with the team yet, you can catch us all day at booth #3341. Get a demo, snag an Artificially Intelligent shirt, or grab some ice cream with one of our execs (we're serving ice cream and fruit bars all day).
1 Comment

Like Comment Share
Baseten

8,791 followers
4d
Report this post
We're thrilled to be included in the #ForbesAI50! 🎉 Congratulations to everyone who made it, it's great to see so many of our customers and partners here too!
6 Comments

Like Comment Share
Baseten

8,791 followers
5d
Report this post
It's Day 1 of #GoogleCloudNext! If you're attending, stop by the booth (#3341) for some ice cream and swag. Say hi to the team, see a demo, and don't miss our happy hour with Google tonight at 6 PM! RSVP 👇
1 Comment

Like Comment Share
Baseten

8,791 followers
6d Edited
Report this post
Meet the Baseten crew at #GoogleCloudNext this week! We have two talks on deck, a happy hour co-hosted with Google, Baseten ice cream at the booth, and much more. 👇 Visit us at booth #3341 for: • Demos from our engineers: https://lnkd.in/d6ciYSJk • Coffee with one of our execs: https://lnkd.in/gDm4RYnC • Baseten ice cream and Artificially Intelligent swag Plus, don't miss: • Secure and optimize AI and ML workloads with the Cross-Cloud Network on Thursday, April 10th (5:15 PM - 6:00 PM) https://lnkd.in/gHYSkhRr • Effortless AI/ML: Accessing GPUs and TPUs on GKE made easy on Friday, April 11th (12:30 PM - 1:15 PM) https://lnkd.in/g3ztUKfW • Happy Hour with Google and Baseten on Wednesday, April 9 (limited availability): https://lu.ma/khe06dww
Like Comment Share
Baseten

8,791 followers
1w
Report this post
New bots for Llama 4 Maverick and Scout are now live on Poe! Get started with an 8M token context window for Scout (yes, you read that right) and 1M for Maverick. We're thrilled to power the fastest open-source models for Quora—more to come!

6 Comments

Like Comment Share
Baseten reposted this
Dustin Cyrus Michaels

building @ baseten
1w Edited
Report this post
The Core Product team at Baseten is growing. We're looking for talented Software Engineers to join our team building cutting-edge AI inference infrastructure. Following our recent $75M Series C, we're accelerating our mission to make AI accessible across all products. As part of our Core Product team, you'll work on groundbreaking features like fine-tuning capabilities, deployment environments for CI/CD workflows, and low-latency inference with websockets. If you're passionate about building developer-focused products and want to shape the future of AI infrastructure, check out our open roles! Software Engineer - Core Product: https://lnkd.in/gDMbhEMG Senior Software Engineer - Core Product: https://lnkd.in/g_-bPiDF #AIInfrastructure #MLOps #TechJobs #SoftwareEngineering #Baseten #NowHiring

Software Engineer - Fullstack (Core Product)

jobs.ashbyhq.com

3 Comments

Like Comment Share
Baseten reposted this
Philip Kiely

DevRel @ Baseten | Not an LLM (yet)
1w
Report this post
In 60 seconds, see how to: - Deploy Llama 4 Maverick with vLLM on Baseten - Replace GPT in your application with an OpenAI-compatible open source model - Vibe check the new model with a simple game-making prompt Llama 4 Scout and Maverick are available today on Baseten!

2 Comments

Like Comment Share

Browse jobs

Funding

Baseten 5 total rounds

Last Round

Series C Mar 19, 2025

US$ 75.0M

Investors

IVP Spark Capital + 9 Other investors

See more info on crunchbase

Baseten

Software Development

San Francisco, CA 8,791 followers

Fast, scalable inference in our cloud or yours

About us

Products

Baseten

Machine Learning Software

Locations

Employees at Baseten

William Lau

Amir Haghighat

Co-founder at Baseten

Aaron Relph

Leading design at Baseten

Sarah Guo

startup investor and company-builder

Updates

Announcing Baseten’s $75M Series C

Join now to see what you are missing

Similar pages

Arize AI

Metronome

Sardine

SpecterOps

Phantom

Candid Health

Cyera

Fay

EoT Labs

Chess.com

Browse jobs

Engineer jobs

Machine Learning Engineer jobs

Scientist jobs

Software Engineer jobs

Developer jobs

Marketing Manager jobs

Manager jobs

Senior Software Engineer jobs

Intern jobs

Associate jobs

Analyst jobs

Human Resources Specialist jobs

Executive jobs

Full Stack Engineer jobs

Operational Specialist jobs

Junior Software Engineer jobs

Designer jobs

Human Resources Generalist jobs

Human Resources Manager jobs

Account Executive jobs

Funding