The Real Reason Your Machine Learning Model Doesn’t Scale

Arnav Munshi

Senior Technical Lead at EY | Azure | Data Science | Data Engineering | AI & ML | Cloud Solutions | Big Data | Automation

Published May 12, 2025

+ Follow

You trained a powerful model. It performed beautifully in testing. But when it hit production... it stalled.

Sound familiar?

The real bottleneck isn’t your algorithm. It’s how you think about scaling.

Let’s break it down.

1. Training ≠ Deployment

In notebooks, we optimize for accuracy. In the real world, we need:

Latency under 100ms
Memory efficiency
Hardware-aware inference

If you’ve ever had to rewrite your model in ONNX, TensorRT, or convert it for edge deployment—you’ve felt this friction.

Lesson: Architect with deployment in mind, not just experimentation.

2. Feature Engineering Is Fragile

Most teams build features in pandas, test in notebooks, and deploy in something completely different.

This leads to:

Feature drift
Inconsistent logic
Models that “break” without changing a single line of model code

Fix it: Use a centralized, versioned feature store. Automate validation across training and production.

3. Real-Time Is a Different Beast

Batch models are predictable. Real-time models introduce chaos:

Late-arriving data
Incomplete records
Input schema changes

If your model doesn’t handle edge cases before inference, scaling will expose every crack.

4. Monitoring Isn’t Optional

You can’t improve what you can’t see. High-performing ML systems include:

Input/output logging
Prediction confidence tracking
Alerting on data drift or anomalous predictions

Monitoring turns AI from magic into a measurable system.

Final Thought

Scaling ML is a team sport. It’s about more than training—it’s about designing systems that perform, adapt, and evolve in production.

Don’t just build for the lab. Build for the real world.

To view or add a comment, sign in

The Real Reason Your Machine Learning Model Doesn’t Scale

Arnav Munshi

Senior Technical Lead at EY | Azure | Data Science | Data Engineering | AI & ML | Cloud Solutions | Big Data | Automation

1. Training ≠ Deployment

2. Feature Engineering Is Fragile

3. Real-Time Is a Different Beast

4. Monitoring Isn’t Optional

Final Thought

More articles by Arnav Munshi

Explore topics

1. Training ≠ Deployment

2. Feature Engineering Is Fragile

3. Real-Time Is a Different Beast

4. Monitoring Isn’t Optional

Final Thought

More articles by Arnav Munshi

The AI Model Is Not the Product — The Pipeline Is

The Hidden Bottleneck in AI Projects Isn’t the Model—It’s the Data Pipeline

Why 80% of Your ML Model’s Success Happens Before You Train It

The Role of Data Versioning in Scalable Machine Learning

Data Labeling – The Backbone of Supervised Learning

Beyond Accuracy – Evaluating Machine Learning Models the Right Way

From Models to Meaning: Bridging the Gap Between Data Science and Real-World Impact

What Failed Models Taught Me About Data Science

The Real Skill in Data Science: Simplicity

In Data Science, Better Questions Matter More Than Perfect Answers

Explore topics