SlideShare a Scribd company logo
GCP Deployment: Vertex AI
Triloki Gupta
Contents
• Vertex AI
• Workbench/ JupyterLab
• AutoML
• Docker
• Trained Model Deployment
• Flask API Deployment
• Conclusion
Vertex AI
Vertex AI is a managed machine learning platform that helps you build, deploy, and
scale machine learning models faster and easier.
Why Vertex AI?
Ease of use: Vertex AI provides a unified experience for managing all aspects of your
machine learning lifecycle, from data preparation to model deployment. This makes it
easy to get started with machine learning, even if you don't have a lot of experience.
Pre-trained models: Vertex AI offers a variety of pre-trained models that can be used
to quickly build and deploy machine learning models for a variety of use cases. This can
save you time and effort in the development process.
Scalability: Vertex AI is a scalable platform that can easily be scaled up or down to
meet your needs. This means that you can start small and then scale up as your needs
grow.
Security: Vertex AI is a secure platform that meets the highest security standards. This
means that you can be confident that your data is safe.
Workbench/ JupyterLab
Managed notebooks provide JupyterLab services and flexible computing resources
integrated with Google Cloud services more details
User-Managed Notebooks have JupyterLab 3 pre-installed and are configured with
GPU-enabled machine learning frameworks more details
Necessary steps for creating Notebook in Vertex AI more details
• Choose the us-central1 option in Region drop down
• In the Networking section
• Choose the Network shared with me
• Unchecked Enable external IP address
• Checked to Allow proxy access when it's available
Auto ML
• Upload the dataset in the Datasets section
• AutoML steps:
• Select CREATE from Model Registry
• Training method choose Dataset and Objective and Model Training Method as
AutoML
• Pass Name and Target column under Model details
• Training options you can add/remove features
• Compute and Pricing pass the number of nodes
• Deploy to Endpoint
• Pass Endpoint name under Define your endpoint
• Add Machine type under Model settings
• Leave Model monitoring as it is and click on DEPLOY
• Endpoint will be ready in few min
• During the endpoint need to check the Explainability options
Docker
Docker is a software platform that allows you to build, test, and deploy applications
quickly
Docker packages software into standardized units called containers that have
everything the software needs to run including libraries, system tools, code, and
runtime
Basics Docker CMD
• gcloud auth login
• gcloud auth configure-docker us-central1-docker.pkg.dev
• docker build ./ -t my-image
• docker run my-image
• docker tag my-image us-central1-docker.pkg.dev/ford-deeb08c04ecbbaaac14dbfa0/ford-
container-images/test-image:version
• docker push us-central1-docker.pkg.dev/ford-deeb08c04ecbbaaac14dbfa0/ford-container-
images/test--image:version
• For more details Link
Trained Model Deployment
• Deployed with Pre-built container
• Select IMPORT from Model Registry
• Pass Name and region
• Model settings Select a Pre-built container and choose a Model framework
according to the project and pass the model package location in the Model artifact
location
• Deployed with Custom container
• Select CREATE from Model Registry
• Under the Training method choose Dataset as No managed dataset
• Pass Name in Model details
• Training Container select Custom container and choose a container image from the
Artifact registry
• Under Compute and Pricing choose Machine Type according to the project
Flask API Deployment
• Build and Push the docker image in Artifact Registry
• Model Deployment
• From Model Registry click IMPORT
• Pass name and region under the Name and Region section
• Under Model settings choose Import an existing customer container also add
Prediction route(/predict) and Health route(/health)
• Click on IMPORT, the model will be ready in a few min
• Deploy to Endpoint
• Pass Endpoint name under Define your endpoint
• Add Machine type under Model settings
• Leave Model monitoring as it is and click on DEPLOY
• Endpoint will be ready in few min
Custom Deployment: Key Points
• Points need to consider before custom deployment
• HTTP server requires 0.0.0.0 host only
• Require /health API and HTTP server should respond with status code 200 OK
and it should be GET method
• The HTTP server accepts prediction requests in JSON format
• Prediction request must be 1.5 MB or smaller and it has to be written in JSON
• Prediction API should be one API(/predict) with POST method
• Every request must contain an instances field
• The response from the HTTP server is a JSON dictionary with one field
predictions
• Each prediction response must also be 1.5 MB or smaller
Conclusion
• The biggest adv of a custom container is that we can add extra workflow
outside of the ML model
• Compared to the optimized pre-built containers, custom containers are more
difficult to implement and more complex to set up, but it does provide more
flexibility
• Everything except for the core logic of model inference is taken care of by
Vertex AI either service management or auto-scaling
• Considering all these points we can say its basically good for ML model where
we can deploy and predict with one API
• We can’t pass multiple APIs because it seems dedicated to only ML prediction
• Datapoint should be 1k minimum for AutoML
Thank You
Optimization: Vertex AI
• There is an upper limit on the timeout of about the 60s(A major drawback
of Vertex AI) StackOverflow
• With a Standard Machine(2 CPU and 7.5 GB RAM) and CPU TensorFlow env
it runs for 20 iterations
• With a High-memory Machine(16 CPU and 108 GB RAM) and GPU
TensorFlow env, we reached 50 iterations because of timeout
• With a High-GPU Machine(96 CPU and 680 GB RAM) and GPU TensorFlow
env, we reached 70-80 iterations because of timeout
• Sometimes it was failing with a High-memory Machine for 20 iterations
Ad

More Related Content

What's hot (20)

Build MLOps System on AWS
Build MLOps System on AWS Build MLOps System on AWS
Build MLOps System on AWS
Yunrui Li
 
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdfGlobal Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Aroh Shukla
 
OpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptxOpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptx
Udaiappa Ramachandran
 
⼤語⾔模型 LLM 應⽤開發入⾨
⼤語⾔模型 LLM 應⽤開發入⾨⼤語⾔模型 LLM 應⽤開發入⾨
⼤語⾔模型 LLM 應⽤開發入⾨
Wen-Tien Chang
 
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdfIntro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Korkrid Akepanidtaworn
 
BERT (v3).pptx
BERT (v3).pptxBERT (v3).pptx
BERT (v3).pptx
akram596384
 
1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)
WarNik Chow
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
Dmitry Kan
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
Leon Dohmen
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
Kumton Suttiraksiri
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Naoki (Neo) SATO
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERT
Suman Debnath
 
OpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and MisconceptionsOpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and Misconceptions
Ivo Andreev
 
Red Hat: Three Pillars of Integration
Red Hat:  Three Pillars of IntegrationRed Hat:  Three Pillars of Integration
Red Hat: Three Pillars of Integration
Judy Breedlove
 
Azure Cognitive Services Bring AI to your applications in 3 steps.pptx
Azure Cognitive Services Bring AI to your applications in 3 steps.pptxAzure Cognitive Services Bring AI to your applications in 3 steps.pptx
Azure Cognitive Services Bring AI to your applications in 3 steps.pptx
Luis Beltran
 
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
StreamNative
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered Search
Trey Grainger
 
Dense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdfDense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdf
Sease
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Young Seok Kim
 
Build MLOps System on AWS
Build MLOps System on AWS Build MLOps System on AWS
Build MLOps System on AWS
Yunrui Li
 
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdfGlobal Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Aroh Shukla
 
⼤語⾔模型 LLM 應⽤開發入⾨
⼤語⾔模型 LLM 應⽤開發入⾨⼤語⾔模型 LLM 應⽤開發入⾨
⼤語⾔模型 LLM 應⽤開發入⾨
Wen-Tien Chang
 
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdfIntro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Korkrid Akepanidtaworn
 
1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)
WarNik Chow
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
Dmitry Kan
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
Leon Dohmen
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
Kumton Suttiraksiri
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Naoki (Neo) SATO
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERT
Suman Debnath
 
OpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and MisconceptionsOpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and Misconceptions
Ivo Andreev
 
Red Hat: Three Pillars of Integration
Red Hat:  Three Pillars of IntegrationRed Hat:  Three Pillars of Integration
Red Hat: Three Pillars of Integration
Judy Breedlove
 
Azure Cognitive Services Bring AI to your applications in 3 steps.pptx
Azure Cognitive Services Bring AI to your applications in 3 steps.pptxAzure Cognitive Services Bring AI to your applications in 3 steps.pptx
Azure Cognitive Services Bring AI to your applications in 3 steps.pptx
Luis Beltran
 
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
StreamNative
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered Search
Trey Grainger
 
Dense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdfDense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdf
Sease
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Young Seok Kim
 

Similar to GCP Deployment- Vertex AI (20)

PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
Puppet
 
Sitecore 8.2 Update 1 on Azure Web Apps
Sitecore 8.2 Update 1 on Azure Web AppsSitecore 8.2 Update 1 on Azure Web Apps
Sitecore 8.2 Update 1 on Azure Web Apps
Rob Habraken
 
Sitecore development approach evolution – destination helix
Sitecore development approach evolution – destination helixSitecore development approach evolution – destination helix
Sitecore development approach evolution – destination helix
Peter Nazarov
 
Machine Learning Operations Cababilities
Machine Learning Operations CababilitiesMachine Learning Operations Cababilities
Machine Learning Operations Cababilities
davidsh11
 
8 cloud design patterns you ought to know - Update Conference 2018
8 cloud design patterns you ought to know - Update Conference 20188 cloud design patterns you ought to know - Update Conference 2018
8 cloud design patterns you ought to know - Update Conference 2018
Taswar Bhatti
 
What is an Automation Framework ?
What is an Automation Framework ?�What is an Automation Framework ?�
What is an Automation Framework ?
Sriram Angajala
 
Tech Talk on Cloud Computing
Tech Talk on Cloud ComputingTech Talk on Cloud Computing
Tech Talk on Cloud Computing
ITviec
 
Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
AWS ML Model Deployment
AWS ML Model DeploymentAWS ML Model Deployment
AWS ML Model Deployment
Knoldus Inc.
 
Logic appsforbeginners
Logic appsforbeginnersLogic appsforbeginners
Logic appsforbeginners
Mandar Dharmadhikari
 
Canada DevOps Summit 2020 Presentation Nov_03_2020
Canada DevOps Summit 2020 Presentation Nov_03_2020Canada DevOps Summit 2020 Presentation Nov_03_2020
Canada DevOps Summit 2020 Presentation Nov_03_2020
Varun Manik
 
Getting to Walk with DevOps
Getting to Walk with DevOpsGetting to Walk with DevOps
Getting to Walk with DevOps
Eklove Mohan
 
Integration Monday - BizTalk Migrator Deep Dive
Integration Monday - BizTalk Migrator Deep DiveIntegration Monday - BizTalk Migrator Deep Dive
Integration Monday - BizTalk Migrator Deep Dive
BizTalk360
 
Diagnosing issues in your ASP.NET applications in production with Visual Stud...
Diagnosing issues in your ASP.NET applications in production with Visual Stud...Diagnosing issues in your ASP.NET applications in production with Visual Stud...
Diagnosing issues in your ASP.NET applications in production with Visual Stud...
Microsoft Developer Network (MSDN) - Belgium and Luxembourg
 
Cloud Design Patterns - Hong Kong Codeaholics
Cloud Design Patterns - Hong Kong CodeaholicsCloud Design Patterns - Hong Kong Codeaholics
Cloud Design Patterns - Hong Kong Codeaholics
Taswar Bhatti
 
Presentation Azure Chat Bot Project.pptx
Presentation Azure Chat Bot Project.pptxPresentation Azure Chat Bot Project.pptx
Presentation Azure Chat Bot Project.pptx
Anupama Kate
 
Wellington MuleSoft Meetup 2021-02-18
Wellington MuleSoft Meetup 2021-02-18Wellington MuleSoft Meetup 2021-02-18
Wellington MuleSoft Meetup 2021-02-18
Mary Joy Sabal
 
Datasheet was pluginforrd
Datasheet was pluginforrdDatasheet was pluginforrd
Datasheet was pluginforrd
MidVision
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
Mark Tabladillo
 
AWS ECS Copilot DevOps Presentation
AWS ECS Copilot DevOps PresentationAWS ECS Copilot DevOps Presentation
AWS ECS Copilot DevOps Presentation
Varun Manik
 
PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
PuppetConf 2017: Unlocking Azure with Puppet Enterprise- Keiran Sweet, Source...
Puppet
 
Sitecore 8.2 Update 1 on Azure Web Apps
Sitecore 8.2 Update 1 on Azure Web AppsSitecore 8.2 Update 1 on Azure Web Apps
Sitecore 8.2 Update 1 on Azure Web Apps
Rob Habraken
 
Sitecore development approach evolution – destination helix
Sitecore development approach evolution – destination helixSitecore development approach evolution – destination helix
Sitecore development approach evolution – destination helix
Peter Nazarov
 
Machine Learning Operations Cababilities
Machine Learning Operations CababilitiesMachine Learning Operations Cababilities
Machine Learning Operations Cababilities
davidsh11
 
8 cloud design patterns you ought to know - Update Conference 2018
8 cloud design patterns you ought to know - Update Conference 20188 cloud design patterns you ought to know - Update Conference 2018
8 cloud design patterns you ought to know - Update Conference 2018
Taswar Bhatti
 
What is an Automation Framework ?
What is an Automation Framework ?�What is an Automation Framework ?�
What is an Automation Framework ?
Sriram Angajala
 
Tech Talk on Cloud Computing
Tech Talk on Cloud ComputingTech Talk on Cloud Computing
Tech Talk on Cloud Computing
ITviec
 
Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
AWS ML Model Deployment
AWS ML Model DeploymentAWS ML Model Deployment
AWS ML Model Deployment
Knoldus Inc.
 
Canada DevOps Summit 2020 Presentation Nov_03_2020
Canada DevOps Summit 2020 Presentation Nov_03_2020Canada DevOps Summit 2020 Presentation Nov_03_2020
Canada DevOps Summit 2020 Presentation Nov_03_2020
Varun Manik
 
Getting to Walk with DevOps
Getting to Walk with DevOpsGetting to Walk with DevOps
Getting to Walk with DevOps
Eklove Mohan
 
Integration Monday - BizTalk Migrator Deep Dive
Integration Monday - BizTalk Migrator Deep DiveIntegration Monday - BizTalk Migrator Deep Dive
Integration Monday - BizTalk Migrator Deep Dive
BizTalk360
 
Cloud Design Patterns - Hong Kong Codeaholics
Cloud Design Patterns - Hong Kong CodeaholicsCloud Design Patterns - Hong Kong Codeaholics
Cloud Design Patterns - Hong Kong Codeaholics
Taswar Bhatti
 
Presentation Azure Chat Bot Project.pptx
Presentation Azure Chat Bot Project.pptxPresentation Azure Chat Bot Project.pptx
Presentation Azure Chat Bot Project.pptx
Anupama Kate
 
Wellington MuleSoft Meetup 2021-02-18
Wellington MuleSoft Meetup 2021-02-18Wellington MuleSoft Meetup 2021-02-18
Wellington MuleSoft Meetup 2021-02-18
Mary Joy Sabal
 
Datasheet was pluginforrd
Datasheet was pluginforrdDatasheet was pluginforrd
Datasheet was pluginforrd
MidVision
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
Mark Tabladillo
 
AWS ECS Copilot DevOps Presentation
AWS ECS Copilot DevOps PresentationAWS ECS Copilot DevOps Presentation
AWS ECS Copilot DevOps Presentation
Varun Manik
 
Ad

More from Triloki Gupta (7)

Flask-Python
Flask-PythonFlask-Python
Flask-Python
Triloki Gupta
 
Speaker identification
Speaker identificationSpeaker identification
Speaker identification
Triloki Gupta
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols Classification
Triloki Gupta
 
Introduction to pig.
Introduction to pig.Introduction to pig.
Introduction to pig.
Triloki Gupta
 
Naive Bayes Classifier using R.
Naive Bayes Classifier using R.Naive Bayes Classifier using R.
Naive Bayes Classifier using R.
Triloki Gupta
 
Meta analysis.
Meta analysis.Meta analysis.
Meta analysis.
Triloki Gupta
 
Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Enhancement of Old Images and Documents by Digital Image Processing Techniques.Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Triloki Gupta
 
Speaker identification
Speaker identificationSpeaker identification
Speaker identification
Triloki Gupta
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols Classification
Triloki Gupta
 
Introduction to pig.
Introduction to pig.Introduction to pig.
Introduction to pig.
Triloki Gupta
 
Naive Bayes Classifier using R.
Naive Bayes Classifier using R.Naive Bayes Classifier using R.
Naive Bayes Classifier using R.
Triloki Gupta
 
Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Enhancement of Old Images and Documents by Digital Image Processing Techniques.Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Enhancement of Old Images and Documents by Digital Image Processing Techniques.
Triloki Gupta
 
Ad

Recently uploaded (20)

Do not let staffing shortages and limited fiscal view hamper your cause
Do not let staffing shortages and limited fiscal view hamper your causeDo not let staffing shortages and limited fiscal view hamper your cause
Do not let staffing shortages and limited fiscal view hamper your cause
Fexle Services Pvt. Ltd.
 
Exchange Migration Tool- Shoviv Software
Exchange Migration Tool- Shoviv SoftwareExchange Migration Tool- Shoviv Software
Exchange Migration Tool- Shoviv Software
Shoviv Software
 
wAIred_LearnWithOutAI_JCON_14052025.pptx
wAIred_LearnWithOutAI_JCON_14052025.pptxwAIred_LearnWithOutAI_JCON_14052025.pptx
wAIred_LearnWithOutAI_JCON_14052025.pptx
SimonedeGijt
 
Troubleshooting JVM Outages – 3 Fortune 500 case studies
Troubleshooting JVM Outages – 3 Fortune 500 case studiesTroubleshooting JVM Outages – 3 Fortune 500 case studies
Troubleshooting JVM Outages – 3 Fortune 500 case studies
Tier1 app
 
How I solved production issues with OpenTelemetry
How I solved production issues with OpenTelemetryHow I solved production issues with OpenTelemetry
How I solved production issues with OpenTelemetry
Cees Bos
 
A Comprehensive Guide to CRM Software Benefits for Every Business Stage
A Comprehensive Guide to CRM Software Benefits for Every Business StageA Comprehensive Guide to CRM Software Benefits for Every Business Stage
A Comprehensive Guide to CRM Software Benefits for Every Business Stage
SynapseIndia
 
Adobe Audition Crack FRESH Version 2025 FREE
Adobe Audition Crack FRESH Version 2025 FREEAdobe Audition Crack FRESH Version 2025 FREE
Adobe Audition Crack FRESH Version 2025 FREE
zafranwaqar90
 
Memory Management and Leaks in Postgres from pgext.day 2025
Memory Management and Leaks in Postgres from pgext.day 2025Memory Management and Leaks in Postgres from pgext.day 2025
Memory Management and Leaks in Postgres from pgext.day 2025
Phil Eaton
 
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdfTop Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
evrigsolution
 
Time Estimation: Expert Tips & Proven Project Techniques
Time Estimation: Expert Tips & Proven Project TechniquesTime Estimation: Expert Tips & Proven Project Techniques
Time Estimation: Expert Tips & Proven Project Techniques
Livetecs LLC
 
How to Troubleshoot 9 Types of OutOfMemoryError
How to Troubleshoot 9 Types of OutOfMemoryErrorHow to Troubleshoot 9 Types of OutOfMemoryError
How to Troubleshoot 9 Types of OutOfMemoryError
Tier1 app
 
Robotic Process Automation (RPA) Software Development Services.pptx
Robotic Process Automation (RPA) Software Development Services.pptxRobotic Process Automation (RPA) Software Development Services.pptx
Robotic Process Automation (RPA) Software Development Services.pptx
julia smits
 
Serato DJ Pro Crack Latest Version 2025??
Serato DJ Pro Crack Latest Version 2025??Serato DJ Pro Crack Latest Version 2025??
Serato DJ Pro Crack Latest Version 2025??
Web Designer
 
sequencediagrams.pptx software Engineering
sequencediagrams.pptx software Engineeringsequencediagrams.pptx software Engineering
sequencediagrams.pptx software Engineering
aashrithakondapalli8
 
Wilcom Embroidery Studio Crack Free Latest 2025
Wilcom Embroidery Studio Crack Free Latest 2025Wilcom Embroidery Studio Crack Free Latest 2025
Wilcom Embroidery Studio Crack Free Latest 2025
Web Designer
 
Why Tapitag Ranks Among the Best Digital Business Card Providers
Why Tapitag Ranks Among the Best Digital Business Card ProvidersWhy Tapitag Ranks Among the Best Digital Business Card Providers
Why Tapitag Ranks Among the Best Digital Business Card Providers
Tapitag
 
Top 12 Most Useful AngularJS Development Tools to Use in 2025
Top 12 Most Useful AngularJS Development Tools to Use in 2025Top 12 Most Useful AngularJS Development Tools to Use in 2025
Top 12 Most Useful AngularJS Development Tools to Use in 2025
GrapesTech Solutions
 
NYC ACE 08-May-2025-Combined Presentation.pdf
NYC ACE 08-May-2025-Combined Presentation.pdfNYC ACE 08-May-2025-Combined Presentation.pdf
NYC ACE 08-May-2025-Combined Presentation.pdf
AUGNYC
 
GC Tuning: A Masterpiece in Performance Engineering
GC Tuning: A Masterpiece in Performance EngineeringGC Tuning: A Masterpiece in Performance Engineering
GC Tuning: A Masterpiece in Performance Engineering
Tier1 app
 
What Do Candidates Really Think About AI-Powered Recruitment Tools?
What Do Candidates Really Think About AI-Powered Recruitment Tools?What Do Candidates Really Think About AI-Powered Recruitment Tools?
What Do Candidates Really Think About AI-Powered Recruitment Tools?
HireME
 
Do not let staffing shortages and limited fiscal view hamper your cause
Do not let staffing shortages and limited fiscal view hamper your causeDo not let staffing shortages and limited fiscal view hamper your cause
Do not let staffing shortages and limited fiscal view hamper your cause
Fexle Services Pvt. Ltd.
 
Exchange Migration Tool- Shoviv Software
Exchange Migration Tool- Shoviv SoftwareExchange Migration Tool- Shoviv Software
Exchange Migration Tool- Shoviv Software
Shoviv Software
 
wAIred_LearnWithOutAI_JCON_14052025.pptx
wAIred_LearnWithOutAI_JCON_14052025.pptxwAIred_LearnWithOutAI_JCON_14052025.pptx
wAIred_LearnWithOutAI_JCON_14052025.pptx
SimonedeGijt
 
Troubleshooting JVM Outages – 3 Fortune 500 case studies
Troubleshooting JVM Outages – 3 Fortune 500 case studiesTroubleshooting JVM Outages – 3 Fortune 500 case studies
Troubleshooting JVM Outages – 3 Fortune 500 case studies
Tier1 app
 
How I solved production issues with OpenTelemetry
How I solved production issues with OpenTelemetryHow I solved production issues with OpenTelemetry
How I solved production issues with OpenTelemetry
Cees Bos
 
A Comprehensive Guide to CRM Software Benefits for Every Business Stage
A Comprehensive Guide to CRM Software Benefits for Every Business StageA Comprehensive Guide to CRM Software Benefits for Every Business Stage
A Comprehensive Guide to CRM Software Benefits for Every Business Stage
SynapseIndia
 
Adobe Audition Crack FRESH Version 2025 FREE
Adobe Audition Crack FRESH Version 2025 FREEAdobe Audition Crack FRESH Version 2025 FREE
Adobe Audition Crack FRESH Version 2025 FREE
zafranwaqar90
 
Memory Management and Leaks in Postgres from pgext.day 2025
Memory Management and Leaks in Postgres from pgext.day 2025Memory Management and Leaks in Postgres from pgext.day 2025
Memory Management and Leaks in Postgres from pgext.day 2025
Phil Eaton
 
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdfTop Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
Top Magento Hyvä Theme Features That Make It Ideal for E-commerce.pdf
evrigsolution
 
Time Estimation: Expert Tips & Proven Project Techniques
Time Estimation: Expert Tips & Proven Project TechniquesTime Estimation: Expert Tips & Proven Project Techniques
Time Estimation: Expert Tips & Proven Project Techniques
Livetecs LLC
 
How to Troubleshoot 9 Types of OutOfMemoryError
How to Troubleshoot 9 Types of OutOfMemoryErrorHow to Troubleshoot 9 Types of OutOfMemoryError
How to Troubleshoot 9 Types of OutOfMemoryError
Tier1 app
 
Robotic Process Automation (RPA) Software Development Services.pptx
Robotic Process Automation (RPA) Software Development Services.pptxRobotic Process Automation (RPA) Software Development Services.pptx
Robotic Process Automation (RPA) Software Development Services.pptx
julia smits
 
Serato DJ Pro Crack Latest Version 2025??
Serato DJ Pro Crack Latest Version 2025??Serato DJ Pro Crack Latest Version 2025??
Serato DJ Pro Crack Latest Version 2025??
Web Designer
 
sequencediagrams.pptx software Engineering
sequencediagrams.pptx software Engineeringsequencediagrams.pptx software Engineering
sequencediagrams.pptx software Engineering
aashrithakondapalli8
 
Wilcom Embroidery Studio Crack Free Latest 2025
Wilcom Embroidery Studio Crack Free Latest 2025Wilcom Embroidery Studio Crack Free Latest 2025
Wilcom Embroidery Studio Crack Free Latest 2025
Web Designer
 
Why Tapitag Ranks Among the Best Digital Business Card Providers
Why Tapitag Ranks Among the Best Digital Business Card ProvidersWhy Tapitag Ranks Among the Best Digital Business Card Providers
Why Tapitag Ranks Among the Best Digital Business Card Providers
Tapitag
 
Top 12 Most Useful AngularJS Development Tools to Use in 2025
Top 12 Most Useful AngularJS Development Tools to Use in 2025Top 12 Most Useful AngularJS Development Tools to Use in 2025
Top 12 Most Useful AngularJS Development Tools to Use in 2025
GrapesTech Solutions
 
NYC ACE 08-May-2025-Combined Presentation.pdf
NYC ACE 08-May-2025-Combined Presentation.pdfNYC ACE 08-May-2025-Combined Presentation.pdf
NYC ACE 08-May-2025-Combined Presentation.pdf
AUGNYC
 
GC Tuning: A Masterpiece in Performance Engineering
GC Tuning: A Masterpiece in Performance EngineeringGC Tuning: A Masterpiece in Performance Engineering
GC Tuning: A Masterpiece in Performance Engineering
Tier1 app
 
What Do Candidates Really Think About AI-Powered Recruitment Tools?
What Do Candidates Really Think About AI-Powered Recruitment Tools?What Do Candidates Really Think About AI-Powered Recruitment Tools?
What Do Candidates Really Think About AI-Powered Recruitment Tools?
HireME
 

GCP Deployment- Vertex AI

  • 1. GCP Deployment: Vertex AI Triloki Gupta
  • 2. Contents • Vertex AI • Workbench/ JupyterLab • AutoML • Docker • Trained Model Deployment • Flask API Deployment • Conclusion
  • 3. Vertex AI Vertex AI is a managed machine learning platform that helps you build, deploy, and scale machine learning models faster and easier. Why Vertex AI? Ease of use: Vertex AI provides a unified experience for managing all aspects of your machine learning lifecycle, from data preparation to model deployment. This makes it easy to get started with machine learning, even if you don't have a lot of experience. Pre-trained models: Vertex AI offers a variety of pre-trained models that can be used to quickly build and deploy machine learning models for a variety of use cases. This can save you time and effort in the development process. Scalability: Vertex AI is a scalable platform that can easily be scaled up or down to meet your needs. This means that you can start small and then scale up as your needs grow. Security: Vertex AI is a secure platform that meets the highest security standards. This means that you can be confident that your data is safe.
  • 4. Workbench/ JupyterLab Managed notebooks provide JupyterLab services and flexible computing resources integrated with Google Cloud services more details User-Managed Notebooks have JupyterLab 3 pre-installed and are configured with GPU-enabled machine learning frameworks more details Necessary steps for creating Notebook in Vertex AI more details • Choose the us-central1 option in Region drop down • In the Networking section • Choose the Network shared with me • Unchecked Enable external IP address • Checked to Allow proxy access when it's available
  • 5. Auto ML • Upload the dataset in the Datasets section • AutoML steps: • Select CREATE from Model Registry • Training method choose Dataset and Objective and Model Training Method as AutoML • Pass Name and Target column under Model details • Training options you can add/remove features • Compute and Pricing pass the number of nodes • Deploy to Endpoint • Pass Endpoint name under Define your endpoint • Add Machine type under Model settings • Leave Model monitoring as it is and click on DEPLOY • Endpoint will be ready in few min • During the endpoint need to check the Explainability options
  • 6. Docker Docker is a software platform that allows you to build, test, and deploy applications quickly Docker packages software into standardized units called containers that have everything the software needs to run including libraries, system tools, code, and runtime Basics Docker CMD • gcloud auth login • gcloud auth configure-docker us-central1-docker.pkg.dev • docker build ./ -t my-image • docker run my-image • docker tag my-image us-central1-docker.pkg.dev/ford-deeb08c04ecbbaaac14dbfa0/ford- container-images/test-image:version • docker push us-central1-docker.pkg.dev/ford-deeb08c04ecbbaaac14dbfa0/ford-container- images/test--image:version • For more details Link
  • 7. Trained Model Deployment • Deployed with Pre-built container • Select IMPORT from Model Registry • Pass Name and region • Model settings Select a Pre-built container and choose a Model framework according to the project and pass the model package location in the Model artifact location • Deployed with Custom container • Select CREATE from Model Registry • Under the Training method choose Dataset as No managed dataset • Pass Name in Model details • Training Container select Custom container and choose a container image from the Artifact registry • Under Compute and Pricing choose Machine Type according to the project
  • 8. Flask API Deployment • Build and Push the docker image in Artifact Registry • Model Deployment • From Model Registry click IMPORT • Pass name and region under the Name and Region section • Under Model settings choose Import an existing customer container also add Prediction route(/predict) and Health route(/health) • Click on IMPORT, the model will be ready in a few min • Deploy to Endpoint • Pass Endpoint name under Define your endpoint • Add Machine type under Model settings • Leave Model monitoring as it is and click on DEPLOY • Endpoint will be ready in few min
  • 9. Custom Deployment: Key Points • Points need to consider before custom deployment • HTTP server requires 0.0.0.0 host only • Require /health API and HTTP server should respond with status code 200 OK and it should be GET method • The HTTP server accepts prediction requests in JSON format • Prediction request must be 1.5 MB or smaller and it has to be written in JSON • Prediction API should be one API(/predict) with POST method • Every request must contain an instances field • The response from the HTTP server is a JSON dictionary with one field predictions • Each prediction response must also be 1.5 MB or smaller
  • 10. Conclusion • The biggest adv of a custom container is that we can add extra workflow outside of the ML model • Compared to the optimized pre-built containers, custom containers are more difficult to implement and more complex to set up, but it does provide more flexibility • Everything except for the core logic of model inference is taken care of by Vertex AI either service management or auto-scaling • Considering all these points we can say its basically good for ML model where we can deploy and predict with one API • We can’t pass multiple APIs because it seems dedicated to only ML prediction • Datapoint should be 1k minimum for AutoML
  • 12. Optimization: Vertex AI • There is an upper limit on the timeout of about the 60s(A major drawback of Vertex AI) StackOverflow • With a Standard Machine(2 CPU and 7.5 GB RAM) and CPU TensorFlow env it runs for 20 iterations • With a High-memory Machine(16 CPU and 108 GB RAM) and GPU TensorFlow env, we reached 50 iterations because of timeout • With a High-GPU Machine(96 CPU and 680 GB RAM) and GPU TensorFlow env, we reached 70-80 iterations because of timeout • Sometimes it was failing with a High-memory Machine for 20 iterations
  翻译: