SlideShare a Scribd company logo
H2O.ai Confidential
LLM Learning Path -
Level 1
Author: Andreea Turcu
Head of Global Training @H2O.ai
H2O.ai Confidential
Fine-tuning
Refining pre-trained
models using
task-specific data,
enhancing their
performance on
targeted tasks.
Foundation
Powerful language
models trained on
extensive text data,
forming the basis for
various language
tasks.
Building Steps for LLMs
01 03
Eval LLMs
Thoroughly assessing
and comparing LLMs
is increasingly vital
due to their
heightened
significance and
complexity.
04
05
04
03
02
01
DataPrep
Converting
documents into
instruction pairs, like
QA pairs, facilitating
fine-tuning and
tasks.
02
Database
Effectively utilize
company data with a
database that
seamlessly
integrates new PDFs,
eliminating the need
for model retraining.
05
Applications
Elevate interactions
with advanced
language
comprehension and
LLM-driven response
generation for
enriched user
experiences.
06
H2O.ai Confidential
Table of Contents
1. Introduction to Language Models
2. Understanding LLM Architecture /
Foundation Models
3. Getting Started with LLM Data Studio
4. Fine-tuning LLMs
5. Making Your Own GPT and Fine-tuning using
LLM Studio
6. Evaluating and Benchmarking LLMs
7. Practical Applications and Case Studies
H2O.ai Confidential
Contents at a Glance
1. Introduction to Language Models
● What is a Language Model?
● Techniques Commonly Used
● Importance and Applications
H2O.ai Confidential
Foundation
Powerful language
models trained on
extensive text data,
forming the basis for
various language
tasks.
Building Steps for LLMs
01
05
04
03
02
01
Contents at a Glance
1. Introduction to Language Models
2. Understanding LLM Architecture /
Foundation Models
● What are Foundation Models?
● Neural Networks and Deep Learning
● Transformer Architecture vs. LLM Architecture
● Pre-training & fine-tuning of LLMs
● Transfer Learning and Adaptation
H2O.ai Confidential
Generative AI Definitions
Foundation Models
Large Language Models (LLMs)
Unlabeled
Training Data
Additional
Text-Based Data
Transformer
Algorithm
Transformer
Algorithm
Foundation
Model
LLM
Generative AI
Collection of ML algorithms that learn a representation of artifacts
from data and models, and use it to generate brand-new, completely
original artifacts that preserve a likeness to original data or models.
Foundation model
Is a Large machine learning model trained on a large amount of
unlabeled data using a transformer algorithm. This model can be
augmented by a range of fine-tuning (adapter) techniques. The
resulting model can be further adapted to a wide range of
applications.
Large Language Model (LLM)
An LLM is a type of foundation model specifically designed for natural
language processing.
Generative Pre-trained Transformer (GPT)
Is an LLM specifically designed to predict the next token. For example
ChatGPT is a conversational application built on top of an LLM.
Essential topics:
1. Grasping the essence of Foundation Models
2. Delving into Neural Networks and Deep Learning
3. Exploring the intricacies of the Transformer
Architecture
4. Understanding the concepts of pre-training and
fine-tuning in LLMs
5. Navigating Transfer Learning and Adaptation
techniques
Foundation models can be
used for a wide range of tasks:
1. Answering questions
2. Generating human-like text
3. Translating languages
4. Creating chatbots
5. Summarizing articles, and more
Neural Networks
Each node receives input from multiple nodes in
the previous layer, performs a computation, and
passes the output to the next layer. The output of
the last layer represents the final prediction or
decision made by the neural network.
Deep Learning = Neural networks with multiple
layers
Deep learning models are capable of learning
complex patterns and representations from large
amounts of data. The term "deep" refers to the
depth of the network, which signifies the
number of hidden layers between the input and
output layers.
In forward propagation,
input data flows through the
network, transforming into a
meaningful output.
Backpropagation fine-tunes
network parameters by
minimizing prediction errors
through iterative adjustments
based on desired output.
To remember:
● Not all neural networks qualify as deep learning
models.
● Deep learning is distinguished by network depth.
● Depth enables the learning of intricate data
features and relationships.
● This leads to improved performance in tasks like
image recognition and natural language
processing.
Applications of Neural
Networks and Deep Learning
in LLMs:
• Natural Language Processing (NLP)
• Speech Recognition
• Recommendation Systems
• Text Generation
• Language Understanding and Context
• Automation and Efficiency
• User Experience Enhancement
H2O.ai Confidential
v
● The emergence of Large Language Models (LLMs) coincided with
advancements in language understanding and generation.
● LLMs are distinguished by their exceptional size and complexity.
○ These models consist of billions of specialized components.
○ These components enable LLMs to comprehend intricate language
nuances.
● LLMs are capable of generating high-quality text.
H2O.ai Confidential
v
Fine-tune Example:
Learn a Specific Style of Answering and Writing
Fine-tuning training
Hyperparameter tuning
Data Scientist
Fine-Tuned
Large Language Model
Foundation
Large Language Model
Autoregressive, trained on diverse
data (“the whole internet”). Good at
continuing text.
Specialized style: learned
prompt & answer,
instructions
H2O.ai Confidential
Crucial Role in Language Models
1. Enhanced Communication
2. Information Assessment
3. Ethical Implications
4. Prospects for the Future
H2O.ai Confidential
Key Areas where LMs are used:
1. Chatbots and Virtual Assistants
2. Language Translation
3. Content Generation
4. Sentiment Analysis
5. Text Completion and Auto-correction
6. Voice Assistants
H2O.ai Confidential
Distinguishing Characteristics of LLMs
1. Scale
2. Creative Writing
3. Complex Problem Solving
4. Domain Expertise
5. Enhanced Language Understanding
6. Data Efficiency
7. Pre-training and Fine-tuning
8. Contextual Understanding
9. Language Generation
10. Transfer Learning
11. Versatility and Applications
12. Research and Innovation
H2O.ai Confidential
Some important terms related to the
Transformer architecture:
1. Attention
2. Multi-head Attention
3. Encoder
4. Decoder
5. Self-Attention
6. Feed-Forward Neural Network
7. Positional Encoding
8. Masking
H2O.ai Confidential
Reminder
- The Transformer is a specialized neural network architecture
introduced in the research paper "Attention is All You Need."
- Its primary function is to process sequences of data.
- It utilizes self-attention, a distinctive mechanism, to efficiently
capture relationships between words within a sentence.
H2O.ai Confidential
Reminder
- Large Language Models fall within a broader category of models
trained on extensive textual data without human annotations.
- Prominent models such as GPT-3 and BERT are constructed based
on the underlying Transformer architecture.
- These models attain comprehensive language representations by
harnessing the abundant data they encounter during their training
process.
H2O.ai Confidential
Primary objective of LLMs
- The primary aim of Large Language Models is to acquire potent
language representations from extensive text data.
- Once they have gained this expertise, they can undergo
fine-tuning for specific language tasks.
- These tasks may include sentiment analysis, question-answering,
or text classification, among others.
H2O.ai Confidential
H2O.ai Confidential
H2O.ai Confidential
Transfer learning
● Uses a pre-trained model as a foundation for a
new task.
● Instead of starting from scratch, the model
begins with pre-trained weights.
● Fine-tunes on a smaller labeled dataset specific
to the new task.
● Adapts pre-learned representations to the new
data's patterns and characteristics.
● Ideal for tasks with limited labeled data or
resource-intensive training.
Adaptation (domain adaptation)
● Targets domain differences between source
and target domains.
● Its goal is to make a model trained on the
source domain perform well on the target
domain, even with limited labeled data.
● A key challenge is ensuring effective
generalization despite distribution shifts.
● Adaptation techniques align representations
from the source domain with the target
domain to reduce domain discrepancies.
H2O.ai Confidential
Robot Adaptation Approaches
1. Feature-based adaptation: Simplifies the
robot's view by finding common features
between old and new objects.
2. Instance-based adaptation: Adjusts the
robot's focus by prioritizing similar objects in
the new environment.
3. Model-based adaptation: Fine-tunes the
robot's recognition abilities by emphasizing
relevant details in the new environment.
H2O.ai Confidential
● Fine-tuning in LLMs enhances
adaptation.
● Empowers models with styles,
personalities, and domain
knowledge.
● Starts with a pre-trained LLM.
● Pre-training is generic, lacks
specificity.
H2O.ai Confidential
- Knowing LLM architecture
empowers researchers and
practitioners.
- Enables capturing context,
managing long-range connections,
and producing quality results.
- Enhances application design, boosts
model performance, and improves
language-related tasks.
H2O.ai Confidential
Thank you!
Ad

More Related Content

What's hot (20)

AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Daniel Zivkovic
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdfUNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
Hermes Romero
 
ChatGPT and not only: how can you use the power of Generative AI at scale
ChatGPT and not only: how can you use the power of Generative AI at scaleChatGPT and not only: how can you use the power of Generative AI at scale
ChatGPT and not only: how can you use the power of Generative AI at scale
Maxim Salnikov
 
Prompt Engineering
Prompt EngineeringPrompt Engineering
Prompt Engineering
Manjunatha Sai
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlow
Gene Leybzon
 
CHATGPT.pptx
CHATGPT.pptxCHATGPT.pptx
CHATGPT.pptx
SajedRahman2
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
AdventureWorld5
 
[BEDROCK] Claude Prompt Engineering Techniques.pptx
[BEDROCK] Claude Prompt Engineering Techniques.pptx[BEDROCK] Claude Prompt Engineering Techniques.pptx
[BEDROCK] Claude Prompt Engineering Techniques.pptx
ssuserdd71c7
 
Generative AI
Generative AIGenerative AI
Generative AI
All Things Open
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
Kaxil Naik
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
taozen
 
Large Language Models | How Large Language Models Work? | Introduction to LLM...
Large Language Models | How Large Language Models Work? | Introduction to LLM...Large Language Models | How Large Language Models Work? | Introduction to LLM...
Large Language Models | How Large Language Models Work? | Introduction to LLM...
Simplilearn
 
Mother of Language`s Langchain
Mother of Language`s LangchainMother of Language`s Langchain
Mother of Language`s Langchain
Jun-hang Lee
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Ghulam Imaduddin
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Daniel Zivkovic
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdfUNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
Hermes Romero
 
ChatGPT and not only: how can you use the power of Generative AI at scale
ChatGPT and not only: how can you use the power of Generative AI at scaleChatGPT and not only: how can you use the power of Generative AI at scale
ChatGPT and not only: how can you use the power of Generative AI at scale
Maxim Salnikov
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlow
Gene Leybzon
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
AdventureWorld5
 
[BEDROCK] Claude Prompt Engineering Techniques.pptx
[BEDROCK] Claude Prompt Engineering Techniques.pptx[BEDROCK] Claude Prompt Engineering Techniques.pptx
[BEDROCK] Claude Prompt Engineering Techniques.pptx
ssuserdd71c7
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
Kaxil Naik
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
taozen
 
Large Language Models | How Large Language Models Work? | Introduction to LLM...
Large Language Models | How Large Language Models Work? | Introduction to LLM...Large Language Models | How Large Language Models Work? | Introduction to LLM...
Large Language Models | How Large Language Models Work? | Introduction to LLM...
Simplilearn
 
Mother of Language`s Langchain
Mother of Language`s LangchainMother of Language`s Langchain
Mother of Language`s Langchain
Jun-hang Lee
 

Similar to LLM Learning Path Level 1 - Presentation Slides (20)

Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptx
SachinAngre3
 
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLMCrafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
ChristopherTHyatt
 
leewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdfleewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdf
alexjohnson7307
 
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdfsolulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
RamayaRam
 
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdfsolulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
RamayaRam
 
Explore the magic of " ChatGPT " .pptx.
Explore the magic of  " ChatGPT " .pptx.Explore the magic of  " ChatGPT " .pptx.
Explore the magic of " ChatGPT " .pptx.
Sanajit Sahoo
 
Top Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdfTop Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdf
SoluLab1231
 
Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model review
Seoung-Ho Choi
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language model
Benjaminlapid1
 
Top Comparison of Large Language ModelsLLMs Explained (2).pdf
Top Comparison of Large Language ModelsLLMs Explained (2).pdfTop Comparison of Large Language ModelsLLMs Explained (2).pdf
Top Comparison of Large Language ModelsLLMs Explained (2).pdf
imoliviabennett
 
Comparison of Large Language Models The Ultimate Guide.pdf
Comparison of Large Language Models The Ultimate Guide.pdfComparison of Large Language Models The Ultimate Guide.pdf
Comparison of Large Language Models The Ultimate Guide.pdf
imoliviabennett
 
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODSA REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
kevig
 
International Journal on Natural Language Computing (IJNLC)
International Journal on Natural Language Computing (IJNLC)International Journal on Natural Language Computing (IJNLC)
International Journal on Natural Language Computing (IJNLC)
basindavid68
 
A Review of Prompt-Free Few-Shot Text Classification Methods
A Review of Prompt-Free Few-Shot Text Classification MethodsA Review of Prompt-Free Few-Shot Text Classification Methods
A Review of Prompt-Free Few-Shot Text Classification Methods
kevig
 
Top Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdfTop Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdf
imoliviabennett
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
ChristopherTHyatt
 
LLM.pdf
LLM.pdfLLM.pdf
LLM.pdf
MedBelatrach
 
Mastering Sentiment Analysis with Large Language Models.pptx
Mastering Sentiment Analysis with Large Language Models.pptxMastering Sentiment Analysis with Large Language Models.pptx
Mastering Sentiment Analysis with Large Language Models.pptx
CSGAMER2
 
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
Nexgits Private Limited
 
How Large Language Models Are Changing the AI Landscape
How Large Language Models Are Changing the AI LandscapeHow Large Language Models Are Changing the AI Landscape
How Large Language Models Are Changing the AI Landscape
Qubited
 
Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptx
SachinAngre3
 
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLMCrafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
ChristopherTHyatt
 
leewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdfleewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdf
alexjohnson7307
 
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdfsolulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
RamayaRam
 
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdfsolulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
solulab.com-Top Comparison of Large Language ModelsLLMs Explained.pdf
RamayaRam
 
Explore the magic of " ChatGPT " .pptx.
Explore the magic of  " ChatGPT " .pptx.Explore the magic of  " ChatGPT " .pptx.
Explore the magic of " ChatGPT " .pptx.
Sanajit Sahoo
 
Top Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdfTop Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdf
SoluLab1231
 
Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model review
Seoung-Ho Choi
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language model
Benjaminlapid1
 
Top Comparison of Large Language ModelsLLMs Explained (2).pdf
Top Comparison of Large Language ModelsLLMs Explained (2).pdfTop Comparison of Large Language ModelsLLMs Explained (2).pdf
Top Comparison of Large Language ModelsLLMs Explained (2).pdf
imoliviabennett
 
Comparison of Large Language Models The Ultimate Guide.pdf
Comparison of Large Language Models The Ultimate Guide.pdfComparison of Large Language Models The Ultimate Guide.pdf
Comparison of Large Language Models The Ultimate Guide.pdf
imoliviabennett
 
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODSA REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
A REVIEW OF PROMPT-FREE FEW-SHOT TEXT CLASSIFICATION METHODS
kevig
 
International Journal on Natural Language Computing (IJNLC)
International Journal on Natural Language Computing (IJNLC)International Journal on Natural Language Computing (IJNLC)
International Journal on Natural Language Computing (IJNLC)
basindavid68
 
A Review of Prompt-Free Few-Shot Text Classification Methods
A Review of Prompt-Free Few-Shot Text Classification MethodsA Review of Prompt-Free Few-Shot Text Classification Methods
A Review of Prompt-Free Few-Shot Text Classification Methods
kevig
 
Top Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdfTop Comparison of Large Language ModelsLLMs Explained.pdf
Top Comparison of Large Language ModelsLLMs Explained.pdf
imoliviabennett
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
ChristopherTHyatt
 
Mastering Sentiment Analysis with Large Language Models.pptx
Mastering Sentiment Analysis with Large Language Models.pptxMastering Sentiment Analysis with Large Language Models.pptx
Mastering Sentiment Analysis with Large Language Models.pptx
CSGAMER2
 
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
How to Enhance NLP’s Accuracy with Large Language Models_ A Comprehensive Gui...
Nexgits Private Limited
 
How Large Language Models Are Changing the AI Landscape
How Large Language Models Are Changing the AI LandscapeHow Large Language Models Are Changing the AI Landscape
How Large Language Models Are Changing the AI Landscape
Qubited
 
Ad

More from Sri Ambati (20)

H2O.ai Agents : From Theory to Practice - Support Presentation
H2O.ai Agents : From Theory to Practice - Support PresentationH2O.ai Agents : From Theory to Practice - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
Sri Ambati
 
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Generative AI Starter Track - Support Presentation Slides.pdfH2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Generative AI Starter Track - Support Presentation Slides.pdf
Sri Ambati
 
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
H2O Gen AI Ecosystem Overview - Level 1 - Slide DeckH2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
Sri Ambati
 
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
An In-depth Exploration of Enterprise h2oGPTe  Slide DeckAn In-depth Exploration of Enterprise h2oGPTe  Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Sri Ambati
 
Intro to Enterprise h2oGPTe Presentation Slides
Intro to Enterprise h2oGPTe Presentation SlidesIntro to Enterprise h2oGPTe Presentation Slides
Intro to Enterprise h2oGPTe Presentation Slides
Sri Ambati
 
Enterprise h2o GPTe Learning Path Slide Deck
Enterprise h2o GPTe Learning Path Slide DeckEnterprise h2o GPTe Learning Path Slide Deck
Enterprise h2o GPTe Learning Path Slide Deck
Sri Ambati
 
H2O Wave Course Starter - Presentation Slides
H2O Wave Course Starter - Presentation SlidesH2O Wave Course Starter - Presentation Slides
H2O Wave Course Starter - Presentation Slides
Sri Ambati
 
Large Language Models (LLMs) - Level 3 Slides
Large Language Models (LLMs) - Level 3 SlidesLarge Language Models (LLMs) - Level 3 Slides
Large Language Models (LLMs) - Level 3 Slides
Sri Ambati
 
Data Science and Machine Learning Platforms (2024) Slides
Data Science and Machine Learning Platforms (2024) SlidesData Science and Machine Learning Platforms (2024) Slides
Data Science and Machine Learning Platforms (2024) Slides
Sri Ambati
 
Data Prep for H2O Driverless AI - Slides
Data Prep for H2O Driverless AI - SlidesData Prep for H2O Driverless AI - Slides
Data Prep for H2O Driverless AI - Slides
Sri Ambati
 
H2O Cloud AI Developer Services - Slides (2024)
H2O Cloud AI Developer Services - Slides (2024)H2O Cloud AI Developer Services - Slides (2024)
H2O Cloud AI Developer Services - Slides (2024)
Sri Ambati
 
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 2 - Presentation SlidesLLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 2 - Presentation Slides
Sri Ambati
 
Hydrogen Torch - Starter Course - Presentation Slides
Hydrogen Torch - Starter Course - Presentation SlidesHydrogen Torch - Starter Course - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Sri Ambati
 
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Sri Ambati
 
H2O Driverless AI Starter Course - Slides and Assignments
H2O Driverless AI Starter Course - Slides and AssignmentsH2O Driverless AI Starter Course - Slides and Assignments
H2O Driverless AI Starter Course - Slides and Assignments
Sri Ambati
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
Sri Ambati
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
Sri Ambati
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
Sri Ambati
 
H2O.ai Agents : From Theory to Practice - Support Presentation
H2O.ai Agents : From Theory to Practice - Support PresentationH2O.ai Agents : From Theory to Practice - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
Sri Ambati
 
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Generative AI Starter Track - Support Presentation Slides.pdfH2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Generative AI Starter Track - Support Presentation Slides.pdf
Sri Ambati
 
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
H2O Gen AI Ecosystem Overview - Level 1 - Slide DeckH2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
Sri Ambati
 
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
An In-depth Exploration of Enterprise h2oGPTe  Slide DeckAn In-depth Exploration of Enterprise h2oGPTe  Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Sri Ambati
 
Intro to Enterprise h2oGPTe Presentation Slides
Intro to Enterprise h2oGPTe Presentation SlidesIntro to Enterprise h2oGPTe Presentation Slides
Intro to Enterprise h2oGPTe Presentation Slides
Sri Ambati
 
Enterprise h2o GPTe Learning Path Slide Deck
Enterprise h2o GPTe Learning Path Slide DeckEnterprise h2o GPTe Learning Path Slide Deck
Enterprise h2o GPTe Learning Path Slide Deck
Sri Ambati
 
H2O Wave Course Starter - Presentation Slides
H2O Wave Course Starter - Presentation SlidesH2O Wave Course Starter - Presentation Slides
H2O Wave Course Starter - Presentation Slides
Sri Ambati
 
Large Language Models (LLMs) - Level 3 Slides
Large Language Models (LLMs) - Level 3 SlidesLarge Language Models (LLMs) - Level 3 Slides
Large Language Models (LLMs) - Level 3 Slides
Sri Ambati
 
Data Science and Machine Learning Platforms (2024) Slides
Data Science and Machine Learning Platforms (2024) SlidesData Science and Machine Learning Platforms (2024) Slides
Data Science and Machine Learning Platforms (2024) Slides
Sri Ambati
 
Data Prep for H2O Driverless AI - Slides
Data Prep for H2O Driverless AI - SlidesData Prep for H2O Driverless AI - Slides
Data Prep for H2O Driverless AI - Slides
Sri Ambati
 
H2O Cloud AI Developer Services - Slides (2024)
H2O Cloud AI Developer Services - Slides (2024)H2O Cloud AI Developer Services - Slides (2024)
H2O Cloud AI Developer Services - Slides (2024)
Sri Ambati
 
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 2 - Presentation SlidesLLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 2 - Presentation Slides
Sri Ambati
 
Hydrogen Torch - Starter Course - Presentation Slides
Hydrogen Torch - Starter Course - Presentation SlidesHydrogen Torch - Starter Course - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Sri Ambati
 
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Sri Ambati
 
H2O Driverless AI Starter Course - Slides and Assignments
H2O Driverless AI Starter Course - Slides and AssignmentsH2O Driverless AI Starter Course - Slides and Assignments
H2O Driverless AI Starter Course - Slides and Assignments
Sri Ambati
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
Sri Ambati
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
Sri Ambati
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
Sri Ambati
 
Ad

Recently uploaded (20)

The History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptxThe History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptx
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
The role of wall art in interior designing
The role of wall art in interior designingThe role of wall art in interior designing
The role of wall art in interior designing
meghaark2110
 
LDMMIA Reiki Yoga S5 Daily Living Workshop
LDMMIA Reiki Yoga S5 Daily Living WorkshopLDMMIA Reiki Yoga S5 Daily Living Workshop
LDMMIA Reiki Yoga S5 Daily Living Workshop
LDM Mia eStudios
 
APGAR SCORE BY sweety Tamanna Mahapatra MSc Pediatric
APGAR SCORE  BY sweety Tamanna Mahapatra MSc PediatricAPGAR SCORE  BY sweety Tamanna Mahapatra MSc Pediatric
APGAR SCORE BY sweety Tamanna Mahapatra MSc Pediatric
SweetytamannaMohapat
 
Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)
Mohamed Rizk Khodair
 
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptxTERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
PoojaSen20
 
Chemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptxChemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptx
Mayuri Chavan
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
CNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscessCNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscess
Mohamed Rizk Khodair
 
spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)
Mohamed Rizk Khodair
 
2025 The Senior Landscape and SET plan preparations.pptx
2025 The Senior Landscape and SET plan preparations.pptx2025 The Senior Landscape and SET plan preparations.pptx
2025 The Senior Landscape and SET plan preparations.pptx
mansk2
 
antiquity of writing in ancient India- literary & archaeological evidence
antiquity of writing in ancient India- literary & archaeological evidenceantiquity of writing in ancient India- literary & archaeological evidence
antiquity of writing in ancient India- literary & archaeological evidence
PrachiSontakke5
 
Cultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptxCultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptx
UmeshTimilsina1
 
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and GuestsLDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDM Mia eStudios
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
How to Manage Amounts in Local Currency in Odoo 18 Purchase
How to Manage Amounts in Local Currency in Odoo 18 PurchaseHow to Manage Amounts in Local Currency in Odoo 18 Purchase
How to Manage Amounts in Local Currency in Odoo 18 Purchase
Celine George
 
Overview Well-Being and Creative Careers
Overview Well-Being and Creative CareersOverview Well-Being and Creative Careers
Overview Well-Being and Creative Careers
University of Amsterdam
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
The role of wall art in interior designing
The role of wall art in interior designingThe role of wall art in interior designing
The role of wall art in interior designing
meghaark2110
 
LDMMIA Reiki Yoga S5 Daily Living Workshop
LDMMIA Reiki Yoga S5 Daily Living WorkshopLDMMIA Reiki Yoga S5 Daily Living Workshop
LDMMIA Reiki Yoga S5 Daily Living Workshop
LDM Mia eStudios
 
APGAR SCORE BY sweety Tamanna Mahapatra MSc Pediatric
APGAR SCORE  BY sweety Tamanna Mahapatra MSc PediatricAPGAR SCORE  BY sweety Tamanna Mahapatra MSc Pediatric
APGAR SCORE BY sweety Tamanna Mahapatra MSc Pediatric
SweetytamannaMohapat
 
Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)
Mohamed Rizk Khodair
 
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptxTERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
TERMINOLOGIES,GRIEF PROCESS AND LOSS AMD ITS TYPES .pptx
PoojaSen20
 
Chemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptxChemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptx
Mayuri Chavan
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
CNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscessCNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscess
Mohamed Rizk Khodair
 
spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)
Mohamed Rizk Khodair
 
2025 The Senior Landscape and SET plan preparations.pptx
2025 The Senior Landscape and SET plan preparations.pptx2025 The Senior Landscape and SET plan preparations.pptx
2025 The Senior Landscape and SET plan preparations.pptx
mansk2
 
antiquity of writing in ancient India- literary & archaeological evidence
antiquity of writing in ancient India- literary & archaeological evidenceantiquity of writing in ancient India- literary & archaeological evidence
antiquity of writing in ancient India- literary & archaeological evidence
PrachiSontakke5
 
Cultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptxCultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptx
UmeshTimilsina1
 
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and GuestsLDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDM Mia eStudios
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
How to Manage Amounts in Local Currency in Odoo 18 Purchase
How to Manage Amounts in Local Currency in Odoo 18 PurchaseHow to Manage Amounts in Local Currency in Odoo 18 Purchase
How to Manage Amounts in Local Currency in Odoo 18 Purchase
Celine George
 
Overview Well-Being and Creative Careers
Overview Well-Being and Creative CareersOverview Well-Being and Creative Careers
Overview Well-Being and Creative Careers
University of Amsterdam
 

LLM Learning Path Level 1 - Presentation Slides

  • 1. H2O.ai Confidential LLM Learning Path - Level 1 Author: Andreea Turcu Head of Global Training @H2O.ai
  • 2. H2O.ai Confidential Fine-tuning Refining pre-trained models using task-specific data, enhancing their performance on targeted tasks. Foundation Powerful language models trained on extensive text data, forming the basis for various language tasks. Building Steps for LLMs 01 03 Eval LLMs Thoroughly assessing and comparing LLMs is increasingly vital due to their heightened significance and complexity. 04 05 04 03 02 01 DataPrep Converting documents into instruction pairs, like QA pairs, facilitating fine-tuning and tasks. 02 Database Effectively utilize company data with a database that seamlessly integrates new PDFs, eliminating the need for model retraining. 05 Applications Elevate interactions with advanced language comprehension and LLM-driven response generation for enriched user experiences. 06
  • 3. H2O.ai Confidential Table of Contents 1. Introduction to Language Models 2. Understanding LLM Architecture / Foundation Models 3. Getting Started with LLM Data Studio 4. Fine-tuning LLMs 5. Making Your Own GPT and Fine-tuning using LLM Studio 6. Evaluating and Benchmarking LLMs 7. Practical Applications and Case Studies
  • 4. H2O.ai Confidential Contents at a Glance 1. Introduction to Language Models ● What is a Language Model? ● Techniques Commonly Used ● Importance and Applications
  • 5. H2O.ai Confidential Foundation Powerful language models trained on extensive text data, forming the basis for various language tasks. Building Steps for LLMs 01 05 04 03 02 01 Contents at a Glance 1. Introduction to Language Models 2. Understanding LLM Architecture / Foundation Models ● What are Foundation Models? ● Neural Networks and Deep Learning ● Transformer Architecture vs. LLM Architecture ● Pre-training & fine-tuning of LLMs ● Transfer Learning and Adaptation
  • 6. H2O.ai Confidential Generative AI Definitions Foundation Models Large Language Models (LLMs) Unlabeled Training Data Additional Text-Based Data Transformer Algorithm Transformer Algorithm Foundation Model LLM Generative AI Collection of ML algorithms that learn a representation of artifacts from data and models, and use it to generate brand-new, completely original artifacts that preserve a likeness to original data or models. Foundation model Is a Large machine learning model trained on a large amount of unlabeled data using a transformer algorithm. This model can be augmented by a range of fine-tuning (adapter) techniques. The resulting model can be further adapted to a wide range of applications. Large Language Model (LLM) An LLM is a type of foundation model specifically designed for natural language processing. Generative Pre-trained Transformer (GPT) Is an LLM specifically designed to predict the next token. For example ChatGPT is a conversational application built on top of an LLM.
  • 7. Essential topics: 1. Grasping the essence of Foundation Models 2. Delving into Neural Networks and Deep Learning 3. Exploring the intricacies of the Transformer Architecture 4. Understanding the concepts of pre-training and fine-tuning in LLMs 5. Navigating Transfer Learning and Adaptation techniques
  • 8. Foundation models can be used for a wide range of tasks: 1. Answering questions 2. Generating human-like text 3. Translating languages 4. Creating chatbots 5. Summarizing articles, and more
  • 9. Neural Networks Each node receives input from multiple nodes in the previous layer, performs a computation, and passes the output to the next layer. The output of the last layer represents the final prediction or decision made by the neural network.
  • 10. Deep Learning = Neural networks with multiple layers Deep learning models are capable of learning complex patterns and representations from large amounts of data. The term "deep" refers to the depth of the network, which signifies the number of hidden layers between the input and output layers.
  • 11. In forward propagation, input data flows through the network, transforming into a meaningful output.
  • 12. Backpropagation fine-tunes network parameters by minimizing prediction errors through iterative adjustments based on desired output.
  • 13. To remember: ● Not all neural networks qualify as deep learning models. ● Deep learning is distinguished by network depth. ● Depth enables the learning of intricate data features and relationships. ● This leads to improved performance in tasks like image recognition and natural language processing.
  • 14. Applications of Neural Networks and Deep Learning in LLMs: • Natural Language Processing (NLP) • Speech Recognition • Recommendation Systems • Text Generation • Language Understanding and Context • Automation and Efficiency • User Experience Enhancement
  • 15. H2O.ai Confidential v ● The emergence of Large Language Models (LLMs) coincided with advancements in language understanding and generation. ● LLMs are distinguished by their exceptional size and complexity. ○ These models consist of billions of specialized components. ○ These components enable LLMs to comprehend intricate language nuances. ● LLMs are capable of generating high-quality text.
  • 16. H2O.ai Confidential v Fine-tune Example: Learn a Specific Style of Answering and Writing Fine-tuning training Hyperparameter tuning Data Scientist Fine-Tuned Large Language Model Foundation Large Language Model Autoregressive, trained on diverse data (“the whole internet”). Good at continuing text. Specialized style: learned prompt & answer, instructions
  • 17. H2O.ai Confidential Crucial Role in Language Models 1. Enhanced Communication 2. Information Assessment 3. Ethical Implications 4. Prospects for the Future
  • 18. H2O.ai Confidential Key Areas where LMs are used: 1. Chatbots and Virtual Assistants 2. Language Translation 3. Content Generation 4. Sentiment Analysis 5. Text Completion and Auto-correction 6. Voice Assistants
  • 19. H2O.ai Confidential Distinguishing Characteristics of LLMs 1. Scale 2. Creative Writing 3. Complex Problem Solving 4. Domain Expertise 5. Enhanced Language Understanding 6. Data Efficiency 7. Pre-training and Fine-tuning 8. Contextual Understanding 9. Language Generation 10. Transfer Learning 11. Versatility and Applications 12. Research and Innovation
  • 20. H2O.ai Confidential Some important terms related to the Transformer architecture: 1. Attention 2. Multi-head Attention 3. Encoder 4. Decoder 5. Self-Attention 6. Feed-Forward Neural Network 7. Positional Encoding 8. Masking
  • 21. H2O.ai Confidential Reminder - The Transformer is a specialized neural network architecture introduced in the research paper "Attention is All You Need." - Its primary function is to process sequences of data. - It utilizes self-attention, a distinctive mechanism, to efficiently capture relationships between words within a sentence.
  • 22. H2O.ai Confidential Reminder - Large Language Models fall within a broader category of models trained on extensive textual data without human annotations. - Prominent models such as GPT-3 and BERT are constructed based on the underlying Transformer architecture. - These models attain comprehensive language representations by harnessing the abundant data they encounter during their training process.
  • 23. H2O.ai Confidential Primary objective of LLMs - The primary aim of Large Language Models is to acquire potent language representations from extensive text data. - Once they have gained this expertise, they can undergo fine-tuning for specific language tasks. - These tasks may include sentiment analysis, question-answering, or text classification, among others.
  • 27. Transfer learning ● Uses a pre-trained model as a foundation for a new task. ● Instead of starting from scratch, the model begins with pre-trained weights. ● Fine-tunes on a smaller labeled dataset specific to the new task. ● Adapts pre-learned representations to the new data's patterns and characteristics. ● Ideal for tasks with limited labeled data or resource-intensive training.
  • 28. Adaptation (domain adaptation) ● Targets domain differences between source and target domains. ● Its goal is to make a model trained on the source domain perform well on the target domain, even with limited labeled data. ● A key challenge is ensuring effective generalization despite distribution shifts. ● Adaptation techniques align representations from the source domain with the target domain to reduce domain discrepancies.
  • 29. H2O.ai Confidential Robot Adaptation Approaches 1. Feature-based adaptation: Simplifies the robot's view by finding common features between old and new objects. 2. Instance-based adaptation: Adjusts the robot's focus by prioritizing similar objects in the new environment. 3. Model-based adaptation: Fine-tunes the robot's recognition abilities by emphasizing relevant details in the new environment.
  • 30. H2O.ai Confidential ● Fine-tuning in LLMs enhances adaptation. ● Empowers models with styles, personalities, and domain knowledge. ● Starts with a pre-trained LLM. ● Pre-training is generic, lacks specificity.
  • 31. H2O.ai Confidential - Knowing LLM architecture empowers researchers and practitioners. - Enables capturing context, managing long-range connections, and producing quality results. - Enhances application design, boosts model performance, and improves language-related tasks.
  翻译: