SlideShare a Scribd company logo
Representation Learning for
NLP: Deep Dive
Anuj Gupta, Satyam Saxena
• Duration : 6 hrs
• Level : Intermediate to Advanced
• Objective: For each of the topics, we will dig into the concepts,
maths to build a theoretical understanding; followed by code
(jupyter notebooks) to understand the implementation details.
Module 1 (30 mins)
• Introduction to Text Representation (5 mins)
• Old ways of representing text (20 mins)
• Bag-Of-Words
• TF–IDF
• Co-occurrence matrix + SVD
• Pros and Cons
• Introduction to Embedding spaces (5 mins)
Outline/Time Map - 4 Modules
Module 2 (160 mins)
• Word-Vectors
• Introduction + Bigram model (25 mins)
• CBOW model (25 mins)
• SKIP-GRAM model (25 mins)
[Efficient estimation of word representations in vector space. Mikolov, et. al.
ICLR Workshop, 2013]
• Speed-Up (20 mins)
• Negative Sampling
• Hierarchical Softmax
[Distributed representations of words and phrases and their compositionality.
Mikolov, et. al. ANIPS, 2013]
• Word-Vectors (contd)
• GLOVE model (30 mins)
[GloVe: Global Vectors for Word Representation. Pennington et. al. EMNLP
2014]
• t-SNE (15 mins)
[Visualizing Data using t-SNE. Hinton et. al. 2008
How to Use t-SNE Effectively – Distill]
• Pros and Cons of using pre-trained word vectors (5 mins)
• Q & A (20 mins)
Module 3 (70 mins)
• Sentence2vec/Paragraph2vec/Doc2Vec
• Introduction (5 mins)
• PV-DM model (35 mins)
• PV-DBOW model
[Distributed representations of sentences and documents. Mikolov, et. al. ICML,
2014]
• Skip-Thoughts model (20 mins)
[Skip-Thought Vectors. Kiros et. al. arXiv preprint 2015]
• Pros and Cons (10 mins)
Module 4 (70 mins)
• Char2Vec
• Introduction (5 mins)
• Introduction to RNNs, LSTMs (20 mins)
• 1-hot Encoding (30 mins)
[The Unreasonable Effectiveness of Recurrent Neural Networks. Andrej Karpathy 2015]
• Character Embeddings (20 mins)
[Character-Aware Neural Language Models. Yoon Kim et. al. AAAI 2015]
• Pros and Cons (5 mins)
• Q & A (10 mins)
Ad

More Related Content

Similar to Representation Learning for NLP (17)

CHAP-0- Lecture Overview Administration--TCPS (SS-2023)-Rev (1)--final.pdf
CHAP-0- Lecture Overview  Administration--TCPS (SS-2023)-Rev (1)--final.pdfCHAP-0- Lecture Overview  Administration--TCPS (SS-2023)-Rev (1)--final.pdf
CHAP-0- Lecture Overview Administration--TCPS (SS-2023)-Rev (1)--final.pdf
yasinalistudy
 
How to design Collaborative learning activities
How to design Collaborative learning activitiesHow to design Collaborative learning activities
How to design Collaborative learning activities
METIS-project
 
Orbital Mechanics via a Simulation-based learning
Orbital Mechanics via a Simulation-based learningOrbital Mechanics via a Simulation-based learning
Orbital Mechanics via a Simulation-based learning
Technological Ecosystems for Enhancing Multiculturality
 
Naver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNaver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltc
NAVER Engineering
 
Investigating the Impact of Organised Orchestration on Teaching
Investigating the Impact of Organised Orchestration on TeachingInvestigating the Impact of Organised Orchestration on Teaching
Investigating the Impact of Organised Orchestration on Teaching
Lighton Phiri
 
21idt.pdf
21idt.pdf21idt.pdf
21idt.pdf
Ekta Jolly
 
My 1st learning design
My 1st learning designMy 1st learning design
My 1st learning design
Carla Gonçalves
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
Roelof Pieters
 
A framework and a TDD methodology for testing web service compositions
A framework and a TDD methodology for testing web service compositionsA framework and a TDD methodology for testing web service compositions
A framework and a TDD methodology for testing web service compositions
Felipe Besson
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
Jinho Choi
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
Jinho Choi
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Thilo Stadelmann
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Lucidworks
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Matthew Lease
 
Software Arch TDD ppt.pdf
Software Arch TDD ppt.pdfSoftware Arch TDD ppt.pdf
Software Arch TDD ppt.pdf
Ted M. Young
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
Qi He
 
NLG, Training, Inference & Evaluation
NLG, Training, Inference & Evaluation NLG, Training, Inference & Evaluation
NLG, Training, Inference & Evaluation
Deep Learning Italia
 
CHAP-0- Lecture Overview Administration--TCPS (SS-2023)-Rev (1)--final.pdf
CHAP-0- Lecture Overview  Administration--TCPS (SS-2023)-Rev (1)--final.pdfCHAP-0- Lecture Overview  Administration--TCPS (SS-2023)-Rev (1)--final.pdf
CHAP-0- Lecture Overview Administration--TCPS (SS-2023)-Rev (1)--final.pdf
yasinalistudy
 
How to design Collaborative learning activities
How to design Collaborative learning activitiesHow to design Collaborative learning activities
How to design Collaborative learning activities
METIS-project
 
Naver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNaver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltc
NAVER Engineering
 
Investigating the Impact of Organised Orchestration on Teaching
Investigating the Impact of Organised Orchestration on TeachingInvestigating the Impact of Organised Orchestration on Teaching
Investigating the Impact of Organised Orchestration on Teaching
Lighton Phiri
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
Roelof Pieters
 
A framework and a TDD methodology for testing web service compositions
A framework and a TDD methodology for testing web service compositionsA framework and a TDD methodology for testing web service compositions
A framework and a TDD methodology for testing web service compositions
Felipe Besson
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
Jinho Choi
 
Seq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese LanguageSeq2seq Model to Tokenize the Chinese Language
Seq2seq Model to Tokenize the Chinese Language
Jinho Choi
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Thilo Stadelmann
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Lucidworks
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Matthew Lease
 
Software Arch TDD ppt.pdf
Software Arch TDD ppt.pdfSoftware Arch TDD ppt.pdf
Software Arch TDD ppt.pdf
Ted M. Young
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
Qi He
 
NLG, Training, Inference & Evaluation
NLG, Training, Inference & Evaluation NLG, Training, Inference & Evaluation
NLG, Training, Inference & Evaluation
Deep Learning Italia
 

More from Anuj Gupta (11)

ODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systemsODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systems
Anuj Gupta
 
Continuous Learning Systems: Building ML systems that learn from their mistakes
Continuous Learning Systems: Building ML systems that learn from their mistakesContinuous Learning Systems: Building ML systems that learn from their mistakes
Continuous Learning Systems: Building ML systems that learn from their mistakes
Anuj Gupta
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysis
Anuj Gupta
 
NLP Bootcamp
NLP BootcampNLP Bootcamp
NLP Bootcamp
Anuj Gupta
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
Anuj Gupta
 
Recent Advances in NLP
  Recent Advances in NLP  Recent Advances in NLP
Recent Advances in NLP
Anuj Gupta
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer Connect
Anuj Gupta
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLP
Anuj Gupta
 
Synthetic Gradients - Decoupling Layers of a Neural Nets
Synthetic Gradients - Decoupling Layers of a Neural NetsSynthetic Gradients - Decoupling Layers of a Neural Nets
Synthetic Gradients - Decoupling Layers of a Neural Nets
Anuj Gupta
 
DLBLR talk
DLBLR talkDLBLR talk
DLBLR talk
Anuj Gupta
 
Building Continuous Learning Systems
Building Continuous Learning SystemsBuilding Continuous Learning Systems
Building Continuous Learning Systems
Anuj Gupta
 
ODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systemsODSC East 2020 : Continuous_learning_systems
ODSC East 2020 : Continuous_learning_systems
Anuj Gupta
 
Continuous Learning Systems: Building ML systems that learn from their mistakes
Continuous Learning Systems: Building ML systems that learn from their mistakesContinuous Learning Systems: Building ML systems that learn from their mistakes
Continuous Learning Systems: Building ML systems that learn from their mistakes
Anuj Gupta
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysis
Anuj Gupta
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
Anuj Gupta
 
Recent Advances in NLP
  Recent Advances in NLP  Recent Advances in NLP
Recent Advances in NLP
Anuj Gupta
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer Connect
Anuj Gupta
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLP
Anuj Gupta
 
Synthetic Gradients - Decoupling Layers of a Neural Nets
Synthetic Gradients - Decoupling Layers of a Neural NetsSynthetic Gradients - Decoupling Layers of a Neural Nets
Synthetic Gradients - Decoupling Layers of a Neural Nets
Anuj Gupta
 
Building Continuous Learning Systems
Building Continuous Learning SystemsBuilding Continuous Learning Systems
Building Continuous Learning Systems
Anuj Gupta
 
Ad

Recently uploaded (20)

Batteries and fuel cells for btech first year
Batteries and fuel cells for btech first yearBatteries and fuel cells for btech first year
Batteries and fuel cells for btech first year
MithilPillai1
 
Study in Pink (forensic case study of Death)
Study in Pink (forensic case study of Death)Study in Pink (forensic case study of Death)
Study in Pink (forensic case study of Death)
memesologiesxd
 
Hypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptxHypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptx
klynct
 
Reticular formation_groups_organization_
Reticular formation_groups_organization_Reticular formation_groups_organization_
Reticular formation_groups_organization_
klynct
 
An upper limit to the lifetime of stellar remnants from gravitational pair pr...
An upper limit to the lifetime of stellar remnants from gravitational pair pr...An upper limit to the lifetime of stellar remnants from gravitational pair pr...
An upper limit to the lifetime of stellar remnants from gravitational pair pr...
Sérgio Sacani
 
Eric Schott- Environment, Animal and Human Health (3).pptx
Eric Schott- Environment, Animal and Human Health (3).pptxEric Schott- Environment, Animal and Human Health (3).pptx
Eric Schott- Environment, Animal and Human Health (3).pptx
ttalbert1
 
Brief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdfBrief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdf
BharathKumar556689
 
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptxA CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
ANJALICHANDRASEKARAN
 
Secondary metabolite ,Plants and Health Care
Secondary metabolite ,Plants and Health CareSecondary metabolite ,Plants and Health Care
Secondary metabolite ,Plants and Health Care
Nistarini College, Purulia (W.B) India
 
Anti fungal agents Medicinal Chemistry III
Anti fungal agents Medicinal Chemistry  IIIAnti fungal agents Medicinal Chemistry  III
Anti fungal agents Medicinal Chemistry III
HRUTUJA WAGH
 
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Professional Content Writing's
 
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Professional Content Writing's
 
dsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentationdsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentation
JessaMaeDacayo
 
Funakoshi_ZymoResearch_2024-2025_catalog
Funakoshi_ZymoResearch_2024-2025_catalogFunakoshi_ZymoResearch_2024-2025_catalog
Funakoshi_ZymoResearch_2024-2025_catalog
fu7koshi
 
Astrobiological implications of the stability andreactivity of peptide nuclei...
Astrobiological implications of the stability andreactivity of peptide nuclei...Astrobiological implications of the stability andreactivity of peptide nuclei...
Astrobiological implications of the stability andreactivity of peptide nuclei...
Sérgio Sacani
 
Water Pollution control using microorganisms
Water Pollution control using microorganismsWater Pollution control using microorganisms
Water Pollution control using microorganisms
gerefam247
 
Antimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry IIIAntimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry III
HRUTUJA WAGH
 
The Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infectionsThe Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infections
NABIHANAEEM2
 
Euclid: The Story So far, a Departmental Colloquium at Maynooth University
Euclid: The Story So far, a Departmental Colloquium at Maynooth UniversityEuclid: The Story So far, a Departmental Colloquium at Maynooth University
Euclid: The Story So far, a Departmental Colloquium at Maynooth University
Peter Coles
 
Applications of Radioisotopes in Cancer Research.pptx
Applications of Radioisotopes in Cancer Research.pptxApplications of Radioisotopes in Cancer Research.pptx
Applications of Radioisotopes in Cancer Research.pptx
MahitaLaveti
 
Batteries and fuel cells for btech first year
Batteries and fuel cells for btech first yearBatteries and fuel cells for btech first year
Batteries and fuel cells for btech first year
MithilPillai1
 
Study in Pink (forensic case study of Death)
Study in Pink (forensic case study of Death)Study in Pink (forensic case study of Death)
Study in Pink (forensic case study of Death)
memesologiesxd
 
Hypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptxHypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptx
klynct
 
Reticular formation_groups_organization_
Reticular formation_groups_organization_Reticular formation_groups_organization_
Reticular formation_groups_organization_
klynct
 
An upper limit to the lifetime of stellar remnants from gravitational pair pr...
An upper limit to the lifetime of stellar remnants from gravitational pair pr...An upper limit to the lifetime of stellar remnants from gravitational pair pr...
An upper limit to the lifetime of stellar remnants from gravitational pair pr...
Sérgio Sacani
 
Eric Schott- Environment, Animal and Human Health (3).pptx
Eric Schott- Environment, Animal and Human Health (3).pptxEric Schott- Environment, Animal and Human Health (3).pptx
Eric Schott- Environment, Animal and Human Health (3).pptx
ttalbert1
 
Brief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdfBrief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdf
BharathKumar556689
 
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptxA CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
A CASE OF MULTINODULAR GOITRE,clinical presentation and management.pptx
ANJALICHANDRASEKARAN
 
Anti fungal agents Medicinal Chemistry III
Anti fungal agents Medicinal Chemistry  IIIAnti fungal agents Medicinal Chemistry  III
Anti fungal agents Medicinal Chemistry III
HRUTUJA WAGH
 
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Discrete choice experiments: Environmental Improvements to Airthrey Loch Lake...
Professional Content Writing's
 
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Professional Content Writing's
 
dsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentationdsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentation
JessaMaeDacayo
 
Funakoshi_ZymoResearch_2024-2025_catalog
Funakoshi_ZymoResearch_2024-2025_catalogFunakoshi_ZymoResearch_2024-2025_catalog
Funakoshi_ZymoResearch_2024-2025_catalog
fu7koshi
 
Astrobiological implications of the stability andreactivity of peptide nuclei...
Astrobiological implications of the stability andreactivity of peptide nuclei...Astrobiological implications of the stability andreactivity of peptide nuclei...
Astrobiological implications of the stability andreactivity of peptide nuclei...
Sérgio Sacani
 
Water Pollution control using microorganisms
Water Pollution control using microorganismsWater Pollution control using microorganisms
Water Pollution control using microorganisms
gerefam247
 
Antimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry IIIAntimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry III
HRUTUJA WAGH
 
The Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infectionsThe Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infections
NABIHANAEEM2
 
Euclid: The Story So far, a Departmental Colloquium at Maynooth University
Euclid: The Story So far, a Departmental Colloquium at Maynooth UniversityEuclid: The Story So far, a Departmental Colloquium at Maynooth University
Euclid: The Story So far, a Departmental Colloquium at Maynooth University
Peter Coles
 
Applications of Radioisotopes in Cancer Research.pptx
Applications of Radioisotopes in Cancer Research.pptxApplications of Radioisotopes in Cancer Research.pptx
Applications of Radioisotopes in Cancer Research.pptx
MahitaLaveti
 
Ad

Representation Learning for NLP

  • 1. Representation Learning for NLP: Deep Dive Anuj Gupta, Satyam Saxena
  • 2. • Duration : 6 hrs • Level : Intermediate to Advanced • Objective: For each of the topics, we will dig into the concepts, maths to build a theoretical understanding; followed by code (jupyter notebooks) to understand the implementation details.
  • 3. Module 1 (30 mins) • Introduction to Text Representation (5 mins) • Old ways of representing text (20 mins) • Bag-Of-Words • TF–IDF • Co-occurrence matrix + SVD • Pros and Cons • Introduction to Embedding spaces (5 mins) Outline/Time Map - 4 Modules
  • 4. Module 2 (160 mins) • Word-Vectors • Introduction + Bigram model (25 mins) • CBOW model (25 mins) • SKIP-GRAM model (25 mins) [Efficient estimation of word representations in vector space. Mikolov, et. al. ICLR Workshop, 2013] • Speed-Up (20 mins) • Negative Sampling • Hierarchical Softmax [Distributed representations of words and phrases and their compositionality. Mikolov, et. al. ANIPS, 2013]
  • 5. • Word-Vectors (contd) • GLOVE model (30 mins) [GloVe: Global Vectors for Word Representation. Pennington et. al. EMNLP 2014] • t-SNE (15 mins) [Visualizing Data using t-SNE. Hinton et. al. 2008 How to Use t-SNE Effectively – Distill] • Pros and Cons of using pre-trained word vectors (5 mins) • Q & A (20 mins)
  • 6. Module 3 (70 mins) • Sentence2vec/Paragraph2vec/Doc2Vec • Introduction (5 mins) • PV-DM model (35 mins) • PV-DBOW model [Distributed representations of sentences and documents. Mikolov, et. al. ICML, 2014] • Skip-Thoughts model (20 mins) [Skip-Thought Vectors. Kiros et. al. arXiv preprint 2015] • Pros and Cons (10 mins)
  • 7. Module 4 (70 mins) • Char2Vec • Introduction (5 mins) • Introduction to RNNs, LSTMs (20 mins) • 1-hot Encoding (30 mins) [The Unreasonable Effectiveness of Recurrent Neural Networks. Andrej Karpathy 2015] • Character Embeddings (20 mins) [Character-Aware Neural Language Models. Yoon Kim et. al. AAAI 2015] • Pros and Cons (5 mins) • Q & A (10 mins)
  翻译: