Sentiment Analysis in Machine Learning

Sentiment Analysis
in
Machine Learning
Prof Pranali V Deshmukh
Department of Information Technology
International Institute of Information Technology, I²IT
www.isquareit.edu.in
1

Predicting sentiment by topic:
An intelligent restaurant
review system

It’s a big day & I want to book a table at
a nice Japanese restaurant
Seattlehas many
★★★★
sushirestaurants
Whatarepeople
sayingabout the
food?
the ambiance?...
3

Positive reviews not positive about everything
Samplereview:
Watching the chefs create
incredible edible artmade the
experience veryunique.
My wife tried their ramen and it
was pretty forgettable.
All the sushi was delicious!
Easilybest sushi in Seattle.
Experience
4

From reviews to topic sentiments
Experience
★★★★
Ramen
★★★
Sushi
★★★★★
Novel intelligent
restaurant review app
Easily best sushi
in Seattle.
Allreviewsfor
restaurant
5

Intelligent restaurant review system
Allreviewsfor
restaurant
Breakall reviews
into sentences
The seaweed salad was just OK,
vegetable salad was just ordinary.
I like the interior decoration and
the blackboard menu on thewall.
6
All the sushi was delicious.
My wife tried their ramen and
it was pretty forgettable.
The sushi was amazing, and
the rice is just outstanding.
The service is somewhat hectic.
Easily best sushi in Seattle.

Core building block
Easilybest sushi in Seattle.
Sentence Sentiment
Classifier
7

Intelligent restaurant review system
Allreviewsfor
restaurant
My wife tried their ramen and
it was pretty forgettable.
The service is somewhat hectic.
BreakSeall e
lc
r
e
t
v
s
i
e
e
n
w
t
e
s
nces
into s
e
a
n
b
t
o
e
u
n
t
c
“
e
s
s
u
s
h
i
”
The seaweed salad was just OK,
vegetable salad was just ordinary.
I like the interior decoration and the
blackboard menu on thewall.
Sentence
Sentiment
Classifier
Sushi
★★★★★
Average
predictions
Easilybest
sushi
in Seattle.
Most
&
8

Machine Learning Specialization
Classifier applications
9 ©2015 Emily Fox & Carlos Guestrin
9

Classifier
Sentence
from
review
Classifier
MODEL
Input: x
Output: y
Predicted
class
10

Example multiclass classifier
Output y has more than 2 categories
Education
Finance
Technology
11
Input: x
Webpage
Output: y

1
Spam filtering
Input: x Output: y
Not spam
Spam
Text of email,
sender, IP,…
1

Image classification
Input: x Image
pixels
13
Output:y
Predicted object

Personalized medical diagnosis
Disease
Classifier
MODEL
Input: x
Healthy
Cold
Flu
Pneumonia
…
Output: y
14

Reading your mind
“Hammer”
“House”
15
1

Representing
classifiers
Sentence
from
review
Classifier
MODEL
Input: x
Output: y
Predicted class
How does itwork???
17

Count positive &negativewords in
sentence
If number of positive words>
number of negative words:
ŷ=
Else:
Listofpositive
words
Listofnegative
words
great,awesome,
good, amazing,…
bad,terrible,
disgusting, sucks,…
ŷ=
18
Sentence
from
review
Input: x
Simple threshold classifier

Count positive &negative words
in sentence
If number of positive words>
number of negative words:
ŷ=
Else:
Listofpositive
words
Listofnegative
words
great,awesome,
good, amazing,…
bad,terrible,
disgusting, sucks,…
Sushi was
great, the
food was
awesome,
but the
servicewas
terrible.
Simple threshold classifier
2
1
ŷ=
19

Problems with threshold classifier
• How do we get list of
positive/negativewords?
• Words havediﬀerent
degreesof sentiment:
- Great >
good
- How do weweigh
diﬀerent words?
• Single words arenot enough:
- Good 

Positive
- Not good 

Negative
Addressed
bylearning
aclassifier
Addressed
bymore
elaborate
features
20

A(linear) classifier
21
• Will usetraining datato learn aweight for
eachword
Word Weight
good 1.0
great 1.5
awesome 2.7
bad -1.0
terrible -2.1
awful -3.3
restaurant,the, we, where, … 0.0
… …

Scoring a
sentence
Word Weight
good 1.0
great 1.2
awesome 1.7
bad -1.0
terrible -2.1
awful -3.3
restaurant,the,
we, where, …
0.0
… …
Input x:
Sushi was great,
the food was awesome, but
the service was terrible.
Called alinear classifier, because output is weighted sum of input.
22

Word Weight
… …
23
Sentence
from
review
Input: x
Simple linear classifier
Score(x) =weighted count of
words in sentence
If Score (x) > 0:
ŷ=
Else:
ŷ=

Decision boundaries
2

Suppose only two words had non-zero weight
Word Weight
awesome 1.0
awful -1.5
awful
3
2
1
4
…
Sushi was awesome, the
food wasawesome,
but the service was awful.
Score(x) =1.0#awesome – 1.5#awful
0
0 1 2 3 4 …
awesome
25

Decision boundary example
Word Weight
awesome 1.0
awful -1.5
awful
1
4
3
2
…
Score(x) =1.0#awesome – 1.5#awful
Score(x)>
0
Score(x)<
0
0
0 1 2 3 4 …
awesome
26

Decision boundary separates
positive & negative predictions
• For linear classifiers:
- When 2weights are non-zero


line
- When 3weights are non-zero


plane
- When manyweights are non-zero


hyperplane
• For more generalclassifiers


morecomplicatedshapes
22

Training and evaluating
a classifier
2

Training a classifier = Learning the weights
Data
(x,y)
(Sentence1, )
(Sentence2, )
…
Training
set
Test
set
Learn
classifier
Evaluate?
Word Weight
good 1.0
awesome 1.7
bad -1.0
awful -3.3
… …
29

Classification error
at,
Test example
(
S
(
F
u
o
s
h
o
i
d
w
w
a
a
s
s
g
O
r
e
K
a
,
t ))
Learnedclassifier
Hide label
Correct
Mistakes
ŷ=
M
Co
is
r
t
r
a
e
k
c
e
t!
0
1
0
1
30

Classification error & accuracy
• Error measuresfraction of mistakes
- Bestpossible valueis0.0
• Often, measureaccuracy
-Fraction of correct predictions
- Bestpossible valueis1.0
error = .
accuracy= .
31

What’s a good
accuracy?
3

What if you ignore the sentence, and just guess?
33
• For binaryclassification:
- Half the time, you’ll get it right! (on average)


accuracy =0.5
• For kclasses,accuracy =1/k
- 0.333 for 3classes, 0.25 for 4 classes,…
Atthe very,very,very least,
you should healthily beatrandom…
Otherwise, it’s(usually) pointless…

2010data shows:
“90% emails sent are spam!”
Predicting everyemail is spam
getsyou 90%accuracy!!!
Majority class prediction
Amazing performance when
there is class imbalance
(butsilly approach)
• One class is more common thanothers
• Beatsrandom (ifyou know the majority class)
Is a classifier with 90% accuracy good? …
34

So, always be digging in and asking the
hard questions about reported accuracies
35
• Is there class imbalance?
• How does it compare to asimple,
baseline approach?
- Random guessing
- Majority class
-…
• Most importantly:
what accuracy does my application need?
- Whatis good enough for myuser’sexperience?
- Whatis the impact of the mistakeswe make?

False positives, false
negatives, and confusion
matrices
3

Types of mistakes
True
label
Predicted label
True
Positive
False
Negative
(FN)
False True
Positive Negative
(FP)
37

Cost of diﬀerent types of mistakes can be
diﬀerent (& high) in some applications
Spam
filtering
Medical
diagnosis
False
negative
False
positive
Annoying
Email lost
Disease not
treated
Wasteful
treatment
38

True
label
Confusion matrix –
binary classification
Predicted label
39

Confusion matrix –
multiclass classification
Healthy Cold Flu
Healthy
Cold
Flu
True
label
Predicted label
40

Learning curves:
How much data do I need?
4

How much data does a model need to
learn?
42
• The more the merrier  
- But dataquality is most important factor
• Theoretical techniques sometimes can
bound how much dataisneeded
- Typically too loose for practicalapplication
- But provide guidance
• In practice:
- More complex models require moredata
- Empirical analysiscan provide guidance

Learning
curves
Amount of trainingdata
Test
error
43

Is there a limit?
Yes, for most
models…
Test
error
Biasof model
44

More complex models tend to have less
bias…Sentiment classifier using single
words can do OK,but…
Never classifies correctly:
“Thesushi wasnot good.”
More complex model:
consider pairsof words(bigrams)
Word Weight
good +1.5
not good -2.1
Lessbias 

potentially more accurate,
needs more datato learn
45

Models with less bias tend to
need more data to learn well,
but do better with suﬃcient data
Test
error
Classifier based
on singlewords
46

Class probabilities
4

4
How confident is your prediction?
• Thus far,we’veoutputted a prediction
• But, how sureareyou about the prediction?
- “The sushi &everything
else were awesome!”
- “The sushi wasgood,
the service was OK.”
©2015 Emily Fox & Carlos Guestrin
Definite
Not sure
Many classifiers provide aconfidence level:
P(y|x)
Extremelyuseful in practice
Output label Input sentence
P(y=+|x)=0.99
P(y=+|x)=0.55
4

Summary of classification
4

5
What you can do now…
• Identify aclassification problem and
some common applications
• Describe decision boundaries andlinear
classifiers
• Train aclassifier
• Measure its error
- Some rules of thumb for goodaccuracy
• Interpret the typesof errorassociated with
classification
• Describe the tradeoﬀs between model bias
anddataset size
• Use class probability to expressdegree of
confidence inprediction
©2015 Emily Fox & Carlos Guestrin
5

Thank You !!
https://www.isquareit.edu.in/
5

Sentiment Analysis in Machine Learning

Recommended

More Related Content

What's hot (20)

Similar to Sentiment Analysis in Machine Learning (20)

More from International Institute of Information Technology (I²IT) (20)

Recently uploaded (20)

Sentiment Analysis in Machine Learning