110 likes | 431 Vues
Multiclass Sentiment Analysis with Restaurant Reviews. Moontae Lee and Patrick Grafe. OpenTable.com Data Set. Overall Rating (1 to 5 stars) Food Rating (1 to 5 stars) Ambiance Rating (1 to 5 stars) Service Rating (1 to 5 stars) Noise Rating (1 to 3) Data Set statistics
E N D
Multiclass Sentiment Analysis with Restaurant Reviews Moontae Lee and Patrick Grafe
OpenTable.com Data Set • Overall Rating (1 to 5 stars) • Food Rating (1 to 5 stars) • Ambiance Rating (1 to 5 stars) • Service Rating (1 to 5 stars) • Noise Rating (1 to 3) • Data Set statistics • Heavily biased toward 5 star ratings
Strategies • Spell Correction • POS Tagging • Unigram/Bigram/Trigram • Stop Words • Pruning
Spell Correction Common Spelling Mistakes: • Restaurant: resturant, restuarant, restaurante • Waiter: waitor • Service: sevice, serivce Distance Metrics: • Edit Distance • Levenstein Distance • Keyboard Distance • Sound Distance
Parsing Problem Sentences: • The atmosphere is pretty bad and food is quite good • The food, service, and atmosphere were fantastic!
Conclusions • Inherently Difficult Data Set • More Advanced Techniques Necessary