1 / 24

Why Watching Movie Tweets Won’t Tell the Whole Story?

Why Watching Movie Tweets Won’t Tell the Whole Story?. Felix Ming-Fai Wong, Soumya Sen , Mung Chiang EE, Princeton University. WOSN’12. August 17, Helsinki. Twitter: A Gold Mine?. Apps: Mombo , TwitCritics , Fflick. Or Too Good to be True?. Representativeness???.

corby
Download Presentation

Why Watching Movie Tweets Won’t Tell the Whole Story?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Why Watching Movie Tweets Won’t Tell the Whole Story? Felix Ming-Fai Wong, Soumya Sen, Mung Chiang EE, Princeton University WOSN’12. August 17, Helsinki.

  2. Twitter: A Gold Mine? Apps: Mombo, TwitCritics, Fflick

  3. Or Too Good to be True? Representativeness???

  4. Why representativeness? • Selection bias • Gender • Age • Education • Computer skills • Ethnicity • Interests • The silentmajority

  5. Why Movies? • Large online interest • Relatively unambiguous • Right in Timing: Oscars

  6. Key Questions • Is there a bias in Twitter movie ratings? • How do Twitters compare to other online site? • Is there a quantifiable difference across types of movies? • Oscar nominated vs. non-nominated commercial movies • Can hype ensure Box-office gains?

  7. Data Stats • 12 Million Tweets • 1.77M valid movie tweets • February 2 – March 12, 2012 • 34 movie titles • New releases (20) • Oscar-nominees (14)

  8. Example Movies Oscar-nominees for Best Picture or Best Animated Film

  9. Rating Comparison • Twitter movie ratings • Binary Classification: Positivity/Negativity • IMDb, Rotten Tomatoes ratings • Rating scale: 0 – 10, 0 – 5 • Benchmark definition • Rotten Tomatoes: Positive - 3.5/5 • IMDb: Positive: 7/10 • Movie score: weighted average • High mutual information

  10. Challenges • Common Words in titles • e.g., “Thanks for the help”, “the grey sky” • No API support for exact matching • e.g., “a grey cat in the box” • Misrepresentations • e.g., “underworld awakening” • Non-standard vocabulary • e.g., “sick movie” • Non-English tweets • Retweets: approves original opinion

  11. Tweet Classification

  12. Steps Preprocessing Training Data Classification Analysis • Preprocessing • Remove usernames • Conversion: • Punctuation marks (e.g., “!” to a meta-word “exclmark”) • Emoticons (e,g,,  or :),  etc. ), URLs, “@” • Feature Vector Conversion • Preprocessed tweet to binary feature vector (using MALLET) • Training & Classification • 11K non-repeated, randomly sampled tweets manually labeled • Trained three classifiers

  13. Classifier Comparison • SVM based classifier performance is significantly better • Numbers in brackets are balanced accuracy rates

  14. Temporal Context Fraction of tweets in joint-classes • Most “current” tweets are “mentions” • LBS • “Before” or “After”tweets are much more positive Woman in Black Safe House The Vow Chronicle

  15. Sentiment: Positive Bias • Twitters are overwhelmingly more positive for almost all movies Woman in Black Star Wars I Chronicle Safe House Haywire The Vow

  16. Rotten Tomatoes vs. IMDb? • Good match between RT and IMDb (Benchmark)

  17. Twitters vs. IMDb vs. RT ratings (1) RT vs. Twitter IMDb vs. Twitter • New(non-nominated) movies score morepositively from Twitter users

  18. Twitters vs. IMDb vs. RT ratings (2) RT vs. Twitter IMDb vs. Twitter • Oscar-nominees rate morepositively on IMDband RT

  19. Quantification Metrics (1) Positiveness (P): • (x*, y*) are the medians of proportion of positive ratings in Twitter and IMDb/RT 1 IMDb/RT score y* B P x* 1 Bias (B): Twitter score

  20. Quantification Metrics (2) Inferrability (I): • If one knows the average rating on Twitter, • how well can he/she predict the ratings on IMDb/RT?

  21. Summary Metrics • Oscar-nominees have higher positiveness • RT-IMDb have high mutual inferrabilityand low bias • Twitter users are more positively biased for non-nominated newreleases

  22. Hype-approval vs. Ratings This Means War This Means War The Vow The Vow • Higher (lower) Hype-approval ≠Higher (lower) IMDb/RT ratings

  23. Hype-approval vs. Box-office • High Hype + High IMDb rating: Box-office success • Other combination: Unpredictable Journey 2

  24. Summary • Tweeters aren’t representative enough • Need to be careful about tweets as online polls • Tweeters’ tend to appear more positive, have specific interests and taste • Need for quantitative metrics Thanks!

More Related