1 / 14

Who is King of the Arena? -- NBA Related Data Analysis

Who is King of the Arena? -- NBA Related Data Analysis. Team: NewBie Aggression 2010. 10. 18. A Brief Project Introduction. Based on Our Love for NBA Orion’s Super analysis power Aimed at Manias who won’t be satisfied with simple statics We can tell

garran
Download Presentation

Who is King of the Arena? -- NBA Related Data Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Who is King of the Arena?--NBA Related Data Analysis Team: NewBie Aggression 2010. 10. 18

  2. A Brief Project Introduction • Based on • Our Love for NBA • Orion’s Superanalysis power • Aimed at • Manias who won’t be satisfied with simple statics • We can tell • Which position is more important in the recent seasons • Which player is likely to win the MVP this year • Which Team is likely to appear in the Playoff • And Much More !

  3. Dashboard Showcase It’s Show Time!

  4. Technical Design • Logical Data Model • Warehouse Design • Project Creation & Management

  5. Logical Data Model • Data Source Available • http://www.databasebasketball.com • Including NBA seasonal team\player\coach statics, playoff teams, final champions and MVP winners from 1946 to 2008 • Lowest Data Level • Season – Team - facts (games won, games lost, pts, ….) • Season – Team - Player - facts (pts, asts,… )

  6. Logical Data Model Region Period City Position Age Coach … M:M Season Team Player Season-Team Facts Season-Player Facts Lowest Level Facts

  7. Warehouse Design • Lookup tables • Season, Player, Coach, Team, City, Region • Lowest level fact tables • Same as data source • Aggregate fact tables • Player Career Facts & Coach Career Facts • Aggregate on “season”, “player”… • For performance consideration

  8. Project Creation & Management • Metadata Schema • Regular Metric & Reports • Widget Usage • Dashboard Layout • Just regular procedures • Time Limit -> details available in Q & A

  9. Project Creation Spotlights – Evaluation Metrics • Examples: • Player credits (value) • Team Offensive/Defensive Efficiency • Team Wining Rate • Compound: • Combination of simple metrics like Asts, Pts,… • Reasonable: • Based on some senior NBA analysis sites • http://www.databasebasketball.com • http://www.nbastuffer.com

  10. Project Creation Spotlights – Training Metric • Player evaluation—All star player prediction • Very difficult • All-star player : all players = 1 : 20 • Training Metric - Logistic regression • Training set: 1950 – 2000 • Test set: 2001 - 2008 • Features • ASTS, BlK, Dreb, Oreb, Pts, Reb, Stl, VI, Gameplayed, Credits, FTA, FGA • Result • Precision: 69.7% • Recall: 60.5% • F Score: 64.8%

  11. Project Creation Spotlights – Training Metric • Player evaluation—MVP player prediction • Much more difficult than All-Star prediction • MVP player : all players = 1 : 450 • Training metric is no longer effective because of the extremely unbalanced dataset • Instead we used an empirical metric “player credits” to predict the MVP • Player Credits= PTS+REB+AST+STL+BLK-FG MISSED-FT MISSED-TO • Result • Top 1 credits player: 37.5% • Top 3 credits player: 62.5%

  12. Project Creation Spotlights – Training Metric • Team evaluation - Predict playoff teams • Much Easier • Training Metric - Logistic regression • Training set: 1950 – 2000 • Test set: 2001 - 2008 • Features • Offense Efficient, Defense Efficient, D Reb, D Pts, O Reb, O Pts, Turn Over • Result • Precision: 89.7% Recall: 95.3% F Score: 92.4%

  13. Division of Work Yang Fan Cao Lei Tang Yang Logical Data Model & Warehouse Design Metadata Schema ETL Processing Metadata & Warehouse Check Metric & Report Metric & Report Metric & Report Dashboard Training Metric Presentation

  14. Q & A Thank You!

More Related