1 / 23

Collating Social Network Profiles

Collating Social Network Profiles. Objective. System. <Twitter Profile, Facebook Profile, G+ Profile, …>. <Twitter Profile, Facebook Profile, G+ Profile, …>. <Company Name>. Objective. System. Input. Output. <Twitter Profile, Facebook Profile, G+ Profile, …>. Social Network Profiles.

Download Presentation

Collating Social Network Profiles

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Collating Social Network Profiles

  2. Objective System <Twitter Profile, Facebook Profile, G+ Profile, …> <Twitter Profile, Facebook Profile, G+ Profile, …> <Company Name>

  3. Objective System Input Output <Twitter Profile, Facebook Profile, G+ Profile, …> Social Network Profiles Company Name

  4. Record Linkage + Identity

  5. Agenda

  6. Baseline System

  7. Ground Truth • Two networks: Facebook and Twitter • Top seventy 2013 Fortune 500 companies

  8. Baseline Algorithm • Take company name. • Search Facebook/Twitter API using it. • Return first result from each.

  9. Baseline Performance

  10. Individual Network Approach

  11. New Approach Score profiles based on • Edit Distance • Company Name – Username • Company Name – Display Name • Relative Popularity

  12. Display Name Username

  13. New Approach Score profiles based on • Edit Distance • Company Name – Username • Company Name – Display Name • Relative Popularity

  14. Scoring Edit Distance Score: Popularity Score:

  15. Best Performing Combination

  16. Machine Learning Experiments

  17. Freebase Ground Truth

  18. Training Set

  19. Cross Validation Results

  20. Next Steps • Improve training set: provide harder examples

  21. Next Steps • Improve training set: provide harder examples • Incorporate more profile data

  22. Next Steps • Improve training set: provide harder examples • Incorporate more profile data • Build system around classifiers

  23. Agenda

More Related