1 / 37

Business Identification: Spatial Detection

Business Identification: Spatial Detection. Alexander Darino Week 5. Outline. Recap of Previous Work Business Name Detection Business Name Matching Business Spatial Detection Weaknesses to Current Approach Alternatives to Current Approach Acknowledgements. Outline. Week 4. Week 5.

hanley
Download Presentation

Business Identification: Spatial Detection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Business Identification:Spatial Detection Alexander Darino Week 5

  2. Outline • Recap of Previous Work • Business Name Detection • Business Name Matching • Business Spatial Detection • Weaknesses to Current Approach • Alternatives to Current Approach • Acknowledgements

  3. Outline Week 4 Week 5 Business Name Matching Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image OCR Detected Text

  4. Previous Work Image Where Am I? Latitude, Longitude 65 George S Aiken Co Winghart's Burger & Whiskey Bar Market Square Bella Sera On the Square Chipotle NOLA Las Velas … Latitude, Longitude Geocoding Reverse Geocoding Nearby Businesses

  5. Business Name Detection Business Name Matching Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image OCR Detected Text

  6. Business Name Detection

  7. Business Name Detection … <line dy="95" dx="1573" y="420" x="11" value="1"> <space dy="26" dx="9" y="379" x="11"/> <box dy="26" dx="9" y="379" x="11" value="0" weights="96" numac="1"/> <box dy="25" dx="6" y="406" x="11" value="J" weights="98,62" numac="2" achars="p"/> <box dy="19" dx="5" y="382" x="19" value="n" weights="96" numac="1"/> <space dy="5" dx="30" y="441" x="25"/> <box dy="5" dx="7" y="441" x="56" value="."/> <box dy="24" dx="5" y="401" x="57" value="."/> <box dy="13" dx="8" y="429" x="58" value="v" weights="98" numac="1"/> <box dy="26" dx="9" y="402" x="60" value="." weights="94" numac="1"/> <box dy="22" dx="5" y="406" x="67" value="0" weights="96" numac="1"/> <box dy="10" dx="12" y="444" x="71" value="."/> </line> …

  8. Business Name Matching Business Name Matching Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image OCR Detected Text

  9. Business Name Matching • Developed Confidence Attribution Algorithm • Confidence of OCR Token being Name Token • Example: Confidence of “ESTUANT” representing “RESTAURANT” • Point-based system • Confidence of Name appearing in Image • Sum of points of matching OCR Text • Use logarithmically-normalized points to determine business inclusion threshold

  10. Business Name Matching

  11. Business Name Matching

  12. Business Name Matching

  13. Business Name Matching Note: k is usually 2 or 3

  14. Business Name Matching

  15. Business Name Matching Note: This originally did not appear because it did not exceed the confidence threshold. It now appears because it contributes to the Business Name Identification

  16. Business Spatial Identification Business Name Matching Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image OCR Detected Text

  17. Business Spatial Identification

  18. Business Spatial Identification Aiken George S Co Category: Food, Grocery Address: 218 Forbes Ave Pittsburgh, PA 15222 Phone: (412) 391-6358 Rating: 4.5/5 (2 Reviews)

  19. Business Spatial Identification

  20. Business Spatial Identification

  21. Business Spatial Identification Bruegger's Bagels Category: Bagels Address:Market Sq Pittsburgh, PA 15222 Phone: (412) 281-2515 Rating: Not Rated

  22. Weaknesses to Current Approach Business Name Matching Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image OCR Detected Text

  23. Weaknesses to Current Approach Lots of Garbage

  24. Weaknesses to Current Approach Fragmented Word Detection

  25. Weaknesses to Current Approach Fails withnon-orthogonal perspective Did I already mention lots of garbage?

  26. Weaknesses to Current Approach Fails withnon-roman text Not scale-invariant

  27. Two different Alternative Approaches

  28. Alternative #1: Image Matching Match to Storefront Image Business Spatial Detection Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses BusinessIdentification Image

  29. Alternative #1: Image Matching

  30. Alternative #1: Image Matching • Weaknesses • Storefront images aren’t always available for matching • Computationally Expensive • Hundreds of images to compare to • Nothing new • Boring!

  31. Alternative #2: Template Matching Latitude Longitude Geocoding Reverse Geocoding Render Templates of Business Names in Different Fonts Nearby Businesses Template Images Image Image Matching (eg. SIFT, HAAR) Business Identification Business Spatial Detection

  32. Alternative #2: Template Matching • Tambellini • Tambellini • Tambellini • Tambellini • Tambellini • Tambellini • Tambellini • Tambellini

  33. Alternative #2: Template Matching OCR Alternative #2 Scale Invariant Bounded Search Whole-word recognition All fonts • Not Scale Invariant • Unbounded Search • Fragmented Recognition • Roman-only font

  34. Acknowledgements • Subh • Provided several ideas regarding template matching using SIFT, HAAR features, etc

  35. Thank You

More Related