1 / 19

Local Business Ambience Characterization Through Mobile Audio Sensing

Local Business Ambience Characterization Through Mobile Audio Sensing. He Wang, Dimitrios Lymberopoulos , Jie L iu. Local Search Experience. c rowded bar playing loud pop music. quiet Thai restaurant outdoor seating. Is it crowded now ?. Static/Stale Experience Ratings/Reviews Images

halil
Download Presentation

Local Business Ambience Characterization Through Mobile Audio Sensing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Local Business Ambience Characterization Through Mobile Audio Sensing He Wang, Dimitrios Lymberopoulos, Jie Liu

  2. Local Search Experience crowded bar playing loud pop music quiet Thai restaurant outdoor seating Is it crowded now? • Static/Stale Experience • Ratings/Reviews • Images • Location/Phone • URL What music is it playing now? How loud is the music now? Is it noisy now? Outdoor seating now?

  3. 65 Users online Study Which information would you like to have? What is the single most important piece of information? Zhou et al., IODetector: A Generic Service for Indoor/Outdoor Detection, Sensys 2012

  4. Goals: reliable detection of levels for Human Chatter Music Noise Occupancy

  5. How? Unique Business Ambience Extraction Phone In User Hands Phone’s Sensors … Audio-based approach

  6. Examples of Real data occupancy: normal chat: high music: low noise: normal near-phone talking: no occupancy: high chat: high music: high noise: normal near-phone talking: yes occupancy: normal chat: high music: very high noise: low near-phone talking: no occupancy: normal chat: normal music: very high noise: high near-phone talking: yes occupancy: low chat: low music: low noise: high near-phone talking: no

  7. Challenges • Multiple sound sources are simultaneously recorded • Music / Human chatter / Environmental noise • Sound source separation is hard • Near-phone talking • phone owner / nearby TV or speaker • saturates audio recording hiding business’ conditions • A single model across multiple businesses • Scalability

  8. Architecture 32K features 40 features tiny window time and frequency domain features 1s Model Training Statistics Near-Phone Talking … Occupancy Chat 7-15s Feature Extraction Music Noise … … … Feature Extraction Majority voting over all 1s model outputs It takes 1 second to process 1 second of audio Occupancy: High Chatter: High Music: Very High Noise: Normal Privacy: Everything runs on the phone!

  9. Feature Extraction: Temporal Letter “S” Letter “O”

  10. Feature Extraction: Frequency

  11. Feature Extraction: frequency

  12. Data Collector on Windows Phone User selects a business 15s sensor data collection User provides ground truth

  13. 150 Real Business Audio traces High Normal 3-fold cross-validation

  14. Classification Accuracy

  15. Audio recording time 4-level Classification 2-level Classification

  16. Near-phone talking – 97% Accuracy Near-phone talking audio segments are ignored

  17. Device Variability After Calibration Before Calibration

  18. Device Variability 2-level classification test on 20 traces

  19. Summary • Real-time Business Ambience metadata • Change the way users interact with local search results • Change the way search engines rank local businesses • Mobile audio sensing approach • Crowd-source ambience info from real users visiting businesses • >80% accuracy • Robust to near-phone talking

More Related