490 likes | 656 Views
Automatic Organization for Digital Photographs with Geographic Coordinates. Mor Naaman, Yee Jiun Song, Andreas Paepcke, Hector Garcia-Molina. Digital Library Project, Database Group Stanford University. April 8 th , 2004 1:20:02pm. Latitude: N34.3121 Longitude: W122.234.
E N D
Automatic Organization for Digital Photographs with Geographic Coordinates Mor Naaman, Yee Jiun Song, Andreas Paepcke, Hector Garcia-Molina Digital Library Project, Database Group Stanford University
April 8th, 20041:20:02pm Latitude: N34.3121 Longitude: W122.234 Geo-Referenced Photos JCDL 2004
Geo-Photography Technology 1) 2) + JCDL 2004
Personal Photo Libraries • Searching/browsing very difficult • Little discernible structure to photo collections JCDL 2004
Managing Personal Photos • Content-based retrieval • Basic, primitive (far from semantic) • Manual labeling • Improved, yet cumbersome • Visual methods for fast scanning (Zoom) • Don’t scale well JCDL 2004
Our Approach • Absolutely no human effort required • Utilize time and location • Automatically captured • Easy to get JCDL 2004
Automatic Organization JCDL 2004
Automatic Organization JCDL 2004
Automatic Organization JCDL 2004
Automatic Organization JCDL 2004
Outline • Requirements and challenges • The algorithms • Sample output • Experiment results JCDL 2004
Browsing by Location/Time • Use a map/calendar • wwmx.org from MSR: • Map issues • Lots of screen space • Sparse • Limited interaction? • Not intuitive for some JCDL 2004
Time: 2003-01-01: Yosemite N.P. (2 Days) 2003-01-18: San Francisco (1 hour) 2003-01-18: San Francisco (1 hour) Using Hierarchies Location: United States Yosemite N.P, Yosemite Valley, CA Around: San Francisco, Berkeley, Sonoma CA Seattle, WA … San Francisco, Golden Gate Park, CA Berkeley, Oakland CA … … … Time
Challenges • Locations should be intuitive • Events are tricky • 3-days trip to NYC • The kid’s soccer game, followed by a birthday party • Good names are important. JCDL 2004
Outline • Requirements and challenges • The algorithms • Sample output • Experiment results JCDL 2004
Process Diagram JCDL 2004
Initial Event Segmentation Discovering Structure Automatic Organization Event Hierarchy Final Event Segmentation Location Hierarchy Initial Event Segmentation Location Clustering JCDL 2004
Initial Event Segmentation • Photos occur in bursts • Identify bursts: semantically “connected” JCDL 2004
Initial Event Segmentation Stream of photos • More details: • Graham et al, JCDL 2002 • Tomorrow • Proceedings JCDL 2004
Location Clustering Discovering Structure Automatic Organization Event Hierarchy Final Event Segmentation Location Hierarchy Initial Event Segmentation JCDL 2004
Location Clusters • Cluster the bursts into locations • A. Gionis and H. Mannila. Finding recurrent sources in sequences. InProceedings, Computational molecular biology 2003. • Minimize: number of clusters • Minimize: error (distance to cluster centers) JCDL 2004
Photo location Location Clusters: 2-D View
Location4- Location3- Location2- Location1- Location Clusters JCDL 2004
Location4- Location3- San Francisco Location2- Location1- Location Clusters (breakdown) • Some clusters may be overloaded: • Many bursts / picture-taking days in one location
Final Event Segmentation Discovering Structure Automatic Organization Event Hierarchy Location Hierarchy Initial Event Segmentation Location Clustering JCDL 2004
Final Event Segmentation • Again scan sequence, new events detected: • Whenever location context changes • In the same location, use adaptive time threshold JCDL 2004
Overnight trip to Yosemite Soccer game and dinner Final Event Segmentation JCDL 2004
Next - names • Detected location and event structure • Need to choose names for each node JCDL 2004
Palo Alto City Park Butano State Park Palo Alto Stanford Photo location Assigning Names
San Francisco, 30 miles San Jose, 20 miles Assigning Names – Nearby? What if photos occur sparsely within cities or parks?
Assigning Names - Nearby Which city has stronger “gravity”? JCDL 2004
Assigning Names - Nearby San Jose is Closer JCDL 2004
Assigning Names - Nearby San Jose is bigger* *larger population JCDL 2004
Assigning Names - Nearby But San Fran is more important!* *greater Google count Final name for location cluster: “Stanford, 30 miles South of SF” JCDL 2004
Assigning Names - Alexandria • Using polygon-based dataset of administrative areas • Alexandria gazetteer can be used for other prominent geographic features JCDL 2004
Outline • The requirement and challenges of automatic organization • The algorithms • Sample output • Experiment results JCDL 2004
Location Hierarchy Photoshop Album (at least 4 man-hours) Our system (about 0 man-seconds) JCDL 2004
Location Hierarchy (US) • San Francisco, Berkeley, Sonoma, CA • -Stanford, Mountain View, Monterey, CA • Monterey (58 miles S of San Jose) • Mountain View (4 miles NW of San Jose) • Stanford • -Colorado (219 miles W of Denver) • -Long Beach (35 miles S of Los Angeles, CA) • -Philadelphia, PA • -Seattle, WA • -Sequoia N.P. (153 miles E of Fresno, CA) • -South lake Tahoe; Bear Valley, CA • -Yosemite N.P.; Yosemite Valley, CA
Events Photoshop Album (at least 4 man-hours) ... 2003-06-28: Long Beach,CA (3 days) 2003-07-04: San Francisco,CA (3 hours) 2003-07-10: Colorado (3 days) 2003-07-15: San Francisco,CA(1 hours) 2003-07-18: Mountain View,CA (5 hours) 2003-07-27: San Francisco,CA (1 hours) 2003-09-28: Philadelphia,PA (1 hours) 2003-10-03: Sequoia NP (3 days) ... about 0 man-seconds:
Event Names • LOCALE: share automatically • Check personal calendar • Event Gazetteer • Easy interface JCDL 2004
Experiment • Tested on 3 real-world geo-referenced photo collections • Our system automatically generated the structure and names • Tested with the owners JCDL 2004
Experiment - Locations • Accepted the automatic hierarchy • Only minor edits requested • Merge/split few of the locations JCDL 2004
Experiment - Events • Compared to events as annotated by users • 80-85% in both recall and precision • Other metrics proposed (see paper) JCDL 2004
Experiment - Naming • Naming location clusters • For 76% of clusters, system and users pick at least one name in common • For the rest, “automatic” name was useful JCDL 2004
Not yet published: • Paid 13 participants to “geo-reference” their photos • Loaded to WWMX and our browser • Most liked the map better, but… • Performed the same for search/browse tasks • Event notion helps overcome location handicap • Organization “made sense” P.S. Some didn’t touch the map, yet used our location hierarchy. P.S.2 This was on a BIG screen!
Thank You! More details: Proceedings Google: Mor Naaman mor@cs.stanford.edu http://www-db.stanford.edu/~mor/ JCDL 2004
Future Work • User interface • PDA • Integrate with map • Global photo libraries JCDL 2004
Remember The Bursts? JCDL 2004