1 / 15

Clustering Internet Search Results with Google App Engine

Explore clustering search results using Google App Engine for Java applications. Learn its features, sandboxing, request handling, and architecture overview.

cyazzie
Download Presentation

Clustering Internet Search Results with Google App Engine

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Google App Engine and Java Application: Clustering Internet search results for a person Faculty of Mathematics, University of Belgrade Aleksandar Kartelj kartelj@matf.bg.ac.rs

  2. Google App Engine • Web application hosting service • Designed for real-time dynamic apps • Many simultaneous users • Scalable • Paying resources monthly • Free account (5 million page views a month) Aleksandar Kartelj kartelj@matf.bg.ac.rs

  3. Sandboxing • App can only read its own files • App cannot manipulate environment variables • App cannot access network facilities • Distributing request non-deterministically • Consequence: run multiple apps on the same server, or one app on multiple servers safely Aleksandar Kartelj kartelj@matf.bg.ac.rs

  4. Architecture overview Aleksandar Kartelj kartelj@matf.bg.ac.rs

  5. Features • The static file servers • The Datastore • Not join-query database • Most resembling to object database • Entities and properties • Entity has 1..* properties • Not to be confused with rows in RDBMS Aleksandar Kartelj kartelj@matf.bg.ac.rs

  6. Features • Queries and indexes • Index in made for every simple query • Query performance affected only by the size of the result set • Transactions • Optimistic concurrency control • Entity groups (entities updated in one trans.) • Services: memcache, URL fetch , Mail, … Aleksandar Kartelj kartelj@matf.bg.ac.rs

  7. App Engine’s request handling Aleksandar Kartelj kartelj@matf.bg.ac.rs

  8. Building applications Aleksandar Kartelj kartelj@matf.bg.ac.rs

  9. App engine dashboard Aleksandar Kartelj kartelj@matf.bg.ac.rs

  10. App engine dashboard Aleksandar Kartelj kartelj@matf.bg.ac.rs

  11. Clustering search results Aleksandar Kartelj kartelj@matf.bg.ac.rs

  12. Clustering search results Aleksandar Kartelj kartelj@matf.bg.ac.rs

  13. EC2 vs GAE vs Azure Aleksandar Kartelj kartelj@matf.bg.ac.rs

  14. EC2 vs GAE vs Azure Aleksandar Kartelj kartelj@matf.bg.ac.rs

  15. Thank you. Aleksandar Kartelj Faculty of Mathematics, University of Belgrade kartelj@matf.bg.ac.rs www.matf.bg.ac.rs/~kartelj

More Related