100 likes | 229 Views
Apartment Cloud. Noah Callaway Zac Fleischmann Zak Nelson Brandon Zahl. Aggregate apartments listings from all across the internet to create a… … simple, one-stop, apartment search Aggregate apartment listings from top sites. (Washington state only)
E N D
Apartment Cloud Noah Callaway Zac Fleischmann Zak Nelson Brandon Zahl
Aggregate apartments listings from all across the internet to create a… …simple, one-stop, apartment search Aggregate apartment listings from top sites. (Washington state only) …mostly one-stop apartment search. …mostly simple. Aspirations / Reality
Brandon – Site specific extractors Statistics Noah – Server configuration Front-end development Zac – Site specific extractors Advanced Search Zak – Crawler / Aggregator Commute distance feature Building It
Much higher accuracy on the structured pages versus unstructured craigslist • Craigslist is candidate for machine learning • Machine learning likely worse on others Experiment Conclusion
How to configure Amazon Web Services with a LAMP stack • How to create a web application with AJAX • How to use Jobo and Nutch for web crawling • How to parse HTML for pertinent data • The considerations of starting a web business What we learned
Amazon Web Services was slower than a $7/month virtual server • Most of the large listing sites were surprisingly easy to extract data from • Aggregating information from the web is legally tricky Unexpected Outcomes
Better version control • More pre-coding design • More quality control and testing • More extensible extractors (Maybe an existing HTML parser) Things We’d Do Differently