130 likes | 237 Views
Not Even Funny [Property Details Team]. Matt Dixon Architect Candace Remaly Project Manager Spencer Smith Business Analyst Bryan Linthicum Developer Adam Sternfeld Tester. Proposed Project Timeline. Tasks Completed. Installed necessary software Created a prototype
E N D
Not Even Funny[Property Details Team] Matt Dixon Architect Candace RemalyProject Manager Spencer Smith Business Analyst Bryan Linthicum Developer Adam SternfeldTester
Tasks Completed Installed necessary software Created a prototype Researched data mining techniques Successfully extracted data from Auditor’s website
Data Mining • Decided to use iRobot web scraper to extract data from Franklin County Auditor website • Obtained a list of every street in Franklin County (http://www.fceo.co.franklin.oh.us/) • iRobot uses the list as search criteria • Able to output data to XML file or database
<Variables Name="" Date="2009/04/13 23:53:41"> <VariableData Name="AbstractURL">http://franklincountyoh.metacama.com/do/selectDisplay?parcelid=56022655500&select=SUMMARY&curpage=*</VariableData> <VariableData Name="parcelid">560-226555-00</VariableData> <VariableData Name="MapRoutingNumber">560-N042KKK -068-00</VariableData> <VariableData Name="location">834 MACARRAN CT</VariableData> </Variables>
Problems Encountered Searching by street name can only return a maximum of 400 results (some addresses are being left out) Auditor’s website not always reliable for street name matching May take several days to extract information for all of Franklin County
Determining Foreclosures Can’t use foreclosure.com … doesn’t have full addresses Most likely use Fannie Mae (homepath.com) & Freddie Mac (homesteps.com) to get as much Foreclosure information as possible
Goals for Next Week Begin mapping out specifications and use cases Begin database design Refine web scraping for Franklin County Auditor’s website Create web scraper for homesteps.com and homepath.com