230 likes | 309 Views
Project omniglean. Team Members: Kenny Trytek Derek Woods Abby Birkett Joe Briggie. Advisor: Simanta Mitra Client: Kingland Systems. Problem statement. Large companies have many layers of corporate hierarchy.
E N D
Projectomniglean Team Members: Kenny Trytek Derek Woods Abby Birkett Joe Briggie Advisor: SimantaMitra Client: Kingland Systems
Problem statement • Large companies have many layers of corporate hierarchy. • Financial and data records sometimes conflict between various layers/entities. • Accurate and comprehensive company records are needed. • There is a need for “Data Mastering”, to take multiple conflicting sources of data and • determine the reality of the matter in conflict.
Analyst User Omniglean Internet Harvest & store Client User Figure 2.1.1: Concept Sketch Concept Sketch WWW
Functional requirements • System shall autonomously traverse publicly available websites • System shall parse information from downloaded file in portable document format (pdf) • System shall store parsed information in a flat file • System shall allow user the ability to create, modify, and delete records • System shall maintain a normalized database • System shall expose functionality through web services
Non-functional requirements • System shall support up to 250 concurrent users • A single run of system shall complete execution in less than six hours • System shall be easily extensible to include more websites than originally specified • System shall be completed by May of 2011
Constraints and technical considerations • System shall interact with a third party library to facilitate database interaction • The database may not be available at all times • Using SVN to manage code • Using the spiral design process
Omniglean provides a unique combination of access to freely available FDIC and FFIEC data through a data mastering suite. • Omniglean provides access to the mastering capability through web services, to enable a rapid delivery of functionality to customers of Kingland Systems, as well as analysts located in different geographical areas. Market Survey
Potential risks and mitigation • External format change • External availability • New technologies • Web services • Not enough time for testing and debugging • Team members
Project milestones • Complete modules related to harvesting and transforming the data. • Complete web services and user interface modules • Integrate all modules successfully
Functional decomposition • Harvester – Gathers data • ETL – Transforms data • DAL – Database access layer • Web services – Exposes data to external users • User interfaces
DAL Database ETL Tool Normalized External Client UI Kingland Data Analyst UI Web Svcs. No Conflicts? Scraper Tool System Diagram WWW Data HTML Parser PDF Parser Flat File Create Read Update Delete
Scraper World Wide Web Parser Flat File (XML) PDF Parser HTML Parser Harvester startGatheringData() - Returns the XML document populated with data from the site. getLogFile() - Returns the log file that is either being written to or has been written to this session. stopGatheringData() - Stops all current harvester operations and writes an error to the log file.
Flat File (XML) ETL Tool DAL ETLTool • loadFFIEC() - This method will load the data from the XML file into the FFIEC table. • loadFDIC() - This method will load the data from the XML file into the FFDIC table. • createORGANIZATION() - This method will take the information from both the FFIEC table and the FDIC table and put it into the ORGANIZATION table.
Database DAL Add() Find() Update() Delete() ETL Tool User Interface Data Access Layer Organization- A class that creates and maintains a connection to the ORGANIZATION table GetConnection() CloseConnection() OrganizationService- This class will allow CRUD functionality with an Organization object. Find(String organizationId) Add(...) Delete(String organizationId) Update(String organizationId, ...)
Authenticated Unauthenticated Read() Write() Update() LogOut() Delete() Webservices Allows remote users to access the database through the Internet • Create() • Read() • Update() • Delete() • login() • logout()
Technology platform • SQL Server 2008 • Visual Studio 2010 development environment • Microsoft Windows operating system • WSDL and SOAP for web services
Test plan The team will be testing the system in three phases. The first phase is testing the individual modules, the second phase will be testing the integration of the modules, and the final phase will be testing the system as a whole.
prototyping We have begun prototyping. The harvester is able to traverse the necessary websites easily. The ETL can read in XML files. The user interface has been mocked up.
Task responsibility Kenny Trytek – Team Leader, responsible for the harvester Derek Woods – Developer, responsible for the ETL and UI Abby Birkett – Developer, responsible for database and DAL Joe Briggie – Developer, responsible for web services
Plan for next semester • Continue prototyping • Continue to meet with client to be sure we are meeting expectations • Develop a more thorough test plan