140 likes | 434 Views
Data Analytics Architectures and Infrastructure. Daniel Silver March, 2014. The KDD Process. Interpretation and Evaluation. Data Mining. Knowledge. Selection and Preprocessing. p(x)=0.02. Data Consolidation. Patterns & Models. Prepared Data. Data Warehouse. Consolidated
E N D
Data Analytics Architectures and Infrastructure Daniel Silver March, 2014
The KDD Process Interpretation and Evaluation Data Mining Knowledge Selection and Preprocessing p(x)=0.02 Data Consolidation Patterns & Models Prepared Data Data Warehouse Consolidated Data Data Sources
The KDD Process The Architecture of a KDD System Graphical User Interface Data Mining Interpretation and Evaluation Selection and Preprocessing Data Consolidation Knowledge Warehouse Data Sources
Big Data Analytics Infrastructure • Requires: • DM: Data Management • DW: Data Warehouse • BI/DS: Business Intelligence / Data Science • DAA: Data Analytics Architecture
Big Data Analytics Infrastructure DW BI/DS DAA DM
Relationship between DW and DM? Strategic Tactical Rationale for data consolidation Analysis Query/Reporting OLAP Data Mining Data Warehousing Source of consolidated data
Data Warehousing OLAP Knowledge Workers “The Ideal Picture” Stats IDT Data Marts & Analytical Pocessors ANN One or more central repositories Data Warehouse Operational feedback from analytics Extraction Transformation Load Operational Data Store (ODS) Source Systems and Operational Users
Top-Down – traditional DA Architecture Bottom-up – Big DA Architecture
Hadoop and MapReduce • http://www.youtube.com/watch?v=9s-vSeWej1U • http://www.youtube.com/watch?v=4DgTLaFNQq0 • http://www.youtube.com/watch?v=RQr0qd8gxW8 • http://hci.stanford.edu/courses/cs448g/a2/files/map_reduce_tutorial.pdf
References • http://www.datasciencecentral.com/profiles/blogs/big-data-analytics-infrastructure • Reference Architecture for Big Data and DW • http://www.youtube.com/watch?v=L_s-x1HAi5k • http://www.youtube.com/watch?v=bETjVsWJsAs