380 likes | 482 Views
Chapter 9 – Business Intelligence. Announcement. Thursday Night we will begin at 5:30. Why do organizations need BI?. Why do organizations need BI?. Tons of data out there! In 2002, 2 exabytes were created In 2008, 70 exabytes 14x words spoken by human beings ever
E N D
Announcement • Thursday Night we will begin at 5:30
Why do organizations need BI? • Tons of data out there! • In 2002, 2 exabytes were created • In 2008, 70 exabytes • 14x words spoken by human beings ever • Business Intelligence – information containing patterns, relationships, and trends • How do you get it out???? BI Systems
What BI Systems are available? • BI System – Information system that employs BI tools to produce and deliver information • Type of systems depend on tools in use • Categories of tools • Reporting - Simple • read, process, format, deliver • Used to assess results – What happened? • Data mining - Sophisticated • Searching for patterns or relationships • Used to make predictions – What will happen? • Knowledge management • Used to store employee knowledge and make it available to others • Source of data – humans • How do you handle what is happening?
Tools vs. Applications vs. Systems • Tool – one or more computer programs that implement the logic of a particular procedure • Example: Decision tree analysis • Application – use of a tool on a particular type of data for a particular purpose • Example: Assess risk for a loan to default • System – has all 5 components (hardware, software, data, people, procedures) delivering results of a BI application • Example: delivers results to loan officer who makes final decision
Reporting Applications • Reporting application inputs data from one or more sources and applies a reporting tool to that data to produce information. This is then delivered to users by reporting system. • Operations commonly used: • Sorting • Grouping • Calculating • Filtering • Formatting
Some Dashboards to see • http://dashboard.virginiadot.org/ • http://dashboard.imamuseum.org/ • http://buildingdashboard.com/clients/jmu/
Analytical Tools • RFM Analysis – ranks information according to purchasing behavior – gives customers an RFM Score (1 – 5, 1 being the top 20%) • How Recently? • How Frequently? • How much Money?
In Class Exercise • Review the data. • Sort the data • Split into 20% increments for R, F, and M • 1 for Most Recent, 5 for Least Recent • 1 for Most Frequent, 5 for Least Frequent • 1 for Most Money, 5 for Least Money • Assign scores to each customer
OLAP – Online Analytical Processing • More generic than RFM • Dynamic – viewer can change the format • Measures and Dimensions • Measures – data item of interest • Total sales, average sales, average cost, etc. • Dimension – characteristic of a measure • Purchase date, customer location, etc.
Example – An OLAP Cube or report • Users can alter the format • Possible to drill down into the data • Requirements • Computing power • Tools may be costly Measure Dimension
A Demo of a Tool • http://www.tableausoftware.com/products/tour
Data Mining • Statistical techniques to find patterns and relationships among data and use it for classification and prediction • Data mining techniques are a blend of statistics and mathematics, and artificial intelligence and machine-learning
What’s the difference between supervised and unsupervised data mining?
Supervised vs. Unsupervised data mining • Unsupervised data-mining characteristics: • No model or hypothesis exists before running the analysis • Analysts apply data-mining techniques and then observe the results • Analysts create a hypothesis after analysis is completed • Cluster analysis, a common technique in this category groups entities together that have similar characteristics • Supervised data-mining characteristics: • Analysts develop a model prior to their analysis • Apply statistical techniques to estimate parameters of a model • Regression analysis is a technique in this category that measures the impact of a set of variables on another variable • Neural networks predict values and make classifications
Market-Basket Analysis • Data mining tool for determining sales patterns • Helps businesses create cross-selling opportunities • Terms used with this type of analysis • Support—the probability that two items will be purchased together • Confidence—a conditional probability estimate • Lift – ratio of confidence to support • Complex, requires analytical tools
Decision Trees • Hierarchical arrangement of criteria that predicts a classification or value • Unsupervised data-mining technique that selects the most useful attributes for classifying entities on some criterion • If…then rules
Example • Select attributes that are most useful for classifying • Predicting Grades for Students in COB 204 • What are some attributes/characteristics we should consider? How do businesses use decision trees?
College Admissions Decision TreeGroup Assignment – Ethics p.303
Data Warehouses and Data Marts • Address the problems companies have with missing data values and inconsistent data • Help standardize data formats between operational data and data purchased from third-party vendors • Prepare, store, and manage data specifically for data mining and analyses.
The Curse of Dimensionality • The more attributes there are, the easier it is to build a model that is worthless
Data Marts vs. Data Warehouses • Data mart is smaller than a warehouse • Data mart addresses a particular component or function
Knowledge Management Applications • KM – process of creating value from intellectual capital and sharing with others who need it • Data mining and reporting create new information • KM shares known information
Benefits of KM • Fosters innovation – free flow of ideas • Improves customer service – faster response time • Boosts revenues – get product to market faster • Enhances retention – recognize/reward knowledge • Streamlines operations – eliminates/reduces redundant or unnecessary operations • Preserves organizational memory
Sharing Document Content • Indexing • Need to be able to easily access information • Need keyword searchability • Need quick response • RSS – Real simple syndication • Think of it as an email system for content • Subscribe to magazines, blogs, websites, and other sources – RSS Feeds
Expert Systems • Rule based systems using if…then logic • Created by interviewing experts and codifying their decisioning (vs. decision trees that review past data and performance) • Can have hundreds of thousands of rules (vs. <12 in decision trees)
Expert System Problems • Difficult and expensive to manage • Difficult to maintain • Implications of rule changes • Difficult to perform at same level as real experts • Example - medicine
Mgt Functions of BI Servers • Maintains metadata about the authorized allocation of BI results to users • Tracks what results are available, who is authorized to view them, and when the results are provided to users • Options for managing results • Users can pull their results from a Web site using a portal server with a customizable user interface • A server can automatically push information to users through alerts which are messages announcing events as they occur • Portal servers – allow for customization of the interface • A report server, a special server dedicated to reports, can supply users with information.
Delivery Functions • Characteristics of the delivery function of a BI server: • Tracks authorized users. • Tracks the schedule for providing results to users. • Uses exception alerts that notify users of an exceptional event. • Procedures used depends on the nature of the BI system. • Procedures tend to be more flexible than those in an operational system because users of a BI system tend to be engaged in work that is neither structured nor routine. • Procedures are determined by unique requirements of users.