160 likes | 270 Views
Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set. Presented By Kallepalli Vijay. Agenda. Introduction Motivation Background Solution Results Conclusion and Future work. Introduction. Stock exchanges maintain a log of events
E N D
Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay
Agenda • Introduction • Motivation • Background • Solution • Results • Conclusion and Future work
Introduction • Stock exchanges maintain a log of events • rise in stock price, option value, number of stocks sold • Investors predict the market trends based on available log information • Stock market is highly unpredictable • The values in the log change very drastically with time
Introduction (Con…) • If investors are given adequate information • regarding stock market trends • Investors can invest money accordingly • maximum profits.
Motivation • Data in the log is very huge • Data contains hidden details • Manually identifying the hidden details • cumbersome process • Apply Apriori to retrieve hidden details • Prior deriving large item data has to be classified
Background • Data mining: A process of extracting unknown patterns, facts and relations from large database • Data mining means knowledge discovery from large databases • Association rules in data mining involves in detecting which items tend to occur together in transactions
Background (Con…) • Association rules in data mining was first proposed by Agrawal, Imielinski and Swami in 1993 • Ex: Customer who purchase one item are likely to purchase another item. • Consider A transaction is a set of items • T = {i1, i2,……it} • T I, where I is the set of all possible items {i1, i2,……in} • P Q, Where P I, Q I, and PQ
Solution • Finance data is quantitative • Classify data into regular intervals • Map the classified data to an index value • Index values range from 0 through 143 • 0 through 35 represent stocks opening value • 36 through 71 represent stocks day high value • 72 through 107 represent stocks day low value • 108 through 143 represent stocks closing value
Solution (Con…) • Finance data of the Apple Computers Inc is read from a text file • Eg1: Stock opening value ranging between 10.0 and 10.5 is mapped to an index value 0 • Eg2: Stock high value ranging between 10.0 and 10.5 is mapped to an index value 36 • Apply Apriori algorithm on the mapped indices to derive association rules
Conclusions & Future work • Manually identifying hidden details is a tedious process • Classified the collected data into regular intervals • Applied apriori algorithm to derive large item sets • Derived large item sets and projected to the user in user readable form
Conclusions & Future work (Con…) • Classification of data plays important role • Correctness of association rules depends on the classification of the data • Selecting the length of the interval for classification is difficult • Fuzzy logic can applied on the data for classification