190 likes | 303 Views
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 . Discovering the constraint-based association rules in an archive for unique Bulgarian bells. Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics and Informatics,
E N D
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells Tihomir Trifonov, Tsvetanka Georgieva Department of Mathematics and Informatics, University of Veliko Tarnovo “St. Cyril and St. Methodius”, Bulgaria The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 • This paper presents an application that discovers the constraint-based association rules. • It allows the association analysis of the different characteristics of the bells. • Detailed information about the examined bells is maintained in an audio and video archive of unique Bulgarian bells. • The application is realized with Java and SQL and provides the possibility for finding the association rules of the data obtained after applying the methods of digital processing of signals for analysis of bell sounds. 2 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Tenth International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2008 Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 • Discovering the association rules is a data mining task [3, 7] in which the goal is to find interesting relationships between the attributes of the analyzed data. • Once found, the association rules can be used for supporting decision making in different areas. • In numerous cases the algorithms generate a large number of association rules, often thousands or even millions. • It is almost impossible for the end users to encompass or validate such a large number of association rules, limiting the results of the data mining is therefore helpful. 3 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Association Rules • Let I = {I1, I2, … , In} be a set of ndifferent values of attributes. Let Rbe a relation, where each tuple thas a unique identifier and contains a set of items, such that tI. An association rule is an implication of the form X → Y, where X, YIare sets of items with XY = . The set Xis called an antecedent, and Y –a consequent. • There are two parameters associated with a rule: • the support of the association rule X → Yis the proportion (in percentages) of the number of the tuples in R, which contain X Yto the total number of the tuples in the relation; • the confidenceof the association rule X → Yis the proportion (in percentages) of the number of the tuples in R, which contain X Yto the number of the tuples, which contain X. 4 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Association Rules • The task of association rules mining is to generate all association rules which have values of the parameters support and confidence, exceeding the previously given respectively minimal support min_suppand minimal confidence min_conf. • Therefore the discovery of the association rule requires finding the sets of items, which have a support, more than the previously defined minimal thresholdmin_supp. These sets are called frequent itemsets. 5 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Constraint-based Association Rules • To increase the efficiency of existing algorithms for data mining, during the mining process constraints are applied with the goal for these association rules, of which only those interesting to the user are generated, instead of all association rules. • The constraint-based association rule mining aims to find all rules from given dataset, which satisfy the constraints required from the users. • For discovering only the rules corresponding to the specific patterns, in [4] the meta-rules are applied. The format of the interesting rules is defined by using a template, the algorithm generates only these rules, which correspond to this template. 6 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Constraint-based Association Rules A meta-rule [4] is a rule template from the following type P1P2 … Pm → Q1Q2 … Ql where Pi (i = 1, …, m) and Qj (j = 1, …, l) are instantiatedpredicates or predicate variables, p = m + lis the number of the predicates in the rule. This paper presents an application, which allows the user to set constraints for searched rules and finds constraint-based association rules. The application is used for performing the association analysis on the different characteristics of the bells, the information for which is kept in an archive produced for the goal. 7 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells Detailed data about the analysis of bells is stored in an audio and video archive of the unique Bulgarian bells [10]. The data of the archive is accessible from http://www.math.bas.bg/bells/belleng.html For each bell, information is maintained for its unique identifier, location, type, geometrical dimensions, weight, material, condition, creator, year or period of creation, description, estimation of its historical value, digital photos, sound and video files, spectrograms. 8 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells A program is realized with MatLab [11] for analysis of the sounds of the bells by using various methods for digital signal processing (DSP) – spectral analysis by means of the Discrete Fourier Transform (DFT), digital filter, wavelet analysis. The partials of the sounds of the bells are found by applying the digital filter. The information about the previously calculated partials of the sounds of the different bells is stored in the archive. The represented application discovers the constraint-based association rules in an archive for unique Bulgarian bells. It is realized by using the languages Java and SQL. 9 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells To the user that starts the application, the following possibilities, are provided: Setting the attributes, being subject to analysis; Setting the minimal value of the support min_suppand the minimal value of the confidence min_conf; Setting the conditions (Boolean expression) for the values of the attributes, which can participate or not in the antecedent and the consequence of the searched rules. 10 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Choosing the attributes; defining the minimal support, the minimal confidence, the conditions for the values of the attributes; outputting the found rules 13 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells The figure shows an example result from the execution of the realized program with given values of the minimal support, minimal confidence and conditions for the values of the attributes. For instance, let the following rule be generated from the archive for unique Bulgarian bells: SecondParial(“600 Hz”) → FirstPartial(“320 Hz”) with values of the support s = 0.025 and the confidence c = 0.5. This rule means, that for monastery bells with second partial 600 Hz one of the most frequent values of the first partial is 320 Hz (with 50.00% confidence) and the monastery bells with value 600 Hz of their second partial represent 2.5% from all bells, included in the study. 14 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Discovering the constraint-based association rules in an archive for unique Bulgarian bells When representing the association rules in a tabular view all found rules are exposed in a table, where each row matches to a rule and provides information for the support and the confidence of this rule. All rules can be showed in different orders – according to the values of the attributes, participating in the antecedents and the consequents of the discovered rules; according to the values of the parameters support and confidence in ascending or descending order. In this manner the user has a more clear and complete view of the rules and can more easily locate a special rule. The tabular view facilitates adopting a large number of rules. 15 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Monastery “St. Transfiguration” near the town of Veliko Tarnovo 16 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Church “St. Nikolay” in Veliko Tarnovo, View close to the bell 17 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 The connection between BellDB and MatLab provides possibility for analyzing the sounds of the bells and to search a concrete bell by a sample of its sound. Real sound of the given bell and its 3D spectrogram, computed in MatLab 18 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Data acquisition of the experimental data (PULSE 11, B&K) 19 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 References • R. Agrawal, T. Imielinski, A. Swami, Mining Association Rules between Sets of Items in Large Databases, In Proc. of the ACM SIGMOD International Conference on Management of Data,1993. • R. Agrawal, R. Srikant, Fast Algorithms for Mining Association Rules, Proc. of the Int. Conf. on Very Large Databases, 1994. • A. A. Barsegyan, M. S. Kupriyanov, V. V. Stepanenko, I. I. Holod, Technologies for data analysis: Data Mining, Visual Mining, Text Mining, OLAP, BHV-Peterburg, 2008 (in Russian). • Y. Fu and J. Han, Meta-rule-guided mining of association rules in relational databases, In Proc. of the Int. Workshop on Integration of Knowledge Discovery with Deductive and Object-Oriented Databases, 1995. • T. Georgieva, Discovering Branching and Fractional Dependencies in Databases, Data and Knowledge Engineering, Elsevier, vol. 66, № 2, 2008. • M. Kamber, J. Han, J. Chiang, Using Data Cubes for Metarule-Guided Mining of Multi-Dimensional Association Rules, Technical Report, CMPT–TR–97–10, School of Computing Sciences, Simon Fraser University, 1997. • M. Kantardzic, Data Mining: Concepts, Models, Methods, and Algorithms, John Wiley & Sons, 2003. • S. Kotsiantis and D. Kanellopoulos, Association Rules Mining: A Recent Overview, GESTS International Transactions on Computer Science and Engineering, Vol.32 (1), 2006. • R. Ng, L. Lakshmanan, J. Han, A. Pang, Exploratory Mining and Pruning Optimizations of Constrained Association Rules, In Proceedings of the ACM SIGMOD Conference on Management of Data, 1998. • T. Trifonov ,T. Georgieva, Web based approach to managing audio and video archive for unique Bulgarian bells, In Proc. of the Tenth Int. Conf. on science and technology "System Analysis and Information Technologies", Kiev, 2008. • T. Trifonov ,T. Georgieva, The bell chime – an acoustical, mathematical and technological challenge, In Proceedings of the National Scientific Conference on Acoustics, Varna, 2008. • The Bell Project “Research and Identification of Valuable Bells of the Historic and Culture Heritage of Bulgaria and Development of Audio and Video Archive with Advanced Technologies” Website, http://www.math.bas.bg/bells/belleng.html 20 The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006
Eleventh International Conference on System Analysis and Information Technologies, Kyiv, Ukraine, 2009 Благодаря за вниманието! Thank youfor your attention! Спасибо за внимание! The work was supported partially by the Bulgarian National Science Fund under Grant KIN-1009/2006 21