250 likes | 383 Views
Technical Session I MCRDR Classifier & Database Design. School of Computing. Presenter Yangsok Kim. Scope. Explain MCRDR classifier installation process Explain MCRDR classifier database table structure Explain relationship between classification process and database table structure.
E N D
Technical Session IMCRDR Classifier & Database Design School of Computing Presenter Yangsok Kim
Scope • Explain MCRDR classifier installation process • Explain MCRDR classifier database table structure • Explain relationship between classification process and database table structure School of Computing, University of Tasmania
MCRDR classifier Installation Click this icon • Download MCRDR classifier from the Unit Web page (http://www.comp.utas.edu.au/iWeb/iPrj/kxa459/2007/project.html) • Create a folder for MCRDR (e.g., mcrdr_classifier) and unzip the download file • Execute MonClassifier_KXA4592007.exe • Configure database setup Select “Database” Fill in database information (host, userid, password, database name) Select username and fill in username and passowrd Click reconnect button School of Computing, University of Tasmania
Client (Classifier) Tables Server Tables Database • You can see your database via phpMyAdmin interface (http://www.comp.utas.edu.au/iWeb/phpMyAdmin/) • 9 tables are client side (classifier table) and 7 tables are server side table. School of Computing, University of Tasmania
tbmonservsite tbmonservsitetree tbmonservarticle Database – Server Side Tables Join tbmonservsite and tbmonservsitetree SELECT a.MonSiteID, b.MonSitePID, a.MonSiteURL, b.MonSiteName, b,nTime FROM tbmonservsite a, tbmonservsitetree b WHERE a.MonSiteID=b.MonSiteID School of Computing, University of Tasmania
Import Monitoring Webpages Click this icon to start import web sites Select Checkbox to import monitoring web sites School of Computing, University of Tasmania
tbclassify_user_article tbmonservarticle Import Monitoring Webpages If you want to article details, you need to join tbclassify_USERNAME_article and tbmonservarticle e.g. SELECT b.ArticleID, b.Title, b.Body FROM tbclassify_user_article a, tbmonservarticle b WHERE a.ArticleID=b.ArticleID This table date is filled when you import monitoring Web pages. OR After registering Web pages, the MonServer automatically feeds newly collected articles. School of Computing, University of Tasmania
tbclassify_user_article Import Monitoring Webpages This field illustrates whether the article is read or not TypeID illustrates current status of Article 0:Not Classified 1:Classified 2:Deleted ClassifiedDate This time information is saved when the article classified School of Computing, University of Tasmania
tbclassify_user_monsitekwdtree tbmonservsite tbmonservsitetree Import Monitoring Webpages Monitoring Site ID Monitoring site name School of Computing, University of Tasmania
Classifier Main UI Monitoring Site List tbclassify_user_monsitekwdtree Monitored Article List tbclassify_USERNAME_article and tbmonservarticle Inference Results tbclassify_user_articlefolder, tbclassify_user_ruletree tbclassify_user_rulecondition Classification structure tbclassify_user_foldertree Selected Article tbclassify_user_article and tbmonservarticle School of Computing, University of Tasmania
Rule base • MCRDR classifier maintains rule base for classification • Rule base is an n-ary tree • Each node is a rule • Rule consists of condition parts and conclusion • Condition Attributes are Title, Body, and All of a article • Each condition evaluates whether the condition words is included in the articles • Conclusion is a folder of the classification structure School of Computing, University of Tasmania
Rule Base Rule Base Root Classification Structure Rule 1 Article IF THEN Root Title: a,b,c,g,f Body: c, d, k,m Title includes {a, b} AND Body include {c, d} A Classify into folder C B Rule 2 C Rule 3 Rule 4 School of Computing, University of Tasmania
tbclassify_user_ruletree tbclassify_user_rulecondition Rule Base Each rule has parent rule id There are two types of rules(refine rule(0), stop rule(1) Conclusion ID is a folder ID CornerstoneCaseID is an ArticleID that is used for rule creation Each rule has multiple conditions Condition word, which is case sensitive Three condition types – 0:Title, 1:Body, and 2:ALL School of Computing, University of Tasmania
Rule base This table stores classification results tbclassify_user_articlefolder School of Computing, University of Tasmania
Classification Structure • There are predefined classification (folder) structure. Don’t change any folders!! • Each folder is used as the conclusion of rule • Classification structure is a tree. School of Computing, University of Tasmania
Classification Structure Each folder has unique ID tbclassify_user_foldertree Each folder’s parent folder ID School of Computing, University of Tasmania
Rule Creation • With MCRDR classifier, user incrementally creates rules that classify documents. • Rule creation initiated • 1) when the classifier doesn’t provide recommendation or • 2)when the classifier gives wrong recommendation • Rule can be refine rule or stop rule School of Computing, University of Tasmania
Rule Creation - Basic Click Knowledge Acquisition icon Select a folder that this article is classified Select Article form the Article List Condition word is added here New rule added to the rule tree Click this button to create a rule System shows keywords that are not found in the excluded cases. Select condition word Display articles that satisfies current rule. We need to validate these cases Select condition type System place the selected article in this pane Select cases that should not classified by current rule If you want to finish click this button School of Computing, University of Tasmania
Rule Creation - Basic tbclassify_user_ruletree tbclassify_user_rulecondition tbclassify_user_articlefolder School of Computing, University of Tasmania
Refine Rule Select article that should be refined Click knowledge acquisition icon New rule is added under the parent rule Select a folder under the current recommended folder View current recommendation The following process is same the basic rule creation process School of Computing, University of Tasmania
Rule Creation – Refine Rule tbclassify_user_ruletree tbclassify_user_rulecondition New Rule is added tbclassify_user_articlefolder Classification result has been changed School of Computing, University of Tasmania
Rule Creation – Stop rule Select an article that should not be classified this folder Select current recommended folder and choose deselect menu from the shortcut menu Stop rule is added under the current fired rule After selecting “Deselect” menu, follow the basic rule creation process School of Computing, University of Tasmania
Rule Creation – Stop rule This rule is added. Note the RuleType and ConclusionID This conditions are added tbclassify_user_articlefolder tbclassify_user_articlefolder This result is deleted School of Computing, University of Tasmania
Inference School of Computing, University of Tasmania
Q&A School of Computing, University of Tasmania