190 likes | 270 Views
Extended Keyword Index & Improved Search for Semantic e-Catalog. 이동주. Contents. Motivation Semantic e-Catalog Search In e-Catalog Search Strategy Keyword Index Scoring Fucntion CatOnt Conclusion & Future Work. Motivation. Keyword Search
E N D
Extended Keyword Index & Improved Searchfor Semantic e-Catalog 이동주 IDS
Contents • Motivation • Semantic e-Catalog • Search In e-Catalog • Search Strategy • Keyword Index • Scoring Fucntion • CatOnt • Conclusion & Future Work IDS
Motivation • Keyword Search • e-Catalog take a very important role in e-Business • many people want to search product information using simple keyword • Semantic e-Catalog • legacy e-Catalog couldn’t fully express the various and complex product information and relationship • semantic e-Catalog system needs • suitable search strategy needs IDS
Classification Scheme3 Classification Scheme2 Classification Scheme1 …… …… …… … P3 P4 P2 P1 P4 v v v v v Semantic e-Catalog (1) Attribute Product Data …… IDS
Semantic e-Catalog (2) EC = {E, R}, E = {P, C, A, U} ME ∈ {C, A, U}, MA = {α1, α2, ..., αm} me = {(α, v)| α ∈ MA, v ∈ VALUE} p = { (a, v)| a ∈ A, v ∈ VALUE} R = { (e1, e2, r)| e1 ∈ E1, e2 ∈ E2, E1 ∈ E, E2 ∈ E, r ∈ DR} EC : Electronic Catalog E : Entity R : Relationship DR : Definition of Relationship ME : Meta Entity, MA : Meta Attribute P : Product , C : Classification Scheme A : Attribute, U : Unit Of Measure IDS
Search In e-Catalog Search Query Search Engine Sorted List Query Analyzer Ranker DB Interface e-Catalog DB IDS
Search Strategy • use simple keyword • use semantics implied in e-Catalog • relationship between entities • construct keyword index of entity’s information (values of attributes) • construct extended keyword index with tagging • use semantics implied in search query • extract useful keyword and tag meaning IDS
Extended Keyword Index • extended keyword • (voc, tag1, tag2, …, tagt) • extend the definition of semantic e-Catalog with extended keyword index e = { (a, v)| a ∈ ATT, v ∈ VALUE} if e is Product ATT is A else ATT is MA ivoc = (voc, tag1, tag2, …, tagt) tag1 is a’s identifier e = {ivoc1, ivoc2, …, ivocv} VOC : Vocabulary IDS
Attribute Classification Scheme G2B Attribute Group UOM Classification Scheme GUNGB UOM Group Product (ComAtt) Product (IndAtt) Classification Scheme UNSPSC VOC RDB Structure for Semantic e-Catalog e-Catalog DB IDS
Extracting Keyword Indexes • different extracting mechanism according to attributes • name • description • numeral • just use original IDS
it’s different according to attribute Process of Keyword Index Extraction Analyze Morpheme Structure use KLT module Select possible result Extends the word using dictionaries Eliminate the useless word Count frequency and mark order Eliminate duplicated word Do tagging and return Keyword List IDS
Tags IDS
Scoring Function Score(Q, e) extend the query Q = {q1, q2, …, qi, …, qn} qi = {voc, tag1, tag2, …, tags} from extended definition with extended keyword index e = {ivoc1, ivoc2, …, ivoca} Score(Q, e) = ∑I,jScore(qi, ivocj) generalize with relationship r related e Score(Q, e) = ∑I,jScore(qi, ivocj) + ∑k,lwrk*Score(Q,e’l) wrk : weight of relation rk e’l : related entity using rk Score(qi, ivocj), wr dominate total score IDS
CatOnt • Parser • Loader • easily extensible semi-automated loading tool using XML specification • Searcher • not implemented yet IDS
Loading Process Specification - Keyword Index Construction (1) IDS
Loading Process Specification - Keyword Index Construction (2) IDS
Conclusion & Future Work • Conclusion • propose extended keyword index using various tag for semantic e-Catalog • implement semi-automated converting tool from legacy e-Catalog to semantic e-Catalog with easily extensible XML specification • propose scoring function which extended keyword index is applicable • Future work • contrive feasible scoring function and methods to assign weights of each relationship • implement Searcher • extend this motel to general E-R model IDS