1 / 24

Technology of Semantic Structuring of the Digital Library Content

Technology of Semantic Structuring of the Digital Library Content. V International Conference Distributed Computing and Grid-technologies in Science and Education. I. Filozova JINR LIT, Dubna LIT JINR (DUBNA), JULY 1 8, 201 2. Contents. Current Trends P roblematic S ituation

Download Presentation

Technology of Semantic Structuring of the Digital Library Content

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Technology of Semantic Structuring of the Digital Library Content V International Conference Distributed Computing and Grid-technologies in Science and Education I. Filozova JINR LIT, Dubna LIT JINR (DUBNA), JULY18, 2012

  2. Contents Current Trends Problematic Situation Research Lines Realization Ideas QA-System on the Logic-Semantic Network Basis Summary

  3. CURRENT TRENDS • Traditional Publishing  Digital Archive-based approach; • Accumulation by the scientific community the expansivedigital informationarrays→ content integration on the metadata level → common Data and Information Spaces; • The growth number of institutional repositories in the open accessform. Repositories Number — 2 900 Records Number ~ 40,000,000 according to ROAR statistics (ROAR - http://roar.eprints.org)

  4. HOW TO FIND

  5. PROBLEMATIC SITUATION CREATION OF the EFFECTIVE MECHANISMS FOR the ANSWERS SEARCH TO QUESTIONS IN the DIGITAL INFORMATION FUNDS – ACTUAL PROBLEM DIGITAL INFORMATION FUND (INFORMATION SOURCERS) INFORMATION LAWS FIND the INFORMATION (INFORMATION SOURCE AND/ORINFORMATION ITSELF) ANSWERS SET (QV) QUESTION (V) PERTINENCE (P) QV= QVR U QVN P = ? METHODSANDMECHANISMSFOR EFFECTIVE SEARCH(SEACRH TECHNOLOGY)

  6. Cognitive Function of the Question Cognitive Indeterminacy Question UNKNOWN Question a thought queryas the interrogativesentence. Answer a realization of the cognitive function of the question asa new obtained judgment. TO DEVELOP THE KNOWLEDGE(TO EXTEND, TOPRODUCE A NEW) TO REFINETHE KNOWLEDGE TOSUPPLEMENT THE KNOWLEDGE KNOWN

  7. Process of Asking Questions and SearchAnswers The Object and  Subject of Research Search Scope Ask Question Datum Question Question Answer Technology of Question Asking Find Answer Conformity Rules Search Technology Set Adequacy Question - Answer Answer Technology of Conformity Setting

  8. RESEARCHLINES • Development of the method and mechanism for effective search of the set of the relevantanswers to the questions. • Technology development for the creation and supportof the catalog serviceof the information fund for providing an efficient search of the answers to the questions. • Software developmentcataloguer workstation for the structuring of the information fund.

  9. REALIZATION IDEAS OF RESEARCH LINES

  10. The method basisis a way to describe the scientific and technical information by set of logic-semantic networksQuestion-Answer-Reaction (LSN QAR). The basis for the search engineare: • motion way along  LSN, controlled by the user; • choice of  LSN  nodes (questions or answers) based on an ontological model of user question. The basis of the technologyis a way of the description of the subject domain by LSN QAR set. Mechanism of technologyis a workstation of the cataloguer (LSN QAR developer)

  11. Formal Structure of Question, Answer, Reaction The logical structure of the question (Q): QUESTION = {QUESTIONTHEME (QT), QUESTION CONTENT (QC), QUESTION VOLUME (QV)} The logical structure of the answer (A): ANSWER = {ANSWER THEME (AT), ANSWER CONTENT (AC), ANSWER VOLUME (AV)} The logical structure of the reaction (R): REACTION= {REACTION THEME (RT), REACTION CONTENT (RC), REACTION VOLUME (RV)}

  12. Logic-Semantic NetworkQuestion-Answer-Reaction Logic-semantic network a setof the questions, answers and  relationships between them formingan uniform system. Questionquery expressed inthe interrogative sentence aimed at thedevelopment, refinement or supplement of the knowledge. Answer a realization the cognitive function of the question in the form of the new obtained judgment.  Answer must be built in accordance with the content and structure of the asked question. Only in this case, the answer is regarded asrelevant. Reaction   a semantic description of the question and answer.Types of reactions:1.Question Reaction a description of the datum question (to understand the enviroment and causes of the question and to establish the semantic adequacy  with the answer scope).2. Answer Reaction adescription  of theanswer scope (to understand the question semantics andrelationship with answer).

  13. Reaction Example (1) Logical unitQuestion-Answer-Reaction: Question 1 (Q1). What is a JAVA? Question 1 Reaction 1 (QR11).With respect pronunciation formed two different standards - borrowed from the English / dʒɑ:və / and traditional «Ява» (on russian), corresponding to the traditional pronunciation ofthe Java name island. Question 1 Reaction 2 (QR12). Java (Indonesian: Jawa) is an islandof  Indonesia with a population of 135 million.Square  132 000 k2… Question 1 Reaction 3 (QR13). Slide show, photo-collage with the views of Javaisland.

  14. Reaction Example (2) Answer 1 to Question 1 (A11). Java – an object-oriented programming language developed by Sun Microsystems. Reaction 1 of the Answer 1 to the Question 1 (RA11). Why is the language calledJAVA?  There is a version that language got its name from coffee grown on the same island. As you know, this drink is hot like some programmers.  Therefore, a cup of steaming coffee is displayed on logo.

  15. Reaction Example (3) Reaction 2 of the Answer 1 to the Question 1 (R2A11). Sun Microsystems, Inc (now part of Oracle Corporation) — U.S. company that produces software and hardware… Answer 2 to Question 1. Java —  not only the language itself, but also a platform for development and execution of the applications based on this language.

  16. R10 Q10 1 3 2 R23 R21 A21 A22 A23 6 8 10 4 5 7 9 Q32 Q33 Q34 Q31 12 14 15 13 17 16 11 A42 A43 A41 Graph LSN QAR

  17. Analysis Method of Scientific Texts The document is studied by the expert in terms of: Semantic matching title and content; Set of filters:Filter 1 (F1) - General Part. F1 includes an analysis of the problem, its history, overview, topicality.Filter 2 (F2) - Author concept.F2 includes new terms introduced by the authors, traditional terms with the author's interpretation, the narrowing semantics.Filter 3 (F3) - Examples and illustrations.To clarify difficult places in the text,reduce the text size under stringent restrictions on the volume.Filter 4 (F4) - The idea of the author.Describes and explains the author's main idea. Markup text (formulation of the basic questions, answers and reactions).

  18. Navigation on LSN R10 Q10 1 3 2 R23 R21 A21 A22 A23 6 8 10 4 5 7 9 Q31 Q32 Q33 Q34 12 14 15 13 17 16 11 A42 A43 A41

  19. MultilayerRelated Set of Graphs

  20. List of Available Questions and Card of Selected Question(fragment) Question Reaction Answer Reaction 1 Question Answer Reaction 2 (Это интересно …) Answer 1 Answer 2 Next Level Questions

  21. Card of Question Reaction

  22. LSN + Visualization Answers Questions

  23. Summary • It’s proposed: • "Catalog Service" creation and  support for the funds-corpuses, • Question-Answer Navigator creation that provides such features: - the ability of the refinement and deepening of the understanding the question meaning; - the ability of the refining, deepening, expansion of the knowledge or the obtaining a new knowledge during the answer to question search process. • Realization of such "Catalog Service" and Navigator allows to study the DL content by the natural mode for the human: refinement, generalization and obtaining a new knowledge  ̶ question-answer mode. • The main problem of the proposed question-answer system is a maximalautomation of the process of the creation and support of  the fund service catalog.

  24. Even the most foolish idea can be implementedmasterfully.  Leszek Kumor

More Related