300 likes | 412 Views
How to survive the document & data tsunami?. Lambda Verdonckt Business Analyst TenForce. 1. We know how to handle large data , regardless of the technology used. Semantic Technology. 2. The only purpose-built technology, to survive a tsunami of doc and data. Semantic Technology. 3.
E N D
How to survive the document & data tsunami? Lambda Verdonckt Business Analyst TenForce
1 We know how to handle large data, regardless of the technology used.
Semantic Technology 2 The only purpose-built technology, to survive a tsunami of doc and data.
Semantic Technology 3 Leveraging information in old systems, no need to change current way of working.
Semantic Technology Turns the web of documents into a web of data. Turns the web as a virtual library into a virtual database. TenForce applies these technologies in corporate environments.
How to survive the document & data tsunami? Semantic Technology • State-of-the-art • Examples • Future
Semantic Technology The meaning of the data is encoded separately The only purpose-built technology for handling a tsunami of data, in a flexible way. data model (JohnDoe, type, Customer) (JohnDoe, owns, Account123) (Account123, type, BankingAccount) Account owns Customer type Person => ontology, thesaurus, taxonomy etc. Software understands the data and can reason about it
Semantic Technology Standards A set of standards & tools to work with large data sets
TenForce Semantic Offering Training Consultancy Projects Products Semantic Technology • Assesment • Architectures • Modeling • Validation • Standard compliancy • End-to-end projects • mixed teams • research projects • EU framework • Unique Training Offer • Introduction • Modeling • Programming and manyothers…
How to survive the document & data tsunami? Semantic Technology • State-of-the-art • Examples • Future
Semantic Technology Solutions The ‘semantic web’ is an application of semantic technology Corporate solutions built with semantic technology include: • Knowledge Bases • Automatic Categorization & Archiving • Natural Language Processing in documents • …
Semantic Technology SolutionsTenForce projects • Publications Office of the EU – a thesaurus of European activities • Wolters Kluwer Globally – building a multilingual publishing bus • DG Employment of the EC – a taxonomy of European Skills, Competences & Occupations
Semantic Technology SolutionsAdvanced examples • New York Times – automatic categorization & archiving with Linked Data • Amdocs – telecom solutions for pro-active decision support • Audi – modeling behaviour to make testing less error-prone
How to survive the document & data tsunami? Semantic Technology • State-of-the-art • Examples • Future
Industry Analysts Gartner: high benefit rating (2010) “ Semantic technologies offer … options that now are difficult or impossible “ HP: top 10 trend in BI (2010) “New approaches are needed, and semantic technologies hold part of the solution.”
A vision of the data web LOD2 – a European FP7 project • Build the infrastructure for the web of data • Opportunities & challenges for all of us!
Future We know the tsunami is coming, the question is – who will be ready to survive?
www.tenforce.com lambda.verdonckt@tenforce.com twitter.com/LambdaVerdonckt
Semantic Technology SolutionsKnowledge Bases • Knowledge is captured in a model, making the DB a KB • Allows to manage & share knowledge i.s.o. mere storage >50% of companies indicate the need to share stored knowledge (VALUE-IT) • Better & faster retrieval of information for decision support • Human-readable: typical CRM with search functionality Machine-readable: expert systems, incl. reasoning eg. clinical decision support • Rules are part of the data, i.s.o. hard-coded: more readily adaptable to changing needs, while interoperable with existing DB’s
Semantic Technology SolutionsAutomatic Categorization & Archiving Categorization based on controlled vocabularies (taxonomies, thesauri, ontologies) • makes content more searchable: better! • eliminates cost of labour-intensive processes: cheaper! vs. user-driven categorization & tagging (web 2.0) Remark: Look at Evrias an online example!
Semantic Technology SolutionsNatural Language Processing Software that analyzes the structure and meaning of textual information • analyze texts, • identify terms & concepts, • extract information, • understand meaning • Automatic categorization & archiving based on NLP Tools: Alchemy, OpenCalais, PoolParty
Wolters Kluwer Global Multilingual publishing system in a EU context for Legal, Tax & Regulatory TenForce
DG Employment of the EU Commission ESCO, a taxonomy of European Skills, Competences & Occupations TenForce
DG Employment of the EU Commission A Semantic Job Portal to leverage the information in ESCO and other information on the web TenForce
Advanced examplesPublishing New York Times • in-house developed vocabulary • automatic categorization & archiving • published as Linked Data (open to the world!) http://data.nytimes.com/
Advanced examplesTelecom RDF Amdocs Knowing why a customer is calling, saves 3’ per call (or € 0,30)! call center logs billing Pro-active decision support social fora ... advanced inference
Advanced examplesManufacturing Audi (Ontoprise) Testing electronic systems in cars using simulations • huge amounts of data are recorded • to be collected and analyzed • time-consuming & error-prone Need for a standardized way to describe • desired system behaviour • known error-cases Solution: ontology-driven & visualized