150 likes | 405 Views
Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both!. Doug Lenat Cycorp. Our content What we’d want a good host to provide
E N D
Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp • Our content • What we’d want a good host to provide • Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?
Our Content • OpenCyc (www.opencyc.org) • The Cyc Ontology made 100% freely available (yes, 100% free even for commercial purposes) • Available for download on SourceForge • Over 30,000 “users” • ResearchCyc (researchcyc.cyc.com) • OpenCyc + millions of hand-engineered assertions • Free for R&D purposes • Current users: 300 research groups (1/2 academic)
What are people doing with it? • USAF 45th Space Wing: Decision Support • USNavy: Threat Scenario Detection • US Forest Service: Regulatory Compliance • LarKC: Large Knowledge Collider • Medical Research Center: Clinical Trial Cohort Selection • (doctors can now directly formulate complex FOPC • queries via interactive clarification dialogue; DBs) • Glaxo: semi-automatic ontology alignment • across multiple large domain-specific info sources
What’s in OpenCyc Explicitly: 300k terms; 14k predicates; 57k classes; 2 million assertions; infin. more nonatomic terms and inferred assertions • (#$isa 596215) • (#$genls 99198) • (#$disjointWith 6114) • (#$resultIsa 4277) • (#$resultGenl 1206) • (#$argIsa 35617 • (#$argGenl 5398) • (#$arg1Isa 16748) • (#$arg1Genl 2354) • (#$arg2Isa 14114 • (#$arg2Genl 2283) • (#$arg3Isa 3486) • (#$argFormat 5493) • (#$arg2Format 3320) • (#$functionalInArgs 1427) • (#$arity 16416) • (#$arityMin 958) • (#$comment 57305) • (#$genlPreds 7440) • (#$negationInverse 990) • (#$genlMt 26078) • (#$denotationInEnglish 409745) • (#$synonymousExternalConcept 13916)
Systems and Processes eventOccursAt resource conveyer energy source ‘lifetime’ of system providerOfMotiveForce resource synthesizer doneBy boundary transporter
Specializations Ecosystem agentInEcosystem FunctionalSystem Organization hasMembers componentInSystem Organism anatomicalParts EcologicalProcess AutocatalyticProcess Culture-Practice Metabolism
Ecosystem Classes Ecosystem Aquatic Life Zone Biome genls genls genls Desert Ecosystem Tropical Rainforest Ecosystem Chaparral Ecosystem Tundra Ecosystem Taiga Ecosystem Grassland Ecosystem
Eco- system Mediterranean Climate Cycle Chaparral Ecosystem genls climateOfEcosystemType hasClimateType terrainClimateType Geographical Region Mediterranean Scrub Territory Of Santa Barbara, CA genls
What We’d Want a Good Host to Provide • A commitment to use – to have contributors all provide content under – some Creative Commons license, as opposed to e.g. a GNU license • Retention of the provenance/lineage of contributed ontological content • Agreement on some of the most fundamental ontological relations • Agreement on a small set of inter-ontology alignment relations
Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more? • OKKAM is already a funded UE FP7 project (~$10M, 3-years) that started 2 months ago. Ontologizing individuals (including organizations such as the USArmy and IBM as individuals), providing a unique identifier and agreed-on set of properties for each individual • DBpedia extracted the content of fact boxes from Wikipedia + 35 open-source ontologies; KBpedia EU STREP ($3M) follow-on and will include true ontology-merging • Lots of other projects which other speakers in this panel will no doubt mention
…and, coming to a lab near you in Feb 2008… The Large Knowledge Collider
massive distributed incomplete reasoning zillions of assertions