120 likes | 334 Views
A Methodology for Developing a Taxonomy – A Subject Oriented Approach. Presented to:. International Symposium on Ontology-Metamodeling State Key Laboratory of Software Engineering, Wuhan University.
E N D
A Methodology for Developing a Taxonomy – A Subject Oriented Approach Presented to: International Symposium on Ontology-Metamodeling State Key Laboratory of Software Engineering, Wuhan University Richard Jordan, Computer Specialist, Office of the Chief Information Officer, Federal Aviation Administration United States of America (USA) With contributions from Kirk Lutz, IBM Corporation By: Date: March 2006
Objectives/Outline • Our Context: Introduction & Drivers • Deriving an FAA Taxonomy • Functional Taxonomies
1. Taxonomy for Organizations: Introduction & Drivers • Providing a classification of information stored in many different forms – relational data, documents, digital assets, XML, web pages, web services, discussion groups, etc. • By tagging such assets with relevant terms from the taxonomy, we enable search and retrieval of those information assets • Getting users to the content they need – quickly • Taxonomies are: • often hierarchical, sometimes a network structure • Used often for web content management • Considered important for having “semantic” web capabilities
Strategic Drivers and Context for Taxonomies in Government • E-Government – making government more accessible to citizens through the Internet and automated capabilities • Enterprise Content and Data management – • Growing needs for these capabilities including metadata management and effective access to data & web resources • Often viewed as separate disciplines • Data Sharing is a driver – part of data management • Making the Internet a better resource – “The Semantic Web”
USA - Federal Data Reference Model (DRM) • One of the Reference Models making up the framework for the Federal Enterprise Architecture (FEA) • DRM Version 2 – three parts: • Data Description – entities, attributes, and relationships • Data Sharing - Information Exchange Packages • Data Context – Taxonomy, Ontology, Classification • Data Context part calls for U.S. government agencies to have a method, such as a taxonomy or ontology, to enable its customers to search for and retrieve information
2. Deriving an FAA Taxonomy • Corporate Data Architecture: A model of the data objects that are relevant to an enterprise, their relationship to each other, and the principles and guidelines governing their design and evolution over time. • Scope: FAA-wide • A part of the FAA Enterprise Architecture • In Entity-Relationship format
Methodology • Form a logical subject area centered on a kernel entity • A kernel entity represents a business object that stands alone and is not dependent on any other entity • Examples: Flight Event, Person, & Course • Each subject area is named for a kernel entity (based on information engineering methodology) • Some subtype entities under a kernel entity are so complex, we separate them into their own sub-subject area • The subject areas make up a logical data model • Collect similar subject areas into higher level subject areas where needed • Example: Parties is a higher level subject area that encompasses Person and Organization • Iterate top-down and bottom-up to complete the analysis • This represents a subject-oriented (data centric) hierarchical taxonomy of an organization. • Some data instances are categorized using valid values for reference entities (for example, an instance of Aircraft Type is glider, balloon, blimp/dirigible, fixed wing single engine, rotorcraft, etc.)
Portions of FAA Taxonomy • Parties • Organizations • Organization Positions • Persons • Events • Flight Events • Flight Plan Events • Weather Observations
This Kind of Taxonomy • In our subject oriented taxonomy, the terms are: • Complete – to the extent that logical data modeling is complete, then the taxonomy is complete • Either non-redundant or are subtype of higher terms • Consistent kinds of terms – all nouns – may facilitate end user usage and hit rate • This kind of taxonomy: • Not especially designed for web search, retrieval or navigation • Provides completeness • Enables metadata management including data classification • Aliases can augment this kind of taxonomy
3. Functionally Oriented Taxonomies • Many taxonomies for accessing web resources use functional terms (rather than nouns or entities) to approximate the purpose or need of end users • These are often service oriented and “citizen-centric” – examples • Finding a national park with swimming • Applying for a pilot’s license • Finding known pollution sites near my address • U.S. government is calling for use of the process part of its Federal Enterprise Architecture (called the Business Reference Model or BRM) to be used by federal agencies as a taxonomy
USA’s Business Reference Model (BRM) • Organized into 3 tiers • Business Area • Line of Business • Sub-function • Example: • Business Area: Transportation • Line of Business: Air Transportation • Sub-function: Air Traffic Control
Functional vs. Subject Oriented Taxonomies • Data should be defined in a stand alone fashion, independent of function, in order for it to be useful to multiple functions or purposes • Having multiple taxonomies is acceptable but: • Functional taxonomies should not lead to defining or structuring the data in a functional way • Each taxonomy that an organization creates must be maintained – including any necessary mapping - overhead • Subject-oriented taxonomies offer: • Potential for completeness • Stability