740 likes | 941 Views
Knowledge Management Systems: Development and Applications Part I: Overview and Related Fields. Hsinchun Chen, Ph.D. McClelland Professor, Director, Artificial Intelligence Lab The University of Arizona Founder, Knowledge Computing Corporation.
E N D
Knowledge Management Systems: Development and ApplicationsPart I: Overview and Related Fields Hsinchun Chen, Ph.D. McClelland Professor, Director, Artificial Intelligence Lab The University of Arizona Founder, Knowledge Computing Corporation Acknowledgement: NSF DLI1, DLI2, NSDL, DG, ITR, IDM, CSS, NIH/NLM, NCI, NIJ, CIA, DHS, NCSA, HP, SAP 美國亞歷桑那大學, 陳炘鈞博士
My Background: ( A Mixed Bag!) BS NCTU Management Science, 1981 MBA SUNY Buffalo Finance, MS, MIS Ph.D. NYU Information System, Minor: CS, 1989 Dissertation: “An AI Approach to the Design Of Online Information Retrieval Systems” (GEAC Online Cataloging System) Assistant/Associate/Full/Chair Professor, University of Arizona, MIS Department Scientific Counselor, National Library of Medicine USA), National Library of China, Academia Sinica
My Background: (A Mixed Bag!) Founder/Director, Artificial Intelligent Lab, 1990 Founder/Director, Hoffman eCommerce Lab, 2000 PIs: NSF CISE DLI-1 DLI-2, NSDL, DG, DARPA, NIJ, NIH, CIA, DHS Associate Editors: JASIST, DSS, ACM TOIS, IEEE SMC, IEEE ITS Conference/program Co-hairs: ICADL 1998-2004, China DL 2002/2004, NSF/NIJ ISI 2003-2006, JCDL 2004 Industry Consulting: HP, IBM, AT&T, SGI, Microsoft, SAP Founder, Knowledge Computing Corporation, 2000
Knowledge Management: Overview
Knowledge Management Overview What is Knowledge Management Data, Information, and Knowledge Why Knowledge Management? Knowledge Management Processes
Data: 1980s Factual Structured, numeric Oracle, Sybase, DB2 Information: 1990s Factual Yahoo!, Excalibur, Unstructured, textual Verity, Documentum Knowledge: 2000s Inferential, sensemaking, decision making Multimedia ??? Unit of Analysis
According to Alter (1996), Tobin (1996), and Beckman (1999): Data: Facts, images, or sounds (+interpretation+meaning =) Information: Formatted, filtered, and summarized data (+action+application =) Knowledge: Instincts, ideas, rules, and procedures that guide actions and decisions Data, Information and Knowledge:
Ontologies, hierarchies, and subject headings Knowledge management systems and practices: knowledge maps Digital libraries, search engines, web mining, text mining, data mining, CRM, eCommerce Semantic web, multilingual web, multimedia web, and wireless web Application and Societal Relevance :
The Third Wave of Net Evolution 2010 ARPANET Internet “SemanticWeb” Function Server Access Info Access Knowledge Access 1995 Unit Server File/Homepage Concepts 1975 2000 Example Email WWW: “World Wide Wait” Concept Protocols 1985 1965 Company IBM Microsoft/Netscape ???
Knowledge Management Definition “The system and managerial approach to collecting, processing, and organizing enterprise-specific knowledge assets for business functions and decision making.”
“… making high-value corporate information and knowledge easily available to support decision making at the lowest, broadest possible levels …” Personnel Turn-over Organizational Resistance Manual Top-down Knowledge Creation Information Overload Knowledge Management Challenges
Research Community NSF / DARPA / NASA, Digital Library Initiative I & II, NSDL ($120M) NSF, Digital Government Initiative ($60M) NSF, Knowledge Networking Initiative ($50M) NSF, Information Technology Research ($300M) Business Community Intellectual Capital, Corporate Memory, Knowledge Chain, Competitive Intelligence Knowledge Management Landscape
Enabling Technologies: Information Retrieval (Excalibur, Verity, Oracle Context) Electronic Document Management (Documentum, PC DOCS) Internet/Intranet (Yahoo!, Google) Groupware (Lotus Notes, MS Exchange) Consulting and System Integration: Best practices, human resources, organizational development, performance metrics, methodology, framework, ontology (Delphi, E&Y, Arthur Andersen, AMS, KPMG) Knowledge Management Foundations
Process perspective (management and behavior): consulting practices, methodology, best practices, e-learning, culture/reward, existing IT new information, old IT, new but manual process Information perspective (information and library sciences): content management, manual ontologies new information, manual process Knowledge Computing perspective (text mining, artificial intelligence): automated knowledge extraction, thesauri, knowledge maps new IT, new knowledge, automated process Knowledge Management Perspectives:
Human Resources Cultural Best Practices Databases Learning / Education ePortals Consulting Methodology Tech Foundation Infrastructure Content/Info Structure Content Mgmt Email KMS Analysis Ontology Notes User Modeling Search Engine Data/Text Mining Web Mining KM Perspectives
Influences from three disciplines: Management and Policy (40%), Computer Science (30%), Information/Library Science (20%) Continuous, steady growth since 1990: academic publications and industry articles; not a fad (unlike BPR, TQM) Seminal books and articles in Knowledge Management (e.g., Drucker, Davenport, Nonaka): the 50 most-cited KM articles KM, Emergence of a Discipline (Ponzi, 2004):
Future organizations are information-based organizations of knowledge workers; Specialization, cross-discipline task teams, disappearance of middle managers (Drucker, “The Coming of the New Organization”) The Japanese Management Style: Tacit knowledge, redundancy, slogans, metaphors; the “Ba”; the SECI Model – Socialization, Externalization, Combination, and Internalization (Nonaka, “The Knowledge-Creating Company) KM Thoughts and Thinkers:
Knowledge generation (acquisition, dedicated resources, fusion, adaptation, knowledge networking); Knowledge codification (mapping and modeling knowledge); Knowledge transfer; Technologies for KM; Learning from experiments (Davenport, “Working Knowledge”) Deep Smart: Seeing the big picture and knowing the skills; learning from experience (Leonard, “Deep Smart”) KM Thoughts and Thinkers: (cont’d)
Teaching smart people how to learn; Defensive reasoning and doom loop; Learning how to reason productively (Argyris, “Teaching Smart People How to Learn”) Technology gets in the way; Research on work practices; Harvesting local innovation and innovating with customer; PARC anthropologists (John Seely Brown, “Research that Reinvents the Corporation”) Inverting organizations (individual professionals leading); Creating intellectual webs (Quinn, “Managing Professional Intellect”) KM Thoughts and Thinkers: (cont’d)
Knowledge Management: The Industry and Status
Anderson Consulting (Accenture) (1) Acquire (2) Create (3) Synthesize (4) Share (5) Use to Achieve Organizational Goals (6) Environment Conducive to Knowledge Sharing
Ernst & Young (1) Knowledge Generation (2) Knowledge Representation (3) Knowledge Codification (4) Knowledge Application
Reason for Adopting KM Retain expertise of personnel 51.9% Increase customer satisfaction 43.1% Improve profits, grow revenues 37.5% Support e-business initiatives 24.7% Shorten product development cycles 23% Provide project workspace 11.7% Knowledge Management and IDC May 2001
Business Uses Of KM Initiative Capture and share best practices 77.7% Provide training, corporate learning 62.4% Manage customer relationships 58% Deliver competitive intelligence 55.7% Provide project workspace 31.4% Manage legal, intellectual property 31.4% Continue
Leader Of KM Initiative Knowledge Management and IDC May 2001
Implementation Challenges Employees have no time for KM 41% Current culture does not encourage sharing 36.6% Lack of understanding of KM and Benefits 29.5% Inability to measure financial benefits of KM 24.5% Lack of Skill in KM techniques 22.7% Organization’s processes are not designed for KM 22.2% Continue
Implementation Challenges Lack of funding for KM 21.8% Lack of incentives, rewards to share 19.9% Have not yet begun implementing KM 18.7% Lack of appropriate technology 17.4% Lack of commitment from senior management 13.9% No challenges encountered 4.3% Knowledge Management and IDC May 2001
Types of Software Purchased Messaging e-mail 44.7% Knowledge base, repository 40.7% Document management 39.2% Data warehousing 34.6% Groupware 33.1% Search engines 32.3% Continue
Types of Software Purchased Web-based training 23.8% Workflow 23.8% Enterprise information portal 23.2% Business rules management 11.6% Knowledge Management and IDC May 2001
Spending On IT Services For KM 15.3% Training 27.8% Consulting Planning 13.7% Maintenance 27% Implementation 15.3% Operations, outsourcing Knowledge Management and IDC May 2001
Software Budget Allotments Enterprise information portal 35.6% Document management 26.2% Groupware 24.4% Workflow 22.9% Data warehousing 19.3% Search engines 13.0% Continue
Software Budget Allotments Web-based training 11.4% Messaging e-mail 10.8% Other 29.2% Knowledge Management and IDC May 2001
Knowledge Management Systems: Overview
Knowledge Management Systems (KMS) Characteristics of KMS The Industry and the Market Major Vendors and Systems
Knowledge Management Systems Definition KMSs are computer-based information systems that: • can help an enterprise acquire, manage, retain, analyze, and retrieve mission-critical information; and help turn enterprise information into well-organized, abstract, and actionable knowledge; and • can help an enterprise identify and inter-connect experts, managers, and knowledge workers; and help extract, retain, and disseminate their knowledge in an organization.
KM Architecture (Source: GartnerGroup) Web UI Web Browser Knowledge Maps Enterprise Knowledge Architecture Knowledge Retrieval Conceptual Physical KR Functions Text and Database Drivers Application Index Database Indexes Text Indexes “Workgroup” Applications Databases Applications Distributed Object Models Intranet and Extranet Network Services Platform Services
KR Functions Knowledge Retrieval Level (Source: GartnerGroup) Concept “Yellow Pages” Retrieved Knowledge • Clustering — categorization “table of contents” • Semantic Networks “index” • Dictionaries • Thesauri • Linguistic analysis • Data extraction • Collaborative filters • Communities • Trusted advisor • Expert identification Semantic Value “Recommendation” Collaboration
Knowledge Retrieval Vendor Direction(Source: GartnerGroup) Market Target Newbies: IR Leaders: • grapeVINE • Sovereign Hill • CompassWare • Intraspect • KnowledgeX • WiseWire • Lycos • Autonomy • Perspecta • Verity • Fulcrum • Excalibur • Dataware Knowledge Retrieval NewBies IR Leaders Niche Players: • IDI • Oracle • Open Text • Folio • IBM • InText • PCDOCS • Documentum Lotus Netscape* Technology Innovation Microsoft Niche Players * Not yet marketed Content Experience
KM Software Vendors Challengers Leaders Lotus * Microsoft * Dataware * Autonomy* * Verity * IBM * Excalibur Ability to Execute Netscape * Documentum* PCDOCS/* Fulcrum IDI* Inference* OpenText* Lycos/InMagic* CompassWare* GrapeVINE* KnowledgeX* * InXight WiseWire* SovereignHill* Semio* *Intraspect Visionaries Niche Players Completeness of Vision
Structured Manual Human-driven Unstructured System-aided Data/Info-driven Two Approaches to Codify Knowledge Top-Down Approach Bottom-Up Approach
Sample KMS: • Search Engine and Web Portal • Data Mining • Text Mining • Web Mining
Managing Information: Search Engine and Web Portal (Source: Jan Peterson and William Chang, Excite)
Basic Architectures: Search Log 20M queries/day Spider Web SE Spam Index SE Browser SE Freshness 24x7 Quality results 800M pages?
Basic Architectures: Directory Url submission Surfing Ontology Web SE Browser SE SE Reviewed Urls
Web HTML data Hyperlinked Directed, disconnected graph Dynamic and static data Estimated 2 billion indexible pages Freshness How often are pages revisited? Spidering
Size from 50M to 150M to 3B urls 50 to 100% indexing overhead 200 to 400GB indices Representation Fields, meta-tags and content NLP: stemming? Indexing
Augmented Vector-space Ranked results with Boolean filtering Quality-based re-ranking Based on hyperlink data or user behavior Spam Manipulation of content to improve placement Search
Short expressions of information need 2.3 words on average Relevance overload is a key issue Users typically only view top results Search is a high volume business Yahoo! 50M queries/day Excite 30M queries/day Infoseek 15M queries/day Queries
Manual categorization and rating Labor intensive 20 to 50 editors High quality, but low coverage 200-500K urls Browsable ontology Open Directory is a distributed solution Directory