1 / 0

Build DoD Vocabularies in the Cloud

Build DoD Vocabularies in the Cloud. 3 rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr . Brand Niemann, Director and Senior Data Scientist, Semantic Community July 13 th Competency Track - 11:55am-12:30pm July 13-14, 2011

mitch
Download Presentation

Build DoD Vocabularies in the Cloud

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Build DoD Vocabularies in the Cloud

    3rd Annual SOA & Semantic Technology Symposium: Interoperable Business Operations Through Shared Understanding Dr. Brand Niemann, Director and Senior Data Scientist, Semantic Community July 13th Competency Track - 11:55am-12:30pm July 13-14, 2011 Waterford, Springfield, Virginia
  2. Semantic Community So far in 2011, Semantic Community has built Knowledge-Centric Systems in the Cloud for: Data Science and Journalism: Data.gov and Federal Computer Week, Ongoing Since January 2011. 1105 Government Information Group/FOSE Institute’s KM 2011 Conference, May 4, 2011, and Geospatial Summit, September 13, 2011. AOL Government “Show Me The Data” Due to Launch July 11, 2011. The Open Group’s TOGAF and UDEF: The Open Group San Diego Conference, February 7, 2011. The Open Group London Conference, May 11, 2011. Semantic Interoperability: Keynote at SEMIC.EU Annual Conference, May 18, 2011. Conference Presentation at SemTech 2011, June 7, 2011. Federal Data Architecture Subcommittee, June 9, 2011. “Big Health Data”: One of the Top Submissions for HealthyPeople.gov Challenge, March 14, 2011. Finalist in the Health Data Initiative Forum, June 9, 2011. DoD: RFI for Data Analysis and Collaboration Tool to Support the DoD OIG, June 28, 2011. 3rd Annual DoD SOA and Semantic Technology Symposium, July 13, 2011. This presentation will show examples from simple (e.g. Air Force One Source) to complex (DOD Office of the Inspector General) DoD Vocabularies.
  3. Take-Home Message Competency: Creating Competency for Shared Understanding and Interoperable Business Operations. This track focuses on the development of knowledge and skills for SOA & Semantic projects, the handling of organizational change management, and the governance needed for and associated with such projects and initiatives. Semantic Community Knowledge-Centric Systems: We take the data (and metadata) directly to information modeling and mashup tools where we then can apply stronger semantic analytics tools. We keep the data (structured and unstructured) and metadata (ontology) together in the knowledgebase in cloud computing tools. We use effective standards-based approaches for real-world case studies. This presentation could also be in the other two tracks!
  4. Abstract Several DoD vocabularies have been harvested into the cloud computing tools used by the author to produce data science products. Those are Air Force OneSource and the DoD Common Vocabulary with two vocabularies, one for the HR community and one for UCORE-SL. The purpose of the Semantic Community’s data science products are to show when/where it is practical to insert semantic technologies in support of cross-domain process and analysis, and the value/ease of using other more mature technologies for certain tasks. The practical boundaries we have found supporting data fusion and analysis for information sharing, and when in the process to maximize the value from applying semantic technologies, are discussed. Note: Credit due to Robert Damashek for suggesting this topic to me.
  5. Bio Dr. Brand Niemann is the Director and Senior Data Scientist of the Semantic Community. He was the former Senior Enterprise Architect and Data Scientist at the U.S. Environmental Protection Agency and co-led the Federal CIO Council’s Semantic Interoperability Community of Practice (SICOP) with Mills Davis from 2003-2008. He is currently authoring a series of Editorials for Federal Computer Week on his work and recently made Spotfire's Twitter list for his cool visualizations on government data to produce more transparent, open and collaborative business analytics applications. http://semanticommunity.info/A_Gov_2.0_spin_on_archiving_2.0_data http://spotfireblog.tibco.com/?p=5328 He is working as a data journalist for AOL Government due to launch July 11th. http://semanticommunity.info/AOL_Government He is also helping organize the 12th SOA for eGov Conference, October 11th. http://semanticommunity.info/Federal_SOA
  6. Overview 1. Introductions 2. Background 3. Semantic Community Apps 4. DoD Common Vocabulary 5. Data Analysis and Collaboration Tool to Support the DoD OIG 6. Questions and Answers 7. Supplemental Slides Recreating Other People’s App the Semantic Community Way!
  7. 1. Introductions Purpose: To help build your professional network especially for doing SOA and Semantic Technologies going forward. Source: http://en.wikipedia.org/wiki/Five_Ws
  8. 2. Background My Experience with “Handling of organizational change management, and the governance needed for and associated with such projects and initiatives”: I tried to change EPA from the inside (1980-1996). I served a detail to the Department of Interior where I was able to start a new organization (1997-2001). I tried to change the Federal Government in my Federal CIO Council (2002-2008) Roles. I also tried to change EPA from the outside at the same time. I am now enjoying being free to do what I think is best to support the Semantic Web/Linked Open Data and Semantic Technologies, but in an easier and simpler way!
  9. 2. Background Federal Semantic Interoperability Community of Practice (SICoP) 2003-2008: Five Annual Conferences and Four Special Conferences. Federal SOA Community of Practices (SOA CoP) 2006-Present: Eleven Semi-Annual Conferences. 12th October 11th . Only Special Recognition for Outstanding Contributions to Both SICoP and SOA CoP: ArunMajumdar, Cutter Consortium/VivoMind Intelligence for Operationalizing SOA-Lessons Learned (Take Home Message: Multi-Level Model-Driven Architecture & First Order Logic). Now from the pilots at these conference come powerful new semantic analytics tools like VivoMind'sTextrium and PrologIKS and Semantic Insights Research Assistant (SIRA) that can be used to mine content to produce data science products that support data journalism!
  10. 2. Background My Experience with “development of knowledge and skills for SOA & Semantic projects”.
  11. 2. Background Integrate and then automatically semantify with sitemap.org and schema.org, respectively, in the MindTouch Technical Communication Suite. See Build SemTech 2011 and SemWeb.com Archives in the Cloud. “My Triple” is: Web (Hendler) vs Data (Fox) Science, Metadata (Hendler) vsMetadata+Data (Niemann), and Little (Hendler) vs Stronger (Sowa) Semantics: (Subject) Data Science (Predicate) is (Object) Metadata+Data and Stronger Semantics. Our Next Events will Focus on Building Knowledge-Centric Systems for “Big Data”. See Build Health Data Infrastructure and Apps in the Cloud for the recent Health Data Initiative Forum.
  12. 2. Background Ontology 101: An Introduction to Knowledge Representation, the Web Ontology Language (OWL)& Ontology Development, Elisa Kendall, Thematix, and Deborah McGuinness, RPI/McGuinness Associates. SemTech 2011, June 6, 2011. Knowledge bases, databases, & ontology An ontology is a conceptual model of some aspect of a particular universe of discourse (or a domain of discourse). Typically ontologies contain only "rarified" or "special" individuals, metadata, representing elemental concepts critical to the domain. A knowledge base is a persistent repository for: Ontology & metadata representing individuals, facts, & rules about how they can be combined or relate to one another. Metadata, facts & rules only - in some applications and frameworks the ontology is separately maintained. Most inference engines require in-memory deductive databases for efficient reasoning (including commercially available reasoners). A knowledge base may be implemented in a physical, external databases, such as a relational database, but reasoning is typically done on a subset (partition) of that knowledge base in memory.
  13. 2. Background Detailed Roadmap: Reuse Content: Text from Desktop, CMS, Web, etc. Tables from Excel, RDBs, etc. “Big Data” from Google, Cray, etc. Exploit Expertise and Training: SemTech 2011 Interviews, Sponsors, Exhibitors, Vendors, Participants, & Presentations. Reproduce Other’s Demos: New York Data Mine, Library of Congress, DoD, Top Quadrant, Cambridge Semantics, Revelytx, etc. Work in the Semantic Community Data Science Product Way: Started with NCOIC Semantic Wiki (Semantic Media Wiki, Visual Knowledge, and MindTouch): http://semanticommunity.info/Network_Centricity/Semantic_Wikis Do the data applications while the semantic technologies are maturing then semantifywith SIRA, Be Informed, etc. Data Integration with Linked Open Data and Semantics (Five Star Government Data) instead of Semantic Data Integration (Jim Hendler, George Thomas, etc.).
  14. 2. Background Detailed Roadmap (continued): Build Knowledge-Centric Systems: The concepts, definitions, and instances are together: Typically instances are not in the ontology, but in the knowledgebase-We put them together - see My EPA Ontology with the sitemap and the schema to support search and inference. Information Architecture, then Data Architecture: Text as is, give structure, or extract entities. Tables with URLs and key fields for joins. Other Resources with URLs.
  15. 2. Background The Ontology and Knowledgebase are Together in a Knowledge-Centric System! http://semanticommunity.info/EPA/EPA_Ontology
  16. 2. Background Recent Example from SemTech 2011 Sounds Like the Semantic Community Way: How Hollywood Learned to Love the Semantic Web, Chris Testa, Ad.ly, Inc.: This 5 step process to integrating linked data sets you up to do long term advanced Semantic Analysis with Reasoning, Machine Learning, and so much more: Understand what your linked “things” are, Choose a linked dataset, Reconcile your things, Build business intelligence, and Feedback and maintenance. http://www.slideshare.net/testac/how-hollywood-learned-to-love-the-semantic-web
  17. 2. Background Another Good Lesson from SemTech 2011 That Supports the Semantic Community Way: When Good Semantic Data and Tools Turn Bad, Baden Hughes, Technology Strategist, Researcher, Entrepreneur and Investor: Much of the value proposition of semantic technology centers on the idea that data is more accessible, interlinked and open for re-use, and that as a result, gains can be made through increased information leverage, better informed decision making, lower operational latency etc. However, with the rapid emergence of large public or semi-public data collections, and the increased prevalence of semantic machinery by which this data can be collected, integrated and analyzed, there is also the risk that the very same semantic data and technologies can be applied for other less desirable purposes. This presentation will consider a number of case studies where availability of enterprise and personal information in the public or semi-public web, combined with freely available semantic analytics tools can be used to determine ‘interesting’ facts about organizations and individuals, at a level of confidence that will be alarming. I Agree! http://semtech2011.semanticweb.com/sessionPop.cfm?confid=62&proposalid=3807
  18. 3. Semantic Community Apps Some Best Practice Examples of Semantic Interoperability Interfaces* Key: See next slide for Key. *The term "interoperable interface" comes from the recent Report to the President and Congress "Designing a Digital Future: Federally Funded Research and Development in Networking and Information Technology", Executive Office of the President and the President's Council of Advisors on Science and Technology, December 2010 (see excerpts in the wiki).  Source: http://semanticommunity.info/Build_SEMIC.EU_in_the_Cloud
  19. 3. Semantic Community Apps Key: http://usa.gov http://semanticommunity.net http://semanticommunity.info/2010_Annual_Statistical_Abstract http://semanticommunity.info/EPA/EPA_Ontology No longer operational – see http://www.sdi.gov The Open Group Architecture Framework - http://pubs.opengroup.org/architecture/togaf9-doc/arch/index.html http://eaprincipals.com/index.htm http://semanticommunity.info/Build_TOGAF_in_the_Cloud http://semanticommunity.info/Build_TOGAF_in_the_Cloud#Alternative_enterprise_architecture_frameworks Semantic Interoperability Centre Europe http://www.semic.eu/ http://ec.europa.eu/eurostat http://eea.europa.eu http://semanticommunity.info/Data.gov/An_Open_Data_Public_Dataset_Catalogs_Faceted_Browser http://semanticommunity.info/Build_SEMIC.EU_in_the_Cloud
  20. 3. Semantic Community Apps A Best Practice Examples of Semantic Interoperability Interfaces Everything within four clicks: Click 1-See your data Click 2: Sort/Search your data Click 3: Download your data Click 4: Share your data (e.g. iPad) Spotfire Web Player
  21. 3. Semantic Community Apps Participants in This Conference Semantic Community Apps Build EPA in the Cloud Build Data.gov in the Cloud Build NIEM and UCORE Semantic Layer in the Cloud Build the Federal Data Architecture Subcommittee in the Cloud Build FAA NextGen SWIM in the Cloud Build Alion Science in the Cloud Build Air Force One Source in the Cloud Build UDEF in the Cloud Build TOGAF in the Cloud Build SEMIC.EU in the Cloud Etc. DoD-SemTech 2011 Oracle-SemTech 2011 Binary-Robert Damashek gave me the idea for this! Spry-SemTech 2011 iTKO-SOA for e-Gov at MITRE IBM-SemTech 2011 & SOA for e-Gov at MITRE ManTech- ASC-SemTech 2011 BBN-SemTech 2011 BMA-?? (DoD??) SRA-?? HHS-SemTech 2011 (Supplemental Slides) SPAWAR-July 26-28 Conference AMDOCS-SemTech 2011 Alion-Example App Expert Systems-SemTech 2011 IMSC-SemTech 2011 Modus 21-?? JPDO-Example App DovelTech- IDA-SemTech 2011 ORBIS-SemTech 2011 MITRE-SemTech 2011 NATO-NCOIC SOA Systems-Previous Conferences Give us your data and we will reproduce your app!
  22. 3. Semantic Community Apps Spotfire: About 100 Semantic Community: About 100 http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/Proposal_Demo#Step_5._Inventory_and_Visualization_of_Spotfire_Assets
  23. 3. Semantic Community Apps DEFENSE INTELLIGENCE AGENCY SELECTS TIBCO SPOTFIRE ANALYTICS SOLUTIONS FOR DEPARTMENT OF DEFENSE INTELLIGENCE INFORMATION SYSTEM COMMUNITY DIA/DoDIISAcquires the Power of Spotfire Analytics and Data Visualization Capabilities Somerville, Mass., May 12, 2011 – TIBCO Software Inc. (NASDAQ: TIBX) today announced that the Defense Intelligence Agency (DIA) has selected the TIBCO Spotfire® Analytics Platform for the Department of Defense Intelligence Information System (DoDIIS). By leveraging predictive analytics of Spotfire, users like the DIA will gain the power to make quick, real-time decisions that will in turn give them an advantage over others. With more than 16,500 military and civilian employees worldwide, DIA is a major producer and manager of foreign military intelligence. Within the Department of Defense and the Intelligence Community, DIA provides military intelligence to personnel in the field, defense policymakers, and force planners in support of U.S. military planning and operations, and weapon systems acquisition. “Years ago, the Abrams tank marked a new technological era on the warfront but tomorrow’s battlefield will be fought in the information realm,” said Rock Gnatovich, senior vice president and chief operating officer for Spotfire, TIBCO Software Inc. “Achieving the concept of ‘the two-second advantage’ – or the ability to capture the right information at the right time and act on it preemptively for a competitive advantage – is absolutely critical to the time-sensitive nature of today’s global intelligence environment.” http://spotfire.tibco.com/about-spotfire/news-room/press-releases/2011/05_12_11-Defense-Intelligence-Agency.aspx
  24. 3. Semantic Community Apps Smart Mapping: Automatic Creation of Information Models: Spotfire3.3 Information Services users can automatically generate 1-to-1 mappings of the existing tables and columns in their Data Sources. Just generate a Data Source in Spotfire, then right click it and select “Create Default Information Model…” This helps a lot when the work has already been done to nicely model and expose tables for business applications such as Spotfire, so the mapping step is more about transparency than transformation. For example, if you use Spotfire Application Data Services, you do the work in ADS to expose Spotfire-ready tables and columns, so a simple transparent mapping of those elements through Spotfire Information Services can now be accomplished in one click. Note that the automated creation will work through nested levels of data objects in the data source you supply. The result is a folder structure that matches the catalogs, schemas etc. that were selected with a column element for each column and an information link for each table containing those column elements. Procedures will get a procedure element and an information link of their own if they return data. See next slide. http://semanticommunity.info/@api/deki/files/10975/=Whats_New_in_Spotfire_3.3.pdf
  25. 3. Semantic Community Apps
  26. 3. Semantic Community Apps The Open Group San Diego Conference, February 7-11, 2011 - Enabling Semantic Interoperability Through Next Generation UDEF (joint session with IEEE), Monday, February 7, 6:30 - 9:00 pm: The Open Group's Universal Data Element Framework (UDEF) standard provides a framework for categorizing, naming, and indexing enterprise data elements. The current version of UDEF is based on an international data management standard (ISO/IEC 11179) and supports enterprise architecture frameworks such as The Open Group Architecture Framework (TOGAF), as well as, recent architecture approaches such as Service Oriented Architecture (SOA) and Cloud Computing. The UDEF standard is also consistent with the "Data-Information-Knowledge-Wisdom (DIKW)" knowledge hierarchy model and related DIKW-oriented data management standards (e.g. TechAmerica/ANSI GEIA-859-2009 and DMG-1, DAMA Data-Management Book-of-Knowledge). UDEF has also made considerable progress as an emerging international standard for providing element level semantic interoperability. For example, the UDEF standard currently provides multi-language support for 11179 compliant object and property UDEF taxonomies (English, Dutch, and French) with additional language support in progress (Chinese, German, Spanish). Based on the lessons learned from the development and use of the current version of UDEF, the Semantic Interoperability Working Group (SIWG) within The Open Group has begun to explore how to address next generation semantic interoperability needs across different existing information exchange standards, vocabularies, and web resources (e.g. UCore 2.0, DISA Metadata Registry, Air Force Vocab OneSource). The briefing will primarily focus on initial progress of this SIWG effort. In addition to highlighting the goals and objectives for a "Next Generation UDEF" capability, the briefing will also highlight the need, benefits, and potential payoff for a variety of applications, such as rapidly deployable sensor-networks, inter-operable unmanned sensor systems, and mobile platforms (e.g. robotics). (NOTE: BOLDING IS MINE) https://www.opengroup.org/sandiego2011/udef-ieee-joint-session.htm
  27. 3. Semantic Community Apps http://semanticommunity.info/UDEF
  28. 3. Semantic Community Apps I did not find the triple store! http://semanticommunity.info/Air_Force_OneSource
  29. 3. Semantic Community Apps http://semanticommunity.info/Universal_Core_Semantic_Layer
  30. 4. DoD Common Vocabulary Sample Content Included in Next Section The mission of the Enterprise Information Web (EIW) project is to create an extensible analytical capability built on top of a federation of information systems across the Department of Defense and provide information visibility and access: Archives: All wikis and vocabularies relevant to the HR EIW project. Business Process Area: Semantic models for the HRM Domain. CHRIS Reference Ontology: ?. Retirements and Separations: DIMHRS Ontology. HR Analytics: Queries the HR Domain Ontology. HR Domain Ontology: Central Knowledgebasefor Concepts and Terminology within the DoD HR Domain. Knowledge Center: EIW Training Materials ODSE Sample Database: Multiple Vocabularies. Ontology Repository: An important contribution in the overall goal of data integration across the HR domain. https://www.commonvocabulary.army.mil/ui/groups/HR_EIW
  31. 5. Data Analysis and Collaboration Tool to Support the DoD OIG The mission of the Department of Defense, Office of the Inspector General (DOD OIG) is to promote integrity, accountability, and improvement of Department of Defense personnel, programs, and operations to support the Department’s mission and serve the public interest.  Each goal of the DOD OIG requires personnel to perform analysis using structured and unstructured data, both government and non-government sources, and in a wide variety of file formats.  Personnel and data sources are spread throughout the globe, requiring teams to acquire data in a remote access storage system for use.  Personnel access analysis tools remotely using laptops running Windows XP (SP3) with dual core processors, 3GB RAM, and 50GB memory. The DOD OIG has recognized a need to improve the efficiency and effectiveness of how data is ingested, shared and analyzed across the organization.  As well as the need to explore advanced analysis capabilities to better assist personnel in identifying fraud, waste, and abuse in the Department. Note: Bolding is mine. http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/Proposal_Demo#BACKGROUND
  32. 5. Data Analysis and Collaboration Tool to Support the DoD OIG Insert for Report to Congress 2011. http://semanticommunity.info/@api/deki/files/12766/=SAR_Insert_MAR_11.pdf
  33. 5. Data Analysis and Collaboration Tool to Support the DoD OIG Semantic Community Workflow: 5.1 Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. 5.2 Public Reports (Web and PDF) in Wiki as Linked Open Data. 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. 5.4 Spreadsheets in Spotfire as Linked Open Data. 5.5 Spreadsheets in Semantic Insights Research Assistant for Semantic Search, Report Writing, and Ontology Development.
  34. 5. Data Analysis and Collaboration Tool to Support the DoD OIG http://www.dodig.mil/
  35. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.1 Information Architecture of Public Web Pages in Spreadsheets as Linked Open Data. Tabs (12): Cover Page Press Room Publications 2011 DoD IG Appendices A, F, & I Report to Congress Statistical Highlights Table 3.1 & Figures 3.1 & 3.2 http://semanticommunity.info/@api/deki/files/12769/=DoDOIG.xlsx
  36. 5. Data Analysis and Collaboration Tool to Support the DoD OIG MindTouch makes the world's most respected social knowledge base. They power purpose-built help 2.0 communities that connect companies with their customers. Millions use their software every day. Many of the world's most respected brands rely on MindTouch including NASA, SAIC, Booz Allen, Microsoft, Cisco, Washington Post, Viacom, the New York Times, AXA, Timberland and HCA. Innovative companies like RightScale, ExactTarget and Mozilla have standardized on MindTouch for their documentation strategy. The open source .NET Web Oriented Architecture Framework (WOAF) is redefining how enterprise software is built. MindTouch is a recognized expert in both open source and Enterprise 2.0 technologies. The MindTouch Productivity Tools bridge Microsoft office and your desktop for all Windows applications. Have your users continue to work with the applications they're familiar with, instead of forcing them to learn a new tool with our document management solution. With the MindTouch Desktop Suite, you'll save time and money by not having to train users on a new system. http://www.mindtouch.com/
  37. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.2 Public Reports (Web and PDF) in Wiki as Linked Open Data. http://semanticommunity.info/Build_DoD_Vocabularies_in_the_Cloud/2011_DOD_IG_Semiannual_Report_to_Congress
  38. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. http://www.mindtouch.com/add-ons/desktop_suite?product-refer=desktop-suite
  39. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.3 Desktop and Network Databases in Wiki and Spreadsheets in Linked Open Data Format. http://semanticommunity.info/Build_TOGAF_in_the_Cloud/TOGAF_9_Course_Directory
  40. 5. Data Analysis and Collaboration Tool to Support the DoD OIG http://help.mindtouch.us/MindTouch_Pro_Member_Guide/MindTouch_Desktop_Suite
  41. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.4 Spreadsheets in Spotfire as Linked Open Data. Spreadsheets in Spotfire as Linked Open Data. PC Desktop Spotfire
  42. 5. Data Analysis and Collaboration Tool to Support the DoD OIG 5.4 Spreadsheets in Spotfire as Linked Open Data. Recall Section 4. DoD Common Vocabulary SpotfireWeb Player
  43. 5. Data Analysis and Collaboration Tool to Support the DoD OIG . 5.5 Spreadsheets in Semantic Insights Research Assistant for Semantic Search, Report Writing, and Ontology Development. http://www.semanticinsights.com/company/presentation.htm
  44. 5. Data Analysis and Collaboration Tool to Support the DoD OIG SIRA can be used to find similarity between current and past events that are expressed or hinted at in text. SIRA can be used to find relationships of people, places, things and activities that may be expressed or hinted at in text. http://www.semanticinsights.com/company/SI%20Fact%20Sheet.pdf
  45. 6. Questions and Answers Sound Byte: Bring the data and the metadata back together and do the data science first to accomplish a business need and lay a solid foundation for integration and application of semantic technologies. Questions about the steps I followed? Questions about the results I produced? See Supplemental Slides for the Data Science Approach to Semantic Web/Technology Pilots.
  46. 7. Supplemental Slides 7.1 Semantic Technology Training: Building Knowledge-Centric Systems KM 2011 SemTech 2011 7.2 W3C Government Linked Data Working Group Clinical Quality Linked Data on Health.data.gov Build Clinical Quality Linked Data on Health.data.gov in the Cloud Hospital Compare Downloadable Database Example of "5 Star Government Data“ 7.3 Library of Congress Project Recollection and Digital Preservation Initiative 7.4 Elsevier/TetherlessWorld Health and Life Sciences Hackathon (27-28 June 2011) Build TWC in the Cloud Build NCI CLASS in the Cloud Build the NYC Data Mine Health in the Cloud Build SciVerse Apps in the Cloud (IN PROCESS) 7.5 Be Informed (IN PROCESS)
  47. 7.1 Semantic Technology Training: Building Knowledge-Centric Systems http://semanticommunity.info/FOSE_Institute/Knowledge_Management
  48. 7.1 Semantic Technology Training: Building Knowledge-Centric Systems http://semanticommunity.info/Semantic_Technology_Conferences
  49. 7.2 W3C Government Linked Data Working Group The mission of the Government Linked Data (GLD) Working Group is to provide standards and other information which help governments around the world publish their data as effective and usable Linked Data using Semantic Web technologies. This group will develop standards-track documents and maintain a community website in order to help governments at all levels (from small towns to nations) share their data as high quality ("five-star") linked data. The Working Group will construct and maintain an online directory of the government linked data community. "Cookbook" Advice Site The group will produce Best Practices for Publishing Linked Data. The group will develop Standard Vocabularies. First Face-to-Face Meeting, June 29-30th, NSF, Arlington, VA. http://www.w3.org/2011/gld/charter
  50. 7.2 Open Public Dataset Catalogs Faceted Browser http://semanticommunity.info/Data.gov/An_Open_Data_Public_Dataset_Catalogs_Faceted_Browser
  51. 7.2 Linked Data Cookbook Linked Data is an evolving set of techniques for publishing and consuming data on the Web. Learn how Linked Data can turn the Web into a distributed database and how you can participate. In this session, Bernadette Hyland takes the mystery out of Linked Data by summarizing seven steps to prepare your data sets as Linked Data and announce it so others will use it. Model without context: There is a Process: Identify, Model, Name, Describe, Convert, Publish, and Maintain. I Disagree! Participants will understand the actual steps to produce high quality, useful data sets that can be modeled, transformed, documented and available on the Linked Data cloud. We'll discuss a recent government agency that did just this in less than 12 weeks. Best practices for data publishing as well as the "social contract" one makes as a publisher will be discussed. Better to make progress with something rather than do nothing because we cannot be comprehensive and complete. I Disagree! Bernadette oversees strategy for Talis‘ North American clients. She brings a strong background in commercial and government data management strategies, coupled with expertise in leading high-growth software organizations. Prior to joining Talis, Bernadette was CEO of several profitable Internet companies delivering scalable Web-based solutions for the enterprise, including Zepheira LLC and Tucana Technologies Inc., a pioneer in the emerging semantic technology community. http://semtech2011.semanticweb.com/sessionPop.cfm?confid=62&proposalid=3822
  52. 7.2 Linked Data Cookbook 1. Leverage what exists. Obtain data extracts (i.e., databases and/or spreadsheets) or create data in a way that can be replicated. 2. Model data without context to allow for reuse and easier merging of data sets. With LD, application logic does not drive the data schema, concepts, etc. 3. Look for real world objects of interest (e.g., people, places, things, locations, etc.) and model them. Use common sense to decide whether or not to make link. I Disagree! 4. Connect data from different sources and authoritative vocabularies (see list of popular vocabularies below). Put aside immediate needs of any application. I Disagree! Don’t think about how an application will use your data. I Disagree! 5. Write a script or process to convert the data set repeatedly. 6. Publish to the Web and announce it! (more details shortly). 7. Maintenance strategy (more details in the social contract at the end). http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook
  53. 7.2 Linked Data Cookbook Guidelines for merging: URIs name the resources we are describing. Two people using the same URI are describing the same thing. The same URI in two datasets means the same thing. Graphs from several different sources can be merged. Resources with the same URI are considered identical. No limitations on which graphs can be merged. For a government agency ... a data policy is “a must”: specify data quality and retention, treatment of data thru secondary sources, restrictions for use, frequency of updates, public participation, and applicability of this data policy. I Agree! http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook
  54. 7.2 Linked Data Cookbook http://www.slideshare.net/bhylandwood/bernadette-hyland-semtech-2011-west-linked-data-cookbook
  55. 7.2 Clinical Quality Linked Data on Health.data.gov http://www.data.gov/communities/node/81/blogs/4920 See Next Slide
  56. 7.2 Clinical Quality Linked Data on Health.data.gov http://health.data.gov/def/hospital/Hospital
  57. 7.2 Clinical Quality Linked Data on Health.data.gov http://health.data.gov/doc/hospital/393303.csv
  58. 7.2 Clinical Quality Linked Data on Health.data.gov http://www.slideshare.net/george.thomas.name/clinical-quality-linked-data-on-healthdatagov
  59. 7.2 Health data innovation 'at a crawl' The health care data community should step up its efforts to innovate to help improve the nation’s health outcomes and reduce costs, Health and Human Services Secretary Kathleen Sebelius said at the department’s second Health Data Initiative Forum on June 9. “Use tools and use data,” Sebelius said at the forum held at the National Institute of Medicine campus in Bethesda, Md. “Do it more, do it better and do it faster.” Sebelius said Americans experience a “triple loss” due to having the highest public health care costs, highest private health care costs, and only mediocre health outcomes. The goal of the conference was to present 45 winning health care IT applications developed with HHS’ newly-available data sets within the last several months. HHS CTO Todd Park called the event a “Health Data Palooza” that would showcase innovation in health IT. PerlDiverInc and Semantic Community were one of the finalists! http://fcw.com/articles/2011/06/09/nation-needs-more-health-data-innovation-sebelius-says-at-forum.aspx
  60. PearlDiver Data Engine & Semantic Community Data Visualization Health Data Initiative Forum Submission Medicare Zombie Hunter Benjamin Young Brand Niemann PearlDiver Technologies Inc. Semantic Community
  61. 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud http://semanticommunity.info/Semantic_Technology_Conferences/Clinical_Quality_Linked_Data_on_Health.data.gov
  62. 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud http://semanticommunity.info/Semantic_Technology_Conferences/Clinical_Quality_Linked_Data_on_Health.data.gov/Hospital_Compare_Downloadable_Database_Metadata
  63. 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud PC Desktop Spotfire
  64. 7.2 Build Clinical Quality Linked Data on Health.data.gov in the Cloud SpotfireWeb Player
  65. 7.3 Library of Congress Project Recollection and Digital Preservation Initiative The Libraries of Congress & MIT are developing a Semantic Web Browser (Exhibit and now Exhibit 3) to do essentially what Spotfire already does!
  66. 7.3 Library of Congress Project Recollection and Digital Preservation Initiative PC Desktop Spotfire
  67. 7.3 Library of Congress Project Recollection and Digital Preservation Initiative http://semanticommunity.info/Semantic_Technology_Conferences/Library_of_Congress
  68. 7.3 Library of Congress Project Recollection and Digital Preservation Initiative Interoperability Interface! SpotfireWeb Player
  69. 7.4 Elsevier/Tetherless World Health and Life Sciences Hackathon (27-28 June 2011) http://semanticommunity.info/Build_TWC_in_the_Cloud
  70. 7.4 NYC Data Web http://knoodl.com/ui/groups/NYC_Homepage
  71. 7.4 NYC Data Web Quote: Ontology architecture is a new aspect of system architecture and development, to our knowledge it has not been employed anywhere else in DOD. http://semanticommunity.info/Semantic_Technology_Conferences/NY_Data_Mine/Revelytix
  72. 7.4 NYC Data Web http://semanticommunity.info/Semantic_Technology_Conferences/NY_Data_Mine/Revelytix#Dashboard
  73. 7.4 NYC Data Web PC Desktop Spotfire
  74. 7.4 NYC Data Web PC Desktop Spotfire
  75. 7.5 Be Informed A recent paper describes the formalism and rationale that Be Informed applies to business process modeling. It explains how and why goal-oriented modeling differs from more conventional business process modeling which is procedural. In the near-term, there is applicability for many government agencies, especially for those exploring semantic approaches. For example, Dennis Wisnoskiadvocates semantic web (RDF & OWL) standards for modeling data integration, and a dialect of BPMN for modeling processes. The metaphor for processes is an electronic circuit specification that uses standard building blocks. "We all know what those primitives mean." Previous, costly attempts at business process modeling were failures in part because there was no standard at the primitive level. However, as this paper makes clear, just having unambiguous primitives is only part of what is needed to specify and manage complex and dynamic business processes. Modeling flow in swim lanes is less agile than modeling goals, activities, and pre and post conditions. Source: Mills Davis, Project10x, July 5, 2011.
  76. 7.5 Be Informed Fig. 1. Summary of the Meta Model for Capturing Business Processes Source: Specifying Flexible Business Processes using Pre and Post Conditions, Jeroen van Grondelle and Menno Gulpers, Be Informed BV, Apeldoorn, The Netherlands, 13 pp.
More Related