1 / 46

Introduction to Clinical Terminology and Classification Clinical Decision Support L4

Introduction to Clinical Terminology and Classification Clinical Decision Support L4. AL Rector Open GALEN TopThing UK The Medical Informatics Group, U of Manchester www.cs.man.ac.uk/mig/galen www.opengalen.org www.topthing.com rector@cs.man.ac.uk. Clinical Terminology. Data Entry.

jana
Download Presentation

Introduction to Clinical Terminology and Classification Clinical Decision Support L4

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Clinical Terminology and Classification Clinical Decision SupportL4 AL Rector OpenGALENTopThing UKThe Medical Informatics Group, U of Manchester www.cs.man.ac.uk/mig/galenwww.opengalen.orgwww.topthing.comrector@cs.man.ac.uk

  2. Clinical Terminology Data Entry Clinical Record Decision Support Data Entry Electronic Health Records Decision Support &Aggregated Data GALENClinicalTerminology HealthCard Mr Ivor Bigun Dun Roamin Anytown Any country 4431 3654 90273 The Vision Best Practice Best Practice

  3. OpenGALEN: Philosophy Terminology is software Terminology is the interface between people and machines Re-use is the key Patient-centred information Terminology must have a purpose Always ask: “What’s it for?” Not art for art’s sake Terminology supports clinical applications - not vice versa Applications for someone to do something for somebody Keep the ‘Horse before the Cart’ Always ask: “How will we know if it works?” “How will we know if it fails?”

  4. OpenGALEN: Key ideas • Separation of kinds of knowledge • Terminology, medical record and information system schemas • Concepts, language, Coding, Indexing, Pragmatics • Machine level, User level • Knowledge is fractal! • There will always be more detail to be added • Therefore terminologies must be extensible • Formal logical Support • Too big and complicated to maintain by hand • Extensibility requires rules • Software needs logical rigour

  5. Axes for kinds of Knowledge • Terminology • Medical Records/Information systems • Concepts • Language • Coding • Indexing • Pragmatics & User Interface • Machine level • Human Level

  6. Uses of Terminology • Clinical • Epidemiology and quality assurance • Reproducibility / Comparability • Indexing • Software • Re-use ! • Integration and Messaging between systems • Authoring and configuring systems • Data capture and presentation (user interface) • Indexing information and knowledge (meta-data, The Web)

  7. History:Origins of existing terminologies • Epidemiology • ICD - Farr in 1860s to ICD9 in 1979 • International reporting of morbidity/mortality • ICPC - 1980s • Clinically validated epidemiology in primary care • Now expanded for use in Dutch GP software • Librarianship • MeSH - NLM from around 1900 - Index Medicus & Medline • EMTree - from Elsevier in 1950s - EMBase • Remumeration • ICD9-CM (Clinical Modification) 1980 • 10 x larger than ICD; aimed at US insurance reimbursement

  8. Traditional Systems • Built by people for interpretation by people (Coding clerks) • Most knowledge implicit in rubrics • Must understand medicine to use intelligently • Not built for software • On paper for use on paper • Enumerated - top down all possibilities listed • Serial - Single use - Single View • Hierarchical Thesauri • Traditional terminological techniques from librarianship • ‘Broader than’ / ‘Narrower than’ (ISO 1087) • no logical foundation • Focused on ‘terms’ • Language and concepts mixed • Synonyms, preferred terms, etc caused confusion

  9. History (2) • Pathology indexing • SNOMED 1970s to 1990 (SNOMED International) • First faceted or combinatorial system • Topology, morphology, aetiology, function • Plus diseases cross referenced to ICD9 • Specialty Systems • Mostly similar hierarchical systems • ACRNEMA/SDM - Radiology • NANDA, ICNP… - Nursing • …

  10. History (3) • Early computer systems • Read I (4 digit Read) • Aimed at saving space on early computers • 1-5 Mbyte / 10,000 patients • Hierarchical modelled on ICD9 • Detailed signs and symptoms for primary care • Purchased by UK government in 1990 • Single use • Morbidity indexing • Medical Entities Dictionary (MED) • Jim Cimino

  11. History (4) • Aspirations for electronic patient records (EPRs) • Weed’s Problem Oriented Medical Record • Direct entry by health care professionals • Aspirations for decision support • Ted Shortliffe (MYCIN), Clem McDonald (Computer based reminders), Perry Miller (Critiquing),.. • Aspirations for re-use • Patient centred information • Needed common multi-use multi-purpose terminology • None worked

  12. Summary of Changes at end of 1st Generation • From terminologies for people to terminologies for machines • From paper to software • From single use to multiple re-use for patient centred systems • From entry by coding clerks to direct entry by health care professionals • From pre-defined reporting for statistics to reliable indexing for decision support

  13. Problems with‘First Generation’Enumerated Systems in coping with these changes

  14. Problems (1) • Scaling !!! • More detail and more specialities required scaling up, but... • The combinatorial explosion • Example: Burns: • 100 sites x 3 depths  404 codes • 5 subsites/site x chemical or thermal  7272 • x 3 extents x 3 durations  116,352 • ‘The Persian chessboard’ • 264  1019 • 1019grains of rice  100 billion tonnes of rice • 1019 nanoseconds  10,000 years • Read II grew from 20,000 to 250,000 terms in ~100 staff-years • still too small to be useful • but too big to use

  15. Problems (2) • Information implicit in the rubrics • “Hypertension excluding pregancy” • Computers can’t read! • Invisible to software • No explicit information except the hierarchy • Minimal support for software • No opportunity to use softwre to help • Language and concepts confused • Synonyms • Preferred terms • Homonyms • Only simple look up and spelling correction

  16. Problems (3) • Mixed Organisation • ‘Heart diseases’ in 13 of 19 chapters of ICD • Tumours, infections, congenital abnormalities, toxic, … • ‘Steroids’ in five chapters of standard drug classifications • Anti-inflammatories, anthi-asthmatics, … • Unreliable for indexing or Abstractions • How to say something about ‘all heart diseases’? • Fixed organisation • Single hierarchy - Single use • Where to put ‘gout’ - arthritis or metabolic disease? • Back and forth in each edition of ICD • No re-use

  17. A Mixed Hierarchy A correct kind-of (subsumption) hierarchy Problems 3bThesauri rather than Classifications

  18. Problems (4) • ‘Semantic identifiers’ • Codes really paths - moving a concept meant changing its code • 3 Cardiovascular disorders …3.4 Disorders of Artery... ...3.4.2 Disorders of coronary artery... …3.4.2.3 Coronary thrombosis … • Easy to process but... • Reorganisation requires changing codes • Codes cannot be permanent

  19. Problems (5) • Maintenance • 20 Years from ICD9 to ICD10 • ~100 person-years from Read 1 to Read 3 • Mega francs/guilders/crowns/marks on European coding schemes • Thousands of unpaid hours of committee time • Impossible / meaningless decisions take longest • You can search forever for something that is not there • Multiple uses compete - • Must choose one use • Most successful were clear about their purpose - ICD, ICPC, MeSH • Codes change meaning with version changes • Old data misleading!

  20. Problems (6) • Version specific artefacts • “Not otherwise specified” (NOS) • Used to move a general concept ‘down’ • Not elsewhere classified (NEC) • Catch all - Nowhere else in coding system e.g. ‘Tumour not elsewhere classified’ • dependent on version, • “Other” • Catch all - Not listed below, e.g. “Other diseases of the cardiovascular system” • dependent on version • Not used consistsently

  21. Problem (7): Language is slippery:Two hands or Four?

  22. Language/Concepts are slippery • Human cognition makes it look easy • Logic fails to capture it • Classification is easy until you try to do it • Trying since Aristotle in the West and Ancient Chinese in the East • Words/Concepts mean what a community decides they mean • Does a chimpanzee have four hands? • Is a prion alive? • Is surgery on the ovary a kind of ‘Endocrine surgery’? • Easier to agree on the concrete than the abstract • Easy to agree on useful abstractions and generalisations • Harder to agree on how to name them

  23. Problems (8) • There is no re-use - there is no standard • The ‘grand challenge’: A common controlled vocabulary for medicine • But re-use requires multiple different views • People’s needs differ / People do and find different things • By profession • Doctors and specialties, nurses, physiotherapiests, dentists… • By situation • Inpatient, outpatient, primary care, community… • By task • Diagnosis, management, prescribing, • patient care, public health, quality assurance, management, planning • By country and community • US, UK, France, Germany, Japan, Korea, ...

  24. Summary of Problems1st Generation Enumerated Systems • Enumerated Single Hierarchies • List all possibilities in advance • Cannot cope with fractal knowledge • Most knowledge implicit • Invisible to software • Can’t agree on common concepts and classification • Unreliable for indexing • Difficult to use for healthcare professionals • No support for user interface • Can’t build and maintain big classifications • Language and concepts don’t translate easily to logic and software

  25. Cimino’s Desiderata (1) • Concept orientation • Separate language (terms) and concepts (codes) • Concept permanence • Never re-use a code (‘retire’ it) • Nonsemantic concept identifiers • Separate the code from the path • Polyhierarchy • Allow one concept to be classified in multiple ways • Gout can be both a metabolic disease and an arthritis

  26. Cimino’s Desiderata (2) • Formal Definitions • i.e ‘Be compositional’ • Reject ‘Not elsewhere classified’ • concept permanence and NEC • Multiple granularities • Organ, tissue, cellular, molecular • Grades, types, classes of diseases • Special clinical criteria • Multiple consistsent views • Allow different organisations • e.g. functional, anatomical, pathological

  27. Cimino’s Desiderata (3) • Represent context • Family history, risk, source of information • Evolve gracefully • Allow controlled changes • Recognise redundancy (equivalence) • ‘Carcinoma’ + ‘Lung’ ?=? ‘Carcinoma of the lung’ • How would we know? • How could a machine know?

  28. Coding & Classification Decision support MeSH MEGA-TERM UMLS Medical Records SNOMED Axes ICPC ACRNEMA READ OPCS Data entry ICD-9 ICD-10 Solution Generation 1Megaterm + Crossmapping = UMLS Decision support Clinical Applications Medical Records Data entry

  29. Solution 1 Cross-mapping & UMLS • Unified Medical Language System (UMLS) from US National Library of Medicine • Defacto common registry for vocabularies • Concept Unique Identifiers (CUIs) and Lexical Unique Identifiers (LUIs) are defacto the common nomenclature

  30. Solution 1 Cross-mapping & UMLS • An invaluable resource, but... • No better than the vocabularies which are mapped • Limited detail for patient care • Unreliable for indexing or abstraction of knowledge • Best for relating everything to MeSH for indexing literature • Still limited by combinatorial explosion • Still can’t cope with fractal knowledge • Not extensible - no help in building or extending terminologiese • No help in reorganising existing terminologies to re-use for new purposes • Top down • Information still implicit • Minimal help with software • No help with data capture, user interfaces

  31. Solutions Generations 2-3Compositional Systems • Beat the combinatorial explosion • Build concepts out of pieces - leggo • Dictionary and grammar rather than phrasebook • But hard

  32. Solution Generation 1.5: Faceted • Faceted systems: SNOMED International • Inflammation + Lung + Infection + Pneumococcus  Pneumoccal pneumonia • Limit combinatorial explosion, but… • Rigid - a limited number of axes / facets / chapters • Each facet has the problems of a first generation enumerated system • Much knowledge still implicit • No way to know how identifiers relate • No explicit relations, only ‘+’ • No way to recognise redundancy / equivalence • No help with data capture or user interface / No way to recognise nonsense • Carcinoma + Hair + Donkey + Emotional  ???? • Still can’t cope with fractal knowledge • Limited extensibility: limited help with building, extending or reorganising • Still Top Down

  33. Generation 2: Enumerated Compositional • Read III with qualifiers • Inflammation: site: lung, cause: pneumococcus  Pnemococcal Pneumonia • More semantics but… • Limited qualifiers - limited views - limited re-use • Limited help with data capture - User interface difficult • Much information still implicit - limited software support • No way to recognise redundancy / equivalence / errors • Organisation still mixed - indexing better but still unreliable • Limited separation of language and concepts • Still can’t cope with fractal knowledge • Limited extensibility; limited help with building and reorganising terminologies • Top down

  34. CT Vocabulary • “Reference Terminology” vs “Interface Terminologies” • Reference terminology = enumerated hierarchy of formally defined terms • Interface terminology = navigation structure for user interface • Explicitly excluded from SNOMED-RT • “Terming”, “Coding”, and “Grouping” • Terming - finding the lexical string • Coding - finding the correct unique code (concept) • Grouping - putting codes into groupers for epidmiological or other purposes

  35. Generation 2.5 Pre-coordinatedFormal Compositions • SNOMED-RT (SNOMED-CT?) • Formal logical model for classifying a fixed list of definitions • Simple fixed ontology (7 links) • GALEN derived terminologies • UK Drug Ontology • Procedure classifications

  36. Generation 2.5 Pre-coordinatedFormal Compositions More semantics • Limited ability to cope with combinatorial explosion • Any one pre-coordinated terminology of fixed size • but arbitrarily many terminologies might be derived • Limited ability to cope with fractal knowledge • Limited extensibility • Extensibility requires access to ‘Workbench’ • Bottom up / middle out • More explicit information • Logical criteria for correctness / redundancy / equivalence • Based on knowledge representation (ontologies) and description logics • Limited support for data capture and user interface

  37. Generation 3: Post-Coordinated Formal Concept Model with Constraints delivered as Software Services • OpenGALEN Reference Model - PEN&PAD/Clinergy™ • Inflammation which hasCause (Infection which hasCause Pneumococcus)  PneumococcalPneumonia  “Pneumococcal Pneumonia” • A dictionary and grammar rather than a phrase book • Software rather than data • A sound logical and ontological foundation

  38. Generation 3: Post-Coordinated Formal Concept Models • Copes with combinatorial explosion • Indefinitely many compositions possible • Lists not pre-enumerated • Copes with fractal knowledge • Easily extensible to add more detail • Most information explicit • More comprehensive ontology (50-250 links) • Good support for data capture / user interface • But requires additional pragmatic knowledge layer • Separates user view and machine view • Intermediate representation vs GRAIL

  39. Case Study 1: The exploding bicycle • ICD-9 (E826) 8 • READ-2 (T30..) 81 • READ-3 87 • ICD-10 (V10-19) 587 • V31.22 Occupant of three-wheeled motor vehicle injured in collision with pedal cycle, person on outside of vehicle, nontraffic accident, while working for income • W65.40 Drowning and submersion while in bath-tub, street and highway, while engaged in sports activity • X35.44 Victim of volcanic eruption, street and highway, while resting, sleeping, eating or engaging in other vital activities

  40. Defusing the exploding bicycle:500 codes in pieces • 10 things to hit… • Pedestrian / cycle / motorbike / car / HGV / train / unpowered vehicle / a tree / other • 5 roles for the injured… • Driving / passenger / cyclist / getting in / other • 5 activities when injured… • resting / at work / sporting / at leisure / other • 2 contexts… • In traffic / not in traffic V12.24 Pedal cyclist injured in collision with two- or three-wheeled motor vehicle, unspecified pedal cyclist, nontraffic accident, while resting, sleeping, eating or engaging in other vital activities

  41. Structured Data Entry File Edit Help Goodbye to picking lists… Cycling Accident What you hit Your Role Activity Location

  42. Other important links and initiatives • HL7 Vocabulary group • See HL7 web site • Or join list server • SNOMED-DICOM-Microglossary (Radiology) • Nursing initiatives - see Nick Hardiker papers • ISO TC215 WG2 / CEN TC251 WG3 • See web sites

  43. Criteria for success • Re-use • A recognised growing library of common decsision support modules • Stop starting from scratch! • Integration • 2+ independently developed DSSs integrated with2+ independently developed EPRS withoutexponentially increasing effort.

  44. Criteria for success • Authoring • No individual invests in their own terminology • enterprise-wide terminology servers • Indexing • Simplification of systems • a sharp drop in special cases and exceptions • a sharp increase in authors’ productivity

  45. Criteria for success • User interfaces • Real systems in real use with real patients by real clinicians • transparent systems

  46. OpenGALEN www.opengalen.org

More Related