240 likes | 259 Views
Explore how voice recognition technology is revolutionizing pathology, from artificial intelligence to linguistic variability. Discover various systems like SpeechMagic, IBM MedSpeak, and Dragon, offering high accuracy in pathology dictation. Learn about obstacles in speech recognition and the importance of training and language models in achieving precision. Unveil how Philips, Lernout & Hauspie, Dictaphone Corporation, and Dragon Systems are paving the way for efficient and accurate voice-controlled systems in pathology.
E N D
XIX Spanish Congress of PathologySpanish Club of Applied Computing in PathologyVoice recognition in Pathology Marcial Garcia Rojo, M.D., Ph.D. Pathology Dept. Complejo Hospitalario de Ciudad Real Barcelona, September 18th, 1999
New perspectives en LIS(LIS = Laboratory Information Systems) • Graphic Interface • Workstation: electronic signature, coding, cytology Q/C, technique requests, etc. • Clerical work: Registry, Archive, Consultations, etc. • Computer Aided Diagnosis • Unique Clinic History • “Outreach”: Primary care and remote labs
Artificial Intelligence & Cytometry • Decision Trees: tumoral markers (Decaestecker, 1997) • Expert systems (semantic networks) breast carcinoma classification (van Diest, 1994) • Artificial Neural Networks: Essential in cancer • Gynecological cytological Screening: PAPNET • Dg/Pro Breast carcinoma (O'Leary, 1992; (Mat-Sakim, 1998) • Dg. Gastric lesions benign/malignant (Karakitsos, 1998) • Liver Displasia & Carcinoma (An, 1997) • FNA breast (Cross, 1997), Thyroid (Karakitsos, 1996)
Main obstacles in Speech recognition • Linguistic Variability: Phonetic, Phonology, Syntax, Semantics, Discourse • User V. : Tempo, Pronunciation, Inflexion, Fatigue, stress, coarseness... • Channel Variability: Noise, transmission channel • Coarticulation: Phonemes context
Acoustic Models • Phonetic-Acoustic Model (phoneme units)Speaker independent systems. Too unreliable • Pattern-Template. Methods to build them: • Vector Quantization • Hidden Markov Models • Neural Networks
Language Models • Provides the knowledge source for the engine. It helps to predict the next word. Components: • LEXICON (VOCABULARY): Context (Professional) • GRAMMAR: Structure and format (Markov Models)
Training • Crucial to all systems: Speaker dependant/independent • Read Text - Vocabulary - Pronunciations • Neural Network Training (iterative process -5- of speech) & Hidden Markov Model building • To develop grammar for phoneme sequences • Greater training, less word error rate (plateau)
Philips Speechwww.speech.philips.com • No end-user solution (Cortex) • Recognition Server • Speech Magic64.000 wd(270K) • Synchronized edition/ playback • First Pathology continuous system • Accuracy >= 95% • SpeechMike (Micro&Trackball&Spk) • SpeechPad (software & hardware) • FreeSpeech98 30.000 words
IBM MedSpeak for Pathology • 25.000 word vocabulary of pathology • Template customization • Integration (Active-X) with LIS; HL-7 interfaces • IBM ViaVoice98 (Español) - Medical vocab. (38.000) • Voice allows total control. Vocab: 33.900 ptas. • 95% Accuracy. 30 minutes de training • Dynamic Healthcare Technologies (CoPath) & Talk Technologies (TalkStation-Pathology™)
Lernout & Hauspie • L&H Kurzweil Clinical Reporter • Medical vocabulary. Templates. Total control • Excellent knowledge base (Rosai Cancer Checklist) • License cost: $6000 - $8000 per user (1 million ptas.) • L&H Voice Xpress for Medicine • Integration with Microsoft Word • Medicine and Pathology vocabulary. Text Macros. • Cost: $499 (80.000 ptas.) Pathology: $1499 (250.000 ptas.) • Up to 640.000 words vocabulary • Accuracy > 95%. 140 words / minute • Requires Pentium II 266
Dictaphone CorporationEnterprise Express Voice System Integrated Voice and Data Management System • Explorer (SpeechMagic de Philips & Cortex path. sys.) • Client Viewer, Job Lister, Reporter (SQLQ)... • HIS integration, Fax distribution, E-mail, etc. • Boomerang™:Software to send voice files (IBM Viavoice) • Pathology context - 15,000 basic pathology words • >95% Accuracy after speaker adaptation
Dragon Systems • They have no product for Pathology but others (Voice Automated & Articulate Systems) use Naturally Speaking (Español) • Naturally Speaking Medical Suite • Active medical Vocabulary up to 55.000 (240.000) • Vocabulary Builder: Customization of medical specific vocab. • Dragon BestMatch: uses expanded speech and vocabulary models to get greater accuracy. • Dragon Command Wizard (macros, scripting language...) • 160 words/minute. 95-98% accuracy • Pentium 200 y 64 MB RAM
Broca: Philips engine. • Surgical Pathology and Autopsy context • First LIS for Pathology system with continuous dictation system (The Gold Standard™) • Accuracy 98-99%. 13.000 pathology words • Text & Formatting Macros • For example, saying "Begin Microscopic" will indicate dictation for the microscopic section of the report and informs the system that the information eventually needs to upload to the Microscopic field in the LIS database.
Macros • Text Macros (E3 -> Multiple fragments ....) • Command Macros (keyboard/mouse) • Program Macros: Executes DLL, objets... • Structured Macros (templates)
Borowitz, StephenMPediatric Research 1999; 45:120A. • Computerized dictation systems are practical and fast enough to replace human transcription. • Differences in the types of errors: Human transcriptionists make spelling errors whereas computerized transcription makes word substitutions • Don't be surprised if your programs first rendition of 'cystic fibrosis' is something similar to 'sixty-five roses.’ (Kovesi, T)
Other solutions • Only 7,5% pediatricians (Spooner, 1998) • Medical Center contract Transcription Services to provide medical transcription at a cost of $0.9 to $0.19 per line. Local telephone call to dictate and the transcribed copy is delivered by modem. • Wireless Computer SystemDragon NaturallySpeaking Mobile • Voice activation of instruments (Margossian et al, 1998)
Interesting References • http://home.nycap.rr.com/voice/ Voice Recognition For Pathology Michael RibenMRIBEN1@NYCAP.rr.com • Teplitz C, Cipriani M, Dicostanzo D, Sarlin J. Automated speech-recognition anatomic pathology (ASAP) reporting.Sem Diagn Pathol 1994; 11: 245-252 • Leslie, KO, Rosai J. Standarization of the Surgical pathology report: formats, templates, and synoptic reports.Sem Diagn Pathol 1994; 11: 253-257 • Tischler AS, Martin MR. Generation of surgical pathology reports using a 5,000-word speech recognizer. Am J Clin Pathol 1989; 92 (Suppl 1): S44-S47. • Borowitz, SM. Computer-Based Speech Recognition as a Replacement for Medical Transcription. Pediatric Research 1999; 45:120A. • Hogan, R.Dragon NaturallySpeaking.JAMA Volume 280(15) 21 October 1998 pp 1369-1370