1 / 41

Das Deutsche Kompetenzzentrum für Sprachtechnologie Angebote, Ergebnisse, Zukunft

Das Deutsche Kompetenzzentrum für Sprachtechnologie Angebote, Ergebnisse, Zukunft. Hans Uszkoreit. PROJECT COLLATE.

randy
Download Presentation

Das Deutsche Kompetenzzentrum für Sprachtechnologie Angebote, Ergebnisse, Zukunft

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Das Deutsche Kompetenzzentrum für Sprachtechnologie Angebote, Ergebnisse, Zukunft Hans Uszkoreit

  2. PROJECT COLLATE Theme:Computational Linguistics and Language Technology for Real World ApplicationsSupport:A Grant by the German Federal Ministry for Education and Research for RTD strengthening the position of Saarbrückenas a Competence Center for Language TechnologyPIs: Hans Uszkoreit, Manfred Pinkal and Wolfgang WahlsterDuration: COLLATE I 4/2001 - 10/2003COLLATE II 11/2003 - 12/2005

  3. Competence Center for Language Technology in Saarbruecken Research Resources in Saarbruecken Services for (potential ) users of LT in industry and public administration LT Research Resources Saarland University Internet Services R&D Project DFKI Transfer Center DFKI, State of Saarland Share Holders University • - proactive R&D • - accompanying R&D • prototypical demonstration • for transfer • Personnel • Consumables • IT Investments Information & AwarenessEvents LT Research Resources DFKI Consulting Personnel Rooms IT Investments Support of Spin-offs BMBF R&D Project Saarland University applied research Offers for temporal work assignments in the center The Concept

  4. Coordination, Controlling, Reporting, WWW Presence, PR, Events Saarland University basic functionalities of LT-based knowledge management competence center of language technology information extraction and fusion dialogue for knowledge access and acquisition virtual information center information management and retrieval demonstration center evaluation center Projektstruktur COLLATE I

  5. Wissenschaftlicher Beirat • Steven Bird, Melbourne, Australia • Martin Kay, Stanford, USA • Donia Scott, Brighton, UK • Sharon Oviatt, Denver, USA • Oliviero Stock, Trento, Italy • Hans Tillmann, Munich, Germany

  6. Resultate • LT-World: Der größte, umfangreichste und meist genutzte Informationsdienst in der Disziplin • Modellcharakter für andere Technologie-Disziplinen • Demonstration Center: Weltweit einzig in Art und Umfang • Große Ausstrahlung und politische Wirkung • Methodologie für einsatzspezifische Systemevaluation • Mehrere Studien und Systemevaluationen

  7. Weitere Resultate in der Forschung • Large Scale Multilevel Corpus Annotation • SPROUT: Unification-Based Information Extraction System • M3I: Mobile, Multimodal, Modular Interface • Information Access Dialogue Systems • AnswerBus: Large-Scale Question Answering System

  8. International Collaboration I • OLAC- Open Language Archives Community • exchange of ontologies and metadata schema • H. Uszkoreit serves on Advisory Board • Nordic Documentation Center for Language Technology • joint initiative by Denmark, Finland, Iceland, Norway, Sweden • Nordoknet took over LT-World ontologies and metadata schema • data exchange • France - TechnoLangue • French project in TechnoLangue adopted COLLATE approach

  9. International Collaboration II • Italy - • CELCT - joint Center of IRST and DFKI for language technology evaluation • starting point was COLLATE model • Europe - • plan to extend the collaboration of DFKI and IRST in CELCT to France and in a second step to the European Level • China • Initiative to establish a Chinese LT Competence Center in Shanghai • Japan • A study is currently evaluating LT-World (and other information services)

  10. Roadmapping • The Competence Center took the lead in the roadmapping tasks of the European Network of Excellence in Language and Speech (ELSNET) • We built an interactive roadmap viewer and editor for the web • We organized several roadmap workshops • The results are accessible on the Web

  11. CELCT: Centre for the Evaluation of Human Language and Multimodal Communication Technologies l Multilingual Q/A (CLEF) l Word sense disambiguation (SENSEVAL) l Cross-language spoken document retrieval l Speech-to-speech translation l Expression of emotion in virtual characters l Multilingual information extraction l Multimodal information presentation l Emotion recognition in multimodal communication CELCT PAT 800k Euro Yearly Budget: 1 M Euro 12-15 researchers Staffing IRST DFKI Pianesi, Lazzari, Stock Uszkoreit, Wahlster

  12. A European Infrastructure Network for Human Language and Multimodal Technologies Headquarters for l virtual information centre l demonstration centre l user-oriented application evaluation CELCT Headquarters for l evaluation of technologies COLLATE Satellites Saarbrücken, Germany Trento, Italy Satellites Satellites ELDA Paris, France Headquarters for l acquisition and distribution of multimodal resources

  13. LangTech 2002 • A new forum for language technology solutions • September 2002 in Berlin • German Competence Center for LT was Co-Organizer • Initiative and Program Chair: Hans Uszkoreit • Keynote by Wolfgang Wahlster • "Window on the Future Session" by Reinhard Karger • 336 participants from 33 countries (2/3 from industry) • More than 70 companies from 20 countries exhibited and presented

  14. LangTech 2003 • In Paris organized by ELDA • strong showing of German campanies in the exhibition • weaker presence of German industry in the program • COLLATE presented in the program and in the demos • Dr. Reuse (BMBF) presented German R&D on a panel dedicated to strategic directions (organized by Uszkoreit)

  15. Strengthening the Center • Computational Psycholinguistics (in Comp. Linguistics) • Matt Crocker • Usability Lab • Speech Technology (in Electrical Engineering) • Dietrich Klakow (newly appointed coming from Philips) • Speech Lab • General Linguistics (in the University Development Plan)

  16. LT-World • Language Technology World is a virtual information center on language technology R&D • it is more than a simple portal • it integrates a number of ambitious services • it is based on advanced technologies: ontologies (XML, RDF, OIL) automatic hyperlinking

  17. LT-World

  18. 上海交通大学计算语言学虚拟信息中心u20449

  19. LT World Stats LT World has become the most comprehensive information source of our discipline Technologies 110 Systems 250 Patents 873 Experts 2295 Organisations 1661 Projects 657 News&Events 1002 Highlights: HLT Survey, ACL Software Registry, NORFA Exchange, Glossary

  20. Usage Stats • Per month we now have an average of 37.000 hits and 16.000 page calls • The number of unique visitors keeps increasing Month Unique Visitors Visits Jan 2003 800 1500 Dec 2003 1150 2420 Apr 2004 1561 2570 maximum: 1620 (Mar03) 2949 (Mar03)

  21. Integration with other Info-Services • Included but autonomous resources: ACL Software Registry, Language Technology Survey • Systematically cross-linked and cross-searchable resources: all OLAC Resources such as (LDC, SIL, ACL SR, and OLAC Home) • Systematically crosslinked resources: HLT Central, ELSNET, EACL ACL NLP Universe, COLIBRI • Linked resources: All other relevant resources relevant for LT

  22. Impact Several other information services have copied approaches, methods and architectures from LT-World • Swedish Documentation Center • French Technolangue Portal • Chinese LT-World

  23. Demo Center • Service for information, promotion, dissemination, education • 5 demo stations with IO periphals • Demo equipment in a combined demo and meeting room • Powerful server • ... and many LT systems to be shown

  24. Demo-Stations 14 m 19 inch Rack 12,50 m Elevator Das Demozentrum

  25. Angela Merkel Chairwoman of CDU Germany, 22.5.02 Johannes Rau, President of Germany, Mrs. Rau, Prime Minister (Saar) Peter Müller, Mrs Müller, Jürgen Schreier Education, Culture and Science Minister (Saar), 26.8.02 Josef Brauner, Senior Vice President Deutsche Telekom, Hanspeter Georgi Minister for Economic Affairs (Saar) Das Demozentrum

  26. Demozentrum • Auftritte auf Messen, Konferenzen, Ausstellungen • 135 Demos mit über 900 Besuchern im DFKI • Firmen verschiedenster Größe und Branche • Politiker und Journalisten • Highlights aus diesem Jahr:12.02.04 Schwedischer Botschafter 08.03.04 Delegation von NTT 22.04.04 Girl's Day 13.05.04 Botschafter von Malta, Estland, Uruguay, Ägypten, Irland, Portugal

  27. Demo Center Stats The demo center is the largest concentration of language technology and a truly unique service for promotion and education Systems installed 51 Products 29 Multilinguality 15 Speech Systems 16 Events 135 Visitors 900 Highlights: President J. Rau, A. Merkel, VP German Telekom, ambassadors, ministers

  28. Evaluation Center Goals • To advance understanding of methodologies for empirical studies of the usability of LT systems • To solve key usability problems that arise with particular types of LT systems • To assist LT R&D and the LT industry in the selection and utilization of effective evaluation methods

  29. Points of Emphasis • Studying the usability of an entire system that has an LT component as opposed to evaluating just the LT component • Detailed understanding of how users deal with the system – as opposed to evaluation of their overall success and satisfaction • Use of advanced logging, monitoring and eye tracking technology to achieve detailed understanding

  30. Types of Systems Studied  Systems that make some use of language technology such as: • (Multilingual) information retrieval systems • Systems for extracting information from texts • Spoken dialog systems

  31. Demozentrum Mobiler Eye-Tracker

  32. Automatic Hyperlinking • How should hyperlinks be marked? • Trade-off between distraction and link awareness • Increased relevance of link redundancy • Which occurrences of a term should be hyperlinked? • All occurrences – too distracting? • Linguistic selection criteria? • Which occurrences do users actually use? • Trade-off between distraction and accessibility of linked information

  33. Multitasking with Xybernaut • Small handheld PC with a basic indoor navigation system based on IR sensors • REAL software developed by the CS Dept., Saarbrücken University • Directions via maps are visible to the user from the system screen • The system is configured for a planned route in the Computer Science department at Saarbrücken University Questions: • How can a user manage multitasking with a mobile system? • How do users combine the tasks walking,scrolling, viewing • Study performed with the ASL Model 501mobile eye tracker

  34. Neueste Aktivitäten • German National TV Magazine Planetopia January 11, 2004

  35. Neueste Aktivitäten • Posterpräsentation CHI-Konferenz 2004) and Usability-Untersuchungen und -modelle für Multitasking: Präsentation im CHI-2004 workshop "time design" • Ontoprise and Evaluationszentrum sind in der letzten Auswahlrunde (noch 3 Bewerber) für das Projekt HALO (http://www.projecthalo.com/). Evaluationszentrum als Usability-Experten.

  36. Dissemination • 85 internationale Publikationen • 51 Vorträge (Hauptvorträge und Konferenzvorträge) • erfolgreiche Ausstellung auf der CeBIT 2002 • Ausstellung und Vortrag auf der LREC 2002 in Las Palmas • Ausstellung und Vortrag auf der LangTech 2002 in Berlin • Ausstellung auf der ACL 2003 • Vortrag auf der ACL 2003 im Workshop Informational Resources • Ausstellung auf der LangTech 2003 in Paris

  37. From the Reviewers' Comments • ...The results were very impressive, the results are very good, we want you to keep going on ... • ...This project could be a model for other larger action, for example for initiatives embracing the whole of Europe... • ...Work is driven by applications and exhibits an innovative view of what an application could be. But it is not driven by fashion!... • We are very impressed by the sheer quality of the presentations ... it could not have been better presented ... we were blown away by that ...

  38. Programm für die zweite Phase • Virtuelles Informationszentrum LT World: • Patent-Informationen • Glossar und Terminologie • Frage-Antwort System • Integration der Neufassung des Survey-Buches • Demozentrum • Mobiles Demozentrum für Präsentationen in den Firmen • Erarbeitung von Anwendungsszenarien • Evaluationszentrum • Gemeinsames Zentrum mit IRST

  39. Informationsveranstaltungen Branchenspezifische Informationsveranstaltungen • Sprachtechnologie im Sicherheitsbereich • UMTS Anwendungen • Mehrsprachige Dokumentenproduktion • Customer-Relationship Management

  40. Weitere Pläne für die zweite Phase • Erweiterung Kompetenzzentrum • Stärkere Einbeziehung von angrenzenden Technologien, die oft mit Sprachtechnologie gemeinsam eingesetzt werden: • Multimediatechnologien • Wissenstechnologien • Webtechnologien • Internationalisierung • Realisierung des bereits angedachten Europäischen Netzwerks

  41. Fazit und Ausblick • Wir haben ein Angebot geschaffen, das es so noch in keinem anderen Land gibt. • Das Kompetenzzentrum wird sich durch stärkere Nutzung nicht abnutzen, sondern noch verbessern. • Wir hoffen, dass Ihre Anregungen und Vorschläge uns helfen werden, unser Angebot noch besser auf Ihre Bedürfnisse abzustimen.

More Related