380 likes | 595 Views
The IATE Database. NPLD Seminar, Dublin. Maria José Palos Caravina Translation Centre for the Bodies of the European Union. 1. Project background and organisation IATE Features IATE as a collaborative workspace Data Structure Data Entry and Validation Communication mechanisms
E N D
The IATE Database NPLD Seminar, Dublin Maria José Palos Caravina Translation Centre for the Bodies of the European Union
1. Project background and organisation • IATE Features • IATE as a collaborative workspace • Data Structure • Data Entry and Validation • Communication mechanisms • Import/export functionalities • Cooperation and Common Guidelines • Best practice guide • Interinstitutional consolidation projects • IATE functionalities in a nutshell
3.IATE goes public • Terminology Workflow: from IATE to IATE Public • 4. Content issues • Nr. terms, entries, languages, etc. • 5. Current developments
The Project • IATE = “Inter Active Terminology for Europe” • Objective: creation of an interactive terminology database for the consultation, update and joint management of terminological data from all the European Institutions, Bodies and Agencies. • Project partners: European Commission, Council, Parliament, Court of Auditors, Economic and Social Committee/Committee of the Regions, Court of Justice (as observer), Translation centre, European Investment bank, European Central Bank. • The development phase started in January 2000. • Operational since the summer 2004. • Available to the public since June 2007.
1. Project organisation • 2000 – 2004 • Project overseen by the Interinstitutional Committee for Translation • Top level group: Expert Group (Institutions, Agencies, Member States) • Steering Committee (Institutions, Agencies, chaired by DG ENTR) • Several technical work groups (data structure, workflow, implementation support) • Since 2004 • An interinstitutional work group oversees the requests for improvements and reports to the Interinstitutional Committee for Translation and Interpretation • IATE Management Group • IATE Support and Development Team at the Translations Centre carries out corrective and evolutive maintenance
IATE in a nutshell • IATE merged all existing terminology databases of the EU’s translation services into a single database Several user friendly interfaces were implemented to cater for the needs of translators, terminologists and other EU staff. TIS Council Euterpe Parliament Others Eurodicautom COM
EURODICAUTOM (Commission) • Coverage: 11 EU official languages plus Latin • over 1.240.000 entries (c. 5 million terms) and 325.000 abbreviations and acronyms (July 1999) • Lenoch classification • Web interfaces on intranet/internet • Fed from work of Terminology Unit (based in Brussels and Luxembourg), contributions from translators systematized by Eurodicautom team and contributions supplied under contract by private companies and field experts • Updated weekly • Translators had access to unit-level MultiTerm databases (110) • Low interactivity
TIS (Council) • 11 EU languages plus Latin and Irish. • 200 000 records (600 000 terms) 45% contain 3+ languages. 25 000 records contain 5+ languages • Terminology reflects translation problems that have arisen in Council texts. • 170 subject codes • Feeding: New terms entered directly by Council's terminologists (5/6 per language division). Other users may enter comments or suggestions. New data searchable immediately (text search on all fields) • Growth rate: 4 000 translations per month.
Euterpe (EP) • 11 EU official languages plus Latin • 171 000 records • bilingual entries: 92 887 • 9-language entries: 13 556 • 11-language entries: 5 623 • Some with notes or definitions or the Latin equivalent (botany and zoology) many with the corresponding abbreviations or acronyms • A few terms (titles of political parties, ...) in non-EU languages • The approach is descriptive rather than normative. • IT platform: MultiTerm '95+ database
Euroterms (CdT) • 11 EU official languages plus Norwegian, Latin and Russian • 180,000 entries • MultiTerm database • Client interface • Interactive access for all in-house translators • Distributed on CD-ROM to freelance translators • Data validated, where possible, by domain experts in Agencies
2. IATE: Features 2.1. IATE as a collaborative workspace • On-line search • On-line data editing • Validation system • Communication mechanisms • User management • Statistics IATE Database
2.2. IATE Data Structure E.g. domain, domain note, origin, problem language, cross reference ... Language Independent Information E.g. definition, comment... Language 1 EN Language 2 FR E.g. term, term type, context, grammatical information... Term 1 Term 2 Term 1 Term 2
2.3. Data Entry and Validation • All institutional IATE users can add terminology to the database. • IATE provides a validation mechanism to ensure that all new terminology is proof-read. • The process takes into consideration the specific competencies of the people involved and caters for a review of terminological entries on different levels: spelling, content, coherence, exhaustiveness, etc. New Term Validation Stage 1 Validation Stage 2 Validation Stage ? Validated Term
2.4. Communication Mechanisms • “Marks”: to comment on specific database entries. Marks are attached to an entry and are addressed to individual users or user groups (e.g. the terminology group of a specific language division). Usually the content of the marks is an exchange of information and opinions.
2.5. Import/export functionalities • IATE has an XML exchange format based on SALT/MARTIF standards • Problem: It is too complicated to be used by linguists. • 3 new import functionalities use a simple MS Excel file, namely to: • modify existing entries (following an export) • add terms to existing entries • create new entries
2.6. Cooperation and Common Guidelines An interinstitutional work group started to organise the practical cooperation between the IATE project partners by… • Regular interinstitutional communication; • Establishing list of contact points for the terminological questions in each participating service per language; • Defining “writing rules” for the creation of new terminology; • Creation of a “Best Practice” guide establishing general principles for IATE use.
2.7. Best Practice: Examples • Usefulness of a term for multilingual drafting, translation and interpreting in the EU context • Reliability of the proposed term. • Relevance of the proposed term. • Information fed into IATE must have an added value over and above what can be found in documentary databases or on the Internet. • Responsibility of each service to apply the guidelines.
3. IATE Goes Public • Implementation of public “read-only” site started in 2005/2006 based on indexing technology used for Eurodicautom; • Hosted at the Data Centre of the European Commission • 04/2007: the Eurodicautom site was shut down, redirecting its complete user community to IATE. IATE Database Fulcrum Index
3.2. IATE Public • Opened on 28 June 2007 • Provides free of charge access to EU terminology. • IATE Public URL: • http://iate.europa.eu • IATE Mailbox: • iate@cdt.europa.eu
3.1. Terminology Workflow: from IATE to IATE Public Add Modify Validate Publish Internal http://iate.europa.eu
The average number of queries submitted per day is about 200.000 • Being 304.034 the highest number of queries for one day, recorded in April 2007 3.3. Is IATE Public actually used?
How many people use IATE? • As IATE can be used without login or a registration procedure it is not straight forward to answer this question. However, since April 2008 IATE keeps track of the internet (IP) addresses of the computers that are used to access the database. These figures indicate that the IATE user community amounts to about 220.000 people. Where is IATE used? • The answer is: around the world. As any internet service IATE is accessible around the globe and usage figures show that the database receives queries from 170 different countries.
IATE in Figures (3) Number of terms in IATE by language:
5. Current developments: IATE Web Services Basic problem: Find a way to integrate IATE in the Office environment. Solution: web services...which means: Web Service Web Service
IATE Web Services: Status 2 web services have been implemented: • Look-up of terms in IATE from a Word document; • Adding terms to (Pre-)IATE from a Word document. • Left to be done: • Finalisation of the macro prototypes for both services; • Testing in production environment.
5.1. Possible developments: term checking for Quality Assurance (QA) • A Term Checker functionality in IATE would allow us to use IATE for text analysis in the context of QA through controlled language. • Example: you have a specialised and validated set of terminology in IATE and would like to make sure this terminology has been used throughout a text. So you cross-check your text against this filtered set of data in IATE and you will get the source text marked (see example) and a report, by language, stating the terms that differ from the ones in IATE.
Does IATE solve all problems? IATE provides the technical facilities for cooperation between the EU’s translation services. It does not solve management issues, such as: • What type of data should be added to the database? • Is it possible - and desirable - to define an EU core terminology? • How can a common “culture” of terminology be created? • How can the cooperation between services be organised most efficiently? • What is the role of terminology in the EU’s translation services today, as other technologies (e.g. translation memories, internet etc.) have become available?
Main achievements • IATE is today a source of reference for terminology management systems (e.g. UN agencies) • The „added value“ of IATE – and terminology work in general – will be based on the provision of reliable multilingual informationon the EU‘s core activities.
Show IATE Interface: https://iate.eu-admin.net/iatenew/login.jsp