260 likes | 387 Views
Different communities with different historical traditions TC 37: linguistics, terminology studies, translation, information science, engineering SC 32/WG 2: computer science, information systems, engineering, information science Differences in scopes and thematic coverage
E N D
Different communities with different historical traditions TC 37: linguistics, terminology studies, translation, information science, engineering SC 32/WG 2: computer science, information systems, engineering, information science Differences in scopes and thematic coverage Shared interests and goals Terminological precision and transparence in communication and information processes Interoperability in heterogeneous information envirionments Commonalities and differences
Variant usage between TC 37 and JTC 1/SC 32 with regard to data category and data element
Computer Assisted Terminology—Data Categories Too many data categories Commonly used data categories scattered through the standard Application areas Official term banks, language planning organizations, standards organizations Major corporations, small and medium businesses Individual terminologists, translators, and technical writers Localization industry Government agencies ISO 12620:1999
Need for clear guidance on application-specific use of the data categories Thematic categorization in 12620:1999 No recommended sample sets Issue of data modeling variance Alphabetical ordering in the new on-line version Addition of data categories for other language resources Accessibility Issues
Difficulty in achieving consensus on logical ordering Addition of new working environments in TC 37 Lexicography Morpho-syntactic markup Semantic markup NLP lexicons SKOS environment Global Data Category Registry (DCR)
Core standard: ISO 12620 Rules for establishing a Data Category Registry as a metadata registry (DCR) Registration authority Maintenance of the DCR on the Web Need for Guide to the data categories in order to facilitate clear and easy usability Web Accessibility
National, regional, and local groups establishing databases and data banks to serve the public interest on a number of levels Standardizers and other domain-specific experts Companies, enterprises, and governmental agencies Individual terminologists, translators, technical writers, and students creating their own terminology resources Terminology Management Environments and Aims
Monolingual and multilingual Specialized design or off-the-shelf software Constraints involving data input, storage, retrieval, and information output Needs of various stakeholders Related resources Thesauri, taxonomies, and ontologies Semantic Web environments Classification Criteria
Replacement for ISO/TR 12618:1994, Computer aides in terminology – Creation and use of terminological databases Variation in database structures Simple glosses Translation-oriented terminology management Complex multi-application approaches Criteria for a Guide to Terminological Data Categories
terms (including all source-language synonyms and equivalents in other languages) classification of term types (e.g., synonyms, variants, full and abbreviated forms) term-related information (grammar, etymology, register, status) descriptive information definitions, contexts examples, notes, and graphic information administrative information identifiers of various sorts dates, responsibility entry status sources, combinable with terms themselves, definitions, contexts, notes, and other pieces of information Basic Data Category Groups
ISO 10241:1992 (and revision), Preparation and layout of international terminology standards Primarily standards-oriented ISO 12616:2002, Translation-oriented terminography Traditional format, main-entry-term-oriented Not focused on the principle of term autonomy Issues with granularity and data element elementarity Other Guides to Data Category Selection
Data modeling variance and term autonomy n conjunction with the application of the closed data category termType Locating data categories throughout the collection Locating non-standard synonymic data category names for standardized names (use for…) Providing sample “common” data category layouts Overcoming “presentational distance” between different data category groups Issues for a New Guide
False friend Degree of equivalence Transfer comment Anchor at language or term-section level Directionality Translation-Related Dat-Cats
Term status (preferred, admitted, deprecated) Regulated terms (legal regulations) Suggested terms Process status (of the term entry itself) Administrative status (of term planning phase) Standardization & Language Planning
Terminological concept systems Controlled vocabularies (thesauri, etc.) Classification systems Ontologies and taxonomies Expression in SKOS (Simple Knowledge Organization System markup) Distinction between generic systems Subsumation, is-a relations … and non-generic, non-subsumation systems Meronymic (partitive) relations All other relations Links and Relations
Internal links to other entries Antonym Entailed term See See also Concept system links Links to other objects Corpus trace (link to term in context) Concordance links Shared resource links, etc. Sources & source identifiers Links to Other Entries & Objects
XML:lang as a language identifier XML:lang applied as a common locale identifier (UTS#35) Transcription, transliteration, Romanization Source & target language identification Language-Related Links
Word/term-related data categories Standard “lexicography” categories (e.g., etymology, definition, etc.) Administrative categories Shared Data Categories