260 likes | 415 Views
視窗外的天空?. CALIS/HKCAN beyond Chinese cataloging by Charlene Chou May 23, 2005 at Peking University. 值得深思的問題. 在 Internet 技術加快“全球化圖書館”發展的情境下,當前的圖書館在現在和未來應當扮演何種角色? Unicode 是否可能解決所有公共目錄檢索系統( OPAC )中多語種檢索的問題? 圖書館和互聯網絡的互通點( crosswalk )在哪裏? CALIS 系統在這趨勢中的定位點爲何?. 五個層次的觀點.
E N D
視窗外的天空? CALIS/HKCAN beyond Chinese cataloging by Charlene Chou May 23, 2005 at Peking University
值得深思的問題 • 在Internet技術加快“全球化圖書館”發展的情境下,當前的圖書館在現在和未來應當扮演何種角色? • Unicode是否可能解決所有公共目錄檢索系統(OPAC)中多語種檢索的問題? • 圖書館和互聯網絡的互通點(crosswalk)在哪裏? • CALIS系統在這趨勢中的定位點爲何?
五個層次的觀點 • 中文編目與權威檔的資源共享(例如HKCAN和CALIS); • 當前美國與國際上編目規則與標準的發展狀況; • 公共目錄檢索系統(OPAC)和互聯網絡上多語種顯示和檢索的技術發展; • 圖書館和互聯網絡的互通點(crosswalk) • “全球化圖書館”—多語種抑或多文化的數字化圖書館?
中文目錄檔案的區域資源共享: HKCAN—XML Version • 中文公共目錄和姓名權威檔的交叉檢索(Cross Searching) • 通過一次性檢索,NLC,CNAD,LC/NAF和HKCAN實現了中文公共目錄與權威檔案的共享; • VIAF/LEAF模型的實現; • 雙語化的橋梁作用:彌補了LC/NAF和NLC單語檢索的局限性; • 通過OCLC,HKCAN的使用範圍得以擴展; • CALIS will be the next very soon, right?
HKCAN對LC/NAF的影響 • 處理non-unique names工作流程的調整 • Verifying Chinese names in vernacular scripts at LC and Columbia University • LC breaking undifferentiated record based on vernacular script in HKCAN • Romanized/English form in HKCAN with the qualifier of birth date • CUL adding 4XX field for variant form • Is CALIS next, esp. with OCLC access?
多語種編目系統和權威檔:發展現狀(美國) • OCLC/Connexion and RLIN • Parallel fields are available in bibliographic records for years • The key is if ILS can display vernacular scripts • LC/NAF • To include vernacular scripts in authority records in the near future? • ILS problem for implementing Unicode and others • Cataloging policies and economic perceptions to be dealt with
多語種編目系統和權威檔:發展現狀(美國) • MARBI • MARC21 has the standards for bibliographic and authority records • Currently has no further plans after 2001-DP05 • Can be communicated through the full Unicode UTF-8 character sets or limited to MARC-8 characters?
權威檔控制:當前發展與挑戰 • Economic concerns in USA • LC/PCC access records for Web resources • Subject vs. descriptive access • Costly for authority control • Multi-level policy: controlled vs. uncontrolled headings • FRANAR—international efforts & standards • UBCIM—Minimal Level • Shift from ISADN to VIAF & INTERPARTY • VIAF: linking is key to retain flexibility and local autonomy
編目系統的整合性發展的時代 • ISSN: international harmonization for continuing resources • FRBR and AACR3 (now called RDA after JSC meeting in May) • FRBR for serials: MFHD/ISSN/FRBR->super record or authority record for serials • FRBRized display in OPAC: collocation • Contents vs. carrier • Grouping multi-manifestation records (only 20% records in OCLC WorldCat)
數字化項目的整體性協調(美國) • Digital Library Federation • Clearinghouse for all DL projects in US • Sharing and communicating among projects • Digitization: Registry of Digital Master • CONSER will seek communication for cataloging policy
Unicode在公共目錄檢索系統(OPAC)中的顯示 • ILS vendors: active interest in recent years • MARC21 spec now support Universal Coded Character Set • Mapping to Unicode from MARC character sets completed • Unicode fonts: more characters available now • Newer operating system: support Unicode now
Unicode: 在ILS系統中實現的障礙 • Which scripts have been implemented • ILS implement language by language/Which scripts are supported, e.g. Arabic, but may not support the additional extended Arabic characters in such languages as Persian • How sorting of particular language is handled • Depending on the language • Which encoding formats are supported to import/export records in Unicode
Unicode • Not complete solution for language support in library system • A foundation to build and will make development for multiple language much easier • A solution for now, maybe new technology will have a better solution in future?
搜索引擎對OPAC學科檢索的影響 • CSULA transaction logs’ analysis • Next generation of the OPAC interface • Concept-based search: allow natural-language searching with keyword search first, and focus on quick-search need • Relevance feedback: bring back related pages • Spelling correction • Relevance-ranked output
互聯網絡上的跨語言檢索 • Multilingual dictionary: • Google definitions are multilingual, e.g. definitions of voip on the Web in all languages--Chinese (Simplified), English, French, German, Italian, Russian, and Spanish • Translation tool for website • Babel Fish, etc. • Translate entire or partial website by user’s choice • User’s choice of language on website
互聯網絡上的跨語言檢索 • Google--Multilingualwebmaster.com • How to implement the translation process of the search engine? • From Translation vendors in multiple languages to direct communication with the translators--Individual translators specialized in a given language • Self-service translation application • Localize the content of its multilingual site • Currently support: native interface in14 languages (+English) & search in 25 languages (+ English)
互聯網絡上的跨語言檢索 • Open Language Archives Community (OLAC)--Specialized OAI sub-domain • Features of the search engine include: • A thesaurus of alternative language names • Language code searching • Keyword-in-context display in search results • Search for similarly spelled words • Search for similar items • Support for standard string search operators and domain-specific inline syntax • Automatically derived search links for other web search engines • Contribution: the inclusion in the search engine results of a metadata quality-centric sorting algorithm
多語種檢索與翻譯的比較 • Currently Web browser displays multi-scripts and fonts; Babel Fish for translating website; Website has language choice; Speech recognition software available • Translation machine is just one component of complex multilingual system (John White) • Multilingual metadata is a cross-cultural retrieval--far more than mere cross-language searching (Cliff Lynch)
圖書館門戶(library portals)和搜索引擎的交叉點(Crosswalk) • Why Google is courting libraries for scanning the text of collections? • Web OPAC, Hot-link to resources, OpenURL, etc. linked to Google, and Amazon, etc? • Searching: precision vs. recall • Library and Google Scholar • Future OPAC: Topic maps, controlled vocabulary, preferred languages, available translation and edition (RLG & Google), and online seller
基於FRBR聯合目錄的語義網( Semantic Web ) • IFLA’s FRBR: a semantic expression of the relationships between items in the library catalog • Berners-Lee’s ultimate vision for XML: the Semantic Web—layering a logic layer over the top—info on WWW to have logical or semantic relationships • RDF—triples—two nodes (one subject & one object), connected with a predicate relationship->DAML+OIL; Topic Maps: triples—topics, associations and scopes • MARC XML open to web harvesters, such as OAI • FRBR, RDF, DAML+OIL, or Topic Maps: all used to set up a FRBR 本體論(ontology) • RDF: can serve as a semantic layer to make FRBR expressive and be layered over MARC XML
權威檔控制:can be 語義圖(semantic map) 的building block and beyond • Tillett’s perspectives on the future: • VIAF: an integral part of a future Semantic Web • Ontology: infrastructure and controlled vocabularies; can be used to enable displays in the user’s own language and script • Help users of the Web for collocation and search precision • Tools to connect: biographical dictionaries, telephone directories, A&I, etc.
全球化圖書館 • Language is an important perspective or a major barrier, but there are much more… • Multilingual searching across diverse collections • Five layers of technology (Noriko Kando) • The pragmatic layer: cultural and social aspects, convention; the semantic layer: concept mapping; the lexical layer: language identification, indexing; the symbolic layer: encoding issues; the physical layer: the network
全球化圖書館 • Multilingual, multicultural and multimedia digital libraries • HKCAN is bilingual but may be multicultural in certain contexts; CALIS can be more diverse • MARBI 2001-DP05: rename as “context-sensitive” authority record • From multilingual to multi-context? • Global vs. localized library • VIAF/LEAF—”glocatization” movement? • Google: multilingual vs. localized site
CALIS未來發展方向 • OCLC in China or much more… • Sharing Chinese bibliographic and authority records from Asia to other continents • Wider range of libraries: beyond academic libraries • Multilingual choice for international users • Wider range of users: from librarians to students • Hotlinks to extended resources
CALIS未來發展方向 • Leading research in library and information science, such as FRBR projects in OCLC • Digitizing more historical materials • Supporting use of rare book collections • Expanding from subject-oriented collections to Semantic Maps for Chinese collections and information • What is the vision of CALIS to participate in the global library?