500 likes | 683 Views
FRBR and RDA -- What they are How we got to where we are now How they may affect our future A presentation for the MOBIUS Conference June 6, 2012, Columbia, Missouri Charles Croissant, Senior Catalog Librarian Pius XII Memorial Library, Saint Louis University croisscr@slu.edu.
E N D
FRBR and RDA -- • What they are • How we got to where we are now • How they may affect our future A presentation for the MOBIUS Conference June 6, 2012, Columbia, Missouri Charles Croissant, Senior Catalog Librarian Pius XII Memorial Library, Saint Louis University croisscr@slu.edu
Functional Requirements for Bibliographic Records (FRBR) an overview • An attempt to develop an conceptual model that could express a shared (international) understanding of what bibliographic records should beand what they should be expected to accomplish.
Impetus: recognition of the need to make optimal use of all the opportunities for shared cataloging in the new online environment, plus a need to have a rational response to all the economic pressures libraries were facing to reduce the cost of cataloging • In its beginning, an attempt to form consensus at the international level on just how minimal a record could be and still perform its necessary functions. • Now seen as a theory of cataloging.
An initiative of IFLA, the International Federation of Library Associations • Began life at the 1990 Stockholm Seminar on Bibliographic Records • Years of research by a task force of IFLA’s Section on Cataloguing. • Some “big names” on the Task Force: John Byrum (Library of Congress), Dorothy McGarry (UCLA), Tom Delsey (National Library of Canada), Elaine Svenonius (UCLA), Barbara Tillett (Library of Congress) • Presented to IFLA and approved in 1997. • Has become our new theoretical model for cataloging, and the basis for the forthcoming cataloging code, Resource Description and Access (RDA)
Starting point: The user tasks • Find (a resource that meets certain criteria) • Identify (make sure the resource is the one you want and not some other, similar resource) • Select (from a number of possible resources, select the one that is most useful to you in your particular situation) • Obtain (get it from its current location to a place where you can use it – gaining access to the resource) • (navigate) (being able to make your way through a catalog, a search engine, a Web site to find what you want)
FRBR poses the question: What data and what kinds of data structures are required so that a user can successfully perform the user tasks?
The FRBR Model • Is an “entity-relationship” model • STEP 1:identify all the possible entities in the process you are analyzing. • STEP 2:Identify all the attributes your entities can possess. • STEP 3:Identify all the possible relationships that can exist between any two of your entities.
The groups of entities: • Group 1: The “bibliographic” entities – Work, expression, manifestation, item. • Group 2: The “agent” entities – individual persons, corporate bodies. • Group 3: The “subject” entities – concepts, events, places, etc., plus any of the Group 1 and Group 2 entities.
WEMI (“work, expression, manifestation, item”) The official definitions of the Group 1 bibliographic entities (quoting from the IFLA document): • Work = “a distinct intellectual or artistic creation.” • Expression = “the intellectual or artistic realization of a workin the form of alpha-numeric, musical, or choreographic notation, sound, image, object, movement, etc., or any combination of such forms.”
Manifestation= “the physical embodiment of an expressionof a work.” • Item = “a single exemplar of a manifestation... a single physical object.” • See examples in the FRBR text: FRBR, Section 3.2
The Group 1 entities exist between two poles: • Entirely abstract: the Work (think “Platonic ideal”) • “A work is an abstract entity; there is no single material object one can point to as the work”—IFLA’s FRBR document, section 3.2.1. • Entirely concrete: the Item
Group 1 entities and the Primary Relationships • Work is realized through • Expressionis embodied in • Manifestationis exemplified by • Item
The “responsibility” relationships • These are the relationships that link: • an entity from Group 2 (person or corporate body) to • an entity from Group 1 (work, expression, manifestation, item)
The responsibility relationships include: • Created by (authorship) • Relationship between a person and a work • Realized by • Relationship between a person and an expression:stage designer, translator, performer of a musical work
Produced by – • Relationship between a person or corporate body and a manifestation:publishing house, sound recording label • Owned by – • Relationship between a person or corporate body and an item:library and a copy of a book; person and a book they own
Attributes Each of the entities in the FRBR model is defined by the attributes it possesses. A workhas some or all of these attributes (among others): • Title of the work • Form of work (i.e. genre; the class to which the work belongs: novel, play, poem, concerto, map, etc.) • Date of the work (date of its creation) • Intended termination (finite? Or continuing on?) • Intended audience • Medium of performance (i.e. for a musical work)
Attributes, con’t. An expression has some or all of these attributes (among others): • Title of the expression (may differ from the title of the work, e.g. in the case of translation into a new language) • Form of the expression (e.g. alpha-numeric, musical notation, musical sound, photographic image, dance) • Date of the expression • Language of the expression
Attributes, con’t. Attributes of a manifestation: A manifestation is a physical object, so its attributes correspond to the elements of a bibliographic description: • Title of the manifestation, statement of responsibility. • Edition statement. • Place of publication, publisher, date of publication. • Form of carrier (book, CD, DVD, microfilm) • Extent of carrier (e.g. for a book, number of pages, height of book) • Mode of capture (e.g. for a sound recording, analog or digital)
Attributes, con’t. Attributes of an item: The item is a specific copy of a manifestation. Its attributes can include: • Item identifier (a number or code uniquely associated with the item, like a call number or bar code) • Provenance (who are the current and previous owners of the item?) • Physical condition of the item (undamaged? damaged in some specific way? brittle paper?) • Restrictions on access (may it only be used by particular persons, or in a particular location?)
Leo Tolstoy’s novel War and Peacein the FRBR model AGroup 2 entity, a person: The person Leo Tolstoy Has a relationship as creator to A Group 1 entity, a work(w¹ ): his novel War and Peace, in its abstract form as Tolstoy’s own distinct intellectual creation Expressions: e¹ = Tolstoy’s own manuscript of the novel e² = The Russian text prepared for the publication of the first edition e³ = The English translation made by Constance Garnett
A Group 1 entity, the person Constance Garnett Has a relationship as translator to e³ Manifestations: m¹ = the first Russian publication of the complete novel, in Moscow in 1869 under the title Война и мир (Voyna i mir). m² = Constance Garnett’s English translation, as published by W. Heinemann in London in 1904. Items: i¹ = a copy of m¹ held by the Harvard University Library. i² = a copy of m² held by the British Library.
Compare and contrast –FRBR vs. our current data model Current data model is based on a “flat” data structure All the information that applies to a particular manifestation is contained in a single record – information that pertains to a whole set of records is essentially re-keyed every time it is needed. FRBR points us toward a “decentralized” data structure based on entities and relationships Data is distributed among a number of records that are then linked together as required to describe a particular manifestation.
Author Tolstoy, Leo, graf, 1828-1910. Uniform Title Voĭna i mir.English Title War and peace / Leo Tolstoy ; translated from the Russian by Richard Pevear and Larissa Volokhonsky. Edition 1st ed. Published New York : Alfred A. Knopf, 2007. Description xviii, 1273 p. ; 25 cm. Bibliography Includes bibliographical references (p. [1223]-1247) and index. Subjects Russia -- History -- Alexander I, 1801-1825 -- Fiction. Napoleonic Wars, 1800-1815 -- Campaigns -- Russia -- Fiction. Related Author Pevear, Richard, 1943- Volokhonsky, Larissa. ISBN 9780307266934
The “exploded record” concept • “Disassembling” or “deconstructing” a bibliographic record into a set of separate data elements • Data elements that represent a particular entity are each recorded in separate records; the possible relationships between entities are also treated as data elements • To create a bibliographic record, the cataloger (or some other intelligent system) chooses a set of entity records and defines the relationships between these entities, such that a computer system can assemble all the various data elements into a record that a user can read and interpret.
Each record for a data element has a one-to-one relationship to the entity or the relationship it represents: • information about the entity or relationship can be added or revised, but the one-to-one correspondence between record and entity/relationship remains stable. • Each data element record can be linked with others in a multitude of ways so that each item in the bibliographic universe can be described by a unique combination of data elements.
“War and Peace” in the “exploded record” model (cascading records) • A person record for Tolstoy links to a record for his work, War and peace, with a link that expresses “Creator” • The workrecord links to an expressionrecord for Constance Garnett’s English translation (which includes a link to a person record for Constance Garnett, where the link defines Garnett as “Translator”) • The expression record links to a manifestation record for the book published by W. Heinemann in London in 1904 • The manifestation record links to an item record for the copy in Harvard’s library
Created/Created By WORK Anna Karenina Novel PERSON Tolstoy, Leo 1828-1910 Russian novelist Created/Created by WORK Voĭna i mir Novel Is realized through EXPRESSION War and peace (English translation) Translated PERSON Constance Garnett 1861-1946 Role: literary translator Is embodied in MANIFESTATION 1904 English edition of War and peace Is exemplified by ITEM Copy of Heinemann’s 1904 edition held by Harvard Library Publishes CORPORATE BODY William Heinemann Ltd. English publishing house
FRBR has the potential to move our library data out to where it can be accessed in the Semantic Web – right now so much of our library data is difficult to access because of the data formats we use (and no one else uses) • Once our bibliographic data is recorded as data elements linked by defined relationships, it is much easier for a computer to interpret.
Example of machine interpretation: a “related work” situation: • work1 -- Jane Austen’s novel, Pride and prejudice • work2 --The BBC video with Colin Firth and Jennifer Ehle, released in 1995
Our bibliographic record for the BBC video contains the note: Based on the novel by Jane Austen. Meaningful to a human, meaningless to a machine. BUT: In FRBR environment:
Work record 1for Jane Austen’s novel • Work record 2 for the BBC video • the 2 records are connected by a link with a defined value: • HasAdaptation/IsAdaptationOf This can be understood and used by a computer. Eventually this model could allow patrons to pose questions to our catalogs in natural language.
Why adopt the FRBR model? • It should better serve the user tasks • Especially helping the user to better find and identify: • works of prolific authors, and all their expressions • works that exist in many versions (expressions or related works) • Increased efficiency – move beyond the re-keying of data
Why adopt FRBR, continued • Increased potential for international sharing of data • Language neutrality – entities and relationships would have stable ID numbers; each entity and relationship can be assigned any number of language forms that can be switched in and out depending on the language environment of the user. • Increased potential for machine interpretation and manipulation of data – more in tune with formats used by other metadata communities – more in tune with the Web environment and the coming Semantic Web.
FRBR and the Semantic Web RDF = Resource Description Framework The foundation document for the Semantic Web. Like FRBR, it is an entity-relationship model.
A basic piece of RDF: the triple subject – predicate – object Natural language: “The sky is blue.” Triple statement: subject: resource (entity) = “sky” + predicate: has color + object: value = blue Each element of the triple is identified by a URI (Uniform Resource Identifier)
RDF: subject – predicate – object FRBR: Subject: Entity 1 (person) = Leo Tolstoy + Predicate: is creator of + Object: Entity 2 (work) = War and peace (each element of this triple would be identified by a URI)
RDA (Resource Description and Access) • an attempt to build a cataloging code on the FRBR model • departs completely from the model used in AACR2, which was based on the International Standard Bibliographic Description (ISBD) • Areas of the Description, Material Types, Access Points
RDA follows FRBR: • Identifies all the entities concerned. • Decides which attributes of those entities should be recorded. • Instructs the cataloger how to record each of the chosen attributes. • Defines the possible relationships between its entities. • Instructs the cataloger how to record the chosen relationships.
Sections 1 through 4 for recording Attributes of: • Manifestation and item • Work and expression • Person, family, and corporate body • Concept, object, event & place(N.B. Section 4 – dealing with the Group 3 Subject entities – hasn’t been written yet) Section 5 for recording Primary Relationships between work, expression, manifestation and item.
Section 6 for recording Relationships to persons, families and corporate bodies. Section 7 for recording Relationships to concepts, objects, events, and places (this covers the area of subject access -- like Section 4, this section has yet to be written) Section 8 for recording between works, expressions, manifestions, and items (i.e. relationships beyond the primary ones recorded in Section 5 – these are “related work” situations, etc.)
Section 9 for recording Relationships between persons, families, and corporate bodies. Section 10 for recording Relationships between concepts, objects, events, and places (like Sections 4 and 7, this section has yet to be written)
Controversies: • The testing and adoption process: Timeline and decision-making process, the national test • Cost – online publishing model – technological haves and have-nots • Retraining • Who’s going to develop software that would allow the linking of data elements that RDA expects?
The “Three scenarios” for adopting RDA • Scenario 1: The “catalog of the future”: fully FRBR-ized; fully realized linking of records for all data elements; utilizes object-oriented database structures. • Scenario 2: A “middle way”: utilizes linking between authority records and bibliographic records, but doesn’t extend as far as the “full FRBR-ization” of Scenario 1 (this scenario is already available in some ILS vendors’ products; is also present in European systems) • Scenario 3: The U.S. status quo: using RDA, but continuing to catalog in the flat record structure of our existing MARC format. Headings in bibliographic records are backed up by authority records, but there are no links between them.
Current timeline: Library of Congress announced in March, 2012, that its target day for implementation of RDA is March 31, 2013. Most US libraries will adopt this target day as well. Library of Congress has a task force charged with making plans for a new data format to replace MARC – stay tuned for further developments.
In the meantime, we’ll still be using MARC as our data format for years to come. Initial differences between AACR2 and RDA are minor and cosmetic: No more abbreviating No more [S.l.] – instead: [Place of publication not identified] No more p. or ill. – instead: 324 pages : illustrations No more use of [sic] – record what you see, make a note as needed.
RDA drops the use of the General Material Designation (GMD), that is, subfield h in the 245. Instead, there are 3 attributes that should be recorded in RDA records. New MARC fields (336, 337, 338) have been created for these data elements: 336 = Content type – text, performed music, two-dimensional moving image, etc. 337 = Media type – indicates the type of equipment needed to access the resource, for example: audio, video, unmediated (i.e. a book – no equipment required) 338 = Carrier type – volume, audio disc, audio cassette, video disc, microfilm, etc.
Why drop the GMD? In the FRBR context, the conventional terms we have used as our designators are ambiguous and can’t be consistently mapped to a single level of the bibliographic hierarchy. Content type is an attribute of the expression. Media type and Carrier type are attributes of the manifestation. However, a GMD like [Sound recording] can be understood to refer to content type, but it could equally be meant to refer to media type or carrier type.
Log in to RDA Toolkit • http://access.rdatoolkit.org/ RDA record in SLU’s catalog: OCLC 689548773 Questions and Discussion Contact: croisscr@slu.edu