240 likes | 254 Views
I nfoscience EPFL’s Institutional Repository … and much more. Objectives Means Content & Services Infoscience vs OAI PhDTheses@epfl Next steps forward. David Aymonin Directeur de l’Information Scientifique et des Bibliothèques. Objectives.
E N D
InfoscienceEPFL’s Institutional Repository… and much more Objectives Means Content & Services Infoscience vs OAI PhDTheses@epfl Next steps forward David Aymonin Directeur de l’Information Scientifique et des Bibliothèques
Objectives • Collect and make known the EPFL intellectual heritage, i.e. its scientific and teaching output • Make researchers and their skills more visible • Make the scientific data collected more accessible and legible,by structuring them • Allow their long term preservation • Allow their processing for the assessment needs of the institution
Means • Human resources • Full-time project leader, new position • Ad-hoc team : EPFL staff, when needed • Technical choice • Based on CDSWare, XMLMARC,Python language • Official partnershipwithCERN for CDSWare software development
Content, 1st june 2005 Searches : 793/day, ±24 000/month Exports : 188/day, ± 5 600/month Fulltexts : 114/day, ±3 500/month Infoscience Scientific outputs Theses People@epfl Union catalogue 4 799 references 1 500 fulltext 26 laboratories 40 000 references. 11 libraries * 3 274 references 733 fulltext 755 profiles of researchers * : 400 000 references from 11 EPFL libraries, members of NEBIS Union catalogue will be added on october 2005
Local database Services : Data re-use Old version. LANOS Lab. Website
infoscience Services : Data re-use New version. LANOS Lab. Website
Services : People@epfl Old version. LANOS Lab. Directory
Services : People@epfl New version. LANOS Lab. Directory
Infoscience & OAI • Freely accessible PhD theses are declared in OAIster • Already tried to declare Infoscience in Scirus, Google scholar… Not as simple as it should be • Next : ISI Web citation indexhttp://scientific.thomson.com/news/newsletter/2005-02/8264025/ • Regarding OA, EPFL attitude is « moderate » • Variable from one lab. to another, Open access still frightens • Advocacy OAI will be done in 2005 • Official statement from Conférence des Universités Suisses awaited and could help
PhDTheses@epfl • 1920 Paper archiving at the Central library. 3000 PhD theses • 2003 Electronic archiving made possible • June 2004 Electronic archiving compulsory. 200 PhD theses /year • Retrodigitalization of all PhD theses, started end 2004 600 000 pages. 300 dpi, B&W or 150 dpi, grey levels for color pages TIFF provided, PDF 1.4 image online OCR for liminary pages of 2000-2004 theses Total amount of data 15 Gb
Workflow , PhDTheses@epfl Information: Final version released PhD Student Academic registration service EPFL THESE File FINAL VERSION PDF or Postscript Asking for Authorisation to put PhD these on the Internet, if NOT, then on Intranet Printing service EPFL Print version N copies PDF file Libraries : National, ETHZ Central Library EPFL File processing, Putting online Swiss National Library Metadata
Metadata processing, PhDTheses@epfl 2 Catalogues (unfortunately) Cataloguing in NEBIS OAI-PMH copy and paste INFOSCIENCE CDSWare OAI enabled Filemaker Web database DTD RERO Data enrichment Loading XML records With abstracts Link to Filemaker web record Abstract Fulltext Abstract
PDF files processing, PhDTheses@epfl Made by the central library for each these Creation of final • Frontpage (in PDF) • Abstract (in PDF + HTML) • TOC (in PDF) Optimization of heavy PDF files Security • Modification not allowed • Printing, searching , copy allowed File metadata • Size of file, links to abstracts and fulltext, • Number of pages
The future is now, PhDTheses@epfl • Main issues • Intellectual Property Rights (IPR) • Metadata and file formats • Harvesting and visibility • Master dissertations
The future is now, PhDTheses@epfl • IPR • Belongs to the Author • Electronic archiving made compulsory upon student registration • Changes in swiss law in 2005 Putting theses on the Internet Is not considered as prior publication
The future is now, PhDTheses@epfl • Metadata and file format • Metadata • Swiss « DTD » ? (Convention in 2003) • DTD-MS from NDLTD ? (already exists) • TEF from AFNOR ? (awaited for 2005) • PDF • PDF/A ? (ISO Standard in 2005?) Standards could appear in 2005
The future is now, PhDTheses@epfl • Harvesting and visibility • OAIster • RERO, EPFL, ETHZ already included • Good Search interface • Not specific to theses • Should we join NDLTD ? • 195 members : UK, B, S, D, E, CN, AU, China, India • Very poor Search interface, for the moment • European Thesis On Line (ETOL, Europe) • Just at its begining Militate in favour of Switzerland member of NDLTD
The future is now, PhDTheses@epfl • Master dissertations • The next step • Will require along and difficult institutional agreement to bep fully set up • In each of the 12 academic sections of EPFL collaboration with librarians, who are in contact with the teachers The Infoscience tool is robust And allows us to work with voluntary people !
Long live OAI ! Thank you for your attention http://infoscience.epfl.ch David.Aymonin@epfl.ch