Table Of ContentEmerging Technologies for Knowledge
Resource Management
CHANDOS
INFORMATION PROFESSIONAL SERIES
Series Editor: Ruth Rikowski
(email: [email protected])
Chandos’new series of books are aimed at the busy information professional.They have
been specially commissioned to provide the reader with an authoritative view of current
thinking. They are designed to provide easy-to-read and (most importantly) practical
coverage of topics that are of interest to librarians and other information professionals.If
you would like a full listing of current and forthcoming titles, please visit our web site
www.chandospublishing.com or contact Hannah Grace-Williams on email
[email protected] or telephone number +44(0) 1865 884447.
New authors:we are always pleased to receive ideas for new titles;if you would like to write
a book for Chandos,please contact Dr Glyn Jones on email [email protected]
or telephone number +44 (0) 1865 884447.
Bulk orders: some organisations buy a number of copies of our books. If you are
interested in doing this, we would be pleased to discuss a discount. Please contact
Hannah Grace-Williams on email [email protected] or telephone number
+44 (0) 1865 884447.
Emerging Technologies for
Knowledge Resource
Management
M. P P
AUL ANDIAN
AND
C.R. K
ARISIDDAPPA
Chandos Publishing
Oxford · England
Chandos Publishing
Oxford · England
Chandos Publishing (Oxford) Limited
Chandos House
5 & 6 Steadys Lane
Stanton Harcourt
Oxford OX29 5RL
UK
Tel:+44 (0) 1865 884447 Fax:+44 (0) 1865 884448
Email:[email protected]
www.chandospublishing.com
First published in Great Britain in 2007
ISBN:
978 1 84334 370 7 (paperback)
978 1 84334 371 4 (hardback)
1 84334 370 3 (paperback)
1 84334 371 1 (hardback)
© M.Paul Pandian and C.R.Karisiddappa,2007
British Library Cataloguing-in-Publication Data.
A catalogue record for this book is available from the British Library.
All rights reserved.No part of this publication may be reproduced,stored in or introduced into
a retrieval system, or transmitted, in any form, or by any means (electronic, mechanical,
photocopying,recording or otherwise) without the prior written permission of the Publishers.
This publication may not be lent,resold,hired out or otherwise disposed of by way of trade in
any form of binding or cover other than that in which it is published without the prior consent
of the Publishers.Any person who does any unauthorised act in relation to this publication may
be liable to criminal prosecution and civil claims for damages.
The Publishers make no representation,express or implied,with regard to the accuracy of the
information contained in this publication and cannot accept any legal responsibility or liability
for any errors or omissions.
The material contained in this publication constitutes general guidelines only and does not
represent to be advice on any particular matter.No reader or purchaser should act on the basis
of material contained in this publication without first taking professional advice appropriate to
their particular circumstances.
Typeset by Domex e-Data Pvt.Ltd.
Printed and bound in Great Britain by 4edge Ltd, Hockley. www.4edge.co.uk
List of figures and tables
Figures
1.1 An ideal distributed library environment 8
2.1 Growth of electronic information 20
2.2 Conceptual framework for digital libraries 27
2.3 Functional components of a digital library 28
2.4 Major system components of a digital library 29
3.1 Shibboleth authentication system 51
3.2 Resource description process 52
3.3 Multiple service providers (OAI-PMH) 64
3.4 Aggregators (OAI-PMH) 65
3.5 Harvesting combined with searching 65
3.6 Z39.50 session 70
3.7 Z39.50 web-based session 71
3.8 OpenURL process flow 74
4.1 Complex information environment 80
4.2 Traditional digital collection model 83
4.3 Ideal digital collection model 83
4.4 Distributed heterogeneous library environment 85
4.5 Harvesting vs federation 91
4.6 Importing metadata into the repository using OAI-PMH 94
4.7 Exporting metadata from the repository 94
4.8 Overview of Daffodil architecture 107
4.9 Overview of Decomate II architecture 109
4.10 Overview of Marian architecture 111
vii
Emerging Technologies for Knowledge Resource Management
4.11 Overview of TEL architecture 113
4.12 Informia’s internal architecture 115
4.13 Informia’s three-tier mediated architecture 115
4.14 Architecture of the Arc system 118
4.15 OpenSiteSearch architecture 121
5.1 Unified portal system environment 128
5.2 UPS system components 130
5.3 Overview of UPS architecture 139
5.4 UPS–user interface environment 141
5.5 UPS–user interface flow chart 143
5.6 UPS–librarian interface environment 144
5.7 UPS–system interface environment 146
5.8 UPS–search interface flow chart 149
5.9 UPS metadata repository 150
5.10 UPS metadata harvesting process 150
5.11 Harvesting metadata from diverse sources 151
5.12 UPS federated search system 152
5.13 UPS Z39.50 model of information retrieval 153
5.14 UPS Z39.50-based search session 153
5.15 Schematic diagram of OpenURL environment 155
5.16 UPS OpenURL search session 156
5.17 UPS resource control system 158
Tables
2.1 Digital library environment 26
3.1 Dublin Core element set 57–9
3.2 Dublin Core qualifiers (types) 60–1
3.3 OAI-PMH requests (verbs) 64
4.1 Metadata harvesting tools 96–100
viii
List of abbreviations
A&I Abstracting and Indexing (service)
AACR2 Anglo American Cataloguing Rules II
AFS/NFS Andrew File System/Network File System
ANSI American National Standards Institute
API Application Program Interface
ARL Association of Research Libraries
CAI Common Access Interface
Caltech California Institute of Technology
CDL California Digital Library
CENL Conference of European National Libraries
CERL Consortium of European Research Libraries
CIMI Consortium for the Computer Interchange of Museum
Information
CORBA Common Object Request Broker Architecture
DBLP Digital Bibliography and Library Project
DC Dublin Core
DCMI Dublin Core Metadata Initiative
DL Digital Library
DN Distinguished Name
DNS Domain Name System
DOI Digital Object Identifier
DSA Directory System Agent
DSTC Distributed Systems Technology Center
DTD Document Type Definition
EAD Encoded Archival Description
FDL Federated Digital Library
FTP File Transfer Protocol
GIL GALILEO Interconnected Libraries
GIS Geographic Information Systems
HPSS High Performance Storage System
HTML Hypertext Markup Language
HTTP Hypertext Transfer Protocol
ix
Emerging Technologies for Knowledge Resource Management
IATLIS Indian Association of Teachers of Library and
Information Science
IFLA International Federation of Library Associations
IIM Indian Institute of Management
ILL Inter-Library Lending
ILS Integrated Library System
INFLIBNET Information and Library Network
IP Internet Protocol
IR Information Resource
ISO International Standards Organization
IST Information Society Technologies
KOBV Cooperative Library Network for Berlin and Brandenburg
LCSH Library of Congress Subject Heading
LDAP Lightweight Directory Access Protocol
LIS Library Information Science
LOC Library of Congress
MACE Middleware Architecture Committee for Education
MAP Millennium Access Plus
MARC Machine-Readable Cataloguing
METS Metadata Encoding and Transmission Standard
MIR Meta-Information Repository
NCIP NISO Circulation Interchange Protocol
NISO National Information Standards Organization
NLM National Library of Medicine
OAI Open Archives Initiative
OAI-PMH Open Archives Initiative Protocol for Metadata
Harvesting
OCLC Online Computer Library Center (formerly Ohio College
Library Center)
ODBC Open Database Connectivity
OPAC Online Public Access Catalogue
RDF Resource Description Framework
RFID Radio Frequency Identification
ROI Return on Investment
SDLIP Simple Digital Library Interoperability Protocol
SGML Standard Generalized Markup Language
SIP2 Standard Interchange Protocol Version 2
SOAP Simple Object Access Protocol
SP Service Providers
SQL Structured Query Language
x
List of abbreviations
SRU Search and Retrieval via URL
SRW Search and Retrieve Web Service
SSI System Service Interface
TEL The European Library Project
TKL The Keystone (Digital) Library
UCP Universal Computer Protocol
UPS Unified Portal System
URI Uniform Resource Identifier
URL Uniform Resource Locator
URN Uniform Resource Name
W3C World Wide Web Consortium
WWW World Wide Web
XML eXtensible Markup Language
XQL XML Query Language
XSL Extensible Stylesheet Language
XSLT Extensible Stylesheet Language Transformation
xi
About the authors
M. Paul Pandianobtained his PhD from Karnatak University, Dharwad,
India and an Associateship in Documentation and Information Science
from the Documentation Research and Training Centre, Indian
Statistical Institute, Bangalore, India. He is currently Head of the Library
and Information Resource Centre at the Institute of Mathematical
Sciences, Department of Atomic Energy, Chennai, India where he is
implementing an RFID-based system for the library. He was previously
the Head of the Library and Information Resource Centre at the Indian
Institute of Management, Indore, India and a member of the core team
that was responsible for the setting up of a campus-wide information
system for the IIM. He has also worked as a scientist at the INFLIBNET
Centre, University Grants Commission, India where he was responsible
for developing the online union catalogues of participating libraries at
INFLIBNET. As a course coordinator at INFLIBNET, he also designed
and developed course materials for a six-week residential course on the
applications of computer and communication technologies in libraries
for library executives and information scientists. He has in addition
contributed several research articles on the topic of library and
information science to a number of journals and presented papers at
national and international conferences.
The author may be contacted at:
Dr M. Paul Pandian
Head, Library and Information Resource Centre
Institute of Mathematical Sciences
CIT Campus, Taramani
Chennai 600 113
Tamil Nadu
India
E-mail: [email protected]
xiii