Table Of ContentAAddvvaanncceess iinn
IInnffoorrmmaattiioonn
SSyysstteemmss SScciieennccee
VVoolluummee 88
CCoonnttrriibbuuttoorrss
KKaarreell CCuulliikk IIII DDeeppaarrttmmeenntt oojj CCoommppuutteerr SScciieeccee
UUnniivveerrssiittyy oojj WWaatteerrlloooo
WWaatteerrlloooo,, OOnnttaarriioo,, CCaannaaddaa
MMaannssoouurr FFaarraahh DDeeppaarrttmmeenntt ddee PPhhyyssiiqquuee--MMaatthheemmaattiiqquueess
UUnniivveerrssiittee ddee MMoonnccttoonn
MMoonnccttoonn,, NN.. BB..,, CCaannaaddaa
SS.. GGaanneesshh CCoommppuutteerr SScciieennccee DDeeppaarrttmmeenntt
TThhee PPeennnnssyyllvvaanniiaa SSttaattee UUnniivveerrssiittyy
UUnniivveerrssiittyy PPaarrkk,, PPAA
EEhhuuddGGuuddeess CCoommppuutteerr SScciieennccee DDeeppaarrttmmeenntt
TThhee PPeennnnssyyllvvaanniiaa SSttaattee UUnniivveerrssiittyy
UUnniivveerrssiittyy PPaarrkk,, PPAA
RR.. CC.. TT.. LLeeee IInnssttiittuuttee oojj CCoommppuutteerr aanndd DDeecciissiioonn SScciieenncceess
NNaattiioonnaall TTssiinngg HHuuaa UUnniivveerrssiittyy
HHssiinncchhuu,, TTaaiiwwaann
RReeppuubblliicc oojj CChhiinnaa
AA.. AA.. GG.. RReeqquuiicchhaa PPrroodduuccttiioonn AAuuttoommaattiioonn PPrroojjeecctt
TThhee UUnniivveerrssiittyy oojj RRoocchheesstteerr
RRoocchheesstteerr,, NNYY
RRaannddaallll PPaauull SShhuummaakkeerr NNaavvaall AAiirr SSyysstteemmss CCoommmmaanndd
WWaasshhiinnggttoonn,, DD..CC..
HH.. BB.. VVooeellcckkeerr PPrroodduuccttiioonn AAuuttoommaattiioonn PPrroojjeecctt
TThhee UUnniivveerrssiittyy ooffRR oocchheesstteerr
RRoocchheesstteerr,, NNYY
AA CCoonnttiinnuuaattiioonn OOrrddeerr PPllaann iiss aavvaaiillaabbllee ffoorr tthhiiss sseerriieess.. AA ccoonnttiinnuuaattiioonn oorrddeerr wwiillll bbrriinngg
ddeelliivveerryy ooff eeaacchh nneeww vvoolluummee iimmmmeeddiiaatteellyy uuppoonn ppuubblliiccaattiioonn.. VVoolluummeess aarree bbiilllleedd oonnllyy uuppoonn
aaccttuuaall sshhiippmmeenntt.. FFoorr ffuurrtthheerr iinnffoorrmmaattiioonn pplleeaassee ccoonnttaacctt tthhee ppuubblliisshheerr..
VVoolluummee 88
..AAddvvaanncceess iinn
IInnffoorrmmaattiioonn
SSyysstteemmss SScciieennccee
EEddiitteedd bbyy
JJuulliiuuss TT..lloouu
CCeenntteerr ffoorr IInnffoorrmmaattiioonn RReesseeaarrcchh
UUnniivveerrssiittyy ooff FFlloorriiddaa
GGaaiinneessvviillllee,, FFlloorriiddaa
PPLLEENNUUMM PPRREESSSS·· NNEEWW YYOORRKK --LLOONNDDOONN
The Library of Congress cataloged the first volume of this title as follows:
Advances in information systems science. v. 1 -
New York, Plenum Press, 1969.
v. illus. 24 cm.
Editorv.1-1. T. Tou.
1. Information science - Collections. I. Tou, Tsu-lieh. ed.
Z699.A1A36 029.7 69-12544
ISBN-13: 978-1-4613-9885-1 e-ISBN-13: 978-1-4613-9883-7
DOl: 10.1007/978-1-4613-9883-7
© 1981 Plenum Press, New York
Softcover reprint of the hardcover 1s t edition 1981
A Division of Plenum Publishing Corporation
233 Spring Street, New York, N.Y. 10013
All rights reserved
No part of this book may be reproduced, stored in a retrieval system, ortransmitted,
in any form or by any means, electronic, mechanical, photocopying, microfilming,
recording, or otherwise, without written permission from the Publisher
Contents of Earlier Volumes
Volume 1 Chapter 1: Theory of Algorithms and Discrete Processors
V. M. Glushkov and A. A. Letichevskii
Chapter 2: Programming Languages
Alfonso Caracciolo di Forino
Chapter 3: Formula Manipulation - The User's Point of View
M. E. Engeli
Chapter 4: Engineering Principles of Pattern Recognition
Julius T. Tou
Chapter 5: Learning Control Systems
K.S. Fu
Volume 2 Chapter 1: Learning Systems
Ya. Z. Tsypkin
Chapter 2: Stochastic Computing Systems
B. R. Gaines
Chapter 3: Semantics of Programming Languages
J. W. de Bakker
Chapter 4: Information Storage and Retrieval Systems
Michael E. Senko
Chapter 5: Some Characteristics of Human Information Processing
Earl Hunt and Walter Makous
Volume 3 Chapter 1: Pattern Recognition: Heuristics or Science?
V. A. Kovalevsky
Chapter 2: Feature Compression
Satosi Watanabe
Chapter 3: Image Processing Principles and Techniques
J. K. Hawkins
Chapter 4: Computer Graphics
R. J. Pankhurst
Chapter 5: Logical Design of Optimal Digital Networks by Integer Programming
Saburo Muroga
Volume 4 Chapter 1: Augmentation of Machine Language Level
V. K. Smimov
Chapter 2: On the Relation between Grammars and Automata
Michael A. Harrison
Chapter 3: An Introduction to Information Structures and Paging
Considerations for On-Line Text Editing Systems
David E. Ri~e and Andries van Dam
Chapter 4: An Introduction to the Structure of Time-Shared Computers
C. Gordon Bell and Michael M. Gold
Chapter 5: Error-Correcting Codes in Computer Arithmetic
James L. Massey and Oscar N. Garda
Volume 5 Chapter 1: Data Organization and Access Methods
P. C. Patton
Chapter 2: Design of Software for On-Line Minicomputer Applications
James D. Schoeffler
Chapter 3: A Survey of the Status of Microprogramming
C. V. Ramamoorthy
Chapter 4: Some Grammars and Recognizers for Formal
and Natural Languages
J. A. Moyne
Volume 6 Chapter 1: Theory of Parallel Programming
Part I: Survey of Practical Aspects
V. E. Kotov
Chapter 2: Theory of Parallel Programming
Part II: Survey of Formal Models
A. S. Narin'yani
Chapter 3: Data Structure Models in Information
System Design
W. M. Turshi
Chapter 4: The Mathematical Theory of L Systems
G. Rozenberg and A. Salomaa
Volume 7 Chapter 1: Diagnosis Techniques and Methodologies
for Di'gital Systems
Takuo Kitamura, Shunji Tashiro, and Masayuki Inagaki
Chapter 2: Design for a Distributed Information Network
P.A.D. de Maine and D. E. Whitten
Chapter 3: Modeling and Design of Distributed Information Systems
S. K. Chang and C. N. Liu
Chapter 4: Hardware, Firmware, Software Technology in Microcomputer
Systems
T. G. Lewis
Chapter 5: Data Structures and Pattern Recognition
Allen Klinger
Articles Planned/or Future Volumes
I. Kobayashi An Overview 0/ Database Technology
M. G. Thomason and Data Structure and Database in Digital Scene
R. C. Gonzalez Analysis
Y. H. Pao Machine Intelligence Techniques/or System
Monitoring and Control
H. Y. Chang Computer-Aided Design
J. T. Tou Knowledge-Based Pattern Recognition
Preface
This volume, the eighth of a continuing series on information systems
science, presents five timely topics which are of current interest in this
growing field. In each chapter, an attempt is made to familiarize the reader
with some basic background information on the advances discussed, so
that this volume may be used independently or in conjunction with the
previous volumes. The emphasis in this volume is centered upon file or
ganization and performance evaluation, computational semantics, digital
control, clustering analysis, and geometric modeling.
Chapter I presents a comprehensive survey of file organization. In
this chapter, Gudes and Ganesh discuss performance evaluation, imple
mentation considerations, relationship to database models, and performance
tradeoffs of various file organizations. The semantics of programming
languages was discussed in Volume 2 of this series. This topic is revisited
by Culik II and Farah from a different point of view. Chapter 2 is concerned
with linked forest manipulation and models for formal description of a
programming language. Linked trees reveal syntactical properties of a
programming language and provide data structures for describing com
putational semantics.
The availability of inexpensive computing power in microcomputers
and the continued cost reduction and performance improvement of mini
computers have stimulated widespread applications of digital control in
industries. The key to the design of a great variety of control applications
is the software development. Shumaker presents the software development
for digital control in Chapter 3. Computer-based data analysis has been
recognized as an important function of information systems. A frequently
used technique of data analysis in information system design is automatic
cluster seeking. Chapter 4 presents a comprehensive review of clustering
techniques and applications. In this chapter, Lee covers a variety of im
portant topics including distance function measures, graph theoretical
approach, nearest neighbor searching, partial match searching, storage
reduction, and program restructuring.
In recent years, CAD jCAM has become a very important aspect in
ix
x Preface
information system design. A fundamental technique in CAD/CAM design
is geometric modeling, which is the subject of Chapter 5. Geometric mod
eling is regarded as a basic notion in automated design and production of
discrete rigid objects. In this chapter, Requicha and Voelcker introduce
geometric modeling techniques, review some properties of solid objects,
and discuss several object-modifying processes.
The editor wishes to express sincere thanks to the authors of this volume
for their cooperation and for the timely completion of their manuscripts.
In fact, many more of our colleagues contributed to the book than those
whose names appear in the contents. Much credit is due to our reviewers
of the articles for their invaluable advice and constructive criticism.
Gainesville, Florida Julius T. Tou
Contents
Chapter 1 A Survey of File Organizations and
Performance
Ehud Gudes and S. Ganesh
1. Introduction . . . . . . . . . .
1.1. Basic Concepts. . . 4
1.2. General Performance Measures 6
1.3. Current Literature . 8
2. Primary Key File Organizations 8
2.1. Introduction. . . . . . 8
2.2. Sequential Organization. . 10
2.3. Direct Organization 13
2.4. Indexed Sequential Organization 16
2.5. Index Tree Structures. 20
2.5.1. B-trees . . . . . . . . 23
2.5.2. VSAM . . . . . . . . 28
2.6. Comparative Evaluation and Simulation 31
2.7. Summary ........ . 34
3. Secondary Key File Organizations 34
3.1. Introduction. 34
3.2. Organizations 36
3.2.1. Inverted File 36
3.2.2. Multilist . . 39
3.2.3. Generalized File Structure 40
3.2.4. Cellular Serial Organization 43
3.2.5. Canonical File Structure . . 44
3.2.6. Combined Indices . . . . . 46
3.2.7. Multiple Key Hashing Model. 48
3.2.8. Superimposed Coding . . . . 49
3.2.9. Combinatorial Filing System . 50
3.2.10. Consecutive Retrieval Organization 50
xi