Cataloging & Classification Quarterly

Volume 37, no. 3-4, 2004

 


 

The Thesaurus: Review, Renaissance and Revision

 

Sandra K. Roe, Alan R. Thomas

special issue guest editors

 

INTRODUCTION

The thesaurus within the context of information retrieval was developed approximately 50 years ago in an environment quite different from the one we find ourselves in today. While paper-based formats like printed books and printed serials remain, new formats continue not only to emerge but to proliferate. A recent OCLC report tells us that annual production projections for digital materials by the year 2007 are too large to estimate [1]. Beyond the challenge of new formats and the quantities and interrelationships of those formats, additional layers of complexity are added as national libraries, bibliographic utilities, and other private and governmental entities work toward bilingual and multilingual subject access within and across online collections and catalogs. Despite all that distances us from 50 years ago, there is this commonality: an interest in thesauri. There is an unprecedented interest in thesauri and related types of vocabulary switching tools not just within the library community but also from the business community, information architects, and linguists. For instance, Rosenfeld and Morville in Information Architecture for the World Wide Web predict that “thesauri [will] become a key tool for dealing with the growing size and importance of web sites and intranets [2].”

In this time of renewed interest in thesauri, this collection of papers is designed to serve three purposes. The first purpose is to acquaint or remind the Library and Information Science (LIS) community of the history of the development of the thesaurus and the standards for thesaurus construction. Aitchison and Dextre Clarke, both active in standards development, provide a rich historical piece that takes the reader from the first use of the word ‘thesaurus’ in the context of information retrieval to the present day challenges of interoperability and end-user usability.

The second purpose of this collection is to provide bibliographies and tutorials from which any reader can become more grounded in her or his understanding of thesaurus construction, use, and evaluation. In “Teach Yourself Thesaurus,” Thomas outlines the skills one needs to create a thesaurus and supplies the reader with sources of additional readings for each. To provide an opportunity to practice these skills, Shearer presents a series of exercises intended to teach the process of building a thesaurus from an alphabetical list of terms. In contrast to Thomas’ paper, Nielsen’s presentation of readings is organized by the technical processes involved in thesaurus construction and the particular problems related to each, including methodologies to gain knowledge about an information environment through the study of information seeking behavior. Do you want to hire a thesaurus consultant or become one yourself? Will’s article, “Thesaurus Consultancy,” outlines the steps in a thesaurus project and reviews the roles and functions of both the consultant and the client. Owens and Cochrane describe evaluation methods which can be applied to a thesaurus as it is being created or used to evaluate an existing one.

The third purpose is to address topics related to thesauri but that are unique to the current digital environment, or network of networks. Greenberg presents the results of a query-expansion study that explores user comprehension and application of the thesaurus in the ABI/Info database. Johnson proposes a distributed thesaurus Web service while Riesland presents criteria for evaluating vocabulary management software packages. In “Multilingual Subject Access,” Landry describes the MACS project which links equivalent headings in English, French, and German subject heading lists so that users can access online catalogs in the language of their choice. To close, Amy Warner describes NISO’s current initiative [3] to revise Z39.19 Guidelines for the Construction, Format, and Management of Monolingual Thesauri in an interview with the editors.

Taken together, these papers are intended to inform the reader about thesaurus construction, its history and development, and speculate on its future. While it seems clear that the skills involved in thesaurus construction will continue to serve us well, it is less certain whether the thesaurus will be our next generation vocabulary switching tool of choice. That function may be carried out by some form of taxonomy, ontology, mind map, or something as yet unnamed and undefined. Among developments to monitor in this rapidly changing information environment will be the work of various standards making bodies and the new audiences and applications for their work.

Sandra K. Roe
Alan R. Thomas

The editors would like to express their appreciation to Jeff A. Jenson for his assistant in verifying the citations throughout this collection of articles.

 
[1]. Five-Year Information Format Trends (Dublin: OCLC Library and Information Center, 2003) p. 1. Available online at URL: http://www.oclc.org/info/trends/.


[2]. Rosenfeld, L. and P. Morville. 2002. Information Architecture for the World Wide Web. 2nd ed. Sebastopol, CA: O’Reilly, p. 188.
 

[3]. National Information Standards Organization. “Developing the Next Generation of Standards for Controlled Vocabularies and Thesauri” (Bethesda: NISO, 2003). Available online at URL: http://www.niso.org/committees/MT-info.html.
 


Back to Vol. 37 Nr.3-4 | CCQ Homepage | Tables of Contents | Informaworld |

Comments to: Jeffrey Beall at
© Haworth Press, Inc.