Building the “next generation” library catalog
September 1st, 2006 by Eric Lease MorganHow will we, the library profession, build the “next generation” library catalog, and to what degree will the process include vendor support and open source software?
I must admit that there are few things that do not succeed over time without some sort of commercial interest. Think OCLC. JSTOR. Even NOTIS. The only exception to the rule seems to be when government subsidizes the process.
Be that as it may, I will still advocate a large dose of grass roots efforts lead by the library community exploiting open source software over something created by a commercial institution. At least for now. Moreover, when your fellow librarians say things like, “We tried those ‘homegrown’ systems a long time ago, and where are they now? We need vendor-supported software”, I can give you a number of reasons why this is not necessarily the case in today’s environment:
1. Computer hardware & software - Twenty or more years ago, when the library profession was supporting “homegrown” systems, the hardware used was vendor-specific. Maybe you had a Prime. A Unisys. An IBM 360. A DEC Watchamacallit. A Sun Something. Etc. These computers had less RAM, less disk space, and less processing power than the computer you have on your desktop right now. Each of these computers had their own operating system and set of programming languages used to create applications. The applications created for these systems was not sharable between computers, and consequently is was difficult, if not impossible, to share code between libraries. Now-a-days the applications will be written for Unix/Linux or Java — platforms that are not computer hardware specific. (If someone creates a Microsoft-based “next generation” library catalog, run the other way, very fast.) The code written for one computer will run on the next computer (no puns intended) without much modification, and this will enable the library community to collaborate to a greater degree.
2. Relational databases - Relational databases and the technology used to implement them was embryonic when libraries were supporting their “homegrown” systems. There were few, if any, well-supported best practices for managing large sets of information. And even when you did you sat around worrying whether or not you should allocate two bytes of disk space to denote the name of a state or twelve. These problems are far less challenging now with the cost of disk space and the availability of any number of relational database applications. The problems of storing the data is much less limiting than it was twenty years ago.
3. Indexing technology - Databases are great for storing and manipulating information. Ironically, they are poor on searching. To search a database you must know the underlying structure of the data. Indexes remove this problem. They invert the content of the database creating lists of words and pointers to records. No knowledge of the database’s structure is necessary. Couple this with statistical analysis and indexing technology begins to appear “smart” — think relevance ranking. Indexing technology has matured to a very large degree in the past twenty years, and there are a large number of freely available indexers. How many indexers were available twenty years ago? One, maybe. BRS.
4. Skills - Computers twenty or more years ago were expensive, very expensive. Much fewer people had access to computers and a proportional number of fewer people had computer expertise. Now-a-days hackers abound. [1] If they didn’t we wouldn’t have the email, Web servers, MySQL, Perl, PHP, Linux, or just about anything related to the Internet. Put another way, there are many many more people now-a-days who know how to make computers do the things they do. There are computer programmers around, they just don’t work in libraries to a large degree. “Libraries are about books. Right?”
5. Communication - Communication via the telephone is dirt cheap. You can make long distance telephone calls for pennies. From my workplace here in Indiana I can talk on the telephone with people in the United Kingdom for .02ยข/minute. At those rates it is silly not to pick up the telephone. The biggest thing the Internet does is facilitate communication. People-to-people communication. People-to-computer communication. Computer-to-computer communication. Twenty years ago the story was much different. You were lucky to have a 2400 baud modem and you dared not make a long-distance telephone call. Because of our increasingly seamless ability to communicate across long distances, it will be easier for libraries to coordinate their effort and create something from the community.
In short, don’t let people write you off when you say, “We can built it ourselves.” Explain to them how the computer environment is substantially different from previous times. Enumerate the things outlined above. Yes, the human challenges still exist. Building consensus. Setting priorities. Keeping things on schedule. Creating communities. Bringing people physically together. Allocating time, space, people, and money. But are those the things you want to pay a vendor for? The other things are “as free as a free kitten.”
Food for thought on a Friday afternoon.
[1] Hackers in this context are contrasted with “crackers”. Hackers are the good guys. They look at source code and figure out ways to improve it or modify it for their own purposes. Crackers, on the other hand, are malicious. They look for ways to exploit software for immoral purposes.

September 1st, 2006 at 5:06 pm
The library world is now very different place from the world of PDPs and System/3s that gave birth to the ILS and the first generation OPACs, for the reasons you noted as well as for the fact that it used to be feasible for a vendor to control all the data the library staff and patrons would need to see all on one computer. “Integrated” systems may not have done everything well, but they at least tried to do everything.
Now the key task we as librarians face is in presenting data from dozens or hundreds or thousands of sources to our patrons in a simple to use, seemlessly integrated way. The job is bigger than one vender or one library; the communities that create standards as well as those that guide open source software products are where our profession can push the industry to that “next generation”.
September 11th, 2006 at 10:30 pm
Nicely said, but I believe strongly that the next iteration of catalog will be an externally managed one (or at least one where the content/database/information resides remotely). It will look more like LibraryThing than any of the local library catalogs.
I actually asked a Google representative at ALA this year when they were going to purchase OCLC…that’s all it would take for there to be a Google Catalog.
September 15th, 2006 at 4:51 am
The modern libraries have laks of bibliographic datas in different categories (books,video recordings,sound recordings etc.).Searching all these things with an apt keyword is very important. An integrated system can satisfy the clients to retrieve thier specific information.Time spending for searching is an important thing . This is the time to develop a system to search the catalogue with extra speed and accuracy . To raise the cooperation of experts in the field of information is a must.
October 7th, 2006 at 12:57 am
Information Technology is a wonderful tool to enhance the services of a library. As mentioned in the article, there are several opensource softwares are available, we (librarians) can develop our own systems to meet the requirements of users. Sometime, we may not succeed on the first attempt, but if we keep on trying, we will succeed one day.