This panel featured six speakers who are involved in a new project to digitize books and make them available both online and print-on-demand via Amazon. Two of the speakers, Lotfi Belkhir and Robin Asbury, work for the companies that are behind the projectâ€”Kirtas Technologies and BookSurge, respectivelyâ€”and the other four speakers are with institutions that are digitizing books: Martin Halbert and Lisa Macklin, from Emory University; Joyce Rumery, from the University of Maine, and Linda McKenzie from the Toronto Public Library.
This project differentiates itself from Google’s scanning project by focusing on quality control. As Lotfi explained in his presentation, Google and their partner libraries are privileging quantityâ€”digitizing the most books possible in the shortest period of timeâ€”over qualityâ€”creating the most complete, accurate, and usable digital copies of books possible. (To demonstrate the problems in the Google method, he showed a set of images of one book that Google scanned that contained a very intricately manicured set of fingernails, and, in some of the images, the entire hand, earning some chuckles from the audience.) In his view, there’s no point in doing a project with such low quality control. The cost of scanning books is only a tiny fraction of the total cost of a digitization process; most of the cost will come in the following years as storage costs. In Lotfi’s opinion, there is no point in scrimping on the scanning and then spending all of that money to store a low-quality productâ€”especially since the institution is unlikely to be able to afford to scan books again any time in the near future.
Robin’s company, BookSurge, is a print-on-demand publisher that was recently purchased by Amazon. BookSurge partners with Kirtas and the libraries to make scanned books available through Amazon, complete with the ability to search inside the book. The book shows up as in stock on Amazon, but no â€œstockâ€ actually existsâ€”BookSurge prints a copy of the book only when it is ordered. This printed book is branded as being from the library, with such features as the library’s logo on the front cover and images of and text about the library on the back cover. When the book is sold to the customer, Kirtas distributes some of the profits from the sale back to the library, helping to defray the cost of digitization.
Emory University, the University of Maine, and the Toronto Public Library are all doing their digitization work with their own staffs and their own purchased Kirtas automated scanners, giving them complete control over the process. The libraries are able to keep a preservation copy of the digital files, separate from the digital files used to print the books on demand. The libraries maintain control of the digitized books as far as dissemination, access, search, organization, etc. are concerned, and the libraries maintain the right to give the public full access to digital representations of the book.
Martin, the director for digital programs and systems at Emory University, spoke about the process that Emory uses in its digitization project. Emory is focused on digitizing collections, not just books. They have a Digital Collections Steering Committee that identifies and prioritizes collections for digitization (beginning with the Southern Methodist collection), and they provide scholarly contextualizations of the digitized materials. Emory is also planning to work with the Kirtas/BookSurge partnership to publish new digital peer-reviewed scholarly monographs and, in the future, some theses and dissertations written at Emory.
Joyce and Linda both spoke about the collections that are being digitized at their institutions. At the University of Maine (which has partnered with the Maine State Library), they first digitized the old University yearbooks, then moved on to town reports from Maine dating back to the early 1880s. Their next projects are going to be pre-1923 Maine history books, biographies of Maine citizens, and Maine travel books. The Toronto Public Library is going begin by digitizing its Canadiana collection, which contains around 11,000 items, and then move on to other special collections.
Lisa, from the intellectual property rights office at Emory, covered the legal considerations that must be considered when launching a digitization project. The biggest legal risk in a digitization project is being sued for accidentally digitizing something that is still under copyright. Having the digitized books available for sale on Amazon heightens this risk, because that makes it very easy for the copyright owner to discover that his or her work has been digitized. The easiest way to lessen this risk is to digitize only books that were published in the United States before 1923 and that have copyright dates printed on the items, since such books are almost certainly in the public domain. Lisa also emphasized the importance of keeping good metadata about how the institution determined that the item was in the public domain, since the penalties for copyright infringement can vary widely depending on whether the judge determines that the copyright was willfully infringed (damages up to $150,000) or that the infringer was acting on a good faith belief that the copy was not infringing (damages as low as $200).
For any libraries that are interested in joining this book digitization project, the person to contact for more information is Lisa Stasevich at Kirtas: [email protected]
This panel was taped, so I assume that there will be a video of it available on the Web shortly.
I was supposed to blog a third session, The Future of Information Retrieval, but due to the fact that it takes just shy of forever to get from Bethesda (where I’m staying) into downtown DC during the morning rush hour and the popularity of that session, I was unable to do so. By the time I got there, not only were all of the seats taken, but so was all of the standing room both inside the room and in the hallway outside of it but close enough to the door to hear the speakers. So this will be my last entry on the LITA Blog for ALA 2007. But please feel free to check out my own newly-launched blog at folksweb.blogspot.com.! I’m going to be blogging about the Semantic Web, folksonomies, Wikipedia, Freebase, and all of the other innovative new ways coming out to organize information on the Web.