Open Access News

News from the open access movement

Tuesday, October 23, 2007

Boston libraries chose OCA over Google for its openness

Katie Hafner, Libraries Shun Deals to Place Books on Web, New York Times, October 22, 2007. Excerpt:

Several major research libraries have rebuffed offers from Google and Microsoft to scan their books into computer databases, saying they are put off by restrictions these companies want to place on the new digital collections.

The research libraries, including a large consortium in the Boston area, are instead signing on with the Open Content Alliance, a nonprofit effort aimed at making their materials broadly available.

Libraries that agree to work with Google must agree to a set of terms, which include making the material unavailable to other commercial search services. Microsoft places a similar restriction on the books it converts to electronic form. The Open Content Alliance, by contrast, is making the material available to any search service.

Google pays to scan the books....[But] it costs the Open Content Alliance as much as $30 to scan each book, a cost shared by the group’s members and benefactors, so there are obvious financial benefits to libraries of Google’s wide-ranging offer, started in 2004....

But the resistance from some libraries, like the Boston Public Library and the Smithsonian Institution, suggests that many in the academic and nonprofit world are intent on pursuing a vision of the Web as a global repository of knowledge that is free of business interests or restrictions....

“There are two opposed pathways being mapped out,” said Paul Duguid, an adjunct professor at the School of Information at the University of California, Berkeley. “One is shaped by commercial concerns, the other by a commitment to openness, and which one will win is not clear.” ...

The Library of Congress has a pilot program with Google to digitize some books. But in January, it announced a project with a more inclusive approach. With $2 million from the Alfred P. Sloan Foundation, the library’s first mass digitization effort will make 136,000 books accessible to any search engine through the Open Content Alliance. The library declined to comment on its future digitization plans.

The Open Content Alliance is the brainchild of Brewster Kahle, the founder and director of the Internet Archive....

Although Google is making public-domain books readily available to individuals who wish to download them, Mr. Kahle and others worry about the possible implications of having one company store and distribute so much public-domain content.

“Scanning the great libraries is a wonderful idea, but if only one corporation controls access to this digital collection, we’ll have handed too much control to a private entity,” Mr. Kahle said....

Microsoft joined the Open Content Alliance at its start in 2005, as did Yahoo, which also has a book search project. Google also spoke with Mr. Kahle about joining the group, but they did not reach an agreement.

A year after joining, Microsoft added a restriction that prohibits a book it has digitized from being included in commercial search engines other than Microsoft’s.

“Unlike Google, there are no restrictions on the distribution of these copies for academic purposes across institutions,” said Jay Girotto, group program manager for Live Book Search from Microsoft. Institutions working with Microsoft, he said, include the University of California and the New York Public Library....

On Wednesday the Internet Archive announced, together with the Boston Public Library and the library of the Marine Biological Laboratory and Woods Hole Oceanographic Institution, that it would start scanning out-of-print but in-copyright works to be distributed through a digital interlibrary loan system.

PS: For more background, see post from September 24 on the Boston Library Consortium decision to work with the Open Content Alliance, and my article from November 2005 comparing the OCA and Google book-scanning projects.

Posted by Peter Suber at 10/23/2007 09:41:00 AM.