Millions of searchable, digitized books & journals: HathiTrust

 
 

US HathiTrust offers full-text search of millions of digitised books and journals23 Nov 2009

"The HathiTrust Digital Library, a partnership among some of the US’s largest academic research libraries, has announced a service that is expected to transform how researchers use the more than 1.6 billion pages (4.6 million volumes) in its collections.

The service allows for full-text searching capabilities across the entire library. Researchers can now search public domain and in-copyright works by keyword or phrase. Based on open source Solr/Lucene technology, the service expands on an experimental search of public domain volumes, introduced in November 2008. Full-text search will continue to be supported across the repository as it grows at a rate of hundreds of thousands of volumes every month.

In combination with the HathiTrust Digital Library’s carefully curated bibliographic data, the new functionality allows researchers to more efficiently locate items relevant to their research. It also lays the foundation for future services such as full-text search with faceted browsing, advanced search, ‘more like this’ options, and tools that can be used in computational research.

HathiTrust (http://www.hathitrust.org) is a collaboration of the thirteen universities of the Committee on Institutional Cooperation, the University of California system, and the University of Virginia. It currently includes digitised volumes from the University of Michigan, University of California, Indiana University and the University of Wisconsin. The HathiTrust partners seek to develop the repository and its services to meet the long-term needs of their academic communities, and offer a unique resource on the Web for scholarship and research."
 

From today’s Knowledgespeak Newsletter

University Libraries in Google Project to Offer Backup Digital Library

The Chronicle of Higher Education on Monday, October 13, 2008, has announced the formation of a giant library to serve as a back-up for Google Books, designated as the HathiTrust.

"The…HathiTrust, …consists of the members of the Committee on Institutional Cooperation, a consortium of the 11 universities in the Big Ten Conference and the University of Chicago, and the 10 campuses in the University of California system. The University of Virginia is joining the project, it will be announced today, and officials hope to bring in other colleges as well."

Already HathiTrust  (a shared digital respository ), contains the full text of more than two million books scanned by Google.  However, only about 16 percent of the books in HathiTrust—or about 327,000 volumes—are out of copyright so that their full text can be delivered to all readers.

To read the whole article: http://chronicle.com/free/2008/10/5061n.htm

 

Continue reading