Open Access News

News from the open access movement


Friday, November 07, 2008

More on the Textpresso text-mining tool

Hans-Michael Müller, et al., Textpresso for Neuroscience: Searching the Full Text of Thousands of Neuroscience Research Papers, Neuroinformatics, October 24, 2008. Abstract:
Textpresso is a text-mining system for scientific literature. Its two major features are access to the full text of research papers and the development and use of categories of biological concepts as well as categories that describe or relate objects. A search engine enables the user to search for one or a combination of these categories and/or keywords within an entire literature. Here we describe Textpresso for Neuroscience, part of the core Neuroscience Information Framework (NIF). The Textpresso site currently consists of 67,500 full text papers and 131,300 abstracts. We show that using categories in literature can make a pure keyword query more refined and meaningful. We also show how semantic queries can be formulated with categories only. We explain the build and content of the database and describe the main features of the web pages and the advanced search options. We also give detailed illustrations of the web service developed to provide programmatic access to Textpresso. This web service is used by the NIF interface to access Textpresso....
See also our past posts on Textpresso.