<p>From the reviews:</p> <p></p> <p>"The main idea of this book, based on the author’s PhD thesis, is to use markup information as a series of cues to the significance of words and concepts in a text, thus enhancing the indexing of that text. The technique is developed for collections of texts with a specific focus, such as a Web site or a collection of documents … . The presented approach is attractive, because it can be adapted to different contexts in a straightforward manner … ." (D. T. Barnard, Computing Reviews, July, 2006)</p>

Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all. Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general. The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.
Les mer
Searches often return either large numbers of matches or no suitable matches at all. The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process.
Les mer
Related Work.- Data Analysis and Domain Model Construction.- Incorporating Additional Knowledge.- A Dialogue System for Partially Structured Data.- UKSearch - Intelligent Web Search.- UKSearch - Evaluation and Discussion.- YPA - Searching Classified Directories.- Future Directions and Conclusions.
Les mer
From the reviews: "The main idea of this book, based on the author’s PhD thesis, is to use markup information as a series of cues to the significance of words and concepts in a text, thus enhancing the indexing of that text. The technique is developed for collections of texts with a specific focus, such as a Web site or a collection of documents … . The presented approach is attractive, because it can be adapted to different contexts in a straightforward manner … ." (D. T. Barnard, Computing Reviews, July, 2006)
Les mer
Using markup structure alone to extract knowledge from documents is new Domain knowledge is extracted from documents in a fully automated process The techniques outlined avoid the bottleneck of manual customization Searching a document collection can be seen as navigating the user through the automatically extracted domain knowledge Combines the theoretical framework and detailed evaluation steps
Les mer
GPSR Compliance The European Union's (EU) General Product Safety Regulation (GPSR) is a set of rules that requires consumer products to be safe and our obligations to ensure this. If you have any concerns about our products you can contact us on ProductSafety@springernature.com. In case Publisher is established outside the EU, the EU authorized representative is: Springer Nature Customer Service Center GmbH Europaplatz 3 69115 Heidelberg, Germany ProductSafety@springernature.com
Les mer

Produktdetaljer

ISBN
9781402037672
Publisert
2005-10-24
Utgiver
Vendor
Springer-Verlag New York Inc.
Høyde
232 mm
Bredde
156 mm
Aldersnivå
Professional/practitioner, P, 06
Språk
Product language
Engelsk
Format
Product format
Innbundet

Forfatter