Discovery of context-specific ranking functions for effective information retrieval using genetic programming

Aus de_evolutionary_art_org
Wechseln zu: Navigation, Suche


Referenz

W. Fan, M.D. Gordon, P. Pathak: Discovery of context-specific ranking functions for effective information retrieval using genetic programming. IEEE Transactions on knowledge and Data Engineering, Volume 16, 4, 523 - 527

DOI

http://dx.doi.org/10.1109/TKDE.2004.1269663

Abstract

The Internet and corporate intranets have brought a lot of information. People usually resort to search engines to find required information. However, these systems tend to use only one fixed ranking strategy regardless of the contexts. This poses serious performance problems when characteristics of different users, queries, and text collections are taken into account. We argue that the ranking strategy should be context specific and we propose a , new systematic method that can automatically generate ranking strategies for different contexts based on genetic programming (GP). The new method was tested on TREC data and the results are very promising.

Extended Abstract

Bibtex

@ARTICLE{1269663,
author={W. Fan and M. D. Gordon and P. Pathak},
journal={IEEE Transactions on Knowledge and Data Engineering},
title={Discovery of context-specific ranking functions for effective information retrieval using genetic programming},
year={2004},
volume={16},
number={4},
pages={523-527},
keywords={data mining;genetic algorithms;information retrieval;search engines;tree data structures;Internet;TREC data;context-specific ranking function discovery;corporate intranets;fixed ranking strategy;genetic programming;information routing;intelligent contextual information retrieval;search engines;term weighting strategy;text mining;Documentation;Genetic programming;Information retrieval;Information systems;Internet;Manuals;Routing;Search engines;Testing;Text mining},
doi={10.1109/TKDE.2004.1269663},
url={http://dx.doi.org/10.1109/TKDE.2004.1269663 http://de.evo-art.org/index.php?title=Discovery_of_context-specific_ranking_functions_for_effective_information_retrieval_using_genetic_programming },
ISSN={1041-4347},
month={April},
}

Used References

M. Gordon, "Probabilistic and Genetic Algorithms for Document Retrieval," Comm. ACM, vol. 31, no. 2, pp. 152-169, 1988. http://dx.doi.org/10.1145/63039.63044

M. Gordon and P. Pathak, "Finding Information on the WWW: The Retrieval Effectiveness of Search Engines," Information Processing and Management, vol. 35, no. 2, pp. 141-180, 1999. http://dx.doi.org/10.1016/S0306-4573(98)00041-7

D.K. Harman, "Relevance Feedback Revisited," Proc. 11th ACM SIGIR Conf., pp. 321-331, 1992. http://dx.doi.org/10.1145/133160.133167

D.K. Harman, "Overview of the Fourth Text Retrieval Conference (TREC-4)," Proc. Fourth Text Retrieval Conf., D.K. Harman, ed., pp. 1-24, NIST Special Publication 500-236, 1996.

J.R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection. Cambridge, Mass.: MIT Press, 1992.

F.W. Lancaster and A.J. Warner, Information Retrieval Today. Information Resources Press, 1993.

W. Langdon and R. Poli, Foundations of Genetic Programming. Springer Velag, 2002. http://dx.doi.org/10.1007/978-3-662-04726-2

T.M. Mitchell, Machine Learning. McGraw Hill, 1997.

H. Ng, W. Goh and K.L. Low, "Feature Selection, Perceptron Learning, and a Usability Case Study for Text Categorization," Proc. 20th Ann. Int',l ACM SIGIR Conf. Research and Development in Information Retrieval, 1997. http://dx.doi.org/10.1145/258525.258537

S.E. Robertson, S. Walker, S. Jones, M.M. Hancock-Beaulieu and M. Gatford, "Okapi at TREC-4," Proc. Fourth Text Retrieval Conf., D.K. Harman, ed., pp. 73-97, NIST Special Publication 500-236, 1996.

G. Salton, Automatic Text Processing. Addison-Wesley Publishing Co., 1989.

H. Schutze, D. Hull and J. Pedersen, "A Comparison of Classifiers and Document Representations for the Routing Problem," Proc. 16th ACM SIGIR ',95, pp. 229-237, 1995. http://dx.doi.org/10.1145/215206.215365

A. Singhal, G. Salton, M. Mitra and C. Buckley, "Document Length Normalization," Information Processing and Management, vol. 32, no. 5, pp. 619-633, 1996. http://dx.doi.org/10.1016/0306-4573(96)00008-8

K. Sparck Jones, "Automatic Indexing," J. Documentation, vol. 30, pp. 393-432, 1974. http://dx.doi.org/10.1108/eb026588

E.M. Voorhees, "Variations in Relevance Judgments and the Measurement of Retrieval Effectiveness," Information Processing and Management, vol. 36, no. 5, pp. 697-716, 2000. http://dx.doi.org/10.1016/S0306-4573(00)00010-8

Links

Full Text

internal file


Sonstige Links