Abstract
Machine learning techniques have been considered a very promising solution to web information retrieval, which is based on the ranking of the relevance of samples to a query input. However, the connotation of labeling in ranking is quite different from that in classification. Specifically, the labeling of samples for ranking is usually incomplete, i.e. only a part of samples are labeled. In order to remedy this methodological gap, in this paper we propose a hybrid learning framework, called fuzzy-label learning, which consists of two layers. First, we utilize a label-propagation algorithm to estimate those labels of unlabeled samples by their neighborhoods. Second, we adopt RankBoost on the samples with fuzzy labels. Experiments with five-fold cross-validation using the Letor benchmark datasets show that the proposed hybrid learning framework can definitively improve the search performance achieved by the RankBoost algorithm for web information retrieval.
Original language | English |
---|---|
Title of host publication | 2008 IEEE International Conference Neural Networks and Signal Processing, ICNNSP |
Pages | 569-574 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 22 Sept 2008 |
Event | 2008 IEEE International Conference Neural Networks and Signal Processing, ICNNSP - Zhenjiang, China Duration: 7 Jun 2008 → 11 Jun 2008 |
Conference
Conference | 2008 IEEE International Conference Neural Networks and Signal Processing, ICNNSP |
---|---|
Country/Territory | China |
City | Zhenjiang |
Period | 7/06/08 → 11/06/08 |
Keywords
- Fuzzy set
- Machine learning
ASJC Scopus subject areas
- Artificial Intelligence
- Signal Processing