Research Article
A Novel Term-Term Similarity Score Based Information Foraging Assessment
@INPROCEEDINGS{10.1007/978-3-319-19656-5_5, author={Ilyes Khennak and Habiba Drias and Hadia Mosteghanemi}, title={A Novel Term-Term Similarity Score Based Information Foraging Assessment}, proceedings={Internet of Things. User-Centric IoT. First International Summit, IoT360 2014, Rome, Italy, October 27-28, 2014, Revised Selected Papers, Part I}, proceedings_a={IOT360}, year={2015}, month={7}, keywords={Information retrieval Information foraging theory Query expansion Term proximity Term co-occurrence}, doi={10.1007/978-3-319-19656-5_5} }
- Ilyes Khennak
Habiba Drias
Hadia Mosteghanemi
Year: 2015
A Novel Term-Term Similarity Score Based Information Foraging Assessment
IOT360
Springer
DOI: 10.1007/978-3-319-19656-5_5
Abstract
The dramatic proliferation of information on the web and the tremendous growth in the number of files published and uploaded online each day have led to the appearance of new words in the Internet. Due to the difficulty of reaching the meanings of these new terms, which play a central role in retrieving the desired information, it becomes necessary to give more importance to the sites and topics where these new words appear, or rather, to give value to the words that occur frequently with them. For this aim, in this paper, we propose a novel term-term similarity score based on the co-occurrence and closeness of words for retrieval performance improvement. A novel efficiency/effectiveness measure based on the principle of optimal information forager is also proposed in order to assess the quality of the obtained results. Our experiments were performed using the OHSUMED test collection and show significant effectiveness enhancement over the state-of-the-art.