ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang, Mengqi Zhang, Shiguang Wu, Jiahuan Pei, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Pengjie Ren

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Exclusion is an important and universal linguistic skill that humans use to express what they do not want. There is little research on exclusionary retrieval, where users express what they do not want to be part of the results produced for their queries. We investigate the scenario of exclusionary retrieval in document retrieval for the first time. We present ExcluIR, a set of resources for exclusionary retrieval, consisting of an evaluation benchmark and a training set for helping retrieval models to comprehend exclusionary queries. The evaluation benchmark includes 3,452 high-quality exclusionary queries, each of which has been manually annotated. The training set contains 70,293 exclusionary queries, each paired with a positive document and a negative document. We conduct detailed experiments and analyses, obtaining three main observations: (i) existing retrieval models with different architectures struggle to comprehend exclusionary queries effectively; (ii) although integrating our training data can improve the performance of retrieval models on exclusionary retrieval, there still exists a gap compared to human performance; and (iii) generative retrieval models have a natural advantage in handling exclusionary queries.
Original languageEnglish
Title of host publicationAAAI'25/IAAI'25/EAAI'25: Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence and Thirty-Seventh Conference on Innovative Applications of Artificial Intelligence and Fifteenth Symposium on Educational Advances in Artificial Intelligence
EditorsToby Walsh, Julie Shah, Zico Kolter
PublisherACM
Pages13295-13303
Number of pages9
ISBN (Electronic)9781577358978
DOIs
Publication statusPublished - 2025
Externally publishedYes

Funding

This work was supported by the Key R&D Program of Shandong Province with grant 2024CXGC010108, the Natural Science Foundation of China (62472261, 62102234, 62372275, 62272274, 62202271, T2293773, 62072279), the National Key R&D Program of China with grant No.2022YFC3303004, the Natural Science Foundation of Shandong Province (ZR2021QF129), and by the Dutch Research Council (NWO), under project numbers 024.004.022, NWA.1389.20.183, and KICH3.LTP.20.006, and the European Union's Horizon Europe program under grant agreement No 101070212. All content represents the opinion of the authors, which is not necessarily shared or endorsed by their respective employers and/or sponsors.

FundersFunder number
Nederlandse Organisatie voor Wetenschappelijk OnderzoekKICH3.LTP.20.006, 024.004.022, NWA.1389.20.183
National Key Research and Development Program of China2022YFC3303004
European Union's Horizon Europe program101070212
National Natural Science Foundation of China62202271, 62372275, T2293773, 62272274, 62472261, 62072279, 62102234
Key R&D Program of Shandong Province2024CXGC010108
Natural Science Foundation of Shandong ProvinceZR2021QF129

    Fingerprint

    Dive into the research topics of 'ExcluIR: Exclusionary Neural Information Retrieval'. Together they form a unique fingerprint.

    Cite this