Research Article

A Survey of Query Refinement Techniques From Neural Architectures to Practical Applications

by  Zahra Taheri, Mahdis Saeedi, Ziad Kobti, Hossein Fani
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Issue 36
Published: September 2025
Authors: Zahra Taheri, Mahdis Saeedi, Ziad Kobti, Hossein Fani
10.5120/ijca2025925604
PDF

Zahra Taheri, Mahdis Saeedi, Ziad Kobti, Hossein Fani . A Survey of Query Refinement Techniques From Neural Architectures to Practical Applications. International Journal of Computer Applications. 187, 36 (September 2025), 1-10. DOI=10.5120/ijca2025925604

                        @article{ 10.5120/ijca2025925604,
                        author  = { Zahra Taheri,Mahdis Saeedi,Ziad Kobti,Hossein Fani },
                        title   = { A Survey of Query Refinement Techniques From Neural Architectures to Practical Applications },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 36 },
                        pages   = { 1-10 },
                        doi     = { 10.5120/ijca2025925604 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2025
                        %A Zahra Taheri
                        %A Mahdis Saeedi
                        %A Ziad Kobti
                        %A Hossein Fani
                        %T A Survey of Query Refinement Techniques From Neural Architectures to Practical Applications%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 36
                        %P 1-10
                        %R 10.5120/ijca2025925604
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Query refinement plays a central role in modern information retrieval (IR) systems by improving query clarity, resolving ambiguity, and enhancing result relevance. This survey provides a comprehensive overview of the model architectures and application domains associated with query refinement techniques. The paper first examines classical non-neural models and then explores a range of neural architectures, including embedding-based methods, recurrent neural networks (RNNs), sequence-to-sequence (seq2seq) frameworks, and transformer-based models. Special attention is given to the progression from static representations to contextaware and generative approaches, with an emphasis on how these models capture user intent and session context. The study then reviews the deployment of query refinement methods across practical domains such as product search, music retrieval, job search, and personalized information access. These applications demonstrate the real-world impact of query refinement in handling ambiguous queries, adapting to user preferences, and improving overall retrieval performance. By highlighting key advancements and challenges, this survey offers insight into the current state and future direction of query refinement research.

References
  • Sanjay Agrawal, Srujana Merugu, and Vivek Sembium. Enhancing e-commerce product search through reinforcement learning-powered query reformulation. In Proceedings of the 32nd ACM International CIKM, 2023.
  • Wasi Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. Context attentive document ranking and query suggestion. In Proceedings of the 42nd international ACM SIGIR, 2019.
  • Mohammad Aliannejadi, Hamed Zamani, Fabio Crestani, and W Bruce Croft. Asking clarifying questions in opendomain information-seeking conversations. In Proceedings of the 42nd international acm sigir, 2019.
  • Rajul Anand and Alexander Kotov. An empirical comparison of statistical term association graphs with dbpedia and conceptnet for query expansion. In Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation, 2015.
  • Hiteshwar Kumar Azad and Akshay Deepak. A new approach for query expansion using wikipedia and wordnet. Information sciences, 2019.
  • Andrea Bacciu, Enrico Palumbo, Andreas Damianou, Nicola Tonellotto, and Fabrizio Silvestri. Generating query recommendations via llms. arXiv preprint arXiv:2405.19749, 2024.
  • Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, and Sujay Kumar Jauhar. Knowledgeaugmented large language models for personalized contextual query suggestion. In Proceedings of the ACMWeb Conference 2024, 2024.
  • Jing Bai, Dawei Song, Peter Bruza, Jian-Yun Nie, and Guihong Cao. Query expansion using term relationships in language models for information retrieval. In Proceedings of the 14th ACM international CIKM, 2005.
  • Paul N Bennett, Filip Radlinski, Ryen W White, and Emine Yilmaz. Inferring and using location metadata to personalize web search. In Proceedings of the 34th international ACM SIGIR, 2011.
  • Sumit Bhatia, Debapriyo Majumdar, and Prasenjit Mitra. Query suggestions in the absence of query logs. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, 2011.
  • Alexey Borisov, Martijn Wardenaar, Ilya Markov, and Maarten de Rijke. A click sequence model for web search. SIGIR ’18. Association for Computing Machinery.
  • Fei Cai, Maarten De Rijke, et al. A survey of query auto completion in information retrieval. Foundations and Trends® in Information Retrieval, 2016.
  • Guihong Cao, Jian-Yun Nie, Jianfeng Gao, and Stephen Robertson. Selecting good expansion terms for pseudorelevance feedback. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, 2008.
  • Kaibo Cao, Chunyang Chen, Sebastian Baltes, Christoph Treude, and Xiang Chen. Automated query reformulation for efficient search based on query logs from stack overflow. In 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE). IEEE.
  • Mark J Carman, Fabio Crestani, Morgan Harvey, and Mark Baillie. Towards query log based personalization using topic models. In Proceedings of the 19th ACM international CIKM, 2010.
  • Claudio Carpineto, Renato De Mori, Giovanni Romano, and Brigitte Bigi. An information-theoretic approach to automatic query expansion. ACM Transactions on Information Systems (TOIS), 2001.
  • Claudio Carpineto and Giovanni Romano. A survey of automatic query expansion in information retrieval. Acm Computing Surveys (CSUR), 2012.
  • Jia Chen, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. A context-aware click model for web search. In Proceedings of the 13th international conference on web search and data mining, 2020.
  • Wanyu Chen, Fei Cai, Honghui Chen, and Maarten de Rijke. Hierarchical neural query suggestion with an attention mechanism. Information Processing & Management, 2020.
  • Mostafa Dehghani, Sascha Rothe, Enrique Alfonseca, and Pascal Fleury. Learning to attend, copy, and generate for session-based query suggestion. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017.
  • Chenlong Deng, Yujia Zhou, and Zhicheng Dou. Improving personalized search with dual-feedback network. In Proceedings of the fifteenth ACM international conference on web search and data mining, 2022.
  • Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics, 2019.
  • Pierre Erbacher, Ludovic Denoyer, and Laure Soulier. Interactive query clarification and refinement via user simulation. In Proceedings of the 45th International ACM SIGIR, 2022.
  • Songwei Ge, Zhicheng Dou, Zhengbao Jiang, Jian-Yun Nie, and Ji-Rong Wen. Personalizing search results using hierarchical rnn with query-aware attention. In Proceedings of the 27th ACM international CIKM, 2018.
  • Helia Hashemi, Hamed Zamani, andWBruce Croft. Learning multiple intent representations for search queries. In Proceedings of the 30th ACM International CIKM, 2021.
  • Sharon Hirsch, Ido Guy, Alexander Nus, Arnon Dagan, and Oren Kurland. Query reformulation in e-commerce search. In Proceedings of the 43rd International ACM SIGIR, pages 1319–1328, 2020.
  • Nick Jardine and Cornelis Joost van Rijsbergen. The use of hierarchic clustering in information retrieval. Information storage and retrieval, 1971.
  • Jyun-Yu Jiang and Wei Wang. Rin: Reformulation inference network for context-aware query suggestion. In Proceedings of the 27th ACM international CIKM, 2018.
  • Karen Sparck Jones. Automatic keyword classification for information retrieval. Archon Books, 1971.
  • Payam Karisani, Maseud Rahgozar, and Farhad Oroumchian. A query term re-weighting approach using document similarity. Information Processing & Management, 2016.
  • Andisheh Keikha, Faezeh Ensan, and Ebrahim Bagheri. Query expansion using pseudo relevance feedback on wikipedia. Journal of Intelligent Information Systems, 2018.
  • Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Ves Stoyanov, and Luke Zettlemoyer. Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461, 2019.
  • Ruirui Li, Liangda Li, Xian Wu, Yunhong Zhou, and Wei Wang. Click feedback-aware query recommendation using adversarial examples. In The World Wide Web Conference, 2019.
  • Alva Liu, Humberto Jes´us Corona Pampin, and Enrico Palumbo. Bootstrapping query suggestions in spotify’s instant search system. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023.
  • Shuang Liu, Fang Liu, Clement Yu, and Weiyi Meng. An effective approach to document retrieval via utilizing wordnet and recognizing phrases. In Proceedings of the 27th annual international ACM SIGIR, 2004.
  • Shuqi Lu, Zhicheng Dou, Xu Jun, Jian-Yun Nie, and Ji-Rong Wen. Psgan: A minimax game for personalized search with limited and noisy click data. In Proceedings of the 42nd international ACM SIGIR, 2019.
  • Yuanhua Lv and ChengXiang Zhai. Positional relevance model for pseudo-relevance feedback. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, 2010.
  • Melvin Earl Maron and John L Kuhns. On relevance, probabilistic indexing and information retrieval. Journal of the ACM (JACM), 1960.
  • Alan Medlar, Jing Li, and Dorota Głowacka. Query suggestions as summarization in exploratory search. In Proceedings of the 2021 Conference on Human Information Interaction and Retrieval, 2021.
  • Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.
  • George A Miller. Wordnet: a lexical database for english. Communications of the ACM, 1995.
  • Jack Minker, Gerald A Wilson, and Barbara H Zimmerman. An evaluation of query expansion by the addition of clustered terms for a document retrieval system. Information Storage and Retrieval, 1972.
  • Bhaskar Mitra. Exploring session context using distributed representations of queries and reformulations. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, 2015.
  • Agn`es Mustar, Sylvain Lamprier, and Benjamin Piwowarski. Using bert and bart for query suggestion. In Joint Conference of the Information Retrieval Communities in Europe, 2020.
  • Shumpei Okura, Yukihiro Tagami, Shingo Ono, and Akira Tajima. Embedding-based news recommendation for millions of users. In Proceedings of the 23rd ACM SIGKDD, 2017.
  • Rajvardhan Patil, Sorio Boit, Venkat Gudivada, and Jagadeesh Nandigam. A survey of text representation and embedding techniques in nlp. 2023.
  • Joseph John Rocchio Jr. Relevance feedback in information retrieval. The SMART retrieval system: experiments in automatic document processing, 1971.
  • Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, and Utpal Garain. Using word embeddings for automatic query expansion. arXiv preprint arXiv:1606.07608, 2016.
  • Tobias Schnabel, Igor Labutov, David Mimno, and Thorsten Joachims. Evaluation methods for unsupervised word embeddings. In Proceedings of the 2015 conference on empirical methods in natural language processing, 2015.
  • Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob Grue Simonsen, and Jian-Yun Nie. A hierarchical recurrent encoder-decoder for generative contextaware query suggestion. In proceedings of the 24th ACM international CIKM, 2015.
  • Cornelis Joost Van Rijsbergen. A theoretical basis for the use of co-occurrence data in information retrieval. Journal of documentation, 1977.
  • Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, 2017.
  • Thanh Vu, Dat Quoc Nguyen, Mark Johnson, Dawei Song, and Alistair Willis. Search personalization with embeddings. In European Conference on Information Retrieval. Springer, 2017.
  • Sida Wang, Weiwei Guo, Huiji Gao, and Bo Long. Efficient neural query auto completion. In Proceedings of the 29th ACM International CIKM, 2020.
  • Bin Wu, Chenyan Xiong, Maosong Sun, and Zhiyuan Liu. Query suggestion with feedback memory network. In Proceedings of the 2018 World Wide Web Conference, 2018.
  • Jing Yao, Zhicheng Dou, and Ji-Rong Wen. Employing personal word embeddings for personalized search. In Proceedings of the 43rd international ACM SIGIR, 2020.
  • Jiuling Zhang, Beixing Deng, and Xing Li. Concept based query expansion using wordnet. In 2009 international econference on advanced science and technology. IEEE, 2009.
  • Zhi Zheng, Kai Hui, Ben He, Xianpei Han, Le Sun, and Andrew Yates. Contextualized query expansion via unsupervised chunk selection for text retrieval. Information Processing & Management, 2021.
  • Jianling Zhong, Weiwei Guo, Huiji Gao, and Bo Long. Personalized query suggestions. In Proceedings of the 43rd International ACM SIGIR, 2020.
  • Dong Zhou, S´eamus Lawless, Xuan Wu, Wenyu Zhao, and Jianxun Liu. A study of user profile representation for personalized cross-language information retrieval. Aslib Journal of Information Management, 2016.
  • Yujia Zhou, Zhicheng Dou, Bingzheng Wei, Ruobing Xie, and Ji-Rong Wen. Group based personalized search by integrating search behaviour and friend network. In Proceedings of the 44th international ACM SIGIR, 2021.
  • Yujia Zhou, Zhicheng Dou, and Ji-Rong Wen. Encoding history with context-aware representation learning for personalized search. In Proceedings of the 43rd international ACM SIGIR, 2020.
  • Yujia Zhou, Zhicheng Dou, Yutao Zhu, and Ji-Rong Wen. Pssl: self-supervised learning for personalized search with contrastive sampling. In Proceedings of the 30th ACM international CIKM, 2021.
  • Zile Zhou, Xiao Zhou, Mingzhe Li, Yang Song, Tao Zhang, and Rui Yan. Personalized query suggestion with searching dynamic flow for online recruitment. In Proceedings of the 31st ACM International CIKM, 2022.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Query Reformulation Query Suggestion Information retrieval Web Search

Powered by PhDFocusTM