Information Retrieval's Role in RAG Systems
More information to come: stay tuned!
Find the schedule
Find the important dates
Find the call for papers
Find the submission instructions
If you are a RAG-enthusiast, this is the workshop for you


In recent years, Retrieval Augmented Generation (RAG) systems have emerged as a pivotal component in the field of artificial intelligence, gaining significant attention and importance across various domains. These systems, which combine the strengths of information retrieval and generative models, have shown promise in enhancing the capabilities and performance of machine learning applications. However, despite their growing prominence, RAG systems are not without their limitations and continue to be in need of exploration and improvement. This workshop seeks to delve into the critical aspect of information retrieval and its integral role within RAG frameworks. We argue that current efforts have undervalued the role of Information Retrieval (IR) in the RAG and have concentrated their attention on the generative part. As the cornerstone of these systems, IR's effectiveness dramatically influences the overall performance and outcomes of RAG models. We call for papers that will seek to revisit and emphasize the fundamental principles underpinning RAG systems. At the end of the workshop, we aim at having a clearer understanding of how robust information retrieval mechanisms can significantly enhance the capabilities of RAG systems. Participants will engage in discussions and presentations focusing on the latest research, challenges, and potential pathways for advancing the information retrieval component within RAG systems. The workshop will serve as a platform for experts, researchers, and practitioners. We intend to foster discussions, share insights, and encourage research that underscores the vital role of Information Retrieval in the future of generative systems.

Important Dates

Submission deadline: April 25, 2024
Acceptance Notification: May 23, 2024
Camera-ready versions of accepted papers due: June 22, 2024
IR-RAG Workshop: July 18, 2024
Deadlines refer to 23:59 (11:59pm) in the AoE (Anywhere on Earth) time zone.

Call for Papers

The primary purpose of this workshop is to shift the focus onto the often-overlooked retriever mechanism of Retrieval-Augmented Generation (RAG) systems while pondering the question:
Should research in information retrieval change now that RAG systems exist?
By gathering experts, practitioners, and enthusiasts in a dedicated forum, the workshop seeks to spotlight and deliberate on challenges and innovative ideas associated with the retrieval aspect of RAG systems, aiming to foster a solid community around this critical topic. The intent is to generate a collective effort to understand and enhance the retrieval mechanisms better, ensuring they are given as much importance as the generative components. This collaborative environment will not only help in sharing knowledge and best practices but also in inspiring new research and development, ultimately leading to more effective and reliable RAG systems. Through this workshop, we aspire to build a strong foundation and a vibrant community committed to advancing the state of the art in information retrieval for RAG systems.
We invite submissions related to (but not limited to):
Use Of The Retrieved Context By The LLM:
Recent work has demonstrated that RAG systems are sensible to the order and the nature of the retrieved context. These can be considered preliminary results that pave the way for future research.
(Query) Representation Learning:
Improving how queries are represented can significantly enhance the retriever's ability to find relevant documents. This could involve using more advanced natural language processing techniques to understand the context and nuances of the query better.
Incorporating Contextual Information:
Including more context in the retrieval process can improve the relevance of the documents retrieved. This could mean taking into account the broader conversation, user preferences, or historical interactions
Updating the Document Database:
Keeping the document database up-to-date ensures that the retriever has access to the latest and most relevant information. This is particularly important for topics that are rapidly evolving.
Reducing Computational Load:
Optimizing the retriever for speed and efficiency, especially when dealing with large databases, can improve its usability in real-time applications. This might involve techniques for reducing the dimensionality of data or faster search algorithms.
Bias Mitigation:
Actively working to identify and mitigate biases in the retrieval process can improve the fairness and reliability of the retrieved content.
Cross-Lingual Retrieval Capabilities:
For systems operating in multilingual environments, improving the retriever's ability to handle and retrieve documents in various languages can enhance its effectiveness.
Most of the current research has focused on textual RAG, even though multimodality is highly needed in many applications.
One of the goals of this workshop is to collect new ideas and challenges, so proposals in this sense are very much welcomed.

Submission Instructions

All submissions will be peer reviewed (double-blind) by the program committee and judged by their relevance to the workshop, especially to the main themes identified above, and their potential to generate discussion. All submission must be written in English and formatted according to the latest ACM SIG proceedings template available at
Submissions must describe work that is not previously published, not accepted for publication elsewhere, and not currently under review elsewhere.
The workshop follows a double-blind reviewing process. Please note that at least one of the authors of each accepted paper must register for the workshop and present the paper.
We invite research contributions, position, demo and opinion papers. Submissions must either be short (at most 4 pages) or full papers (at most 9 pages). References do not count against the page limit.
We encourage but do not require authors to release any code and/or datasets associated with their paper.

Schedule (tentative)

Start Time
End Time
9:00 AM
9:15 AM
9:15 AM
10:15 AM
Keynote speaker (1)
Nicola Tonellotto, University of Pisa
10:15 AM
11:00 AM
Paper presentations (1)
Accepted presentations here
11:00 AM
11:30 AM
Coffee break
11:30 AM
12:15 PM
Paper presentations (2)
Accepted presentations here
12:15 PM
1:45 PM
Lunch break
1:45 PM
2:45 PM
Keynote speaker (2)
Yuhao Zhang, Samaya AI
2:45 PM
3:30 PM
Poster session
Accepted posters here
3:30 PM
4:00 PM
Refreshment break
4:00 PM
5:00 PM
Breakout Session & discussion among participants
5:00 PM
5:30 PM
Round up & concluding remarks
There are no rows in this table

Keynote Speakers

Nicola Tonellotto
Yuhao Zhang


Foto - Amazon.jpg

Fabio Petroni

Federico Siciliano

Fabrizio Silvestri

Giovanni Trappolini


European Union (EU
Ministero dell’Università e della Ricerca (MUR)

Want to print your doc?
This is not the way.
Try clicking the ⋯ next to your doc name or using a keyboard shortcut (
) instead.