The 7th of November 2024, Daniel and Jettro will visit Pathe in Ede for the annual JFall conference. In the morning, we will deliver our workshop Build the best knowledge retriever for your Large Language Model.. Generative AI is here to stay. Tools to generate text, images, or data are now common goods. Large Language models (LLMs) only have the knowledge they acquired through learning, and even that knowledge does not include all the details. To overcome the knowledge problem, the Retrieval Augmented Generation (RAG) pattern arose. An essential part of RAG is the retrieval part. Retrieval is not new. The search or retrieval domain is rich with tools, metrics and research. The new kid on the block is semantic search using vectors. Vector search got a jump start with the rise of LLMs and RAG. This workshop aims to build a high-quality retriever, integrate the retriever into your LLM solution and measure the overall quality of your RAG system.

The workshop uses our Rag4j/Rag4p framework, which we created especially for workshops. It is easy to learn, so you can focus on understanding and building the details of the components during the workshop. You experiment with different chunking mechanisms (sentence, max tokens, semantic). After that, you use various strategies to construct the context for the LLM (TopN, Window, Document, Hierarchical). To find the optimum combination, you’ll use quality metrics for the retriever as well as the other components of the RAG system. You can do the workshop using Python or Java (21). We provide access to a remote LLM (OpenAI). You can also run an open-source LLM on Ollama on your local machine.

We hope to see you in Pathe Ede and that you will enjoy the workshop. If you have any questions, feel free to contact us.

7th of November 2024 10:35 - 12:30 Space for Hands-on labs


Preparing for the workshop

First, choose your programming language (Java or Python). Then follow the instructions below:

Java: RAG4j

Python: RAG4p

The repositories contain a README.md file with instructions on setting up your environment.


The presentation used during the workshop is available here.


 

Workshop Image