PyTerrier RAG

pyterrier-rag is an extension for PyTerrier that makes it easier to produce retrieval augmented generation pipelines. PyTerrier-RAG supports:

  1. Easy access to common QA datasets

  2. Pre-built indices for common corpora

  3. Popular reader models, such as Fusion-in-Decoder, LLama

  4. Evaluation measures

As well as access to all of the retrievers (sparse, learned sparse, and dense) and rerankers (from MonoT5 to RankGPT) accessible through the wider PyTerrier ecosystem.

Example Notebooks

Try out the following example notebooks to get started with PyTerrier RAG:

Credits

  • Craig Macdonald, University of Glasgow

  • Jinyuan Fang, University of Glasgow

  • Andrew Parry, University of Glasgow

  • Zaiqiao Meng, University of Glasgow

  • Sean MacAvaney, University of Glasgow