Skip to content

Terminology Enhanced Retrieval Augmented Generation for Spanish Legal Corpora

Notifications You must be signed in to change notification settings

oeg-upm/term-rag

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Terminology Enhanced Retrieval Augmented Generation for Spanish Legal Corpora

About this work

This work intends to highlight the importance of reusing terminologies in the context of Large Language Models (LLMs), particularly within a Retrieval-Augmented Generation (RAG) scenario. We explore the application of query expansion techniques using a controlled terminology enriched with synonyms. Our case study focuses on the Spanish legal domain, investigating both query expansion and improvements in retrieval effectiveness within the RAG model. The experimental setup includes various LLMs, such as Mistral, Llama 3.2, and Granite 3, along with multiple Spanish-language embedding models. The results demonstrate that integrating current neural approaches with linguistic resources enhances RAG performance, reinforcing the role of structured lexical and terminological knowledge in modern NLP pipelines.

About

Terminology Enhanced Retrieval Augmented Generation for Spanish Legal Corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages