Skip to content
View FlackoJodye1's full-sized avatar

Block or report FlackoJodye1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
FlackoJodye1/README.md

Hi, I'm Paul 👋

AI / NLP Engineer focused on RAG systems, semantic search, and LLM evaluation

🎓 M.Sc. Data Science @ TU Wien (2024–2026)
📍 Vienna
📄 Co-author @ ACL BioNLP 2025


🚀 Featured Projects

Verbatim RAG Framework

Evidence-grounded QA with span-level retrieval and citation generation

  • End-to-end retrieval + grounding pipeline
  • Designed for clinical QA (EHR setting)
  • Applied in ArchEHR-QA shared task

👉 https://github.com/KRLabsOrg/verbatim-rag


Sentence Relevance Model

Core component of the Verbatim RAG pipeline for sentence-level relevance classification

  • Developed transformer-based relevance model for fine-grained retrieval
  • Trained on RAGBench + medical QA datasets
  • Enables sentence-level grounding and evaluation

👉 https://github.com/KRLabsOrg/verbatim-rag


Temporal Word Embeddings

Modeling semantic change using diachronic embeddings and TPPMI

  • Trained on 100k+ documents (NYT corpus)
  • Evaluated on 8k+ temporal analogy queries
  • Published at KONVENS / CPSS 2024

👉 https://github.com/FlackoJodye1/temporal-word-embeddings


🧪 Current Work

Word Sense Disambiguation & Semantic Change (German)

Ongoing research on WSD and diachronic semantic change using contextual embeddings

  • Sense representation with GermaNet + LLM enrichment
  • Embedding-based WSD (multilingual-e5)
  • Large-scale diachronic analysis on DWDS corpus

📄 Planned submission to EMNLP 2026 👉 https://2026.emnlp.org/
(repo is private for the moment)


📚 Publications


🧠 Interests

  • Retrieval-Augmented Generation (RAG)
  • NLP & Transformer Models
  • LLM Evaluation & Grounding
  • Semantic Search & Embeddings

🔗 Links

Pinned Loading

  1. KRLabsOrg/verbatim-rag KRLabsOrg/verbatim-rag Public

    Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.

    Python 174 22

  2. Word-Sense-Disambiguation Word-Sense-Disambiguation Public

    BERT-based system for German word sense disambiguation

    Python

  3. temporal-word-embeddings temporal-word-embeddings Public

    Source code of my bachelors thesis on the topic of temporal word embeddings

    Python

  4. edge-path-bundling edge-path-bundling Public

    Interactive visualization of Edge-Path Bundling algorithm — implemented from scratch in Python with support for geospatial, migration, and 3D brain connectivity datasets.

    Python