FlackoJodye1

Follow

Paul FlackoJodye1

Follow

Machine Learning Engineer — NLP, LLM Systems & Production ML — Data Science @ TU Wien

3 followers · 10 following

23:18 (UTC +02:00)
in/paul-leon-schmitt-977b9419b

FlackoJodye1/README.md

Hi, I'm Paul 👋

AI / NLP Engineer focused on RAG systems, semantic search, and LLM evaluation

🎓 M.Sc. Data Science @ TU Wien (2024–2026)
📍 Vienna
📄 Co-author @ ACL BioNLP 2025

🚀 Featured Projects

Verbatim RAG Framework

Evidence-grounded QA with span-level retrieval and citation generation

End-to-end retrieval + grounding pipeline
Designed for clinical QA (EHR setting)
Applied in ArchEHR-QA shared task

👉 https://github.com/KRLabsOrg/verbatim-rag

Sentence Relevance Model

Core component of the Verbatim RAG pipeline for sentence-level relevance classification

Developed transformer-based relevance model for fine-grained retrieval
Trained on RAGBench + medical QA datasets
Enables sentence-level grounding and evaluation

👉 https://github.com/KRLabsOrg/verbatim-rag

Temporal Word Embeddings

Modeling semantic change using diachronic embeddings and TPPMI

Trained on 100k+ documents (NYT corpus)
Evaluated on 8k+ temporal analogy queries
Published at KONVENS / CPSS 2024

👉 https://github.com/FlackoJodye1/temporal-word-embeddings

🧪 Current Work

Word Sense Disambiguation & Semantic Change (German)

Ongoing research on WSD and diachronic semantic change using contextual embeddings

Sense representation with GermaNet + LLM enrichment
Embedding-based WSD (multilingual-e5)
Large-scale diachronic analysis on DWDS corpus

📄 Planned submission to EMNLP 2026 👉 https://2026.emnlp.org/
(repo is private for the moment)

📚 Publications

ACL BioNLP 2025 – ArchEHR-QA Shared Task (Top-10)
https://aclanthology.org/2025.bionlp-share.8/
KONVENS / CPSS 2024 – Temporal Word Embeddings
https://aclanthology.org/2024.cpss-1.10/

🧠 Interests

Retrieval-Augmented Generation (RAG)
NLP & Transformer Models
LLM Evaluation & Grounding
Semantic Search & Embeddings

🔗 Links

GitHub: https://github.com/FlackoJodye1
Google Scholar: https://scholar.google.com/citations?user=hmV4GPEAAAAJ

Pinned Loading

KRLabsOrg/verbatim-rag KRLabsOrg/verbatim-rag Public

Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.

Python 174 22
Word-Sense-Disambiguation Word-Sense-Disambiguation Public

BERT-based system for German word sense disambiguation

Python
temporal-word-embeddings temporal-word-embeddings Public

Source code of my bachelors thesis on the topic of temporal word embeddings

Python
edge-path-bundling edge-path-bundling Public

Interactive visualization of Edge-Path Bundling algorithm — implemented from scratch in Python with support for geospatial, migration, and 3D brain connectivity datasets.

Python