- Groningen
- http://andreasvc.github.io
Stars
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Efficient few-shot learning with Sentence Transformers
Replication code for Chaudhuri et al., "A small set of stylometric features differentiates Latin prose and verse," Digital Scholarship in the Humanities 2018
Package to extract connotation frames
A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python
Material for UvA course Coding the Humanities 2023
Positive-unlabeled learning with Python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Extraction of structured and unstructured information from fandom.com pages
A simple, bare-bones, no-frills note taking app for Android.
Debian, Ubuntu, and others packaging for ungoogled-chromium
Using machine learning to classify book reviews based on genre
The code used for the paper "Evaluating and Improving the Coreference Capabilities of Machine Translation Models"
An extremely fast Python linter and code formatter, written in Rust.
Cultural Analytics Open Science Guide (powered by Quarto)
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Using OpenAI's Whisper to automatically generate YouTube subtitles
Robust Speech Recognition via Large-Scale Weak Supervision
Master's thesis comparing a rule-based and neural approach for quote attribution evaluated on Dutch literature