Skip to content
Change the repository type filter

All

    Repositories list

    • hugme

      Public
      HuGME is an easy-to-use LLM assessment framework, which explicitly rates the Hungarian language skills and cultural knowledge of the models. It can be used to evaluate LLM outputs based on metrics such as hallucination, relevance of response, bias and more.
      Python
      0310Updated Jul 25, 2025Jul 25, 2025
    • HuLU

      Public
      Hungarian Language Understanding Benchmark Kit
      Python
      0900Updated Jul 15, 2025Jul 15, 2025
    • Conference and thesis projects of students
      Python
      0000Updated Apr 29, 2025Apr 29, 2025
    • TSV files of the Parallel Bible Reader
      1120Updated Apr 29, 2025Apr 29, 2025
    • emtsv

      Public
      e-magyar text processing system -- inter-module communication via tsv + REST API
      Python
      112961Updated Apr 18, 2025Apr 18, 2025
    • xtsv

      Public
      A generic TSV-style format based intermodular communication framework and REST API implemented in Python
      HTML
      3101Updated Apr 18, 2025Apr 18, 2025
    • HuCoPA

      Public
      Hungarian Choice of Plausible Alternatives Corpus
      0101Updated Jan 22, 2025Jan 22, 2025
    • HuSST

      Public
      Hungarian version of the Stanford Sentiment Treebank
      0101Updated Jan 17, 2025Jan 17, 2025
    • HuRTE

      Public
      Hungarian version of the Recognising Textual Entailment datasets
      1001Updated Jan 17, 2025Jan 17, 2025
    • HuCOLA

      Public
      Hungarian Corpus of Linguistic Acceptability
      0101Updated Jan 17, 2025Jan 17, 2025
    • 0001Updated Jan 17, 2025Jan 17, 2025
    • HunTag3

      Public
      A sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models
      Lex
      0301Updated Jun 17, 2024Jun 17, 2024
    • HAPP

      Public
      0100Updated Feb 19, 2024Feb 19, 2024
    • The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
      Shell
      61600Updated Sep 20, 2023Sep 20, 2023
    • emdummy

      Public template
      An example module for emtsv
      Python
      0022Updated May 1, 2023May 1, 2023
    • Dockerfile
      1500Updated Apr 27, 2023Apr 27, 2023
    • news-please - an integrated web crawler and information extractor for news that just works
      Python
      444000Updated Mar 11, 2023Mar 11, 2023
    • Personal site data for Research Group for LangTech @ MTA NYTI
      Perl
      0000Updated Mar 6, 2023Mar 6, 2023
    • PWS

      Public
      TeX
      0000Updated Feb 7, 2023Feb 7, 2023
    • Python
      0100Updated Feb 2, 2023Feb 2, 2023
    • 0000Updated Jan 29, 2023Jan 29, 2023
    • HuWS

      Public
      Hungarian Winograd Schemes
      0100Updated Jan 23, 2023Jan 23, 2023
    • HuWNLI

      Public
      Anaphora resolution datasets for Hungarian as an inference task
      0000Updated Jan 17, 2023Jan 17, 2023
    • Hunspell integrated with the xtsv framework
      Python
      0200Updated Dec 23, 2022Dec 23, 2022
    • Python
      0000Updated Dec 2, 2022Dec 2, 2022
    • Various models trained on parts of Webcorpus 2.0
      0000Updated Nov 10, 2022Nov 10, 2022
    • ParlaMint

      Public
      ParlaMint: Comparable Parliamentary Corpora
      GLSL
      52000Updated Oct 24, 2022Oct 24, 2022
    • Sentence embeddings with autoencoders
      Python
      0000Updated Oct 12, 2022Oct 12, 2022
    • HuWiC

      Public
      Hungarian Word-in-Context Corpus
      Jupyter Notebook
      0100Updated Oct 10, 2022Oct 10, 2022
    • The Hungarian anonymization tool for CURLICAT
      Python
      0100Updated Aug 18, 2022Aug 18, 2022