detoxification

Here are 4 public repositories matching this topic...

zjunlp / NLPCC2024_RegulatingLLM

[NLPCC 2024] Shared Task 10: Regulating Large Language Models

natural-language-processing artificial-intelligence multimodal hallucination large-language-models detoxification regulating-mechanisms nlpcc-2024

Updated Jun 12, 2024

infiniterik / detoxify

Star

Detoxifying Online Discourse: A Guided Response Generation Approach for Reducing Toxicity in User-Generated Text

moderation large-language-models detoxification sicon

Updated Jul 18, 2023
Python

RuvenGuna94 / Dialogue-Summary-remove-toxic-text-PPO

Star

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.

nlp toxic-comment-classification hate-speech-detection toxicity-analysis ppo-pytorch dialogue-summarization generative-ai detoxification reward-model

Updated Jan 4, 2025
Jupyter Notebook

eivankin / text-detoxification

Star

PMLDL Assignment 1

nlp-machine-learning paraphrase detoxification

Updated Nov 6, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the detoxification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the detoxification topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

detoxification

Here are 4 public repositories matching this topic...

zjunlp / NLPCC2024_RegulatingLLM

infiniterik / detoxify

RuvenGuna94 / Dialogue-Summary-remove-toxic-text-PPO

eivankin / text-detoxification

Improve this page

Add this topic to your repo