
Starred repositories
Variable expansion for dotenv. Expand variables already on your machine for use in your .env file.
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
基于pytorch的中文拼写纠错,使用的模型是Bert以及SoftMaskedBert
This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
OpenMMLab Pre-training Toolbox and Benchmark
Automatically remove the mosaics in images and videos, or add mosaics to them.
all kinds of text classificaiton models and more with deep learning
100+ Chinese Word Vectors 上百种预训练中文词向量
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
Reproducing the paper — Deep Short Text Classification with Knowledge Powered Attention
Open source annotation tool for machine learning practitioners.
State-of-the-Art Text Embeddings
Dynamic graph/network dataset for dynamic graph/network embedding/representation
Archive of Temporal Knowledge Reasoning in Social Network and Knowledge Graph