From 63da6857d86beaaa14a8da349f38dd74de3483d7 Mon Sep 17 00:00:00 2001 From: Kavya Manohar Date: Sat, 23 Mar 2024 14:13:00 +0530 Subject: [PATCH] Update README with features of normalizer --- README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/README.md b/README.md index de3d1fe..de2d2ad 100644 --- a/README.md +++ b/README.md @@ -4,6 +4,13 @@ LibIndic's normalizer module may be used to normalize the text to a canonical format to handle inconsistencies in text. Right now, it supports Malayalam language only. +## Features + +- Removes punctuations by default +- Changes combination chillus to atomic chillu characters +- Normalization of vowel signs +- Corrects some common typos in Malayalam (needs thorough review) + ## Installation ### Directly from git @@ -55,3 +62,4 @@ OK flake8 --max-complexity 10 libindic ``` +