Skip to content

Commit

Permalink
Update README with features of normalizer
Browse files Browse the repository at this point in the history
  • Loading branch information
kavyamanohar committed Mar 23, 2024
1 parent e29b301 commit 63da685
Showing 1 changed file with 8 additions and 0 deletions.
8 changes: 8 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,13 @@ LibIndic's normalizer module may be used to normalize the text to a canonical
format to handle inconsistencies in text. Right now, it supports
Malayalam language only.

## Features

- Removes punctuations by default
- Changes combination chillus to atomic chillu characters
- Normalization of vowel signs
- Corrects some common typos in Malayalam (needs thorough review)

## Installation

### Directly from git
Expand Down Expand Up @@ -55,3 +62,4 @@ OK
flake8 --max-complexity 10 libindic
```

0 comments on commit 63da685

Please sign in to comment.