Accented characters #23

nilem · 2015-10-05T15:42:47Z

Hi! Your project is really great! But it doesn't support accented characters well. Right now, converting the word "préféré" for instance will return "pr_f_e_r", which is much less readable then "prefere".

I added those lines in case_parse.py (39), which doesn't handle all cases, but help a little bit :

arrayR = [(u'\xc9', 'e'),(u'\xc8', 'e'),(u'\xca', 'e'),(u'\xe8', 'e'),(u'\xe9', 'e'),(u'\xea', 'e'),(u'\xc0', 'a'),(u'\xc1', 'a'),(u'\xc2', 'a'),(u'\xe0', 'a'),(u'\xe1', 'a'),(u'\xe2', 'a')]

for a,b in arrayR:
    var = var.replace(a,b)

I hope this can help.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accented characters #23

Accented characters #23

nilem commented Oct 5, 2015

Accented characters #23

Accented characters #23

Comments

nilem commented Oct 5, 2015