-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Epic | Model improvements 2023-02-15 #74
Labels
data preprocessing
consists of all changing, cleaning and validating the data before running the model
feature-engineering
Milestone
Comments
This was referenced Feb 14, 2023
This was referenced Feb 14, 2023
Merged
1 task
apmt
pushed a commit
that referenced
this issue
Feb 17, 2023
apmt
pushed a commit
that referenced
this issue
Feb 17, 2023
apmt
added a commit
that referenced
this issue
Feb 17, 2023
* #76 remove 'y' from consonant sequences feature * #77 add all Mexico states abbreviations and its source in the docstring * #73 implement shannon entropy method and adapt the threshold calculation to also match values below * #73 new model with shannon entropy and notebook sets * #73 fix baja california abbreviation * #73 fix keysmash sequence test to also consider special characters * #74 fix private methods naming convention to double underscores * #75 and #78 add KeySmash features: repeated bigrams and unique chars ratios * #74 fix tests and parser private methods * #74 add model test * #74 update initial sets models --------- Co-authored-by: atarchetti <[email protected]>
apmt
pushed a commit
that referenced
this issue
Feb 23, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
data preprocessing
consists of all changing, cleaning and validating the data before running the model
feature-engineering
Description
This story was created to describe the new models adjustments for the 2023-02-05 delivery
User story
Shannon Entropy
Tasks
Bigrams Sequence Feature
Tasks
Fix Y as not consonant in consonant sequence KeySmash feature
Tasks
Get context states abbreviations into context abbreviations file
Tasks
Unique Characters
Tasks
The text was updated successfully, but these errors were encountered: