Epic | Model improvements 2023-02-15 #74

apmt · 2023-02-14T04:25:44Z

Description

This story was created to describe the new models adjustments for the 2023-02-05 delivery

User story

Shannon Entropy

Who	When	Then
A tech support	need to add Shannon Entropy to KeySmash Features	in order to validate if the model performs better

Tasks

Implement shannon entropy #73

Bigrams Sequence Feature

Who	When	Then
A tech support	need to add Bigrams Sequence Feature to KeySmash Features	in order to validate if the model performs better

Tasks

Bigrams Sequence Feature #75

Fix Y as not consonant in consonant sequence KeySmash feature

Who	When	Then
A tech support	need to add Fix Y as not consonant in consonant sequence KeySmash feature	in order to validate if the model performs better

Tasks

Fix Y as not consonant in consonant sequence KeySmash feature #76

Get context states abbreviations into context abbreviations file

Who	When	Then
A tech support	need to get context states abbreviations into context abbreviations file	in order to validate if the model performs better

Tasks

Get context states abbreviations into context abbreviations file #77

Unique Characters

Who	When	Then
A tech support	need to add Unique Characters to KeySmash Features	in order validate if the model performs better

Tasks

Unique Characters #78

* #76 remove 'y' from consonant sequences feature * #77 add all Mexico states abbreviations and its source in the docstring * #73 implement shannon entropy method and adapt the threshold calculation to also match values below * #73 new model with shannon entropy and notebook sets * #73 fix baja california abbreviation * #73 fix keysmash sequence test to also consider special characters * #74 fix private methods naming convention to double underscores * #75 and #78 add KeySmash features: repeated bigrams and unique chars ratios * #74 fix tests and parser private methods * #74 add model test * #74 update initial sets models --------- Co-authored-by: atarchetti <[email protected]>

This was referenced Feb 14, 2023

Implement shannon entropy #73

Open

Bigrams Sequence Feature #75

Open

apmt self-assigned this Feb 14, 2023

apmt added feature-engineering data preprocessing consists of all changing, cleaning and validating the data before running the model labels Feb 14, 2023

apmt added this to the Sprint 5 milestone Feb 14, 2023

This was referenced Feb 14, 2023

Fix Y as not consonant in consonant sequence KeySmash feature #76

Open

Get context states abbreviations into context abbreviations file #77

Open

Unique Characters #78

Open

apmt changed the title ~~Epic | Model adjustments 2023-02-15~~ Epic | Model improvements 2023-02-15 Feb 15, 2023

apmt linked a pull request Feb 16, 2023 that will close this issue

74/model improvements #81

Merged

1 task

apmt pushed a commit that referenced this issue Feb 17, 2023

#74 fix private methods naming convention to double underscores

ff7bbb4

apmt pushed a commit that referenced this issue Feb 17, 2023

#74 fix tests and parser private methods

1ae15d4

apmt pushed a commit that referenced this issue Feb 17, 2023

#74 add model test

4ed19ee

apmt pushed a commit that referenced this issue Feb 17, 2023

#74 update initial sets models

19bbc80

apmt closed this as completed in #81 Feb 17, 2023

apmt pushed a commit that referenced this issue Feb 23, 2023

#74 update model sets

d01aee8

apmt pushed a commit that referenced this issue Feb 23, 2023

#74 update examples retrainning model sets

acbb1e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Epic | Model improvements 2023-02-15 #74

Epic | Model improvements 2023-02-15 #74

apmt commented Feb 14, 2023 •

edited

Loading

Epic | Model improvements 2023-02-15 #74

Epic | Model improvements 2023-02-15 #74

Comments

apmt commented Feb 14, 2023 • edited Loading

Description

User story

Shannon Entropy

Tasks

Bigrams Sequence Feature

Tasks

Fix Y as not consonant in consonant sequence KeySmash feature

Tasks

Get context states abbreviations into context abbreviations file

Tasks

Unique Characters

Tasks

apmt commented Feb 14, 2023 •

edited

Loading