Skip to content

Cluster quality results? #15

@gmobile15

Description

@gmobile15

Hi,

Massive thanks for this great tool and it works absolutely fine with my data. In the paper, you mentioned that you experimented with different values of k to create the k-gram sequences. What metric would you recommend to evaluate these clusters?

For e.g. if I experiment with k -> [1,2,3,4,5] I would have 5 set of results (assuming I dont include time gaps at this stage, as that would double the number of results). How would I decide which clustering is the best? Is it simply the modularity score? If yes, each cluster has a modularity value but is there a way to amalgamate that for an entire set of results?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions