You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a canonical document is selected, the info pane should show a list of commonly-occuring concepts in the documents that are most similar to it.
These should probably be pre-SVD counts, not post-SVD similarity scores. For example, counting the number of times the words "chinese" and "thai" occur, weighted by which documents they are in, but not weighted by anything involving the "chinese" and "thai" concept vectors themselves. This would reassure users that the data reflects reality, even if the SVD comes out kind of weird.
Probably the best way to report these values would be as percentages.
[This bug transferred from Launchpad]
The text was updated successfully, but these errors were encountered:
[from sgt101 on Launchpad]
Related to this it would be very useful to put the info on canonical documents into some structured file like .xls or .csv separate from the general concepts file that is in results now.
I would like to see the following records produced :
When a canonical document is selected, the info pane should show a list of commonly-occuring concepts in the documents that are most similar to it.
These should probably be pre-SVD counts, not post-SVD similarity scores. For example, counting the number of times the words "chinese" and "thai" occur, weighted by which documents they are in, but not weighted by anything involving the "chinese" and "thai" concept vectors themselves. This would reassure users that the data reflects reality, even if the SVD comes out kind of weird.
Probably the best way to report these values would be as percentages.
[This bug transferred from Launchpad]
The text was updated successfully, but these errors were encountered: