Research: Scale Activation Factorization #131
Description
🔬 This is an experiment in doing radically open research. I plan to post all my work on this openly as I do it, tracking it in this issue. I'd love for people to comment, or better yet collaborate! See more.
Please be respectful of the fact that this is unpublished research and that people involved in this are putting themselves in an unusually vulnerable position. Please treat it as you would unpublished work described in a seminar or by a colleague.
Description
In Building Blocks, we create neuron groups by applying Non-Negative Matrix Factorization to the activations of a single example. This can be seen as creating a local basis for explaining that single example.
A natural next step would be to factor activations from many examples. The blocker on this has been figuring out how to scale NMF factorization. I think it's likely that reading documentation for a few hours (especially about mini batch factorization approaches) would give us enough to solve this.
An intermediate approach might be to factor the activation for examples of a single class or set of similar classes.