Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Understanding Black-box Predictions via Influence Functions #1

Open
YeonwooSung opened this issue Aug 25, 2020 · 2 comments
Open

Understanding Black-box Predictions via Influence Functions #1

YeonwooSung opened this issue Aug 25, 2020 · 2 comments

Comments

@YeonwooSung
Copy link
Owner

YeonwooSung commented Aug 25, 2020

Abstract

  • Use influence function to trace a model's prediction back to its training data.
  • Approximation of influence function that requires gradients and Hessian vectors provides valuable information
  • Useful in debugging models and detecting dataset errors

Details

  • Using influence function, one can ask questions such as "What is the model parameter like when certain training data was missing/altered?" without re-training the whole model
  • Useful in detecting adversarial examples
  • Useful in fixing mislabeled examples by providing good candidate lists, but limited boost compared to the simple listing via highest training loss

Personal Thoughts

  • Understanding neural networks is difficult because all the theoretical assumptions do not hold in non-convex, data-dependent, .. environment.
  • Good approximation methods are always powerful and applicable

Link: https://arxiv.org/pdf/1703.04730.pdf
Authors: Pang Wei Koh(Stanford), Percy Liang(Stanford)

@Helaly96
Copy link

Are you still interested in writing this part?
Would love to discuss it with someone

@YeonwooSung
Copy link
Owner Author

Yeh, having discussion with others is always welcome. Please share your thoughts about this paper :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants