Question about coreset selection #2

zhangxin-xd · 2023-08-08T08:27:10Z

Thank you for sharing your excellent work! I have a question about coreset selection. I noticed that in Algorithm 1, all the samples are re-sorted according to dACS and then reconstituted in the subset. It appears that the coreset selection is dynamic, akin to dropping out some unimportant samples during the training phase (the dropped ones can be reselected). However, some of the comparison methods are static (the dropping is permanent). Is the comparison reasonable?

I'm looking forward to your reply!

HuangOwen · 2023-08-08T14:35:43Z

Thanks for the question and your interest in our work! We believe that this comparison is reasonable as all the coreset method has the same "coreset data fraction per epoch". Since our target is to improve the training efficiency, the training time reduction is the same across different methods. In addition, previous work [1][2] also adopts a similar adaptive coreset strategy and compares it with other fixed-coreset methods.
[1] Adaptive second order coresets for data-efficient machine learning, ICML 2022
[2] RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning, NeurIPS 2021

zhangxin-xd · 2023-08-10T02:52:06Z

Thanks for your reply!!

zhangxin-xd · 2023-08-17T15:24:09Z

Hi, I have another question about the Error Vector Score which acts like EL2N.
In EL2N,
.
The average is calculated across several models with different initialization weights.
In your ACS,
.
When QAT starts, the weights at time t are fixed so how is the average calculated?

HuangOwen · 2023-08-18T03:46:44Z

Thanks for your question. Different from the GraNd score proposed in EL2N, the expectation of our ACS is computed on all logits $m \in M$ at a given training time $t$ (It is an average of these gradients instead of the sum). Since we then use $d_{\text{EVS}}$ to approximate it, the analysis still holds. We will correct the expectation equation in our manuscript later to avoid confusion.

zhangxin-xd · 2023-08-18T08:20:00Z

Got that! Thanks for the reply!

HuangOwen added the good first issue Good for newcomers label Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about coreset selection #2

Question about coreset selection #2

zhangxin-xd commented Aug 8, 2023

HuangOwen commented Aug 8, 2023

zhangxin-xd commented Aug 10, 2023

zhangxin-xd commented Aug 17, 2023

HuangOwen commented Aug 18, 2023

zhangxin-xd commented Aug 18, 2023

Question about coreset selection #2

Question about coreset selection #2

Comments

zhangxin-xd commented Aug 8, 2023

HuangOwen commented Aug 8, 2023

zhangxin-xd commented Aug 10, 2023

zhangxin-xd commented Aug 17, 2023

HuangOwen commented Aug 18, 2023

zhangxin-xd commented Aug 18, 2023