Your evaluation function evaluates the indicators of the first category, and the zeroth category is the negative class. This indicator cannot directly represent the performance of the model, and the global TP, P, T should be counted to calculate the global F1, pre, recall indicators. Is there any purpose in using the indicators of the first category as the indicators to evaluate the performance of the model
