Taming Overconfident Prediction on Unlabeled Data From Hindsight

Page view(s)

Checked on Aug 10, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/20943

Title:

Taming Overconfident Prediction on Unlabeled Data From Hindsight

Journal Title:

IEEE Transactions on Neural Networks and Learning Systems

DOI:

10.1109/TNNLS.2023.3274845

Publication URL:

http://dx.doi.org/10.1109/tnnls.2023.3274845

Authors:

Jing Li, Yuangang Pan, Ivor W. Tsang

Keywords:

semi-supervised learning, distillation, probability transformation

Publication Date:

23 May 2023

Citation:

Li, J., Pan, Y., Tsang, I. W. (2024). Taming Overconfident Prediction on Unlabeled Data From Hindsight. IEEE Transactions on Neural Networks and Learning Systems, 1–13. https://doi.org/10.1109/tnnls.2023.3274845

Abstract:

Minimizing prediction uncertainty on unlabeled data is a key factor to achieve good performance in semi-supervised learning (SSL). The prediction uncertainty is typically expressed as the \emph{entropy} computed by the transformed probabilities in output space. Most existing works distill low-entropy prediction by either accepting the determining class (with the largest probability) as the true label or suppressing subtle predictions (with the smaller probabilities). Unarguably, these distillation strategies are usually heuristic and less informative for model training. From this discernment, this paper proposes a dual mechanism, named ADaptive Sharpening (\ADS), which first applies a soft-threshold to adaptively mask out determinate and negligible predictions, and then seamlessly sharpens the informed predictions, distilling certain predictions with the informed ones only. More importantly, we theoretically analyze the traits of \ADS by comparing it with various distillation strategies. Numerous experiments verify that \ADS significantly improves the state-of-the-art SSL methods by making it a plug-in. Our proposed \ADS forges a cornerstone for future distillation-based SSL research.

License type:

Publisher Copyright

Funding Info:

This research is supported by core funding from: A*STAR Science and Engineering Research Council
Grant Reference no. : C222812019

Description:

© 2023 IEEE.  Personal use of this material is permitted.  Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

URI:

https://oar.a-star.edu.sg/communities-collections/articles/20943

ISSN:

2162-237X
2162-2388

Collections:

Institute of High Performance Computing

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
taming-overconfident-prediction-on-unlabeled-data-from-hindsight.pdf	953.50 KB	PDF	Open