Computing Reviews

Learning safe multi-label prediction for weakly labeled data
Wei T., Guo L., Li Y., Gao W. Machine Learning107(4):703-725,2018.Type:Article
Date Reviewed: 09/11/18

Many real-world applications involve learning in the presence of multiple labels. For example, in the case of images, a single image may be labeled sky, cloud, or even flower. To make matters more complicated, the dataset for training may have missing labels. The challenge, then, is to learn to (multi)label items even in the presence of missing labels. In many cases, using weakly labeled data may degrade performance. It is thus desirable to have a method that does not degrade learning.

This paper presents the SafeML method, an algorithm that addresses the issue. There are in fact two algorithms given. One is directed toward the evaluation of performance through the F1 score, which trades off precision and recall, and the other through top-k precision. Both algorithms are formulated as zero-sum games that use only an active set of constraints. Both use linear programming for the iterative improvement of the predictor’s label matrix, so both algorithms are efficient.

The authors compare eight state-of-the-art methods, evaluated on a number of datasets. As the proportion of missing labels increases, SafeML tends to perform much better than the other methods. The algorithms are explained in detail, and a lower bound for the performance of SafeML is given. The paper is clearly written and should be of interest to anyone interested in learning in a multi-label situation.

Reviewer:  J. P. E. Hodgson Review #: CR146238 (1902-0050)

Reproduction in whole or in part without permission is prohibited.   Copyright 2024 ComputingReviews.com™
Terms of Use
| Privacy Policy