Taylor & Francis Group
Browse
1/1
11 files

Multiclass Probability Estimation With Support Vector Machines

Version 2 2019-06-17, 15:28
Version 1 2019-03-11, 18:58
dataset
posted on 2019-06-17, 15:28 authored by Xin Wang, Hao Helen Zhang, Yichao Wu

Multiclass classification and probability estimation have important applications in data analytics. Support vector machines (SVMs) have shown great success in various real-world problems due to their high classification accuracy. However, one main limitation of standard SVMs is that they do not provide class probability estimates, and thus fail to offer uncertainty measure about class prediction. In this article, we propose a simple yet effective framework to endow kernel SVMs with the feature of multiclass probability estimation. The new probability estimator does not rely on any parametric assumption on the data distribution, therefore, it is flexible and robust. Theoretically, we show that the proposed estimator is asymptotically consistent. Computationally, the new procedure can be conveniently implemented using standard SVM softwares. Our extensive numerical studies demonstrate competitive performance of the new estimator when compared with existing methods such as multiple logistic regression, linear discrimination analysis, tree-based methods, and random forest, under various classification settings. Supplementary materials for this article are available online.

Funding

This research was supported in part by National Science Foundation grants CCF-1740858 and DMS-1055210 and NSFC 11571009.

History