Survival SVM model
Instead of modeling the probability that an event will occur, we could look at Survival Analysis as a Ranking Problem. Indeed, the idea behind formulating the survival problem as a ranking problem is that in some applications, like clinical applications, one is only interested in defining risks groups, and not the prediction of the survival time, but whether the unit has a high or low risk of experiencing the event.
Van Belle et al. developed the Rank Support Vector Machines (RankSVMs) and Polsterl et al. designed a straightforward algorithm to efficiently use the primal formulation, by computing a convex quadratic loss function, so that we can use the Newton optimization to minimize it, for a linear approach and a nonlinear/kernel based approach.
Linear approach
The objective function of ranking-based linear survival support vector machine is defined as:
with and
The objective function, gradient and Hessian can be expressed in matrix form as:
with:
- , the L2 regularization parameter
- , the identity matrix
-
-
if ; if and ; and otherwise.
- and
- and
Kernel approach
It is possible to model non-linearities and interactions within the covariates by using kernel-based methods.
The objective function, gradient and Hessian can be expressed in matrix form as:
with:
-
, the L2 regularization parameter
-
is the symmetric positive definite kernel matrix such that , , with , a kernel function and , the number of samples.
-
-
-
if ; if and ; and otherwise.
- and
- and
References
- Van Belle, Vanya, et al. "Support vector machines for survival analysis." Proceedings of the Third International Conference on Computational Intelligence in Medicine and Healthcare (CIMED2007). 2007.
- Pölsterl, Sebastian, et al. "Fast training of support vector machines for survival analysis." Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Cham, 2015.
- Slides about "Fast training of support vector machines for survival analysis."
- Pölsterl, Sebastian, et al. "An Efficient Training Algorithm for Kernel Survival Support Vector Machines." arXiv preprint arXiv:1611.07054 (2016).