MultiTask Logistic Regression models
When it comes to predicting the survival function for a specific unit, the Cox Proportional Hazard Model is usually the goto model. However, it presents some important drawbacks:
 It relies on the proportional hazard assumption, which specifies that the ratio of the hazards for any two individuals is constant over time.
 The exact formula of the model that can handle ties isn’t computationally efficient, and is often rewritten using approximations, such as the Efron’s or Breslow’s approximations, in order to fit the model in a reasonable time.
 The fact that the time component of the hazard function remains unspecified makes the CoxPH model illsuited for actual survival function predictions.
The MultiTask Logistic Regression models were therefore introduced to provide great alternatives.
Linear MTLR model
The MultiTask Logistic Regression (MTLR) model created by Yu, ChunNam, et al. in 2011 can be seen as a series of logistic regression models built on different time intervals so as to estimate the probability that the event of interest happened within each interval.
The model can be built using the following steps:

We start by dividing the time axis into time intervals such that , with and .
Figure 1  Subdivisions of the time axis in J intervals 
We then build a logistic regression model on each interval , with the parameters and the response variable But, because we are not analyzing the effects of recurrent events, we need to make sure that when a unit experiences an event on interval with , its status for the remaining intervals stays the same. Thus, the response vector is: ChunNam Yu et al. proposed the following definitions for the density and survival functions:

Density function:

Survival function:
with:
 , the Hadamard product

the coefficients matrix,

the bias vector:

the triangular matrix:

the normalization constant


We can therefore compute the loss function and minimize it to find the optimal model parameters.
Neural MTLR model
Although the MTLR model provides similar results as the CoxPH model without having to rely on the assumptions required by the latter, at its core, it is still powered by a linear transformation. Thus, in the presence of nonlinear elements in the data, it will stop yielding satisfactory performances. The Neural MultiTask Logistic Regression (NMTLR) which allows the use of Neural Networks within the original MTLR design, will help solve this issue.
In the case of Neural MultiTask Logistic Regression, the density and survival functions become:

Density function:

Survival function:
with is the nonlinear transformation using feature vector as its input. Its output is a vector whose values are mapped to the subdivisions of the time axis
References
 "Learning patientspecific cancer survival distributions as a sequence of dependent regressors." Yu, ChunNam, et al. Advances in Neural Information Processing Systems. 2011.
 "Deep Neural Networks for Survival Analysis Based on a MultiTask Framework." Fotso, S. (2018). arXiv:1801.05512.
 Square's Technical Blog  Deep Learning + Survival Analysis: Our Approach to MultiTask Frameworks