**Open Journal of Statistics**

Vol.08 No.04(2018), Article ID:86068,9 pages

10.4236/ojs.2018.84042

Analysis of Influencing Factors on Survival Time of Patients with Heart Failure

Jianwei Sheng^{1}, Xiyuan Qian^{1}, Tong Ruan^{2}^{ }

^{1}School of Science, East China University of Science and Technology, Shanghai, China

^{2}School of Information Science and Engineering, East China University of Science and Technology, Shanghai, China

Copyright © 2018 by authors and Scientific Research Publishing Inc.

This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).

http://creativecommons.org/licenses/by/4.0/

Received: May 6, 2018; Accepted: July 16, 2018; Published: July 19, 2018

ABSTRACT

To explore the influencing factors of survival time of patients with heart failure, a total of 1789 patients with heart failure were collected from Shanghai Shuguang Hospital. The Cox proportional hazards model and the mixed effects Cox model were used to analyze the factors on survival time of patients. The results of Cox proportional hazards model showed that age (RR = 1.32), hypertension (RR = 0.67), ARB (RR = 0.55), diuretic (RR = 1.48) and antiplatelet (RR = 0.53) have significant impacts on the survival time of patients. The results of mixed effects Cox model showed that age (RR = 1.16), hypertension (RR = 0.61), lung infection (RR = 1.43), ARB (RR = 0.64), β-blockers (RR = 0.77) and antiplatelet (RR = 0.69) have a significant impact on the survival time of patients. The results are consistent with the covariates age, hypertension, ARB and antiplatelet but inconsistent with the covariates lung infection and β-blockers.

**Keywords:**

Heart Failure, Survival Analysis, Longitudinal Data, Mixed Effects Cox Model

1. Introduction

Heart failure is a syndrome with symptoms and signs caused by cardiac dysfunction, resulting in reduced longevity [1] . The prevalence of heart failure in western countries is 1% - 2% of the adult population and 5 - 10 per 1000 population per year, respectively [2] [3] . In China, the prevalence of heart failure in Chinese population aged 35 - 74 is 0.9% and the population significantly increases with age [4] [5] . With the acceleration of population aging in China, it is foreseeable that the burden caused by heart failure will become heavier in the near future. So it is important to study and analyze the influencing factors of the survival time of patients with heart failure.

In medical research, follow-up is the common way to study the law of things; for instance: study the efficacy of a drug, study the survival time after surgery, study the lifetime of a medical device [6] [7] . The common ground of the above studies is that it will take some time to trace the research objects, which was called the survival time in statistics. The study of the distribution and influencing factors of survival time is the so-called survival analysis [8] [9] [10] . Proportional hazard regression model has become the most common used procedure for modeling the relationship of covariates to a survival or other censored outcome since this model was proposed by D.R. Cox in 1972 [11] . In clinical practice, many studies collect both longitudinal data [12] [13] (longitudinal data are data in which a response variable is measured at different time points over time) and survival-time data. In this paper, Cox proportional hazards model was used to model the survival-time data and mixed effects Cox model [14] [15] was used to model the survival-time and longitudinal data.

2. Models

2.1. Cox Proportional Hazards Model

The Cox proportional hazards model was proposed by British statistician D.R. Cox in 1972, which has been widely applied to analyze the effect of exposure and other covariates on patient’s survival. The Cox model specifies the hazard for individual i as:

${\lambda}_{i}\left(t\right)={\lambda}_{0}\left(t\right)\mathrm{exp}\left({\beta}_{1}{X}_{i1}+{\beta}_{2}{X}_{i2}+\cdots +{\beta}_{p}{X}_{ip}\right)={\lambda}_{0}\left(t\right)\mathrm{exp}\left({X}_{i}\left(t\right)\beta \right)$ (1)

where $\beta ={\left({\beta}_{1},{\beta}_{2},\cdots {\beta}_{p}\right)}^{\text{T}}$ is a $p\times 1$ column vector of coefficients, ${X}_{i}=\left({X}_{i1},{X}_{i2},\cdots ,{X}_{ip}\right)$ is a $1\times p$ vector of covariates for subject i, and ${\lambda}_{0}\left(t\right)$ is an unspecified nonnegative function of time called the baseline hazard, describing how the risk of event per time unit changes over time at baseline levels of covariates. Since the hazard ratio for two subjects with fixed covariate vectors ${X}_{i}$ and ${X}_{j}$

$\frac{{\lambda}_{i}\left(t\right)}{{\lambda}_{j}\left(t\right)}=\frac{{\lambda}_{0}\left(t\right)\mathrm{exp}\left({X}_{i}\beta \right)}{{\lambda}_{0}\left(t\right)\mathrm{exp}\left({X}_{j}\beta \right)}=\mathrm{exp}\left(\left({X}_{i}-{X}_{j}\right)\beta \right)$ (2)

is constant over time, the model is called proportional hazards model.

Let the event be observed to have occurred with subject i at time ${t}_{i}$ . The probability that happened can be written as

${L}_{i}\left(\beta \right)=\frac{\lambda \left({t}_{i}|{X}_{i}\right)}{{\displaystyle {\sum}_{:{t}_{j}\ge {t}_{i}}\lambda \left({t}_{i}|{X}_{j}\right)}}=\frac{{\theta}_{i}}{{\displaystyle {\sum}_{:{t}_{j}\ge {t}_{i}}{\theta}_{j}}}$ (3)

where ${\theta}_{j}=\mathrm{exp}\left({X}_{j}\beta \right)$ and the summation is over the set of subjects j who is still under observation at time ${t}_{i}$ , the set is called risk set and denoted by $R\left({t}_{i}\right)$ , this is the partial likelihood for subject i. So taking the product of Equation (3) yields the partial likelihood function:

$PL\left(\beta \right)={\displaystyle \underset{i=1}{\overset{n}{\prod}}{\left[\frac{\mathrm{exp}\left({X}_{i}\beta \right)}{{\displaystyle {\sum}_{j\in R\left({t}_{i}\right)}\mathrm{exp}\left({X}_{j}\beta \right)}}\right]}^{{\delta}_{i}}}$ (4)

where ${\delta}_{i}$ is 1 if the event is happened to subject i and 0 otherwise.

2.2. Mixed Effects Cox Model

In clinical practice, some subjects may be observed more than once during the time from first hospitalization to death. The number of hospitalizations and the days between two hospitalizations varies from patient to patient in the heart failure set. The Cox proportional hazards model only uses the survival-time data, which inevitably lose some useful information. The data obtained from multiple measurements of a series of experimental individuals over time are called longitudinal data. More precisely, suppose there are m individuals in an experiment where each individual is measured over time. ${Y}_{i1},{Y}_{i2},\cdots {Y}_{i{n}_{i}},i=1,\cdots ,m$ are the measured data for the individual i at time ${t}_{i1}<{t}_{i2}<\cdots <{t}_{i{n}_{i}}$ , then $\left\{{Y}_{ik}:1\le k\le {n}_{i},1\le i\le m\right\}$ is called longitudinal data, which is also called panel data in econometrics [16] . This type of data is different from cross-section data and time series data. The linear mixed effects model is a common model to dealing with the longitudinal data [17] . It adds individual difference as random effects into the regression model. These random effects describe how every object’s measurement changes over time and reflect the internal structure of the longitudinal data. In matrix notation a mixed model can be represented as:

$Y={X}^{\text{T}}\beta +{Z}^{\text{T}}b+\epsilon ,b\sim N\left(0,\Sigma \right)$ (5)

where

$\lambda \left(t\right)={\lambda}_{0}\left(t\right)\mathrm{exp}\left({X}^{\text{T}}\beta +{Z}^{\text{T}}b\right)\text{,}b\sim N\left(0,\Sigma \right)$ (6)

Coefficients can be estimated based on the partial likelihood:

$\mathrm{ln}\left[PL\left(\beta ,b\right)\right]={\displaystyle \underset{i=1}{\overset{n}{\sum}}{\displaystyle {\int}_{0}^{\infty}\left\{{Y}_{i}\left(t\right){\eta}_{i}\left(t\right)-\mathrm{ln}\left[{\displaystyle \underset{j}{\sum}{Y}_{j}\left(t\right)\mathrm{exp}\left({\eta}_{j}\left(t\right)\right)}\right]\right\}\text{d}t}}$ (7)

where ${\eta}_{i}\left(t\right)={X}_{i}\left(t\right)\beta +{Z}_{i}\left(t\right)b$ is the linear score for subject i at time t and ${Y}_{i}\left(t\right)=1$ if subject i is still under observation at time t and 0 otherwise [18] [19] .

3. Data

We collected patient basic information, laboratory information, medical records, doctor’s advice information and other information from Shanghai Shuguang Hospital database during January 1, 2003 to December 31, 2013. The start point of survival analysis is the first time in hospital date and the end point is the last time out of hospital date or the date of death or the end date of the study. According to the guidance of the doctor formed the heart failure dataset used in this paper. This dataset contains data from 1789 patients with heart failure, for a total of 8332 observations and 23 covariates. See Table 1 for details.

Most are categorical variables, but age is a multi-variable. Its distribution is shown in Figure 1.

Statistics for other binary variables are shown in Table 2.

4. Results

Firstly, we use the Cox proportional hazards to model the survival-time data with all covariates. The results are shown in Table 3.

Table 1. Variables description in heart failure dataset.

Table 2. Statistics for binary variable in hear failure set (total = 1789).

Secondly, we use the mixed effects Cox model to model the survival-time data and longitudinal data with all the covariates and variable day as the covariate for random effects. The results are shown in Table 4.

5. Conclusions

Cox proportional hazards model showed that age, hypertension, ARB, diuretics and antiplatelet have a statistically significant effect on the survival time of patients. Age (RR = 1.32) and diuretic (RR = 1.48) were risk factors. Hypertension (RR = 0.67), ARB (RR = 0.55) and antiplatelet (RR = 0.53) were protective factors. The mixed effects Cox model showed that age, hypertension, lung infection, ARB, β-blockers, and antiplatelet have statistically significant effects on the survival time of patients. Age (RR = 1.16) and lung infection (RR = 1.43) were risk

Figure 1. Distribution of heart failure patients’ age.

Table 3. Result of Cox proportional hazards model with all covariates.

*coef is the estimation of the coefficients; RR is relative risk; Se (coef) is the standard error of the estimation.

Table 4. Results of mixed effects Cox model.

Figure 2. Survival distributions by significant covariates.

factors; hypertension (RR = 0.61), ARB (RR = 0.64), β blockers (RR = 0.77) and antiplatelet (RR = 0.69) were protective factors. Results of the two models are consistent with the covariates age, hypertension, ARB and antiplatelet. Further, age was risk factor, namely the older has lower survival rate. Hypertension, ARB, and antiplatelet were protective factors, namely patients with hypertension have higher survival rates than those without hypertension; patients who used ARBs had higher survival rates than unused patients; patients who used antiplatelet drugs had higher survival rates than those who did not. Survival distributions by these covariates are shown in Figure 2.

The difference is that there are another two covariates which have significantly effect on the survival rate in the mixed effects Cox model: one was risk factor lung infection (RR = 1.43), and the other was protective factor β blocker (RR = 0.67). In addition, the protective factor diuretic in the Cox proportional hazards model became insignificant in the mixed effects Cox model, which shows that the effect of diuretics on survival rate gradually reduces.

Acknowledgements

This work was partially supported by The National High-Tech R&D Program of China (863 Program) under Grant No. 2015AA020107.

Cite this paper

Sheng, J.W., Qian, X.Y. and Ruan, T. (2018) Analysis of Influencing Factors on Survival Time of Patients with Heart Failure. Open Journal of Statistics, 8, 651-659. https://doi.org/10.4236/ojs.2018.84042

References

- 1. Mosterd, A. and Hoes, A.W. (2007) Clinical Epidemiology of Heart Failure. Heart, 93, 1137-1146. https://doi.org/10.1136/hrt.2003.025270
- 2. McMurray, J.J., Adamopoulos, S., Anker, S.D., et al. (2012) ESC Guidelines for the Diagnosis and Treatment of Acute and Chronic Heart Failure. European Heart Journal, 33, 1787-1146.
- 3. Cowie, M.R., Wood, D.A., Coats, A.J., et al. (1999) Incidence an Aetiology of Heart Failure: A Population-Based Study. European Heart Journal, 20, 421-428. https://doi.org/10.1053/euhj.1998.1280
- 4. Guo, Y., Zhao, D. and Liu, J. (2015) Epidemiological Study of Heart Failure in China. Cardiovascular Innovations and Applications, 1, 47-55. https://doi.org/10.15212/CVIA.2015.0003
- 5. Gu, D.F., Huang, G.Y., He, J., et al. (2003) Investigation of Prevalence and Distribution Feature of Chronic Heart Failure in Chinese Adult Population. Chinese Journal Cardio, 1, 3-6.
- 6. Bland, J.M. (2000) An Introduction to Medical Statistics. 3rd Edition, Oxford University Press, New York.
- 7. Kirkwood, B.R. and Sterne, J.C. (2003) Essential Medical Statistics. 2nd Edition, Blackwell Publishers, Malden.
- 8. Klein, J.P. (2013) Handbook of Survival Analysis. CRC Press, Boca Raton.
- 9. Lawless, J.F. (2003) Statistical Models and Methods for Lifetime Data. 2nd Edition, John Wiley and Sons, New York.
- 10. Kalbfleisch, J.D. and Prentice, R.L. (2002) The Statistical Analysis of Failure Time Data. John Wiley and Sons, New York. https://doi.org/10.1002/9781118032985
- 11. Cox, D.R. (1972) Regression Models and Lifetables (with Discussion). Journal of the Royal Statistical Society B, 34, 187-220.
- 12. Frees, E. (2004) Longitudinal and Panel Data: Analysis and Applications in the Social Sciences. Cambridge University Press, New York. https://doi.org/10.1017/CBO9780511790928
- 13. Diggle, P.J., Heagerty, P., Liang, K.-Y. and Zeger, S.L. (2002) Analysis of Longitudinal Data. 2nd Edition, Oxford University Press, Oxford.
- 14. Therneau. T.M. (2015) Mixed Effects Cox Models. BMC Genetics, 6, S127.
- 15. Therneau, T.M. and Grambsch, P.M. (2000) Modeling Survival Data: Extending the Cox Model. Springer, New York. https://doi.org/10.1007/978-1-4757-3294-8
- 16. Wooldridge, J.M. (2013) Introductory Econometrics: A Modern Approach. 5th Edition, South-Western, Mason, OH.
- 17. Militino, A.F. (2010) Mixed Effects Models and Extensions in Ecology with R. Journal of Royal Statistical Society, 173, 938-939. https://doi.org/10.1111/j.1467-985X.2010.00663_9.x
- 18. Ripatti, S. and Palmgren, J. (2000) Estimation of Multivariate Frailty Models Using Penalized Partial Likelihood. Biometrics, 56, 1016-1022. https://doi.org/10.1111/j.0006-341X.2000.01016.x
- 19. Gamst, A., Donohue, M. and Xu, R. (2009) Asymptotic Properties and Empirical Evaluation of the Npmle in the Proportional Hazards Mixed-Effects Model. Statistical Sinica, 19, 997-1011.