<script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.1/MathJax.js?config=TeX-AMS-MML_HTMLorMML">
</script>
本文于r format(Sys.Date(), "%Y-%m-%d")更新。 如发现问题或者有建议,欢迎提交 Issue
相关笔记可见 github
The Kaplan-Meier estimator, independently described by Edward Kaplan and Paul Meier and conjointly published in 1958 in the Journal of the American Statistical Association, is a non-parametric statistic that allows us to estimate the survival function. [@Schuette2018]
Remember that a non-parametric statistic is not based on the assumption of an underlying probability distribution, which makes sense since survival data has a skewed distribution. [@Schuette2018]
Kaplan-Meier 估计是非参数的,因此主要用户描述性统计,主要见 github
@Schuette2018 给出了相关教程但是我觉得github对初学者更友好,直观地展现了生存模型。
- 可练习的数据来自 @Edmonson1979Different ,R中可以直接调用
survival::ovarian。
这个模型类似于互联网运营的漏斗模型和转化率相关的模型。
$$\begin{alignat}{2} S(t)&=p_{1} \cdot \dots \cdot p_{t}\ &=\prod_{i=1}^t p_i\ \end{alignat}$$
- $S(t)$衡量在$t$时刻,总样本存活率、转化率。
- $p(t)$衡量在$t$时刻,总样本在该时刻的存活率、转化率。
且满足关系
$$S(t) = S(t-1) \cdot p_{t}$$