The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. T...The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.展开更多
Receiver operating characteristic (ROC) curves are often used to study the two sample problem in medical studies. However, most data in medical studies are censored. Usually a natural estimator is based on the Kapla...Receiver operating characteristic (ROC) curves are often used to study the two sample problem in medical studies. However, most data in medical studies are censored. Usually a natural estimator is based on the Kaplan-Meier estimator. In this paper we propose a smoothed estimator based on kernel techniques for the ROC curve with censored data. The large sample properties of the smoothed estimator are established. Moreover, deficiency is considered in order to compare the proposed smoothed estimator of the ROC curve with the empirical one based on Kaplan-Meier estimator. It is shown that the smoothed estimator outperforms the direct empirical estimator based on the Kaplan-Meier estimator under the criterion of deficiency. A simulation study is also conducted and a real data is analyzed.展开更多
In cancer survival analysis, it is very frequently to estimate the confidence intervals for survival probabilities.But this calculation is not commonly involve in most popular computer packages, or only one methods of...In cancer survival analysis, it is very frequently to estimate the confidence intervals for survival probabilities.But this calculation is not commonly involve in most popular computer packages, or only one methods of estimation in the packages. In the present Paper, we will describe a microcomputer Program for estimating the confidence intervals of survival probabilities, when the survival functions are estimated using Kaplan-Meier product-limit or life-table method. There are five methods of estimation in the program (SPCI), which are the classical(based on Greenwood's formula of variance of S(ti), Rothman-Wilson, arcsin transformation, log(-Iog) transformation, Iogit transformation methods. Two example analysis are given for testing the performances of the program running.展开更多
Breast cancer is one of the leading diseases that affect women’s lives. It affects their lives in so many ways by denying them the required standard of health needed to carry out all of their daily activities for som...Breast cancer is one of the leading diseases that affect women’s lives. It affects their lives in so many ways by denying them the required standard of health needed to carry out all of their daily activities for some days, weeks, months or years before eventually causing death. This research estimates the survival rate of breast cancer patients and investigates the effects of stage of tumor, gender, age, ethnic group, occupation, marital status and type of cancer upon the survival of patients. Data used for the study were extracted from the case file of patients in the Radiation Oncology Department, University College Hospital, Ibadan using a well-structured pro forma in which 74 observations were censored and 30 events occurred. The Kaplan-Meier estimator was used to estimate the overall survival probability of breast cancer patients following their recruitment into the study and determine the mean and median survival times of breast cancer patients following their time of recruitment into the study. Since there are different groups with respect to the stages of tumor at the time of diagnosis, the log-rank test was used to compare the survival curve of the stages of tumor with considering p-values below 0.05 as statistically significant. Multivariate Cox regression was used to investigate the effects of some variables on the survival of patients. The overall cumulative survival probability obtained is 0.175 (17.5%). The overall estimated mean time until death is 28.751 weeks while the median time between admission and death is 23 weeks. As the p-value (0.000032) of the log-rank test for comparing stages of tumor is less than 0.05, it is concluded that there is significant evidence of a difference in survival times for the stages of tumor. The survival function plot for the stages of tumor shows that patients with stage III tumor are less likely to survive. From the estimated mean time until death for the stages of tumor, it was deduced that stage I tumor patients have an increased chance of survival. Types of展开更多
The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situ...The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situations where the objects of interest are not constantly monitored. Thus events are known only to have occurred between the two observation periods. Interval censoring has become increasingly common in the areas that produce failure time data. This paper explores the statistical analysis of interval-censored failure time data with applications. Three different data sets, namely Breast Cancer, Hemophilia, and AIDS data were used to illustrate the methods during this study. Both parametric and nonparametric methods of analysis are carried out in this study. Theory and methodology of fitted models for the interval-censored data are described. Fitting of parametric and non-parametric models to three real data sets are considered. Results derived from different methods are presented and also compared.展开更多
Lung cancer is one of the leading causes of death worldwide, accounting for an estimated 2.1 million cases in 2018. To analyze the risk factors behind the lung cancer survival, this paper employs two main models: Kapl...Lung cancer is one of the leading causes of death worldwide, accounting for an estimated 2.1 million cases in 2018. To analyze the risk factors behind the lung cancer survival, this paper employs two main models: Kaplan-Meier estimator and Cox proportional hazard model [1]. Also, log-rank test and wald test are utilized to test whether a correlation exists or not, which is discussed in detail in later parts of the paper. The aim is to find out the most influential factors for the survival probability of lung cancer patients. To summarize the results, stage of cancer is always a significant factor for lung cancer survival, and time has to be taken into account when analyzing the survival rate of patients in our data sample, which is from TCGA. Future study on lung cancer is also required to make improvement for the treatment of lung cancer, as our data sample might not represent the overall condition of patients diagnosed with lung cancer;also, more appropriate and advanced models should be employed in order to reflect factors that can affect survival rate of patients with lung cancer in detail.展开更多
In this article, we study a model of a single variable sampling plan with Type I censoring.Assume that the quality of an item in a batch is measured by a random variable which follows aWeibull distribution W(λ,m), wi...In this article, we study a model of a single variable sampling plan with Type I censoring.Assume that the quality of an item in a batch is measured by a random variable which follows aWeibull distribution W(λ,m), with scale parameter A and shape parameter m having a gammadiscrete prior distribution or θ=1/λ and m having an inverse gamma-uniform prior distribution.The decision function is based on the Kaplan-Meter estimator. Then, the explicit expressions ofthe Bayes risk are derived. In addition, an algorithm is suggested so that an optimal samplingplan can be determined approximately after a finite number of searching steps.展开更多
It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. ...It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. However, it is well-known that the K-M estimator is not continuous, thus it can not always be used to calculate quantile residual lifetime. In this paper, the authors propose a kernel smoothing method to give an estimator of quantile residual lifetime. By using modern empirical process techniques, the consistency and the asymptotic normality of the proposed estimator are provided neatly.The authors also present the empirical small sample performances of the estimator. Deficiency is introduced to compare the performance of the proposed estimator with the naive unsmoothed estimator of the quantile residaul lifetime. Further simulation studies indicate that the proposed estimator performs very well.展开更多
文摘The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.
基金partially supported by National Natural Science Foundation of China (NSFC) (No.70911130018,No.71271128)National Natural Science Funds for Distinguished Young Scholar (No.70825004)+1 种基金Creative Research Groups of China (No.10721101)Shanghai University of Finance and Economics through Project 211Phase III and Shanghai Leading Academic Discipline Project, Project Number: B803
文摘Receiver operating characteristic (ROC) curves are often used to study the two sample problem in medical studies. However, most data in medical studies are censored. Usually a natural estimator is based on the Kaplan-Meier estimator. In this paper we propose a smoothed estimator based on kernel techniques for the ROC curve with censored data. The large sample properties of the smoothed estimator are established. Moreover, deficiency is considered in order to compare the proposed smoothed estimator of the ROC curve with the empirical one based on Kaplan-Meier estimator. It is shown that the smoothed estimator outperforms the direct empirical estimator based on the Kaplan-Meier estimator under the criterion of deficiency. A simulation study is also conducted and a real data is analyzed.
文摘In cancer survival analysis, it is very frequently to estimate the confidence intervals for survival probabilities.But this calculation is not commonly involve in most popular computer packages, or only one methods of estimation in the packages. In the present Paper, we will describe a microcomputer Program for estimating the confidence intervals of survival probabilities, when the survival functions are estimated using Kaplan-Meier product-limit or life-table method. There are five methods of estimation in the program (SPCI), which are the classical(based on Greenwood's formula of variance of S(ti), Rothman-Wilson, arcsin transformation, log(-Iog) transformation, Iogit transformation methods. Two example analysis are given for testing the performances of the program running.
文摘Breast cancer is one of the leading diseases that affect women’s lives. It affects their lives in so many ways by denying them the required standard of health needed to carry out all of their daily activities for some days, weeks, months or years before eventually causing death. This research estimates the survival rate of breast cancer patients and investigates the effects of stage of tumor, gender, age, ethnic group, occupation, marital status and type of cancer upon the survival of patients. Data used for the study were extracted from the case file of patients in the Radiation Oncology Department, University College Hospital, Ibadan using a well-structured pro forma in which 74 observations were censored and 30 events occurred. The Kaplan-Meier estimator was used to estimate the overall survival probability of breast cancer patients following their recruitment into the study and determine the mean and median survival times of breast cancer patients following their time of recruitment into the study. Since there are different groups with respect to the stages of tumor at the time of diagnosis, the log-rank test was used to compare the survival curve of the stages of tumor with considering p-values below 0.05 as statistically significant. Multivariate Cox regression was used to investigate the effects of some variables on the survival of patients. The overall cumulative survival probability obtained is 0.175 (17.5%). The overall estimated mean time until death is 28.751 weeks while the median time between admission and death is 23 weeks. As the p-value (0.000032) of the log-rank test for comparing stages of tumor is less than 0.05, it is concluded that there is significant evidence of a difference in survival times for the stages of tumor. The survival function plot for the stages of tumor shows that patients with stage III tumor are less likely to survive. From the estimated mean time until death for the stages of tumor, it was deduced that stage I tumor patients have an increased chance of survival. Types of
文摘The analysis of survival data is a major focus of statistics. Interval censored data reflect uncertainty as to the exact times the units failed within an interval. This type of data frequently comes from tests or situations where the objects of interest are not constantly monitored. Thus events are known only to have occurred between the two observation periods. Interval censoring has become increasingly common in the areas that produce failure time data. This paper explores the statistical analysis of interval-censored failure time data with applications. Three different data sets, namely Breast Cancer, Hemophilia, and AIDS data were used to illustrate the methods during this study. Both parametric and nonparametric methods of analysis are carried out in this study. Theory and methodology of fitted models for the interval-censored data are described. Fitting of parametric and non-parametric models to three real data sets are considered. Results derived from different methods are presented and also compared.
文摘Lung cancer is one of the leading causes of death worldwide, accounting for an estimated 2.1 million cases in 2018. To analyze the risk factors behind the lung cancer survival, this paper employs two main models: Kaplan-Meier estimator and Cox proportional hazard model [1]. Also, log-rank test and wald test are utilized to test whether a correlation exists or not, which is discussed in detail in later parts of the paper. The aim is to find out the most influential factors for the survival probability of lung cancer patients. To summarize the results, stage of cancer is always a significant factor for lung cancer survival, and time has to be taken into account when analyzing the survival rate of patients in our data sample, which is from TCGA. Future study on lung cancer is also required to make improvement for the treatment of lung cancer, as our data sample might not represent the overall condition of patients diagnosed with lung cancer;also, more appropriate and advanced models should be employed in order to reflect factors that can affect survival rate of patients with lung cancer in detail.
文摘In this article, we study a model of a single variable sampling plan with Type I censoring.Assume that the quality of an item in a batch is measured by a random variable which follows aWeibull distribution W(λ,m), with scale parameter A and shape parameter m having a gammadiscrete prior distribution or θ=1/λ and m having an inverse gamma-uniform prior distribution.The decision function is based on the Kaplan-Meter estimator. Then, the explicit expressions ofthe Bayes risk are derived. In addition, an algorithm is suggested so that an optimal samplingplan can be determined approximately after a finite number of searching steps.
基金supported by the National Natural Science Foundation of China under Grant No.71271128the State Key Program of National Natural Science Foundation of China under Grant No.71331006+4 种基金NCMISKey Laboratory of RCSDSCAS and IRTSHUFEPCSIRT(IRT13077)supported by Graduate Innovation Fund of Shanghai University of Finance and Economics under Grant No.CXJJ-2011-429
文摘It is of great interest to estimate quantile residual lifetime in medical science and many other fields. In survival analysis, Kaplan-Meier(K-M) estimator has been widely used to estimate the survival distribution. However, it is well-known that the K-M estimator is not continuous, thus it can not always be used to calculate quantile residual lifetime. In this paper, the authors propose a kernel smoothing method to give an estimator of quantile residual lifetime. By using modern empirical process techniques, the consistency and the asymptotic normality of the proposed estimator are provided neatly.The authors also present the empirical small sample performances of the estimator. Deficiency is introduced to compare the performance of the proposed estimator with the naive unsmoothed estimator of the quantile residaul lifetime. Further simulation studies indicate that the proposed estimator performs very well.