Fit a generalized linear model (GLM) using clusters as predictors

FKM.glm() fits a generalized linear model (GLM) using clusters output from cluster.fitted() or cluster.coefs() as predictors, along with additional covariates.

Usage

FKM.glm(FKM_object, data, y, covariates, refclus = 1, family = "gaussian", ...)

Arguments

FKM_object: An object of class 'FKM.TPS' output from cluster.fitted() or cluster.coefs().
data: A data frame with the same subjects used for spline-fitting and clustering that includes an outcome variable of interest and optional covariates.
y: Name of the outcome variable (e.g. y="Death")
covariates: A vector of covariates of interest to be included in the model.
refclus: Numeric identification of the cluster to be used as the reference cluster. Default is cluster 1 (refclus=1). Use refclus=0 to identify the noise cluster as the reference cluster.
family: A description of the error distribution and link function to be used in the model.
...: Additional arguments for the glm() function.

Value

An object of class 'FKM.glm' containing the following components:

FKM_object The inputted object of class 'FKM.TPS'.
model_data A data frame containing the variables used in the model, including degree of cluster membership.
formula The formula used in the model.
family The family call used in the model.
covariates The covariates that were included.
model_full The GLM model using clusters as predictors and any additional covariates of interest.
model_noclusters The GLM model using the covariates of interest but no clusters.
anova ANOVA comparing the models with and without clusters as predictors.
anova_pval P-value for the ANOVA comparing the models with and without clusters as predictors.

Details

FKM.glm() applies the glm() function to fit a generalized linear model using clusters as predictors. Clusters are obtained using cluster.fitted() or cluster.coefs(), and the output object of class FKM.TPS is input into the FKM.glm() function, along with a dataset containing the output variable and additional covariates of interest. Clusters are included using the "partial assignment" method that employs the degree of cluster membership for each individual to account for uncertainty in the cluster assignment.

Examples