I have a dataset consisting of questionnaires from patient survey data. There are around 10 questions which are asked during several stages of treatment like during first day of visit, after a week, after two weeks and so on till after 3 months. Now some patients dropout in between the treatment stage. The dataset I have consists of around 50 columns(10 questions repeated over 5 times during the course of treatment), but there are missing data for some patients as they dropout from the treatment.
My questions are:
how do I handle the missing data as it is not filled by the patient?
Should I impute that with mean values or is there any other way?
P.S.: I am new to survival analysis. So any help will be appreciated. Thanks in advance.
id age sex dropout s1_q1 s1_q2 s1_q3 s1_q4 s1_q5.... s5_q10 217 50 m 0 2 3 3 3 2 3 202 58 f 0 4 9 10 10 10 N/A 222 72 m 1 3 8 9 10 9 N/A 207 50 m 0 2 7 6 7 7 6 277 55 f 0 2 4 5 5 5 6 281 62 m 0 4 10 10 10 10 10