I am building a model to predict customer churn using survival analysis, COX regression in particular. While it is pretty straight forward to collect churn events, it is a little tricky to sample for the not churned events. I think it is safe to include the services that were renewed at the end of term as non-event. Due to the nature of our services (subscritpion based), the contract terms are somewhere between 3 and 10 years. So it is not surprising the churn events and renewals usually happen at the end of term.
However there are many more active services with different months in their terms (tenure). Should I consider them as non-event? One of the potential problem is that it will make the dataset extremely imbalanced. The month-in-term of current services is a wide range. Some of them just started a month ago. Maybe I need to selectively chose some of the active services with longer tenure. What would be a good approach of sampling from these current non-event samples?