-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about preprocessing functions #37
Comments
Hi, great questions. From the data pipeline template:
The most thorough explanation can be found here
So TL:DR, in your case (non-recurrent events) it might be safe, but does it make sense for inference? I.e, when does your data arrive? I guess you want to predict About question 2: Yes this sounds correct! |
Hi,
I've two questions regarding the preprocessing functions:
prep_tensors
- the linesSimply throw away the first event, right? Is this a necessity? In my data, a significant portion of the chruners churn at the beginning, and I'd be happy to try and predict these, as well.
nanmask_to_keras_mask
function: As far as I understand, they
variable returned by this function is of dimension(n_subjects,t_timesteps,2)
, such thaty[i]
is the matrix whose rows are the different times and its columns are time-to-event and censoring indicator, respectively, for subjecti
. In my data, each subject is either churned or not churned (no recurrent events). This means that for each subject, the second column is either all ones (if it's a churned subject) or all zeros (if it's a censored subject); this, of course, without taking into account the 0.95 mask. Is this the correct input format for training the model?The text was updated successfully, but these errors were encountered: