-
Notifications
You must be signed in to change notification settings - Fork 8
Allow or help handle epix_as_of
last recorded version even when this version has no update
#109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
To make this more concrete, using HHS COVID-19 hospitalization data; suppose every Monday we prepare a forecast of COVID-19 admissions based on recent COVID-19 admissions.
yields [at time of running; should have set the end issue to something before today...]
So on 12 of these Mondays, even if we made forecasts at the very end of the day, we would not have a nonempty patch on that day. Earlier in the day, we might have more Mondays with this situation cropping up. We have to consider various possibilities
|
Part 1, in progress:
(Should require Part 2: functions/parameters that work off of this information. These might belong here or might belong in |
Part 1 is addressed in #101. For Part 2: The |
The |
Is there still some work to be done here? I don't quite follow the discussion, but wonder if step_adjust_latency has mitigated some of the core concerns. If there's more to be done, then we probably need to clarify that and then icebox it. In any case, I'm going to unassign myself from it. |
We might decide to produce forecasts So there are maybe three potential remaining elements:
|
When preparing prospective production forecasts, we may want to be able to mirror
epix_slide
orepix_as_of
operations that we use to prepare pseudoprospective analyses. However, this may involve getting a snapshotepix_as_of
a version with no update data (e.g., due to a nondaily update cadence, a holiday or some other occurrence causing the data source to be stale, or running the forecasts earlier than the data source has been updated and not being able to distinguish this from the data source responding that there has been no change). Currently, unless we have some "redundant"DT
rows duplicating previous values with the latest no-change version, this is going to raise an error (max_version
>self_max
). (This also means that compactification (#101) could change the error behavior here.) We should consider one of the following:self_max
, and would allow this operation to go through with a warning rather than an error, ormax(DT$version)
<forecast_ref_time
), do we interpret this as if the data source reported no change (unless it looks really stale) and allow getting the snapshot, or do we interpret it as an issue fetching the data and, up to some point, allow ourselves to back up toepix_as_of
the max DT version, providing this as a parameter to the slide function to adjust things? These are actually different options when you think about fetching backfill-aware training sets and forecast aheads relative to the snapshot version.The text was updated successfully, but these errors were encountered: