Handle missing forecasts before summarisation by seabbs-bot · Pull Request #1156 · epiforecasts/scoringutils

seabbs-bot · 2026-03-30T20:51:08Z

Summary

Closes #1122

Adds filter_missing_scores() with strategy pattern for dropping incomplete targets
Adds impute_missing_scores() with strategy pattern for filling missing score rows
Filter strategies: filter_to_intersection(min_coverage, models)
Impute strategies: impute_worst_score(), impute_mean_score(), impute_na_score(), impute_model_score(model)
Shared internal build_missing_grid() detects missing model × target combinations
.imputed column added to protected columns for tracking

Test plan

21 filter tests pass (coverage thresholds, model subsetting, class preservation)
35 impute tests pass (all strategies, custom functions, integration with summarise_scores())
Full test suite (788 tests) passes with no regressions
Integration pipeline: filter_missing_scores(strategy = filter_to_intersection(models = "baseline")) |> impute_missing_scores(strategy = impute_model_score("baseline"))

This was opened by a bot. Please ping @seabbs for any questions. # checks-passed

codecov · 2026-03-30T20:56:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 98.15%. Comparing base (4a222da) to head (1cf4beb).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1156      +/-   ##
==========================================
+ Coverage   97.98%   98.15%   +0.16%     
==========================================
  Files          37       40       +3     
  Lines        1984     2163     +179     
==========================================
+ Hits         1944     2123     +179     
  Misses         40       40

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

seabbs-bot · 2026-03-30T20:59:51Z

Review

Design feedback (from @seabbs)

Should this even be filter_missing_scores or might people want to filter for other reasons, with the intersection being a special case? Not sure the filter even needs to know the missing grid.

Agreed. Renamed filter_missing_scores() → filter_scores() — a general-purpose filtering function where filter_to_intersection() is just one strategy. Removed the redundant build_missing_grid() call from the filter path entirely. The strategy handles everything independently.

Does the filter always need to do the expand grid thing?

No. filter_to_intersection() works purely by counting coverage per target combination. It never calls build_missing_grid(). The expand-grid detection is only needed on the impute side.

Code review findings (addressed)

compare hardcoded as "model" in all impute strategies — impute_worst_score(), impute_mean_score(), impute_model_score() all did setdiff(fu, "model") instead of using the actual compare value. The strategy signature didn't receive compare at all. Fixed: strategy signature changed to function(scores, missing_rows, metrics, compare), all hardcoded "model" references replaced.
build_missing_grid() called redundantly in filter — result stored but never passed to strategy. Fixed: removed entirely from filter path.
No validation that compare column exists — both functions now have assert_subset(compare, names(scores)).
Inconsistent @keywords — all functions now use @keywords handle-metrics.
Weak test for impute_worst_score() — was checking global max, now verifies per-target-combination maximum.

This was opened by a bot. Please ping @seabbs for any questions.

nikosbosse · 2026-03-31T14:36:53Z

I'll wait until you tag me for review, right?

seabbs

I think this all works as expected. I was a bit uncertain that compare was the correct name but I think its the best option overall.

seabbs

Snapshots look like a ggplot2 change

…actories Tests for impute_missing_scores(), impute_worst_score(), impute_mean_score(), impute_na_score(), impute_model_score(), and integration with summarise_scores.

…_to_intersection Add shared internal build_missing_grid() for detecting missing model-target combinations. Add filter_missing_scores() with strategy pattern and filter_to_intersection() strategy factory. Add .imputed to protected columns and globalVariables. Update test for protected columns. All tests green. Ref #1122

Add impute_missing_scores() with .imputed column tracking, plus four strategy factories: impute_worst_score(), impute_mean_score(), impute_na_score(), and impute_model_score(). Fix data.table scoping issues in closures by using explicit namespacing and avoiding variable name collisions with column names.

Generalise the function as a strategy-based score filter that delegates all logic to the strategy function. Remove the redundant build_missing_grid() call and early-return check. Add validation that the compare column exists in scores. Rename source and test files accordingly.

@Keywords

- Add compare to strategy function signature (4th argument) - Replace hardcoded "model" with compare in all strategies - Add assert_subset validation for compare column - Change @Keywords from scoring to handle-metrics - Strengthen impute_worst_score test to verify per-target max

- Rename filter_to_intersection(models=) to include= for genericity - Add validation for unknown values in include argument - Fix cli_abort messages with embedded whitespace - Add non-default compare tests for both filter and impute - Pass compare through to all impute strategies (was already done, now tested)

Restructure the handling-missing-forecasts vignette to introduce concepts incrementally with summarise_scores() shown after each approach. Add cli_inform() messages to impute_missing_scores() to match the pattern used by filter_scores(). Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Add tests for edge cases in imputation strategies: - impute_worst_score/impute_mean_score skip metrics not in columns - impute_model_score errors for nonexistent reference model Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Moved to a separate issue (#1159). Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

- Rename "Relaxing with min_coverage" to "Requiring partial coverage" with clearer explanation of why 0.75 keeps all targets here - Mention compare argument early in the Scoring section - Reorder imputation strategies by severity: NA (diagnostic), worst (heavy penalty), reference model (moderate), mean (least severe) - Show imputed row counts by model and target_type instead of raw rows - Add context to each strategy about when to use it - Fix combined workflow text Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

- filter_to_intersection(include) with scores_quantile verifies EpiNow2's targets are correctly selected - filter_to_intersection(min_coverage) boundary: 0.75 keeps cases, 1.0 drops them - filter then summarise gives equal target counts per model - impute then summarise gives equal row counts per model - impute_model_score values match reference model scores exactly Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Vignette: - Rename "Relaxing with min_coverage" to "Requiring partial coverage" with explanation of why 0.75 keeps all targets here - Mention compare argument early in the Scoring section - Reorder imputation: NA, worst, reference model, mean (severity order) - Show imputed row counts by model/target_type not raw rows - Add context to each strategy about when to use it - Soften absolute claims Tests: - Wrap filter_scores/impute_missing_scores calls in suppressMessages to reduce test output noise, matching existing test conventions Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Move filter_scores, filter_to_intersection, and impute_* functions from handle-metrics to a new postprocess-scores keyword and pkgdown section, separating them from metric selection functions. Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

…r messages - impute_mean_score now verifies imputed values match per-target mean - imputation preserves original score values (compared as numeric to handle logical-to-integer coercion from rbindlist) - filter_scores reports correct "Filtered out N rows" message Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

seabbs self-requested a review March 30, 2026 20:51

seabbs-bot force-pushed the issue-1122-missing-scores branch from a2b0160 to 1c2996e Compare March 30, 2026 20:53

seabbs approved these changes Mar 31, 2026

View reviewed changes

seabbs requested review from nikosbosse and sbfnk March 31, 2026 17:13

seabbs approved these changes Apr 1, 2026

View reviewed changes

seabbs-bot force-pushed the issue-1122-missing-scores branch from bc98a89 to 8d0c866 Compare April 1, 2026 09:44

seabbs-bot and others added 20 commits April 1, 2026 11:01

test(red): add failing tests for impute_missing_scores and strategy f…

42bf377

…actories Tests for impute_missing_scores(), impute_worst_score(), impute_mean_score(), impute_na_score(), impute_model_score(), and integration with summarise_scores.

style: fix lint warnings in impute tests

3e1298d

style: suppress lint warning for internal function reference

b15c5cf

fix: update integration test to use renamed filter_scores

1d50511

style: fix indentation in impute test

06b65e7

style: fix redundant_equals_linter in test

6ca383f

docs: add vignette for handling missing forecasts

797b926

docs: add vignette and NEWS entry for missing scores handling

048be7c

docs: improve vignette clarity and fix review issues

3d266ab

style: put cli_inform message on single line

3fde364

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

fix: use cli::qty() for correct pluralisation in impute message

3ff432a

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

docs: credit Kim et al (2026) as inspiration for missing scores handling

3490100

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

docs: soften absolute claims in vignette prose

e11bc62

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

seabbs-bot and others added 11 commits April 1, 2026 11:01

docs: add articles section to pkgdown config

8856e8d

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

docs: rename articles group from Getting started to Articles

bf60b74

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

revert: remove articles section from pkgdown config

12ef1a7

Moved to a separate issue (#1159). Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

style: use bare i = in cli_inform to avoid nolint blocks

a7934c6

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

style: remove unhelpful comment in build_missing_grid

7c88db2

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

test: update vdiffr plot snapshots after merge from main

1cf4beb

Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

seabbs-bot force-pushed the issue-1122-missing-scores branch from 8d0c866 to 1cf4beb Compare April 1, 2026 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle missing forecasts before summarisation#1156

Handle missing forecasts before summarisation#1156
seabbs-bot wants to merge 31 commits intomainfrom
issue-1122-missing-scores

seabbs-bot commented Mar 30, 2026

Uh oh!

codecov bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

seabbs-bot commented Mar 30, 2026

Uh oh!

nikosbosse commented Mar 31, 2026

Uh oh!

seabbs left a comment

Uh oh!

seabbs left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

seabbs-bot commented Mar 30, 2026

Summary

Test plan

Uh oh!

codecov bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

seabbs-bot commented Mar 30, 2026

Review

Design feedback (from @seabbs)

Code review findings (addressed)

Uh oh!

nikosbosse commented Mar 31, 2026

Uh oh!

seabbs left a comment

Choose a reason for hiding this comment

Uh oh!

seabbs left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Mar 30, 2026 •

edited

Loading