BOT: Fix #1022: allow pairwise comparisons with two models by nikosbosse · Pull Request #1075 · epiforecasts/scoringutils

nikosbosse · 2026-02-13T01:35:32Z

Summary

Fixes Let get_pairwise_comparison() run with two models? #1022: get_pairwise_comparisons() now warns instead of erroring when there are exactly 2 models with one as the baseline
Changed cli_abort() to cli_warn() for the two-model-with-baseline case, allowing the function to proceed and compute the single score ratio
Fixed the "compairisons" typo in the warning message
The single-model case (truly insufficient comparators) still errors via pairwise_comparison_one_group()

Root cause

The check length(setdiff(comparators, baseline)) < 2 at line 161 of R/pairwise-comparisons.R called cli_abort(), blocking the legitimate use case of comparing 2 models where one is the baseline. While the pairwise comparison is just a single ratio in this case, it's still useful output.

What the fix does

Adds !is.null(baseline) guard so the check only applies when a baseline is specified
Changes cli_abort() to cli_warn() so the function proceeds with a warning
Improves the warning message to be more informative

Test coverage added

get_pairwise_comparisons() warns but works with two models and a baseline
add_relative_skill() warns but works with two models and a baseline
Two-model ratio is mathematically correct (deterministic test with known values)
Single-model case still errors (regression test)
Two models without baseline still works without warning (regression test)
Updated existing expect_error to expect_warning for the two-model-with-baseline case

Test plan

New tests pass
Full test suite passes (695 tests, 0 failures)
R CMD check: 0 errors, 0 warnings, 2 pre-existing notes

🤖 Generated with Claude Code

…aseline The check at line 161 of pairwise-comparisons.R called cli_abort() when fewer than 2 non-baseline models existed, blocking the legitimate case of comparing exactly 2 models where one is the baseline. Changed to cli_warn() so the function proceeds with a warning instead of erroring. Also fixed the "compairisons" typo and improved the warning message. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codecov · 2026-02-13T01:38:33Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.83%. Comparing base (ac0c01a) to head (92468f7).
⚠️ Report is 93 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1075   +/-   ##
=======================================
  Coverage   97.83%   97.83%           
=======================================
  Files          35       35           
  Lines        1845     1845           
=======================================
  Hits         1805     1805           
  Misses         40       40

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

nikosbosse

CLAUDE: Approving (posted as comment since self-approval is blocked). Clean, correct fix. The !is.null(baseline) guard is well-considered — it ensures the single-model-without-baseline case still errors downstream at pairwise_comparison_one_group(). All edge cases (1 model, 2 models with/without baseline) are handled correctly. The deterministic mathematical correctness test (Test 3) with known score ratios is excellent. Typo fix and improved warning message are appropriate. No issues found.

nikosbosse · 2026-04-04T19:43:06Z

Duplicated

nikosbosse added a commit that referenced this pull request Feb 13, 2026

Pipeline: mark #1022 as implemented (PR #1075)

669c0cb

nikosbosse commented Feb 13, 2026

View reviewed changes

nikosbosse marked this pull request as draft February 13, 2026 08:26

nikosbosse changed the title ~~Fix #1022: allow pairwise comparisons with two models~~ BOT: Fix #1022: allow pairwise comparisons with two models Feb 13, 2026

nikosbosse closed this Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BOT: Fix #1022: allow pairwise comparisons with two models#1075

BOT: Fix #1022: allow pairwise comparisons with two models#1075
nikosbosse wants to merge 1 commit intomainfrom
fix/1022-pairwise-two-models

nikosbosse commented Feb 13, 2026

Uh oh!

codecov bot commented Feb 13, 2026 •

edited

Loading

Uh oh!

nikosbosse left a comment

Uh oh!

nikosbosse commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nikosbosse commented Feb 13, 2026

Summary

Root cause

What the fix does

Test coverage added

Test plan

Uh oh!

codecov bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

nikosbosse left a comment

Choose a reason for hiding this comment

Uh oh!

nikosbosse commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov bot commented Feb 13, 2026 •

edited

Loading