Create new kfold.print method by florence-bockting · Pull Request #342 · stan-dev/loo

florence-bockting · 2026-03-27T12:18:14Z

Description

In brms (PR#1869), we updated the kfold function such that it now also returns pareto-k diagnostics.

This PR suggests a new kfold.print method that prints additionally to the loo.print output information about pareto-k diagnostics.

TODO

add kfold.print method to support pareto-k diagnostics if they exist
add test data and unittests to ensure that pareto-k diagnostics are printed correctly with the updated method
add unittest that ensures that traditional behavior of kfold.print reduces to loo.print if no diagnostics$pareto_k in kfold object exists

codecov-commenter · 2026-03-27T12:23:36Z

Codecov Report

❌ Patch coverage is 91.66667% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 92.80%. Comparing base (03a6932) to head (9bd9ff0).
⚠️ Report is 52 commits behind head on master.

Files with missing lines	Patch %	Lines
R/print.R	91.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #342      +/-   ##
==========================================
+ Coverage   92.78%   92.80%   +0.02%     
==========================================
  Files          31       31              
  Lines        2992     3004      +12     
==========================================
+ Hits         2776     2788      +12     
  Misses        216      216

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

florence-bockting · 2026-03-27T12:31:33Z

The current behavior shows the following:

If all pareto-k are acceptable:

Based on 10-fold cross-validation.

           Estimate   SE
elpd_kfold   -284.7 10.0
p_kfold         2.4  0.6
kfoldic       569.3 20.1
------

All Pareto k estimates are good (k < 0.7).
See help('pareto-k-diagnostic') for details.

If some pareto-k are problematic:

Based on 10-fold cross-validation.

           Estimate     SE
elpd_kfold  -5521.0  713.1
p_kfold       318.5   97.9
kfoldic     11042.0 1426.3
------

Pareto k diagnostic values:
                         Count Pct.    Min. ESS
(-Inf, 0.7]   (good)     249   95.0%   <NA>    
   (0.7, 1]   (bad)        6    2.3%   <NA>    
   (1, Inf)   (very bad)   7    2.7%   <NA>    
See help('pareto-k-diagnostic') for details.

If no pareto-k diagnostics exist in kfold output structure:

Based on 10-fold cross-validation.

           Estimate     SE
elpd_kfold  -5521.0  713.1
p_kfold       318.5   97.9
kfoldic     11042.0 1426.3

As we have only pareto-k information in the diagnostics the Min. ESS column is always <NA>. Shall we somehow communicate this further to the user @avehtari ? I didn't want to change the default structure of the pareto-k-table, therefore I didn't consider further the option to simply delete this column.

florence-bockting · 2026-03-27T13:12:47Z

I don't really understand what the issue with the failing R-CMD-check for ubuntu-latest is. Do you have any idea what the problem on my side can be?

avehtari · 2026-03-27T13:41:54Z

As we have only pareto-k information in the diagnostics the Min. ESS column is always .

Or we could add the pointwise ESS's to diagnostics

jgabry · 2026-03-27T16:34:11Z

I don't really understand what the issue with the failing R-CMD-check for ubuntu-latest is. Do you have any idea what the problem on my side can be?

I think there's a bug in r-devel. I'm seeing this with cmdstanr too. I bet it will be fixed soon.

jgabry · 2026-03-27T16:41:13Z

tests/testthat/test_print_plot.R

+test_that("print.loo supports kfold with pareto-k diagnostics - calibrated", {
+  kfold1 <- readRDS("data-for-tests/kfold-calibrated.Rds")
+
+  expect_output(print(kfold1), "All Pareto k estimates are good")


Could we use expect_snapshot here to test the entire output since it should be deterministic, right? Same with the others below. I know that the existing print tests don't do that, but they were written a long time ago and maybe we should update them to do that too? (not in this PR but at some point)

Florence Bockting added 3 commits March 27, 2026 13:16

feat: update print.loo to support kfold pareto-k diagnostics

e0a8bcb

tests: add unittest for updated print method and test data

0efb8b1

refactor: create new kfold.print method instead of changing print.loo

5145c66

florence-bockting marked this pull request as ready for review March 27, 2026 13:10

florence-bockting requested review from avehtari and jgabry March 27, 2026 13:10

jgabry reviewed Mar 27, 2026

View reviewed changes

tests: use expect_snapshot to check table output

9bd9ff0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create new kfold.print method#342

Create new kfold.print method#342
florence-bockting wants to merge 4 commits intostan-dev:masterfrom
florence-bockting:update-loo-print

florence-bockting commented Mar 27, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Mar 27, 2026 •

edited

Loading

Uh oh!

florence-bockting commented Mar 27, 2026

Uh oh!

florence-bockting commented Mar 27, 2026 •

edited

Loading

Uh oh!

avehtari commented Mar 27, 2026

Uh oh!

jgabry commented Mar 27, 2026 •

edited

Loading

Uh oh!

jgabry Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

florence-bockting commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

TODO

Uh oh!

codecov-commenter commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

florence-bockting commented Mar 27, 2026

Uh oh!

florence-bockting commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

avehtari commented Mar 27, 2026

Uh oh!

jgabry commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgabry Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

florence-bockting commented Mar 27, 2026 •

edited

Loading

codecov-commenter commented Mar 27, 2026 •

edited

Loading

florence-bockting commented Mar 27, 2026 •

edited

Loading

jgabry commented Mar 27, 2026 •

edited

Loading