fix(eval): exclude booleans from parsed benchmark metrics by sacredvoid · Pull Request #30 · sacredvoid/alignrl

sacredvoid · 2026-03-26T00:45:39Z

Summary

Adds and not isinstance(v, bool) guard to the metric filter in parse_results
bool is a subclass of int in Python, so isinstance(True, (int, float)) returns True
Boolean metadata from lm-eval (e.g. config flags) was leaking into benchmarks as 1/0

Fixes #29

Test plan

New test: test_filters_booleans verifies True/False are excluded
All 153 tests pass

bool is a subclass of int in Python, so isinstance(True, (int, float)) returns True. Boolean metadata from lm-eval results was leaking into benchmark metrics as 1/0. Added explicit bool exclusion. Fixes #29

fix(eval): exclude booleans from parsed benchmark metrics

7df0024

bool is a subclass of int in Python, so isinstance(True, (int, float)) returns True. Boolean metadata from lm-eval results was leaking into benchmark metrics as 1/0. Added explicit bool exclusion. Fixes #29

sacredvoid merged commit 2fba9ba into main Mar 26, 2026

sacredvoid deleted the fix/parse-results-bool-filter branch March 26, 2026 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(eval): exclude booleans from parsed benchmark metrics#30

fix(eval): exclude booleans from parsed benchmark metrics#30
sacredvoid merged 1 commit intomainfrom
fix/parse-results-bool-filter

sacredvoid commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sacredvoid commented Mar 26, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant