fix(inference): use config fields instead of hardcoded max_seq_length/load_in_4bit by sacredvoid · Pull Request #28 · sacredvoid/alignrl

sacredvoid · 2026-03-26T00:44:34Z

Summary

Adds max_seq_length and load_in_4bit fields to InferenceConfig (defaults: 2048, True)
_load_unsloth now reads from config instead of hardcoding
Fixes silent truncation when training used max_seq_length=4096 but inference was hardcoded to 2048

Fixes #27

Test plan

All 152 tests pass
Defaults match previous hardcoded values (no breaking change)

…ead of hardcoding _load_unsloth hardcoded max_seq_length=2048 and load_in_4bit=True instead of reading from InferenceConfig. Added both fields to InferenceConfig with matching defaults. Users training with max_seq_length=4096 can now set it for inference too. Fixes #27

sacredvoid merged commit 5a167c3 into main Mar 26, 2026

sacredvoid deleted the fix/inference-config-hardcoded-params branch March 26, 2026 00:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(inference): use config fields instead of hardcoded max_seq_length/load_in_4bit#28

fix(inference): use config fields instead of hardcoded max_seq_length/load_in_4bit#28
sacredvoid merged 1 commit intomainfrom
fix/inference-config-hardcoded-params

sacredvoid commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sacredvoid commented Mar 26, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant