Skip to content

grpo_compute_loss_slow called with wrong positional args#4887

Merged
danielhanchen merged 1 commit intounslothai:mainfrom
jonahsamost:main
Apr 15, 2026
Merged

grpo_compute_loss_slow called with wrong positional args#4887
danielhanchen merged 1 commit intounslothai:mainfrom
jonahsamost:main

Conversation

@jonahsamost
Copy link
Copy Markdown
Contributor

fixes #4885

Simple fix to make sampling_per_token_logps a positional arg not a kwarg

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Datta0
Copy link
Copy Markdown
Collaborator

Datta0 commented Apr 7, 2026

So trl changed the positions of the arguments? Is that what caused the bug?

@danielhanchen danielhanchen added auto-reviewing PR is being auto-reviewed auto-approved Auto-review passed, ready to merge and removed auto-reviewing PR is being auto-reviewed auto-approved Auto-review passed, ready to merge labels Apr 11, 2026
@danielhanchen danielhanchen merged commit 777e1bd into unslothai:main Apr 15, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] GRPO: grpo_compute_loss_slow called with wrong positional args

3 participants