Fix docker builds and latest tag by codelion · Pull Request #270 · algorithmicsuperintelligence/optillm

codelion · 2025-11-08T00:55:22Z

Fixes #269

Introduces two new scripts: eval_imobench_answer.py for evaluating short-answer mathematical problems from the AnswerBench dataset, and eval_imobench_proof.py for evaluating rigorous proof problems from the ProofBench dataset. Both scripts support model evaluation, result saving, and detailed performance analysis.

Added a step to remove unused SDKs and prune Docker system volumes in both amd64 and arm64 Docker publish workflows. This helps prevent disk space issues during CI builds.

Co-Authored-By: Claude <noreply@anthropic.com>

codelion and others added 4 commits November 6, 2025 09:09

Free up disk space in Docker publish workflows

2558d80

Added a step to remove unused SDKs and prune Docker system volumes in both amd64 and arm64 Docker publish workflows. This helps prevent disk space issues during CI builds.

Bump version to 0.3.6

909502e

Co-Authored-By: Claude <noreply@anthropic.com>

Update __init__.py

5030c52

codelion merged commit 1d65beb into main Nov 8, 2025
3 checks passed

codelion deleted the feat-add-proofbench-answerbench-evals branch November 8, 2025 01:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix docker builds and latest tag#270

Fix docker builds and latest tag#270
codelion merged 4 commits intomainfrom
feat-add-proofbench-answerbench-evals

codelion commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

codelion commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant