-
Notifications
You must be signed in to change notification settings - Fork 75
Pull requests: vllm-project/speculators
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add FP8 quantization for hidden state data generation
needs-rebase
two-reviews
#461
opened Apr 22, 2026 by
shubhra
Collaborator
Loading…
feat(mtp): add MTP model architecture, converter, and stitcher
two-reviews
#452
opened Apr 21, 2026 by
rahul-tuli
Collaborator
Loading…
3 of 4 tasks
Add guidellm-based performance benchmarking utilities
documentation
Improvements or additions to documentation
#434
opened Apr 17, 2026 by
anmarques
Collaborator
Loading…
2 of 4 tasks
fix: resolve Pydantic v2 + transformers MRO conflicts
two-reviews
#431
opened Apr 16, 2026 by
ianliuy
Loading…
docs: add FastMTP documentation, benchmarks, and integration tests
documentation
Improvements or additions to documentation
#430
opened Apr 16, 2026 by
ianliuy
Loading…
Fix local_rank usage in distributed batch sampler for multi-node training
#427
opened Apr 15, 2026 by
ianliuy
Loading…
Add support for async hidden states connector
#424
opened Apr 14, 2026 by
fynnsu
Collaborator
Loading…
3 of 4 tasks
Update pytest requirement from ~=8.2.2 to ~=9.0.3
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
quality-failed
#421
opened Apr 14, 2026 by
dependabot
Bot
Loading…
Update setuptools-git-versioning requirement from <3,>=2.0 to >=3.0.1,<4
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#415
opened Apr 14, 2026 by
dependabot
Bot
Loading…
[WIP] P-eagle Training Phase 1 implementation
documentation
Improvements or additions to documentation
feat(fast-mtp): FastMTP speculative head finetuning for Qwen3-Next
documentation
Improvements or additions to documentation
training
#366
opened Mar 26, 2026 by
rahul-tuli
Collaborator
•
Draft
fix(datagen): fix VllmHiddenStatesGenerator for hybrid KV cache models and large datasets
#365
opened Mar 26, 2026 by
rahul-tuli
Collaborator
•
Draft
[Doc] Add a choosing algorithms section
documentation
Improvements or additions to documentation
#355
opened Mar 22, 2026 by
DonaghBr
Loading…
4 tasks done
add speculators finetune examples
documentation
Improvements or additions to documentation
needs-rebase
#339
opened Mar 10, 2026 by
Annarine
Loading…
4 tasks
feat: P-EAGLE Phase 1 - COD sampling, model, config, and loss utilities
needs-rebase
#327
opened Mar 4, 2026 by
NJX-njx
Loading…
docs: add algorithm documentation for EAGLE-3 and FastMTP
documentation
Improvements or additions to documentation
#326
opened Mar 4, 2026 by
NJX-njx
Loading…
Avoid D2H copies in VllmHiddenStatesGenerator to speed up offline data generation by x5
#322
opened Mar 2, 2026 by
qGentry
Loading…
Materialize non-layer modules after FSDP sharding
#314
opened Feb 27, 2026 by
guan404ming
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.