benchmark

Benchmark MAX

This directory contains tools to benchmark the performance of an LLM model server—measuring throughput, latency, and resource utilization. You can use these scripts to compare other serving backends, namely vLLM, against MAX.

The benchmark_serving.py script is adapted from vLLM, licensed under Apache 2.0. We forked this script to ensure consistency with vLLM's measurement methodology and extended it with features we found helpful, such as client-side GPU metric collection via max.profiler.

benchmark_serving.py supports:

text generation
text-to-image generation
image-to-image generation via /v1/responses

For image-to-image benchmarks, use:

--dataset-name local-image --dataset-path /path/to/file.jsonl for a generic local JSONL dataset with prompt and image_path rows
--dataset-name synthetic-pixel for a synthetic image-edit workload backed by a generated local placeholder image

Example local-image JSONL:

{"prompt": "Turn this into watercolor", "image_path": "images/sample.png"}
{"prompt": "Replace the sky with a sunset", "image_path": "/abs/path/to/photo.jpg"}

synthetic-pixel is analogous to the placeholder-image path used in diffusion serving benchmarks such as vLLM-Omni: MAX generates a white PNG in the system temp directory and reuses it for each request in the run.

For benchmark_serving.py usage instructions, see Benchmarking a MAX endpoint.

Note

This benchmarking script is also available with the max benchmark command, which you can get by installing modular with pip, uv, conda, or pixi package managers. Try it now by following the detailed guide to benchmark MAX on GPUs.

Name		Name	Last commit message	Last commit date
parent directory ..
benchmark_shared		benchmark_shared
configs		configs
.gitattributes		.gitattributes
.gitignore		.gitignore
BUILD.bazel		BUILD.bazel
README.md		README.md
benchmark_config.md		benchmark_config.md
benchmark_serving.py		benchmark_serving.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
serve_replay.py		serve_replay.py
sweep_benchmark_serving.py		sweep_benchmark_serving.py
sweep_benchmark_serving_result_utils.py		sweep_benchmark_serving_result_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Benchmark MAX

FilesExpand file tree

benchmark

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmark

Folders and files

parent directory

README.md

Benchmark MAX