perf: add microbenchmarks for crc32c and MRD reads by chandra-siri · Pull Request #17392 · googleapis/google-cloud-python

chandra-siri · 2026-06-08T10:56:25Z

Benchmarks for crc32c execution time and MRD reads with checksum enabled vs disabled.

gemini-code-assist

Code Review

This pull request introduces two new microbenchmark scripts: "benchmark_crc32c.py" to measure the execution time of "google_crc32c.value", and "benchmark_mrd_reads.py" to benchmark GCS Object Range Downloads using "AsyncMultiRangeDownloader" (MRD) with and without checksums. Feedback on "benchmark_mrd_reads.py" highlights two potential issues: hardcoding the object size to 10 GiB (which can cause out-of-range errors if a smaller object is specified) and a potential crash in "random.randint" if the requested download size exceeds the object size. Suggestions were provided to dynamically fetch the object size and add a guard clause.

gemini-code-assist · 2026-06-08T10:57:49Z

+    # 10 GiB in bytes
+    object_size_bytes = 10 * 1024 * 1024 * 1024
+
+    grpc_client = AsyncGrpcClient()


Hardcoding object_size_bytes to 10 GiB breaks the benchmark when a different --object is specified via the command line. If the specified object is smaller than 10 GiB, generating random offsets up to 10 GiB will result in out-of-range errors during download.

To fix this, dynamically retrieve the actual object size using the grpc_client.

Suggested change

# 10 GiB in bytes

object_size_bytes = 10 * 1024 * 1024 * 1024

grpc_client = AsyncGrpcClient()

grpc_client = AsyncGrpcClient()

try:

object_metadata = await grpc_client.get_object(args.bucket, args.object)

object_size_bytes = object_metadata.size

except Exception as e:

print(f"Warning: Could not retrieve object metadata ({e}). Defaulting to 10GiB.", file=sys.stderr)

object_size_bytes = 10 * 1024 * 1024 * 1024

gemini-code-assist · 2026-06-08T10:57:50Z

+    for size_str, size_bytes in sizes_to_test:
+        # Pre-generate random offsets so that both Enabled and Disabled configurations run on the exact same offsets
+        offsets = [random.randint(0, object_size_bytes - size_bytes) for _ in range(args.iterations)]


If size_bytes is larger than object_size_bytes, random.randint(0, object_size_bytes - size_bytes) will raise a ValueError: empty range for randrange() and crash the script.

Adding a guard clause to skip sizes larger than the object size makes the benchmark much more robust.

for size_str, size_bytes in sizes_to_test: if size_bytes > object_size_bytes: print(f"Skipping size {size_str} ({size_bytes} bytes) because it is larger than the object size ({object_size_bytes} bytes).", file=sys.stderr) continue # Pre-generate random offsets so that both Enabled and Disabled configurations run on the exact same offsets offsets = [random.randint(0, object_size_bytes - size_bytes) for _ in range(args.iterations)]

…benchmark

…r progress output

…esting Full and Full-1 ranges

…benchmark

…mark

perf: add microbenchmarks for crc32c and MRD reads

75d0127

gemini-code-assist Bot reviewed Jun 8, 2026

View reviewed changes

chandra-siri added 10 commits June 8, 2026 12:23

perf: add warmup phase to MRD reads benchmark

02fa59c

perf: update warmup chunk size to 10MiB

891a981

perf: make GCS object size configurable in MRD reads benchmark

88f0e21

perf: add % change when checksum disabled column to MRD reads report

1dbb5f2

perf: support pre-upload and full range downloads in MRD reads benchmark

fc85a68

perf: align upload/download sizes and use random temp objects in MRD …

0334d2c

…benchmark

perf: migrate prints to stderr logging, introducing --debug option fo…

f268e4f

…r progress output

perf: skip Full-1 case when checksum validation is disabled

1fe6c69

perf: add pytest-benchmark test for checksum overhead in MRD reads

9403ed4

perf: calculate and report average throughput in test_checksum_overhead

bc0ac8d

parthea assigned chandra-siri Jun 8, 2026

chandra-siri added 3 commits June 8, 2026 16:55

perf: convert test parameter to (object_size, download_size) tuple, t…

3efb011

…esting Full and Full-1 ranges

perf: upload fresh object for each enable_chk iteration in MRD reads …

183e297

…benchmark

perf: compare Full-1 throughput with Full baseline in MRD reads bench…

39d633c

…mark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: add microbenchmarks for crc32c and MRD reads#17392

perf: add microbenchmarks for crc32c and MRD reads#17392
chandra-siri wants to merge 14 commits into
mainfrom
perf-checksum-benchmarks

chandra-siri commented Jun 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

chandra-siri commented Jun 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant