perf(spanner): optimize query result decoding by olavloite · Pull Request #17375 · googleapis/google-cloud-python

olavloite · 2026-06-04T14:40:49Z

Work in progress.

Optimizes the decoding and reading of (large) result sets for Spanner.

gemini-code-assist

Code Review

This pull request optimizes performance across Spanner streaming and helper modules by caching list append methods, pre-compiling regular expressions, and replacing helper functions with direct attribute getters or lambdas. However, several critical issues were identified in _helpers.py: missing imports for datetime and base64 will cause runtime errors, BYTES values are incorrectly encoded to UTF-8 instead of being base64-decoded, and the new timestamp parsing logic ignores timezone offsets for non-UTC inputs, resulting in incorrect times.

olavloite · 2026-06-04T15:50:28Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces several performance optimizations to the Spanner client library, such as caching local variables and methods during row merging, using lambdas and operator.attrgetter for type decoding, pre-compiling regex patterns, and optimizing timestamp parsing. The reviewer provided valuable feedback to further boost performance, including checking the more common number_value field first for float decoding, using string slicing instead of regular expressions for timestamp parsing, and removing the unused re import.

olavloite · 2026-06-04T16:30:01Z

The time needed to decode 100,000 rows is significantly reduced from approx 6-7s to approx 3-3.5s. The increasing gap between the two lines is due to the nature of this benchmark. The benchmark simulates a randomized poisson load. A small burst in traffic is enough to completely overload the old implementation, and cause requests to queue up and start taking an exponentially increasing time.

rahul2393

LGTM - with minor comments/questions and test assert fix

rahul2393 · 2026-06-04T18:26:09Z

+        if val[19] == ".":
+            if val.endswith("Z"):
+                offset = "Z"
+                fraction = val[20:-1]


Spanner Timestamp seems to be always UTC Z at the API/wire level. Non-Z offset only come from CAST(ts as String), which is a STRING column and routes through the STRING decoder.

+/-HH:MM handling plus the astimezone call won't ever be executed on real TIMESTAMP value? Can we drop it to shrink the risky code, or comment marking it deliberately defensive. The Z fast-path is the legitimate win and short-circuits before this anyway?

rahul2393 · 2026-06-04T18:27:49Z

+
+        parsed = self._callFUT(value_pb, field_type, field_name)
+        self.assertIsInstance(parsed, datetime_helpers.DatetimeWithNanoseconds)
+        self.assertEqual(parsed, expected)


DatetimeWithNanoseconds.eq is inherited from datetime and compares only to microsecond - it ignores .nanosecond. So this passes even if sub-us digits are wrong.

rahul2393 · 2026-06-04T18:29:24Z

+                    index = 0
+        else:
+            for value in values:
+                if value.HasField("null_value"):


value.HasField("null_value") duplicates _parse_nullables's semantics(still used in lazy/array/single-value paths). Should we add comment here to keep in sync so duplication doesn't silently drift?

rahul2393 · 2026-06-04T18:31:01Z

+        return operator.attrgetter("bool_value")
    elif type_code == TypeCode.INT64:
-        return _parse_int64
+        return lambda value_pb: int(value_pb.string_value)


Maybe a comment on why only STRING/BOOL use attrgetter(faster) while transform trypes use lambdas?

perf(spanner): optimize query result decoding

9d80835

gemini-code-assist Bot reviewed Jun 4, 2026

View reviewed changes

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py Outdated

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py Outdated

perf: further optimizations + timestamp fix

7820f5c

gemini-code-assist Bot reviewed Jun 4, 2026

View reviewed changes

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py Outdated

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py

Comment thread packages/google-cloud-spanner/google/cloud/spanner_v1/_helpers.py Outdated

chore: address review comments

10173de

olavloite marked this pull request as ready for review June 4, 2026 16:30

olavloite requested a review from a team as a code owner June 4, 2026 16:30

olavloite changed the title ~~perf(spanner): [WIP] optimize query result decoding~~ perf(spanner): optimize query result decoding Jun 4, 2026

rahul2393 approved these changes Jun 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(spanner): optimize query result decoding#17375

perf(spanner): optimize query result decoding#17375
olavloite wants to merge 3 commits into
mainfrom
spanner-optimize-decoding

olavloite commented Jun 4, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olavloite commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olavloite commented Jun 4, 2026

Uh oh!

rahul2393 left a comment

Uh oh!

rahul2393 Jun 4, 2026

Uh oh!

rahul2393 Jun 4, 2026

Uh oh!

rahul2393 Jun 4, 2026

Uh oh!

rahul2393 Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

olavloite commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olavloite commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olavloite commented Jun 4, 2026

Uh oh!

rahul2393 left a comment

Choose a reason for hiding this comment

Uh oh!

rahul2393 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

rahul2393 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

rahul2393 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

rahul2393 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

olavloite commented Jun 4, 2026 •

edited

Loading