python: add support for adhoc query as pyarrow table by monochromatti · Pull Request #5814 · feldera/feldera

monochromatti · 2026-03-13T06:51:09Z

Ran tests locally against a running Feldera API.

From python/:

Full Python SDK suite (excluding tests/runtime_aggtest):
- uv run python -m pytest tests/ --ignore=tests/runtime_aggtest -ra
- Local result: 122 passed, 45 skipped
Targeted reruns:
- uv run python -m pytest tests/platform/test_shared_pipeline.py::TestPipeline::test_adhoc_query_arrow -q
- uv run python -m pytest tests/unit/test_query_as_arrow.py -q

Checklist

Unit tests added/updated
Integration tests added/updated
Documentation updated
Changelog updated

Breaking Changes?

Mark if you think the answer is yes for any of these components:

OpenAPI / REST HTTP API / feldera-types / manager (What is a breaking change?)
Feldera SQL (Syntax, Semantics)
feldera-sqllib (incl. dependencies fxp, etc.) (What is a breaking change?)
Python SDK (What is a breaking change?)
fda (CLI arguments)
Adapters (including configuration)
Storage Format / Checkpoints
Others (specify)

Describe Incompatible Changes

None.

Summary

This PR adds Arrow IPC query support to the Python SDK so ad-hoc query results can be consumed as streamed PyArrow record batches.

What changed

Added a new client API:
- FelderaClient.query_as_arrow(pipeline_name, query) -> Generator[pyarrow.RecordBatch, None, None]
Added a pipeline convenience method:
- Pipeline.query_arrow(query) -> Generator[pyarrow.RecordBatch, None, None]
Added optional Arrow dependency extra:
- pip install "feldera[arrow]"
Updated Python README with Arrow installation guidance
Added unit and platform tests for Arrow IPC query behavior

Notes

The Arrow response is consumed from an HTTP stream (stream=True) and yielded batch-by-batch.
Users can materialize a pyarrow.Table when desired via pyarrow.Table.from_batches(...).

mythical-fred

LGTM — but see inline: there is an existing open PR covering the same feature.

python/feldera/rest/feldera_client.py

gz · 2026-03-13T16:39:29Z

hi @monochromatti this looks good thanks a lot for your contribution. @abhizer can you review this

monochromatti · 2026-03-13T16:43:58Z

I'd like input on whether to return Generator[pyarrow.RecordBatch, ...] or a pyarrow.Table directly. The latter is the current state of the PR, but after some thinking it feels like generating batches is more in style with similar existing functionality and better suited for big payloads.

abhizer

Thank you!

As a heads up, the reason we didn't merge the prior PR is because the server intermittently sent bad data and we were unable to figure out why.

abhizer · 2026-03-13T17:07:47Z

I'd like input on whether to return Generator[pyarrow.RecordBatch, ...] or a pyarrow.Table directly

We normally return a generator, and it might be a good idea to keep this behavior consistent.

mihaibudiu · 2026-03-18T18:33:50Z

@monochromatti please re-request a review from @abhizer when this is ready again

abhizer

Thank you!

monochromatti · 2026-04-04T10:26:43Z

Rebased on main to solve a uv.lock conflict

mythical-fred

LGTM

monochromatti · 2026-04-04T14:20:37Z

Sorry I might be missing something, but the PR still requires an approval to run CI?

abhizer · 2026-04-04T14:28:07Z

Done!

mythical-fred · 2026-04-05T07:23:53Z

The "Pre Merge Queue Tasks" CI failure looks transient — the failing step is the Rust build check, but this PR has no Rust changes. The same step failed and then passed for other PRs around the same time. Could someone re-trigger CI?

mythical-fred · 2026-04-06T07:38:48Z

CI is still showing a failure on "Pre Merge Queue Tasks" from Apr 4 — looks like nobody re-triggered it yet. Could someone queue a fresh run? This is a Python-only PR and that step has been transiently failing for unrelated Rust check reasons.

abhizer · 2026-04-06T15:21:48Z

You might have to run "ruff format" for it to pass the pre merge queue.

mythical-fred

LGTM

Signed-off-by: Mattias Matthiesen <mattias.matthiesen@eviny.no>

monochromatti · 2026-04-08T05:44:24Z

Updated PR body and solved uv.lock conflict (exclude-newer timestamp). @abhizer

abhizer · 2026-04-08T14:18:31Z

Thank you!

monochromatti force-pushed the arrow-ipc-sdk branch 2 times, most recently from 4065f37 to edcaa7e Compare March 13, 2026 06:54

mythical-fred approved these changes Mar 13, 2026

View reviewed changes

python/feldera/rest/feldera_client.py Show resolved Hide resolved

monochromatti mentioned this pull request Mar 13, 2026

py: support arrow_ipc format for adhoc queries #4226

Open

gz requested a review from abhizer March 13, 2026 16:39

abhizer approved these changes Mar 13, 2026

View reviewed changes

monochromatti force-pushed the arrow-ipc-sdk branch from edcaa7e to dd5c74e Compare March 18, 2026 13:06

monochromatti force-pushed the arrow-ipc-sdk branch 2 times, most recently from 379bfe8 to 5f06e6a Compare March 24, 2026 12:16

monochromatti requested a review from abhizer March 24, 2026 12:18

abhizer approved these changes Apr 2, 2026

View reviewed changes

abhizer changed the title ~~arrow ipc sdk~~ python: add support for adhoc query as pyarrow table Apr 2, 2026

monochromatti force-pushed the arrow-ipc-sdk branch from 5f06e6a to 2cc02ae Compare April 4, 2026 10:15

monochromatti requested a review from mythical-fred April 4, 2026 10:17

mythical-fred approved these changes Apr 4, 2026

View reviewed changes

monochromatti force-pushed the arrow-ipc-sdk branch from 2cc02ae to d0a2187 Compare April 7, 2026 05:40

mythical-fred approved these changes Apr 7, 2026

View reviewed changes

monochromatti added 3 commits April 8, 2026 07:43

[python] Add optional arrow dependency and installation docs

95d8faa

Signed-off-by: Mattias Matthiesen <mattias.matthiesen@eviny.no>

[python] Add Arrow IPC query API to client and pipeline

8d285cf

Signed-off-by: Mattias Matthiesen <mattias.matthiesen@eviny.no>

[python] Add tests for Arrow IPC query results

541b6b7

Signed-off-by: Mattias Matthiesen <mattias.matthiesen@eviny.no>

monochromatti force-pushed the arrow-ipc-sdk branch from d0a2187 to 541b6b7 Compare April 8, 2026 05:43

monochromatti requested a review from abhizer April 8, 2026 05:45

abhizer approved these changes Apr 8, 2026

View reviewed changes

Conversation

monochromatti commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Breaking Changes?

Describe Incompatible Changes

Summary

What changed

Notes

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gz commented Mar 13, 2026

Uh oh!

monochromatti commented Mar 13, 2026

Uh oh!

abhizer left a comment

Choose a reason for hiding this comment

Uh oh!

abhizer commented Mar 13, 2026

Uh oh!

mihaibudiu commented Mar 18, 2026

Uh oh!

abhizer left a comment

Choose a reason for hiding this comment

Uh oh!

monochromatti commented Apr 4, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

monochromatti commented Apr 4, 2026

Uh oh!

abhizer commented Apr 4, 2026

Uh oh!

mythical-fred commented Apr 5, 2026

Uh oh!

mythical-fred commented Apr 6, 2026

Uh oh!

abhizer commented Apr 6, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

monochromatti commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhizer commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

monochromatti commented Mar 13, 2026 •

edited

Loading

monochromatti commented Apr 8, 2026 •

edited

Loading