feat(gooddata-sdk): handle datetime granularity and support bytes limits in arrow fetch by Martozar · Pull Request #1563 · gooddata/gooddata-python-sdk

Martozar · 2026-04-20T11:26:45Z

Add support for datetime granularity handling using AttributeConverterStore for fetching data using Arrow.
Unify Arrow/AFM API by adding max_bytes parameter to Arrow table fetch.
Improve logging

risk: low

codecov · 2026-04-20T11:31:00Z

Codecov Report

❌ Patch coverage is 93.84615% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 78.71%. Comparing base (c00ca3b) to head (add06e9).
⚠️ Report is 170 commits behind head on master.

Files with missing lines	Patch %	Lines
...ta-sdk/src/gooddata_sdk/compute/model/execution.py	50.00%	4 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master    #1563       +/-   ##
===========================================
+ Coverage    0.00%   78.71%   +78.71%     
===========================================
  Files         158      230       +72     
  Lines       11048    15449     +4401     
===========================================
+ Hits            0    12161    +12161     
+ Misses      11048     3288     -7760

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lupko · 2026-04-20T12:51:16Z

+        total_ref_cols = [f.name for f in table.schema if f.name.startswith(_COL_TOTAL_REF_PREFIX)]
+        if total_ref_cols:
+            if len(total_ref_cols) > 1:
+                logger.warning(
+                    "Arrow table has %d __total_ref* columns; only %r is used for aggregation names.",
+                    len(total_ref_cols),
+                    total_ref_cols[0],


i wonder, did this actually happen? multiple total ref columns means something is very hosed on the backend...

being defensive is fine, no problem here but if you run into this warning then perhaps there's a more serious problem in xtab impl?

No, it didn't happen yet, just a defensive warning "what if", so as you say, if it happens in the future, we should react on this. Original implementation silently picked the first column with the prefix.

lupko · 2026-04-20T13:01:14Z

+            # Read the full HTTP response body before releasing the connection.
+            # pyarrow's IPC stream reader stops at the Arrow EOS marker and may leave
+            # the HTTP chunked-encoding terminator unread, which puts the connection in
+            # Request-sent state and breaks HTTP keep-alive reuse.
+            data = response.read()


i'm not 100% sure about this. it's worth investigating the implication on memory usage & allocations.

before: arrow reads stream, incrementally builds the table chunk by chunk.
now: likely: first read data for all chunks, then build table. this may (i'm not 100% certain though) to bigger spikes in RSS.

imho its worth investigating / measuring. could be there is no impact due to some technicalities in the underlying arrow code. could be that the load now needs ~2x more memory.

I did some benchmarks, there was a bit of a memory usage increase, but it wasn't that brutal (can run tests again to see the exact numbers). In any case, from my other measures, Arrow path has less memory consumption than original AFM, so it's kind of a trade-off.

However, I did run into the issue with the broken connection, and this was a fix for it.

ok. i wonder though, isn't there a subtler way to address it?

let's say: try to read stream as before (passing response to Arrow) -> the happy path, ideal memory footprint. then if error happens (except), do response.read() to read everything that is remaining, and then finally close?

or does arrow take ownership of the stream and closes it 'incorrectly' before reading everything or something like that? so exception handler cannot do the cleanup anymore?

risk: low

feat(gooddata-pandas): adopt AttributeConverterStore in arrow DF loading

4d81813

risk: low

Martozar requested review from hkad98, jaceksan, lupko and pcerny as code owners April 20, 2026 11:26

Martozar force-pushed the c.mze-cq-105 branch 2 times, most recently from 26b9953 to 660ac1a Compare April 20, 2026 12:15

lupko reviewed Apr 20, 2026

View reviewed changes

Martozar added 2 commits April 20, 2026 16:30

fix(gooddata-pandas): drain HTTP body to enable keep-alive reuse

6fb331c

risk: low

feat(gooddata-pandas): add max_bytes to arrow path and improve logging

add06e9

risk: low

Martozar force-pushed the c.mze-cq-105 branch from 660ac1a to add06e9 Compare April 20, 2026 14:35

lupko approved these changes Apr 20, 2026

View reviewed changes

Martozar merged commit 6d554ce into gooddata:master Apr 21, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gooddata-sdk): handle datetime granularity and support bytes limits in arrow fetch#1563

feat(gooddata-sdk): handle datetime granularity and support bytes limits in arrow fetch#1563
Martozar merged 3 commits intogooddata:masterfrom
Martozar:c.mze-cq-105

Martozar commented Apr 20, 2026

Uh oh!

codecov bot commented Apr 20, 2026 •

edited

Loading

Uh oh!

lupko Apr 20, 2026

Uh oh!

Martozar Apr 20, 2026

Uh oh!

lupko Apr 20, 2026

Uh oh!

Martozar Apr 20, 2026

Uh oh!

lupko Apr 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Martozar commented Apr 20, 2026

Uh oh!

codecov bot commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lupko Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Martozar Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

lupko Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Martozar Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

lupko Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Apr 20, 2026 •

edited

Loading