fix(histogram): add NULL handling for histogram #35693

janani-gurram · 2025-10-17T00:18:55Z

SUMMARY

Fixes the histogram chart so that the chart renders even when the x-axis variable contains NULL values.
Adds unit tests to verify the behavior when:
- The target column has all nulls (with and without grouping)
- The target column has some nulls (with and without grouping)

This issue is fixed by removing empty values from the input DataFrame before rendering. In cases where dropping these values results in an empty DataFrame, we now safely return the empty DataFrame instead of failing.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

Before
The chart failed to render when the selected column contained NULLs.

After
The chart now renders correctly, ignoring NULL values.

TESTING INSTRUCTIONS

Open any dataset and create a histogram chart.
Select a column that contains NULL values.
Verify that the chart renders correctly, excluding NULL entries.

ADDITIONAL INFORMATION

Has associated issue: Fixes Histogram fails to load when null values present in x-axis variable (regression from 4.1.2, occured in 5.0.0rc3, present in 6.0.0rc2) #33738
Required feature flags:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Status
	Inefficient DataFrame copy for null filtering ▹ view	✅ Fix detected

Files scanned

File Path	Reviewed
superset/utils/pandas_postprocessing/histogram.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

superset/utils/pandas_postprocessing/histogram.py

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Status
	Input DataFrame modified in-place ▹ view

Files scanned

File Path	Reviewed
superset/utils/pandas_postprocessing/histogram.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

korbit-ai · 2025-10-17T15:53:15Z

superset/utils/pandas_postprocessing/histogram.py

+    df.dropna(subset=[column], inplace=True)
+    if df.empty:
+        return df


Input DataFrame modified in-place

Tell me more

What is the issue?

The function modifies the input DataFrame in-place by dropping rows with NULLs, which violates the principle of not mutating input parameters and can cause unexpected side effects for the caller.

Why this matters

Callers may not expect their original DataFrame to be modified, leading to data loss in the calling code and potential bugs in downstream processing that depends on the original DataFrame structure.

Suggested change ∙ Feature Preview

Create a copy of the DataFrame before dropping NULLs:

# drop empty values from the target column df = df.dropna(subset=[column]) if df.empty: return df

Provide feedback to improve future suggestions

_{💬 Looking for more details? Reply to this comment to chat with Korbit.}

codecov · 2025-10-17T17:35:06Z

Codecov Report

❌ Patch coverage is 0% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 71.92%. Comparing base (fb8fca4) to head (55b453f).
⚠️ Report is 18 commits behind head on master.

Files with missing lines	Patch %	Lines
superset/utils/pandas_postprocessing/histogram.py	0.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #35693       +/-   ##
===========================================
+ Coverage        0   71.92%   +71.92%     
===========================================
  Files           0      589      +589     
  Lines           0    43638    +43638     
  Branches        0     4726     +4726     
===========================================
+ Hits            0    31388    +31388     
- Misses          0    11006    +11006     
- Partials        0     1244     +1244

Flag	Coverage Δ
hive	`46.27% <0.00%> (?)`
mysql	`70.96% <0.00%> (?)`
postgres	`71.01% <0.00%> (?)`
presto	`49.97% <0.00%> (?)`
python	`71.89% <0.00%> (?)`
sqlite	`70.61% <0.00%> (?)`
unit	`100.00% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

github-actions · 2025-10-17T18:05:10Z

🎪 Showtime is building environment on GHA for 55b453f

github-actions · 2025-10-17T18:18:48Z

🎪 Showtime deployed environment on GHA for 55b453f

• Environment: http://52.37.95.52:8080 (admin/admin)
• Lifetime: 48h auto-cleanup
• Updates: New commits create fresh environments automatically

rusackas · 2025-10-20T17:23:13Z

Are there cases where we should use zero-imputation for these values (counting them as zero)? If selecting zero imputation in Advanced Analytics solves that use case, we may be good.

rusackas · 2025-10-20T17:26:19Z

This viz doesn't have the Advanced Analytics feature, it seems that would be worth adding here, since it provides the zero-imputation feature. Advanced Analytics doesn't have an option to strip out null values however. It's probably better to use Advanced Analytics to "fix" null values OR use a Filter to remove null values right from the control panel. Stripping them out in post-processing doesn't give users the chance to set them as 0... they won't even know there ARE null values, which seems dangerous.

panrach and others added 6 commits October 16, 2025 14:55

fix(histogram): add NULL handling for histogram (apache#33738)

2f23078

fix(histogram): remove unnecessary check

ccef6c9

move dropna so the non numeric check is not redundant

50cfe80

fix(histogram): add NULL handling for histogram (apache#33738)

49d7165

fix(histogram): add handling for empty df

8e93aca

test(backend): add histogram tests

7e37e67

pull-request-size bot added the size/M label Oct 17, 2025

janani-gurram marked this pull request as draft October 17, 2025 00:19

janani-gurram changed the title ~~Fix/handle nulls in hist~~ fix(histogram): add NULL handling for histogram Oct 17, 2025

korbit-ai bot suggested changes Oct 17, 2025

View reviewed changes

superset/utils/pandas_postprocessing/histogram.py Outdated Show resolved Hide resolved

dosubot bot added the viz:charts:histogram Related to the Histogram chart label Oct 17, 2025

fix(histogram): prevent creating new df

55b453f

korbit-ai bot approved these changes Oct 17, 2025

View reviewed changes

janani-gurram marked this pull request as ready for review October 17, 2025 15:51

korbit-ai bot suggested changes Oct 17, 2025

View reviewed changes

janani-gurram mentioned this pull request Oct 17, 2025

Histogram fails to load when null values present in x-axis variable (regression from 4.1.2, occured in 5.0.0rc3, present in 6.0.0rc2) #33738

Open

3 tasks

sfirke added the 🎪 ⚡ showtime-trigger-start Create new ephemeral environment for this PR label Oct 17, 2025

github-actions bot added 🎪 55b453f 🚦 running Environment 55b453f status: running 🎪 55b453f 🌐 52.37.95.52:8080 Environment 55b453f URL: http://52.37.95.52:8080 (click to visit) and removed 🎪 🎯 55b453f Active environment pointer - 55b453f is receiving traffic labels Oct 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(histogram): add NULL handling for histogram #35693

fix(histogram): add NULL handling for histogram #35693

janani-gurram commented Oct 17, 2025

Uh oh!

korbit-ai bot left a comment •

edited

Loading

Uh oh!

Uh oh!

korbit-ai bot left a comment •

edited

Loading

Uh oh!

korbit-ai bot Oct 17, 2025

Uh oh!

codecov bot commented Oct 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

rusackas commented Oct 20, 2025

Uh oh!

rusackas commented Oct 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(histogram): add NULL handling for histogram #35693

Are you sure you want to change the base?

fix(histogram): add NULL handling for histogram #35693

Conversation

janani-gurram commented Oct 17, 2025

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Uh oh!

korbit-ai bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Uh oh!

Uh oh!

korbit-ai bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Uh oh!

korbit-ai bot Oct 17, 2025

Choose a reason for hiding this comment

Input DataFrame modified in-place

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Provide feedback to improve future suggestions

Uh oh!

codecov bot commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

rusackas commented Oct 20, 2025

Uh oh!

rusackas commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

korbit-ai bot left a comment •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

codecov bot commented Oct 17, 2025 •

edited

Loading

rusackas commented Oct 20, 2025 •

edited

Loading