Skip to content

Statistics::total_byte_size does not account for projection in FileScanConfig::with_projection #14936

Open
@alamb

Description

@alamb

Describe the bug

As @blaginin found in #14685, the statistics when a File is projected (aka only a subset of the columns are present) is incorrect

Specifically, the projected statistics have the same total_byte_size as the input. However, given only a subset of columns are selected this will mean that the total_byte_size should actually be lower

To Reproduce

See tests referenced in #14685

Expected behavior

total_byte_size should take into account the subset of columns

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions