Open
Description
Describe the bug
As @blaginin found in #14685, the statistics when a File is projected (aka only a subset of the columns are present) is incorrect
Specifically, the projected statistics have the same total_byte_size
as the input. However, given only a subset of columns are selected this will mean that the total_byte_size
should actually be lower
To Reproduce
See tests referenced in #14685
Expected behavior
total_byte_size
should take into account the subset of columns
Additional context
No response