-
Notifications
You must be signed in to change notification settings - Fork 32
Improved stats inspection #101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
5f53ea9 to
2721808
Compare
2721808 to
af49018
Compare
1fc60b1 to
44f4190
Compare
90593f3 to
f373345
Compare
44f4190 to
92a3ef0
Compare
f373345 to
47d17cb
Compare
| > { | ||
| let uri = parse_uri(&uri); | ||
|
|
||
| ensure_access_privilege_to_uri(&uri, true); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
random thought: wonder whether we should have a separate role for public URLs
(probably not needed for now)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
still a network connection. might think on it though
src/parquet_udfs/stats.rs
Outdated
| .get_basic_info() | ||
| .has_id() | ||
| { | ||
| continue; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does not having a filed ID imply not having stats?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good point
22d6a6b to
cfb3d2c
Compare
cfb3d2c to
1d1d248
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #101 +/- ##
==========================================
+ Coverage 91.97% 92.43% +0.46%
==========================================
Files 84 85 +1
Lines 10651 11288 +637
==========================================
+ Hits 9796 10434 +638
+ Misses 855 854 -1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
src/parquet_udfs/stats.rs
Outdated
| }) if !is_adjusted_to_u_t_c | ||
| ); | ||
|
|
||
| let is_timetz = matches!( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
doing all these checks upfront seems a bit strange, should they perhaps move into the match below?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done, still prefer matches to match {}
97c2b26 to
28b8448
Compare
Previously, we printed column statistics for each column per row group via `parquet.metadata(uri)`. With the new udf `parquet.column_stats(uri)`, we print stats for each column aggregated by row groups. Stats for some of the types were printed in a text format that cannot be converted to actual Postgres type. This PR also makes sure the output format is convertible to the actual Postgres type.
28b8448 to
648d129
Compare
Previously, we printed column statistics for each column per row group via
parquet.metadata(uri). With the new udfparquet.column_stats(uri), we print the column stats for each column aggregated by row groups.Stats for some of the types were printed in a text format that cannot be converted to actual Postgres type. This PR also makes sure the output format is convertible to the actual Postgres type.