More profile metrics for Iceberg, S3 and Azure #1123

ianton-ru · 2025-11-04T16:19:29Z

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

More metrics for Iceberg, S3 and Azure

Documentation entry for user-facing changes

New Iceberg profile metrics:

IcebergAvroFileParsing - counter of parsed avro metadata files
IcebergAvroFileParsingMicroseconds - time spent on parsing avro metadata files
IcebergJsonFileParsing - counter of parsed json metadata files
IcebergJsonFileParsingMicroseconds - time spent on parsing json metadata files

New S3 profile metrics:

S3ListObjectsMicroseconds - time spent on ListObjects requests
S3HeadObjectMicroseconds - time spent on HeadObject requests

New Azure profile metric:

AzureListObjectsMicroseconds - time spent on ListObjects requests

Small optimization - dumpMetadataObjectToString called always, including case when insertRowToLogTable dumps nothing and exits after iceberg_metadata_log_level setting check. Now serialization is only when required.

CI/CD Options

Exclude tests:

Regression jobs to run:

github-actions · 2025-11-04T16:20:25Z

Workflow [PR], commit [b746e01]

arthurpassos · 2025-11-04T18:21:36Z

src/Disks/ObjectStorages/AzureBlobStorage/AzureObjectStorage.cpp

        ProfileEvents::increment(ProfileEvents::AzureListObjects);
        if (client->IsClientForDisk())
            ProfileEvents::increment(ProfileEvents::DiskAzureListObjects);
+        ProfileEventTimeIncrement<Microseconds> watch(ProfileEvents::AzureListObjectsMicroseconds);


Perhaps putting it in a scope will be more accurate and safer (e.g, future changes to this method that introduce slow operations could lead to wrong values)?

Example:

ListBlobsPagedResponse blob_list_response; { ProfileEventTimeIncrement<Microseconds> watch(ProfileEvents::AzureListObjectsMicroseconds); blob_list_response = client->ListBlobs(options); }

The same comment applies to the other list object operations.

In any case, I see that it is already implemented without this "protection". So it is not a must to implement this 👍

You are right, I want to get time for S3/Azure communication, not for parsing for response.

arthurpassos · 2025-11-04T18:35:55Z

src/Interpreters/IcebergMetadataLog.cpp

+    return removeEscapedSlashes(oss.str());
+}
+
+void insertRowToLogTable(


Do you still need the "old" overload? If so, add a comment explaining the difference between both overloads..

And btw, is dumpMetadataObjectToString that expensive that you need to optimize it in a few code paths?

In other places insertRowToLogTable called with strings from other sources like https://github.com/Altinity/ClickHouse/blob/antalya-25.8/src/Storages/ObjectStorage/DataLakes/Iceberg/AvroForIcebergDeserializer.cpp#L119.
It is also JSON string, but with some complex procedure to get.

In my test on my desktop with ~7k parquet files metadata.json has size near 5Mb.
Query select count() from iceberg.table does not read data, gets size from metadata only.
Speed changed from 3.5 seconds to 0.3 seconds.
JSON parsing is fast, but serialization back to string isn't.

Ok. What if you unified the interface?

void insertRowToLogTable( ... const std::function<const std::string &()> & metadata_string_resolver) { ... Context::getGlobalContextInstance()->getIcebergMetadataLog()->add( ... .metadata_content = metadata_string_resolver(); }

The above would potentially allow a single method, but not sure it is a good idea. Adding a comment is enough I guess.

But getting json after condition check make sense in all case. Changed for all.

arthurpassos

Looking good 👍

Enmk · 2025-11-07T14:10:54Z

src/Interpreters/IcebergMetadataLog.cpp


 void insertRowToLogTable(
    const ContextPtr & local_context,
-    String row,


That needs a comment that explains WHY we need NOT a value, but a function here... do I get it right that major reason is not to skew the the time measurements by getting metadata content?

To make row string only when actually required. Serialization from JSON structure to string can take a lot of time, in my test select count() from iceberg.table reduced from 3.5 seconds to 0.2 only with this optimization. Several thousands parquet files, metadata.json near 5Mb.

Enmk

Minor changes required

Refactor insertRowToLogTable to use get_row function for lazy evaluation and improve exit logic.

More metrics for Iceberg

8ec2285

ianton-ru added antalya antalya-25.8 labels Nov 4, 2025

arthurpassos reviewed Nov 4, 2025

View reviewed changes

arthurpassos previously approved these changes Nov 5, 2025

View reviewed changes

Move json serialization under condition in all insertRowToLogTable calls

d4b55b5

ianton-ru dismissed arthurpassos’s stale review via d4b55b5 November 5, 2025 13:28

arthurpassos previously approved these changes Nov 5, 2025

View reviewed changes

Enmk reviewed Nov 7, 2025

View reviewed changes

Enmk requested changes Nov 7, 2025

View reviewed changes

Add comment

b746e01

Refactor insertRowToLogTable to use get_row function for lazy evaluation and improve exit logic.

ianton-ru dismissed arthurpassos’s stale review via b746e01 November 7, 2025 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More profile metrics for Iceberg, S3 and Azure #1123

More profile metrics for Iceberg, S3 and Azure #1123

Uh oh!

ianton-ru commented Nov 4, 2025

Uh oh!

github-actions bot commented Nov 4, 2025 •

edited

Loading

Uh oh!

arthurpassos Nov 4, 2025

Uh oh!

ianton-ru Nov 5, 2025

Uh oh!

arthurpassos Nov 4, 2025

Uh oh!

ianton-ru Nov 5, 2025

Uh oh!

arthurpassos Nov 5, 2025

Uh oh!

ianton-ru Nov 5, 2025

Uh oh!

arthurpassos left a comment

Uh oh!

Enmk Nov 7, 2025 •

edited

Loading

Uh oh!

ianton-ru Nov 7, 2025

Uh oh!

Enmk left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

More profile metrics for Iceberg, S3 and Azure #1123

Are you sure you want to change the base?

More profile metrics for Iceberg, S3 and Azure #1123

Uh oh!

Conversation

ianton-ru commented Nov 4, 2025

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Documentation entry for user-facing changes

CI/CD Options

Exclude tests:

Regression jobs to run:

Uh oh!

github-actions bot commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arthurpassos Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

ianton-ru Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

arthurpassos Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

ianton-ru Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

arthurpassos Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

ianton-ru Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

arthurpassos left a comment

Choose a reason for hiding this comment

Uh oh!

Enmk Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ianton-ru Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Enmk left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Nov 4, 2025 •

edited

Loading

Enmk Nov 7, 2025 •

edited

Loading