Support Aws STS AssumeRoleWithWebIdentity #3170

cccs-cat001 · 2025-11-26T14:14:07Z

Here is a preliminary pass at fixing #3038, I've added in an option to have the sts use assumeRoleWithWebIdentity and pass along the user credentials. This will allow the STS to know who is asking for credentials instead of the request coming from some shared service credentials.

We've found that using our on-prem S3 solution that the way Polaris accesses the STS results in connectivity errors, and the recommended way of accessing the STS is with a web identity token (A.K.A. access token), so adding this optional feature will result in Polaris being usable by more S3 appliances than just AWS :)

Checklist

🛡️ Don't disclose security issues! (contact [email protected])
🔗 Clearly explained why the changes are needed, or linked related issues: Fixes #
🧪 Added/updated tests with good coverage, or manually tested (and explained how)
💡 Added comments for complex logic
🧾 Updated CHANGELOG.md (if needed)
📚 Updated documentation in site/content/in-dev/unreleased (if needed)

…gging levels

Co-authored-by: Alexandre Dutra <[email protected]>

…ting/DefaultMetricsReporter.java Co-authored-by: Alexandre Dutra <[email protected]>

…test/PolarisRestCatalogIntegrationBase.java Co-authored-by: Alexandre Dutra <[email protected]>

dimas-b · 2025-11-28T17:32:19Z

@adutra : could you double check quarkus auth integration changes (they LGTM)

dimas-b · 2025-12-02T19:59:38Z

@cccs-cat001 : please resolve conflicts... otherwise CI does not run 🤷

adnanhemani

Based on the response here, I'm afraid I will be a -1 on this PR.

If the user has the ability to communicate with the "STS" service directly and bypass any access controls rules that the Polaris server has set, then Polaris should not be in the business of vending credentials to this client. By creating this approach, we are introducing a security loophole that is completely against Polaris' security modelling today.

If there is a dire need to use such Polaris authentication tokens with an on-prem "AWS-compatible" service using only the user's tokens, then the client should be modified to talk to STS directly to gain credentials without Polaris' involvement. IMO Polaris should not be involved in abetting ANY security loopholes.

Or we can look towards creating a similar security model as we have today whereby using the Polaris service credential or setting Polaris up as an intermediate token broker which the "STS" service trusts, a new credential can be minted for the client to access the storage layer.

(This is not a new problem that we are setting out to solve - look into IAM Roles for Service Accounts (IRSA) used by AWS EKS, which at its core, solves a very similar base problem. Trying to fit a simplistic solution to quick-solve a problem for a system we don't even fully support is a very dangerous precedence to set. Speeding through a solution for functionality is one thing, but to risk security postures while doing that is a whole different situation.)

adutra · 2025-12-03T13:08:02Z

@adutra : could you double check quarkus auth integration changes (they LGTM)

I find the changes in this PR a bit invasive tbh. I would rather explore Quarkus OIDC token propagation techniques, and more specifically, I would try to inject io.quarkus.oidc.client.Tokens wherever necessary:

https://quarkus.io/guides/security-openid-connect-client-reference#inject-tokens

But I don't want to delay this PR that is extremely useful. I think we can go with the approach taken here and then switch to token propagation in a follow-up PR.

adutra · 2025-12-03T13:54:25Z

By creating this approach, we are introducing a security loophole that is completely against Polaris' security modelling today.

I am completely failing to see what security loopholes we would be introducing by leveraging AssumeRoleWithWebIdentity. Propagating the user's access token to AWS STS using AssumeRoleWithWebIdentity is the standard pattern recommended by AWS itself for federated OIDC access.

the client should be modified to talk to STS directly to gain credentials without Polaris' involvement. IMO Polaris should not be involved in abetting ANY security loopholes.

Are you serious about allowing clients to talk to STS directly? THAT, indeed, would be a giant security loophole.

Or we can look towards [...] setting Polaris up as an intermediate token broker which the "STS" service trusts, a new credential can be minted for the client to access the storage layer.

IMHO this is unrealistic, and over-engineered. That would require a form of token exchange and would be extremely hard to implement for little added-value. And again: let's please stop considering Polaris as an OAuth2 token broker. This is legacy behavior.

Trying to fit a simplistic solution to quick-solve a problem for a system we don't even fully support is a very dangerous precedence to set.

Can you clarify what "simplistic solution" you are talking about and what is this "system [that] we don't [...] fully support"?

dimas-b · 2025-12-03T14:24:36Z

I support @adutra 's point about Polaris not acting as a token broker or authorization server. Polaris, in general, should be a resource server in OAuth2 terms.

cccs-cat001 · 2025-12-03T15:08:30Z

I find the changes in this PR a bit invasive tbh. I would rather explore Quarkus OIDC token propagation techniques, and more specifically, I would try to inject io.quarkus.oidc.client.Tokens wherever necessary:

https://quarkus.io/guides/security-openid-connect-client-reference#inject-tokens

But I don't want to delay this PR that is extremely useful. I think we can go with the approach taken here and then switch to token propagation in a follow-up PR.

that looks pretty simple compared to all the changes I'm making tbh. Is everything set up so that I can just @Inject Token token;? 🤔

dimas-b · 2025-12-03T15:12:17Z

Is everything set up so that I can just @Inject Token token;? 🤔

Injecting in runtime/service classes should be ok, but polaris-core should remain CDI-neutral by convention (i.e. no CDI annotations in core).

That is to say, I think current PolarisPrincipal changes will probably have to remain.

adutra · 2025-12-03T16:44:39Z

that looks pretty simple compared to all the changes I'm making tbh. Is everything set up so that I can just @Inject Token token;? 🤔

You may need to add the io.quarkus:quarkus-oidc-client dependency, I don't remember if it's already there or not.

adutra · 2025-12-03T16:46:23Z

That is to say, I think current PolarisPrincipal changes will probably have to remain.

True, at some point we'd need to pass the token as a parameter. @cccs-cat001 please don't rush into using token propagation if you can't find a simple path forward – what you have currently is perfectly acceptable.

tokoko · 2025-12-03T18:28:51Z

To be honest I'm very confused about the implementation in the PR as well. Not sure about other s3-compatible systems, but in the case of minio, (if I'm reading this correctly...) this option will only work if all catalog users have the ability to assume the role that's configured centrally on the catalog level and which also needs to have read/write privileges on all catalog locations. I fail to see how this can be useful to anyone. why would anyone hand out assume role privileges on what's basically a superuser?

If the problem that the PR is trying to solve is simply to support those systems that only support AssumeRoleWithWebIdentity, the more straightforward solution would be to enable the catalog to acquire required web identity instead of acquiring it from a user. The same way that the standard AssumeRole option relies on AWS environment variables to authenticate the call, the new AssumeRoleWithWebIdentity solution should read necessary oauth configs and have an internal background process that obtains and refreshes a token needed to authenticate AssumeRoleWithWebIdentity calls. I'm not sure why we would complicate this with some sort of token pass-through mechanism.

dimas-b · 2025-12-03T18:58:52Z

@tokoko : AFAIK (and @cccs-cat001 may have a better answer) this feature is mostly intended for custom systems. Using the workflow it enables with AWS, for example, may be possible, yet whether it is the best approach with AWS is subject to specific user demands. I tend to agree that this may not be the best approach for MinIO or AWS S3.

Nonetheless, I do not see why Polaris as an OSS project should not open this use case for deployments what may benefit from it. The feature is well isolated under a catalog property, there is no impact to other use cases.

For reference, here's the related dev ML thread: https://lists.apache.org/thread/tm76ntbgdqt31r6402dro8vb7m4pdzzq

adnanhemani · 2025-12-04T00:04:12Z

Propagating the user's access token to AWS STS using AssumeRoleWithWebIdentity is the standard pattern recommended by AWS itself for federated OIDC access.

This is only the standard pattern recommended by AWS for federated OIDC access - if you do not have a governance system in between. If you feel that's wrong, please feel free to drop links to the AWS documentation here that support your claim.

Are you serious about allowing clients to talk to STS directly? THAT, indeed, would be a giant security loophole.

(Also mentioned in Laurent's email to the Dev ML regarding this approach) Isn't this possible today with how the code is written? If it isn't, what stops it? Is it a network policy? And as a result, are we betting the whole of Polaris' security posture on such a network policy? Is that truly wise?

Let's compare this to the AssumeRole path we have today. In our best practices, are IAM roles allowed to be assumed by the clients directly? That is clearly not the model that Polaris has today - so why are we breaking that security model for this feature?

Nonetheless, I do not see why Polaris as an OSS project should not open this use case for deployments what may benefit from it. The feature is well isolated under a catalog property, there is no impact to other use cases.

Not sure I agree, we are enabling a use case here that does not align with our current security best practices. And to be clear, this is not as a result of there not being a better way of doing this, but rather because this is the fastest way of achieving this goal. Trampling security best practices for velocity is not something that we have a track record of within this project.

I agree that this use case is a valid one - and that we should be supporting it. But I highly disagree in the way that this implementation does so. Rather than bending the security model that we have today (which generally can be stated like: Polaris user who is unprivileged with regards to storage credentials, authenticates and authorizes with Polaris in order to gain storage credentials) to fit this use case, we should be tailoring the implementation of this use case to fit the security model.

cccs-cat001 · 2025-12-04T18:20:16Z

Marking as draft as tests are failing for unknown reasons, they'll potentially be fixed with #3203, so we'll wait for that.

cccs-cat001 · 2025-12-08T14:18:33Z

Closing this PR in favour of #3224 and #3236

cccs-cat001 and others added 30 commits October 24, 2025 08:16

Added interface for reporting metrics

8940129

Added metrics reporting config to application.properties

0acbb64

PR Fixes

95e0566

fixes for the PR fixes

2a78172

reworked iceberg metrics logging to be one class with configurable lo…

c9097f7

…gging levels

simplified report logging

ef22353

run precommit

5d0b0d5

fix tests for metrics addition

a5e9d37

Update runtime/defaults/src/main/resources/application.properties

d39abc4

Co-authored-by: Alexandre Dutra <[email protected]>

Update runtime/service/src/main/java/org/apache/polaris/service/repor…

0215cac

…ting/DefaultMetricsReporter.java Co-authored-by: Alexandre Dutra <[email protected]>

Update runtime/service/src/main/java/org/apache/polaris/service/repor…

44b039b

…ting/DefaultMetricsReporter.java Co-authored-by: Alexandre Dutra <[email protected]>

Update integration-tests/src/main/java/org/apache/polaris/service/it/…

1849ab0

…test/PolarisRestCatalogIntegrationBase.java Co-authored-by: Alexandre Dutra <[email protected]>

pre-commit + testing changes

e6677d3

Merge branch 'apache:main' into main

236f65b

added license to new files

ff0c342

Merge branch 'apache:main' into main

c41a23e

Added info to changelog

b5362fb

reworded properties

8498ffb

removed reportMetrics from icebergCatalogHandler

cfe0851

cleanup tests

9b9552e

changed warehouse to catalogName

380de92

removed scope

f3eaf23

added back scope

dcacc5c

Merge branch 'apache:main' into main

ebee75c

fixed issue where value cannot be compared with nil

e4618ca

customized build scripts

0260b44

made the regex more generic

fa7b282

cleanup aisle 2

c340ab3

updated ROLE_ARN_PATTERN comment

bd2c3f8

removed test-case that would not be considered valid

79fd424

dimas-b requested a review from adutra November 28, 2025 17:31

cccs-cat001 requested a review from dimas-b November 28, 2025 18:39

cccs-cat001 added 3 commits December 2, 2025 07:31

removed build.gradle changes (again)

13f935a

removed cli change

f3e1e85

set config same as stsUnavailable

a68225c

adnanhemani suggested changes Dec 2, 2025

View reviewed changes

Merge branch 'main' into s3

ea6afe5

dimas-b mentioned this pull request Dec 3, 2025

Pass principal name as part of aws subscoped credentials session name #3196

Open

cccs-cat001 added 2 commits December 3, 2025 14:27

Passing PolarisPrincipal's token down the rabbit hole

f3763e6

Merge branch 's3' of github.com:cccs-cat001/polaris into s3

7ef2de0

Fixing tests

d1fdb38

cccs-cat001 marked this pull request as draft December 4, 2025 18:17

dimas-b mentioned this pull request Dec 5, 2025

[FEATURE REQUEST] - Support S3 STS AssumeRoleWithWebIdentity for on‑prem S3 providers #3038

Open

cccs-cat001 closed this Dec 8, 2025

github-project-automation bot moved this from PRs In Progress to Done in Basic Kanban Board Dec 8, 2025

adutra mentioned this pull request Dec 8, 2025

Added user token to the PolarisPrincipal #3236

Open

6 tasks

Support Aws STS AssumeRoleWithWebIdentity #3170

Support Aws STS AssumeRoleWithWebIdentity #3170

Uh oh!

Conversation

cccs-cat001 commented Nov 26, 2025

Checklist

Uh oh!

dimas-b commented Nov 28, 2025

Uh oh!

dimas-b commented Dec 2, 2025

Uh oh!

adnanhemani left a comment

Choose a reason for hiding this comment

Uh oh!

adutra commented Dec 3, 2025

Uh oh!

adutra commented Dec 3, 2025

Uh oh!

dimas-b commented Dec 3, 2025

Uh oh!

cccs-cat001 commented Dec 3, 2025

Uh oh!

dimas-b commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adutra commented Dec 3, 2025

Uh oh!

adutra commented Dec 3, 2025

Uh oh!

tokoko commented Dec 3, 2025

Uh oh!

dimas-b commented Dec 3, 2025

Uh oh!

adnanhemani commented Dec 4, 2025

Uh oh!

cccs-cat001 commented Dec 4, 2025

Uh oh!

cccs-cat001 commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dimas-b commented Dec 3, 2025 •

edited

Loading