feat: implement RFC 8628 #3851

nsklikas · 2024-09-30T06:56:04Z

Implements the Device Authorization Grant to enable authentication for headless machines (see https://datatracker.ietf.org/doc/html/rfc8628)

Related issue(s)

Implements RFC 8628.

This PR is based on the work done on #3252, by @supercairos and @BuzzBumbleBee. That PR was based on an older version of Hydra and was missing some features/tests.

We have prepared a spec, that describes our design and implementation. We have tried to mimic the existing logic in Hydra and not make changes that would disrupt the existing workflows

Checklist

I have read the contributing guidelines.
I have referenced an issue containing the design document if my change
introduces a new feature.
I am following the
contributing code guidelines.
I have read the security policy.
I confirm that this pull request does not address a security
vulnerability. If this pull request addresses a security vulnerability, I
confirm that I got the approval (please contact
[email protected]) from the maintainers to push
the changes.
I have added tests that prove my fix is effective or that my feature
works.
I have added or changed the documentation.

Further Comments

Notes:

The current implementation has been manually tested only for memory and postgres databases. The tests pass all of them.
Fosite is installed from our fork to ease testing. Once the relevant PR in fosite is merged, we will update go.mod.

Testing

To test this you need to built the hydra image:

make docker

This will create an image with the name: oryd/hydra:latest-sqlite

To run the flow you can use our UI, from https://github.com/canonical/identity-platform-login-ui/tree/hydra-device-test:

git clone [email protected]:canonical/identity-platform-login-ui.git -b hydra-device-test
cd identity-platform-login-ui/
# The image name is hard-coded in the docker-compose file
docker compose up --remove-orphans --force-recreate -d

Create a client for Hydra:

docker exec -it identity-platform-login-ui-hydra-1 hydra create client   --endpoint http://localhost:4445   --grant-type authorization_code,refresh_token,urn:ietf:params:oauth:grant-type:device_code --scope openid,offline_access,email,profile --token-endpoint-auth-method client_secret_post

Use that client to perform the device flow:

docker exec -it identity-platform-login-ui-hydra-1 hydra perform device-code --client-id <client-id> --client-secret <client-secret> -e http://localhost:4444 --scope openid,offline_access,email,profile

The user for logging in is:

username: [email protected]
password: test

CLAassistant · 2024-09-30T06:56:11Z

All committers have signed the CLA.

supercairos · 2024-10-16T15:00:51Z

You kept the user_code & device_code in separate tables ?
I thought It could be merge with the flow table but might be tricky to do (lots of SQL constrains to manage)

Otherwise, it's great work! Would love to see this land into Hydra as IMO it's a much needed feature :)

zepatrik · 2024-10-30T16:15:18Z

One thing I am struggling to understand is the device_challenge parameter. On one side, I cannot find any mention of that in the RFC, on the other side I don't see how it would be available in all variants of the device flow. Sure, when using the verification_uri_complete from the device authorization response, it is easy to set additional query parameters. However, the device auth flow also has to work when only the user code is entered into a generic website, like on https://youtube.com/tv/activate

I just noticed the design doc you linked, should have taken a look there first.

zepatrik · 2024-10-31T08:34:34Z

OK, I think I now cleared up the confusion I had about the device_challenge.
IMO it is not really necessary because it would also work to send the user straight to the UI implementation, and let the /admin/oauth2/auth/requests/device/accept create the flow. The relevant information at this point is always in the user-code, and only the UI provides that. However, I see how it fits better into the existing code and architecture, so I do like the proposal 👍

nsklikas · 2024-11-01T10:18:18Z

Apparently the tests are broken because of https://github.com/ory/fosite/pull/827/files#diff-b92270a81f4021a9cdf52dfcfaeac9b66254471b85fd5ef4101acdbad02e4296R161, not sure if it's a bug or if the hydra tests need to be updated. @zepatrik any tip on how to fix this?

zepatrik · 2024-11-05T08:57:27Z

@nsklikas I have fixed the upstream fosite issue.

I also thought about the overall flow a bit more, and I think this flow is a bit better wrt database strain (writes and storage). Also it is less complex from the bigger picture, but I agree that it might be more complex to implement. Did you consider something similar, and what are your thoughts? I would especially prefer the user and device codes to be in one table, so we can rely on the database to ensure the link between the two.
The big difference is that we would create the flow already when the device initializes everything, and then reuse it for the user as soon as we have the user code.

Also happy to discus synchronously.

Instead of updating the device session, we were over-writing it causing existing session info that were created from fosite to be lost.

nsklikas · 2024-11-06T14:26:49Z

We considered merging the 2 tables (I think it was also discussed in the older PR). Merging the 2 tables would complicate the schema (you would need to have 2 expiration periods, 2 active fields, more indexes, etc) and we would have to create new logic to handle calls to this table (now we re-use the logic that is used for all the other tokens), of course that is not a blocker but would require a decently sized refactor of this PR. We would also have to merge the 2 fosite APIs (DeviceCodeStorage and UserCodeStorage), again not a big blocker. AFAICT by merging the 2 tables we would be making 1 less read to the database (we wouldn't need to fetch the device_code in performOAuth2DeviceVerificationFlow) and one less write (we could invalidate the user_code and mark the device_code as ready to be used in a single query). The main drawback with the current approach is that the 2 tables (user_code and device_code) are not directly merged, instead we use the requestID to connect them. The reason we decided not to go that route is that we thought that the performance benefit does not out-weight having a uniform experience with the existing fosite APIs and hydra database, but I can see the value of changing this.

About persisting the flow table to the database at the beginning of the flow, we would be doing one less redirect (we could send the user directly to the login UI, they wouldn't need to go through Hydra). But we would have to:

persist the flow to the database every time a device starts a flow (1 write)
fetch the flow every time a user_code is accepted (we wouldn't update it on the db, because we want to allow the user to to restart the flow)
update the database constraints to handle the device flow status
This was something we considered, but decided that we wouldn't be gaining much for these extra call/changes. What is the reason you think we should change this?

I would rather we keep the design as it is, because we think that these changes wouldn't improve the current flow much (I understand that, depending on the load, merging the 2 tables can offer considerable improved performance) and it would complicate the implementation. Ideally I would rather we do not change the design of the current PR (unless you think that there is something wrong with it), to get something going and to avoid getting lost on the many changes that it introduces. We can always iterate on it on subsequent PRs, BUT I understand that if we want to change the database schema (by merging the tables), it would be best to do it as early as possible to avoid having to create a migration plan.

nsklikas · 2024-11-06T14:40:42Z

About persisting the flow table to the database at the beginning of the flow, we would be doing one less redirect (we could send the user directly to the login UI, they wouldn't need to go through Hydra). But we would have to:
1. persist the flow to the database every time a device starts a flow (1 write)

2. fetch the flow every time a user_code is accepted (we wouldn't update it on the db, because we want to allow the user to to restart the flow)

3. update the database constraints to handle the device flow status
   This was something we considered, but decided that we wouldn't be gaining much for these extra call/changes. What is the reason you think we should change this?

I now realize that we wouldn't be avoiding the first redirect, as we want to setup csrf protection. I don't think I see the value of making this change (referring only to writing the flow in the database when creating the user code).

zepatrik · 2024-11-06T15:31:36Z

Thanks for revisiting, I was just adding some follow-up clarifications.

TL;DR we would like to do the refactor to have one-table for the codes, everything else looks good.

We had even more discussions also with @alnr and basically came to these conclusions:

We need to write the device & user codes into a table when creating them, mainly to avoid collisions. Ideally this would be one table (further device_auth_codes) that is only used for this purpose. The table should have the PK (nid, device_code_sig) and a secondary unique index (nid, user_code_sig). This table will be polled by the device using the device code.
The flow should not be persisted in the beginning by the device, but used only by the user browser and persisted after successful completion.
Once the user code is used, we should mark it as such and release it by setting user_code_sig=null. We can then include the device code signature in the flow. The main reason here is that we can make sure the code is only used once.
However, this is not a strong opinion and just what we thought would be the better option. Making the code reusable for error cases would probably reduce some friction in the UX. Open to discuss.
There seems to be no value in adding the device_verifier. We propose to remove that. The existing CSRF token should be sufficient to ensure that a flow was completed in the same browser it was started. The reason here is so that we can persist the flow state. Makes sense now.
The "accept user code API" should be an admin API (as it is now), for flexibility and consistency reasons.

Overall, the refactor to use only one table should be worth it right away. We can also help out with the refactor if necessary.

nsklikas requested review from aeneasr, hperl and alnr as code owners September 30, 2024 06:56

nsklikas changed the title ~~Implement RFC 8628~~ feat: Implement RFC 8628 Sep 30, 2024

nsklikas changed the title ~~feat: Implement RFC 8628~~ feat: implement RFC 8628 Sep 30, 2024

nsklikas mentioned this pull request Sep 30, 2024

Implement RFC 8628 ory/fosite#826

Open

6 tasks

bateller approved these changes Oct 16, 2024

View reviewed changes

nsklikas force-pushed the canonical-master branch 2 times, most recently from 87c6315 to 349743e Compare October 18, 2024 13:42

nsklikas force-pushed the canonical-master branch from 349743e to e0b066f Compare October 25, 2024 12:03

nsklikas force-pushed the canonical-master branch 3 times, most recently from 20555bd to 140f75e Compare November 1, 2024 09:44

nsklikas added 11 commits November 6, 2024 16:21

chore: install fosite from branch (remove)

07b5313

fix: set utc expires_at

f038745

fix: add redirect_uri to test

366fe01

chore: update go.mod

3211246

fix: add rfc8628 providers to registry

b9ccf06

fix: update database schema

467d9c3

fix: update oauth persister logic

8c11107

feat: add device authorization endpoint handler

09a00ee

refactor: move logic to updateSessionWithRequest method

27e029c

fix: rename device auth endpoint handler

f6da362

feat: add device user verification handler

76fd069

nsklikas and others added 22 commits November 6, 2024 16:22

fix: implement device user verification logic

a488e83

feat: update flow

a8233fb

fix: add post device auth handler

1df9bcd

feat: add consent handler for accepting a user_code

30678c2

chore: add post_device_done to config schema

a956f32

chore: add e2e tests

554b5dc

feat: token request handling for device flow

04ce2df

chore: update config

5ebeb51

fix: fix the OIDC token and refresh token issue for device flow

d4391d9

fix: update OpenID Connect session after user consent

0a2eadd

fix: add GetDeviceCodeSessionByRequestID method

da85bb1

fix: return client_id to post_device page

d874a9f

fix: update existing device session

f1d6341

Instead of updating the device session, we were over-writing it causing existing session info that were created from fosite to be lost.

fix: update tests

44ca5df

fix: add device auth endpoint in discovery metadata

20e1fe3

fix: make device grant lifetimes configurable

b32093c

test: update sql fixtures

1568863

fix: perform device flow from CLI

5b6cc1f

fix: wrap db calls in transaction

a5bb44b

chore: fix license

e897168

chore: update sdk

111eea0

fix: duplicate user_code update

b7767f9

nsklikas force-pushed the canonical-master branch from 140f75e to b7767f9 Compare November 6, 2024 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement RFC 8628 #3851

feat: implement RFC 8628 #3851

nsklikas commented Sep 30, 2024 •

edited

Loading

CLAassistant commented Sep 30, 2024 •

edited

Loading

supercairos commented Oct 16, 2024 •

edited

Loading

zepatrik commented Oct 30, 2024 •

edited

Loading

zepatrik commented Oct 31, 2024

nsklikas commented Nov 1, 2024

zepatrik commented Nov 5, 2024

nsklikas commented Nov 6, 2024

nsklikas commented Nov 6, 2024

zepatrik commented Nov 6, 2024 •

edited

Loading

feat: implement RFC 8628 #3851

Are you sure you want to change the base?

feat: implement RFC 8628 #3851

Conversation

nsklikas commented Sep 30, 2024 • edited Loading

Related issue(s)

Checklist

Further Comments

Testing

CLAassistant commented Sep 30, 2024 • edited Loading

supercairos commented Oct 16, 2024 • edited Loading

zepatrik commented Oct 30, 2024 • edited Loading

zepatrik commented Oct 31, 2024

nsklikas commented Nov 1, 2024

zepatrik commented Nov 5, 2024

nsklikas commented Nov 6, 2024

nsklikas commented Nov 6, 2024

zepatrik commented Nov 6, 2024 • edited Loading

nsklikas commented Sep 30, 2024 •

edited

Loading

CLAassistant commented Sep 30, 2024 •

edited

Loading

supercairos commented Oct 16, 2024 •

edited

Loading

zepatrik commented Oct 30, 2024 •

edited

Loading

zepatrik commented Nov 6, 2024 •

edited

Loading