Port API to FastAPI #133 #134

VKTB · 2024-11-04T16:15:41Z

Description

Drops support for all Python versions below 3.11
Ports the API to FastAPI
Updates project dependencies to latest versions
Adds missing severity key value pair to the scheduled_maintenance.json file
Defines new custom exception classes to improve error handling
Logger is now configured using a logging.ini file
App is now configured using a .env file instead of the old config.json. The config is loaded when the first starts up which means that updates to the the AUTHENTICATION__JWT_REFRESH_TOKEN_BLACKLIST and AUTHENTICATION__ADMIN_USERS values now require the application to be restarted to take effect. If this is a problem then I suggest that we store these values in separate files like we do with the active usernames in LDAP JWT Auth.
Refactors the majority of the app logic including tests
Fixes a bug where any refresh token could refresh any access token
The linter was complaining about no use of timeout in requests calls so I added an option in the config to allow for this to be set. I set it to 5 seconds in the example file (hopefully this is enough?).
Updates the Dockerfile to have multistages:
- dev for local development
- test for running the tests
- prod for production
Updates the tests CI job to run the tests in a Docker container

Testing Instructions

Add a set up instructions describing how the reviewer should test the code

Review code
Check GitHub Actions build
Review changes to test coverage
Test it with SciGateway, ensure it doesn't break anything
Test endpoints (swagger docs available at /docs) work as expected

Agile Board Tracking

closes #93
closes #94
closes #95
closes #129
closes #133

VKTB · 2024-11-12T11:54:00Z

@louise-davies As I mentioned, earlier there are a few things I want to check

Do the Ansible scripts create the maintenance files? If so, does the scheduled_maintenace.json have a severity key value pair?
Do the Ansible scripts need to be updated to create a .env and logging.ini config files?
Are there any other changes that need to be made to the ansible scripts?
The response status code/message have changed for the below endpoints. Will this break SciGateway?
- (For POST requests to /login) If the mnemonic field in the body is missing, it now returns 422 { "detail": [ { "type": "missing", "loc": [ "body", "mnemonic" ], "msg": "Field required", "input": {} } ] } instead of 400 Missing mnemonic. (Message is built by Pydantic)
- (For POST requests to /refresh) If JWT refresh token (HTTP-only cookie) is missing, it now returns 400 No JWT refresh token found instead of 400 No refresh token cookie found.
- (For POST requests to /refresh) If JWT refresh token (HTTP-only cookie) is blacklisted, it now returns 403 Unable to refresh access token instead of 403 Refresh token was not valid".
- (For POST requests to /refresh If token field (which is the JWT access token) in the body is missing, it now returns 422 { "detail": [ { "type": "missing", "loc": [ "body", "token" ], "msg": "Field required", "input": null } ] } instead of 400 No access token found. (Message is built by Pydantic)
- (For POST requests to /refresh) If anything goes wrong while it is trying to refresh the access token like invalid JWT tokens, the usernames in the tokens are not matching, or ICAT errors, it now returns 403 Unable to refresh access token instead of 403 Unable to refresh token.
- (For POST requests to /verify) For 200 responses, it no longer returns a body . Previously, it returned an empty string (""). I think returning 204 instead would be more suitable here given no body is returned but not sure how that will impact SciGateway.
- (For POST requests to /verify) If JWT token in the body is invalid, it now returns 403 Invalid JWT token provided instead of 403 Token could not be verified.
- (For GET requests to /maintenance) If it fails to get the maintenance state, it now returns 500 Failed to get maintenance state instead of 500 Failed to retrieve maintenance mode state.
- (For PUT requests to /maintenance) If there are validation errors as a result of invalid/missing JSON data in the body, it now returns 422 instead of 400. The message returned is different as it the validation is handled by Pydantic instead of the jsonschema library.
- (For PUT requests to /maintenance) If JWT access token in the body is invalid, it now returns 403 Unable to update maintenance state instead of 403 Access token was not valid.
- (For PUT requests to /maintenance) If user is not admin, it now returns 403 Unable to update maintenance state instead of 403 Unauthorized.
- (For PUT requests to /maintenance) If anything goes wrong while it is trying to update the maintenance state like file write error, it now returns 500 Failed to update maintenance state instead of 500 Failed to update maintenance mode state.
- (For PUT requests to /maintenance) For 200 responsnes, it now returns Maintenance state successfully updated instead of Maintenance mode state successfully updated.
- (For GET requests to /scheduled_maintenance) If it fails to get the scheduled maintenance state, it now returns 500 Failed to get scheduled maintenance state instead of 500 Failed to return scheduled maintenance state.
- (For PUT requests to /scheduled_maintenance) If there are validation errors as a result of invalid/missing JSON data in the body, it now returns 422 instead of 400. The message returned is different as it the validation is handled by Pydantic instead of the jsonschema library.
- (For PUT requests to /scheduled_maintenance) If JWT access token in the body is invalid, it now returns 403 Unable to update scheduled maintenance state instead of 403 Access token was not valid.
- (For PUT requests to /scheduled_maintenance) If user is not admin, it now returns 403 Unable to update scheduled maintenance state instead of 403 Unauthorized.
- (For PUT requests to /scheduled_maintenance) If anything goes wrong while it is trying to update the scheduled maintenance state like file write error, it now returns 500 Failed to update scheduled maintenance state instead of 500 Failed to update scheduled maintenance mode state.
- (For PUT requests to /scheduled_maintenance) For 200 responsnes, it now returns Maintenance state successfully updated instead of Maintenance mode state successfully updated.

louise-davies · 2024-11-14T10:14:11Z

@VKTB

Do the Ansible scripts create the maintenance files? If so, does the scheduled_maintenace.json have a severity key value pair?

Yes, they create them if they don't already exist. But no they weren't updated to have severity yet - this was added later. Can easily update the ansible to have that set to warning by default and I can update the files already installed on machines too. I presume this is because this PR adds validation to the JSON now - previously we didn't care and just stored any JSON iirc.

Do the Ansible scripts need to be updated to create a .env and logging.ini config files?

Yes they would do if they are required. The .env file replaces config.json right? Which is configured in the ansible so can just adapt it. Should not be hard to add, just standard templating stuff/copying files if they're supplied.

Are there any other changes that need to be made to the ansible scripts?

I wouldn't think so, unless it's installed differently all the ansible scripts do is set up directories/files, install packages into virtualenv & set up the apache config to run the WSGI script.

The response status code/message have changed for the below endpoints. Will this break SciGateway?

(For POST requests to /login) If the mnemonic field in the body is missing, it now returns 422 { "detail": [ { "type": "missing", "loc": [ "body", "mnemonic" ], "msg": "Field required", "input": {} } ] } instead of 400 Missing mnemonic. (Message is built by Pydantic)

Shouldn't matter - we don't really care how the login commands fail - we just ensure the user is logged out and display a generic "Failed to login" message (aka we don't parse the error from the API)

(For POST requests to /refresh) If JWT refresh token (HTTP-only cookie) is missing, it now returns 400 No JWT refresh token found instead of 400 No refresh token cookie found.

Again, we don't parse the error message so no concern, would just display a "Signed out due to token invalidation" message

(For POST requests to /refresh) If JWT refresh token (HTTP-only cookie) is blacklisted, it now returns 403 Unable to refresh access token instead of 403 Refresh token was not valid".

See above

(For POST requests to /refresh If token field (which is the JWT access token) in the body is missing, it now returns 422 { "detail": [ { "type": "missing", "loc": [ "body", "token" ], "msg": "Field required", "input": null } ] } instead of 400 No access token found. (Message is built by Pydantic)

See above

(For POST requests to /refresh) If anything goes wrong while it is trying to refresh the access token like invalid JWT tokens, the usernames in the tokens are not matching, or ICAT errors, it now returns 403 Unable to refresh access token instead of 403 Unable to refresh token.

See above

(For POST requests to /verify) For 200 responses, it no longer returns a body . Previously, it returned an empty string (""). I think returning 204 instead would be more suitable here given no body is returned but not sure how that will impact SciGateway.

We don't care about the specific body or code, as long as it's a 2XX code

(For POST requests to /verify) If JWT token in the body is invalid, it now returns 403 Invalid JWT token provided instead of 403 Token could not be verified.

Any failure to verify like refresh is not parsed, and will just result in a generic "Signed out due to token invalidation"

(For GET requests to /maintenance) If it fails to get the maintenance state, it now returns 500 Failed to get maintenance state instead of 500 Failed to retrieve maintenance mode state.

We don't parse the message so no concern

(For PUT requests to /maintenance) If there are validation errors as a result of invalid/missing JSON data in the body, it now returns 422 instead of 400. The message returned is different as it the validation is handled by Pydantic instead of the jsonschema library.

Again, we don't parse the message and the error code is still an error code so error handling will work the same

(For PUT requests to /maintenance) If JWT access token in the body is invalid, it now returns 403 Unable to update maintenance state instead of 403 Access token was not valid.

See above, we don't parse the message

(For PUT requests to /maintenance) If user is not admin, it now returns 403 Unable to update maintenance state instead of 403 Unauthorized.

See above

(For PUT requests to /maintenance) If anything goes wrong while it is trying to update the maintenance state like file write error, it now returns 500 Failed to update maintenance state instead of 500 Failed to update maintenance mode state.

See above

(For PUT requests to /maintenance) For 200 responses, it now returns Maintenance state successfully updated instead of Maintenance mode state successfully updated.

We don't parse success messages either, it just returns the body. So the message the admins see will change slightly but not a concern in my opinion.

(For GET requests to /scheduled_maintenance) If it fails to get the scheduled maintenance state, it now returns 500 Failed to get scheduled maintenance state instead of 500 Failed to return scheduled maintenance state.

We don't parse the error

(For PUT requests to /scheduled_maintenance) If there are validation errors as a result of invalid/missing JSON data in the body, it now returns 422 instead of 400. The message returned is different as it the validation is handled by Pydantic instead of the jsonschema library.

See above

(For PUT requests to /scheduled_maintenance) If JWT access token in the body is invalid, it now returns 403 Unable to update scheduled maintenance state instead of 403 Access token was not valid.

See above

(For PUT requests to /scheduled_maintenance) If user is not admin, it now returns 403 Unable to update scheduled maintenance state instead of 403 Unauthorized.

See above

(For PUT requests to /scheduled_maintenance) If anything goes wrong while it is trying to update the scheduled maintenance state like file write error, it now returns 500 Failed to update scheduled maintenance state instead of 500 Failed to update scheduled maintenance mode state.

See above

(For PUT requests to /scheduled_maintenance) For 200 responsnes, it now returns Maintenance state successfully updated instead of Maintenance mode state successfully updated.

Like with the maintenance endpoint, we don't parse success messages, it just returns the body. So the message the admins see will change slightly but not a concern in my opinion.

Overall, I think the changes to the API endpoints themselves don't require any changes in SG and overall look for the better, whereas scigateway-ansible will need updating but it's relatively easy to do so I think

VKTB · 2024-11-14T12:06:04Z

@louise-davies Thank you for providing these answers! Sounds like we don't need to do anything in SciGateway which is good news.

I presume this is because this PR adds validation to the JSON now - previously we didn't care and just stored any JSON iirc.

That's right, it now uses Pydantic models which automatically validate the data stored in the JSON files.

The .env file replaces config.json right?

Correct. The configuration is now handled using Pydantic Settings which allows for loading config values from system environment variables or the .env file. It's up to us to decide whether we use system ENV VARs or a .env file. Please note that when both are used, Pydantic will still read system ENV VARs as well as the .env file but system ENV VARs will always take priority over values loaded from the .env file.

I wouldn't think so, unless it's installed differently all the ansible scripts do is set up directories/files, install packages into virtualenv & set up the apache config to run the WSGI script.

There is no WSGI script anymore as FastAPI is ASGI and in the world of containers the app is run in production mode using the FastAPI CLI command line program which internally uses a production-ready Uvicorn ASGI server. I have never run a FastAPI app on an apache server like that so we should look at how we get it to work.

louise-davies · 2024-11-14T12:44:53Z

@VKTB ah yeah, forgot about the WSGI to ASGI change. Will probably just have to set up a system service for the API, and use Apache to reverse proxy to whatever port we run it on - so slightly more changes to the ansible scripts than I thought

VKTB added 30 commits October 28, 2024 16:47

Drop support for Python versions below 3.11 #93 #94 #95

9a5d86b

Replace Flask-RESTful dependencies with FastAPI #133

f7cc27a

Update dependencies to latest versions #133

03c59c7

Delete wsgi.py module #133

e9ce95c

Set black's and flake's max line length to 120 #133

b4ee440

Add severity value to scheduled_maintenance.json file #133

cf5ffb0

Define custom exception classes #133

286c554

Refactor logger logic to use configuration from .ini file #133

5450937

Add a logging.ini example file #133

f7e5768

Define API schema models #133

3aa29ee

Refactor ICATAuthenticator and AuthenticationHandler classes #133

e03840e

Refactor MaintenanceMode and ScheduledMaintenanceMode classes #133

c98de7b

Implement authentication API router #133

2dfbbdc

Implement maintenance API router #133

69583f0

Remove unnecessary config values #133

45ac714

Update config.json example file #133

248dff5

Update constants.py module #133

8009cc0

Update .gitignore #133

bd480b3

Update main module #133

da03c8f

Update noxfile.py #133

6db30b2

Update CI workflow file #133

a86f3f9

Fix syntax error in CI workflow file #133

af54b65

Remove openapi.yaml file #133

88dae1c

Use .env file and pydantic BaseSettings for configuration #133

11a5460

Add pytest-env as dev dependency #133

fd56c1a

Refactor logic #133

fb6894c

Check usernames in tokens match before refreshing access token #129

c80ac0c

Redo tests #133

840d253

Configure nox tests session to use pytest config file #133

aea6b5d

Fix failing test CI job #133

04d05f1

VKTB added 7 commits November 12, 2024 11:01

Set timeout for ICAT requests #133

b093a52

Fix failing tests #133

9eab439

Define a Docker stage for local development #133

f5ef933

Define a Docker stage for running tests #133

ef559d8

Define a Docker stage for production #133

1a229ee

Set target stage in Docker CI job #133

4d506ee

Change responses for PUT maintenance endpoints #133

445d618

Configure tests CI job to run tests in Docker container #133

b923526

VKTB force-pushed the port-over-to-fastapi-#133 branch from c76ad13 to 6221365 Compare November 12, 2024 17:31

Only push Docker images to Harbor on workflow_dispatch #133

a470be7

VKTB force-pushed the port-over-to-fastapi-#133 branch from 6221365 to a470be7 Compare November 12, 2024 17:35

VKTB added 2 commits November 12, 2024 17:48

Fix failing docker CI job #133

92a7311

Get coverage.xml report from container #133

0fb5fa1

VKTB force-pushed the port-over-to-fastapi-#133 branch from b76ccfb to 0fb5fa1 Compare November 12, 2024 18:08

VKTB marked this pull request as ready for review November 12, 2024 18:10

VKTB added 2 commits November 14, 2024 16:31

Createa docker compose file #133

5d49497

Update README #133

9d1192d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port API to FastAPI #133 #134

Port API to FastAPI #133 #134

VKTB commented Nov 4, 2024 •

edited

Loading

VKTB commented Nov 12, 2024 •

edited

Loading

louise-davies commented Nov 14, 2024 •

edited

Loading

VKTB commented Nov 14, 2024

louise-davies commented Nov 14, 2024 •

edited

Loading

Port API to FastAPI #133 #134

Are you sure you want to change the base?

Port API to FastAPI #133 #134

Conversation

VKTB commented Nov 4, 2024 • edited Loading

Description

Testing Instructions

Agile Board Tracking

VKTB commented Nov 12, 2024 • edited Loading

louise-davies commented Nov 14, 2024 • edited Loading

VKTB commented Nov 14, 2024

louise-davies commented Nov 14, 2024 • edited Loading

VKTB commented Nov 4, 2024 •

edited

Loading

VKTB commented Nov 12, 2024 •

edited

Loading

louise-davies commented Nov 14, 2024 •

edited

Loading

louise-davies commented Nov 14, 2024 •

edited

Loading