Switch OpenAI Image Generation to use new `gpt-image-1` model #897

dkotter · 2025-04-23T21:54:28Z

Description of the Change

OpenAI recently introduced a new image generation model into their API, gpt-image-1, which can be used as a replacement for the dall-e-3 and dall-e-2 models. We upgraded from dall-e-2 to dall-e-3 in #717 and seems worthwhile to now upgrade to gpt-image-1, as the quality of the images generated is better (and pricing is similar).

There are some differences between these models that we need to account for:

Different quality options: New: auto, low, medium, high. Old: hd, standard
Different size options; New: 1024x1024, 1536x1024, 1024x1536. Old: 1024x1024, 1792x1024, 1024x1792
New model doesn't support the style options at all (vivid and natural)
New model only returns base64 encoded images, not an image URL

For new users, this won't matter but for existing users, there's code in place that will automatically use the new options when needed. As an example, for an existing user that has an image size set of 1792x1024, when they generate an image, it will force them to use 1024x1024. They'll need to go to the Feature settings screen to update their defaults.

Also worth noting I've updated the use of DALLE to be more general (OpenAI Images) in multiple places, as this makes more sense to what we're doing. I did not deprecate anything and I'm personally fine with that as it's not likely someone is directly using the Provider class. I did leave things like settings alone so none of those configurations will be lost.

How to test the Change

Turn on and configure the Image Generation Feature
Ensure you can generate images in the stand-alone page and within the media modal
Ensure the various settings work as expected
If desired, configure an environment using the current released version of the plugin and then update to this PR. Test and ensure image generation still works even if using old settings

Changelog Entry

Added - Support for the new OpenAI gpt-image-1 image generation model
Developer - Rename the DallE Provider class to Images. If you directly extend that class yourself, you'll need to update your code to account for this. Also updated a handful of other references to DALLE to Images

Credits

Props @dkotter

Checklist:

I agree to follow this project's Code of Conduct.
I have updated the documentation accordingly.
I have added Critical Flows, Test Cases, and/or End-to-End Tests to cover my change.
All new and existing tests pass.

… Add new helper methods and filters allowing more control over each option. Start removing the DALLE name

…odel

dkotter · 2025-06-05T16:18:23Z

includes/Classifai/Providers/OpenAI/Images.php


-class DallE extends Provider {
+class Images extends Provider {


I am changing the class name here but don't think we need to worry about deprecating this as it's unlikely anyone was directly extending this class

dkotter · 2025-06-05T17:25:54Z

Couple things to keep in mind here:

As noted here, your account needs to be verified to use the gpt-image-1 model. There may be some sites using image generation that aren't verified and will need to complete that before things will work, I think it's worth the minor pain there as the images generated are a much higher quality
You can get way more specific in the image generation prompt and the more specific you get, the longer it will take for the image to be generated. We currently have a timeout of 60 seconds on the request and in my testing, that seemed okay. But we may want to consider increasing that limit. Also a chance some sites have server-level limits (max execution time) that could be run into

As an example complex prompt, I pulled this from OpenAI's cookbook to test. Took about 45 seconds for me locally:

Render a realistic image of this character:
Blobby Alien Character Spec Name: Glorptak (or nickname: "Glorp")
Visual Appearance Body Shape: Amorphous and gelatinous. Overall silhouette resembles a teardrop or melting marshmallow, shifting slightly over time. Can squish and elongate when emotional or startled.
Material Texture: Semi-translucent, bio-luminescent goo with a jelly-like wobble. Surface occasionally ripples when communicating or moving quickly.
Color Palette:
- Base: Iridescent lavender or seafoam green
- Accents: Subsurface glowing veins of neon pink, electric blue, or golden yellow
- Mood-based color shifts (anger = dark red, joy = bright aqua, fear = pale gray)
Facial Features:
- Eyes: 3–5 asymmetrical floating orbs inside the blob that rotate or blink independently
- Mouth: Optional—appears as a rippling crescent on the surface when speaking or emoting
- No visible nose or ears; uses vibration-sensitive receptors embedded in goo
- Limbs: None by default, but can extrude pseudopods (tentacle-like limbs) when needed for interaction or locomotion. Can manifest temporary feet or hands.
Movement & Behavior Locomotion:
- Slides, bounces, and rolls.
- Can stick to walls and ceilings via suction. When scared, may flatten and ooze away quickly.
Mannerisms:
- Constant wiggling or wobbling even at rest
- Leaves harmless glowing slime trails
- Tends to absorb nearby small objects temporarily out of curiosity

jeffpaul · 2025-06-05T19:29:26Z

Also worth noting I've updated the use of DALLE to be more general (OpenAI Images) in multiple places, as this makes more sense to what we're doing. I did not deprecate anything and I'm personally fine with that as it's not likely someone is directly using the Provider class. I did leave things like settings alone so none of those configurations will be lost.

That all sounds good to me.

jeffpaul · 2025-06-05T19:30:50Z

As noted here, your account needs to be verified to use the gpt-image-1 model. There may be some sites using image generation that aren't verified and will need to complete that before things will work, I think it's worth the minor pain there as the images generated are a much higher quality

Do we have any handling (or even a way with the OAI API) to detect someone's not verified and to throw a notice for them to complete that to ensure the image gen features continue to work as expected?

jeffpaul · 2025-06-05T19:31:49Z

You can get way more specific in the image generation prompt and the more specific you get, the longer it will take for the image to be generated. We currently have a timeout of 60 seconds on the request and in my testing, that seemed okay. But we may want to consider increasing that limit. Also a chance some sites have server-level limits (max execution time) that could be run into

Perhaps as part of an error messaging in-context we add a simple note that if they're seeing timeouts beyond 60 seconds that they consider filtering at the app or server level to adjust that limitation?

dkotter · 2025-06-05T20:36:03Z

Do we have any handling (or even a way with the OAI API) to detect someone's not verified and to throw a notice for them to complete that to ensure the image gen features continue to work as expected?

Not that I can find (as far as an API that tells us that). We could try and make an image generation request and use that response but not sure I love the idea of automatically doing that (as it costs them money).

Perhaps as part of an error messaging in-context we add a simple note that if they're seeing timeouts beyond 60 seconds that they consider filtering at the app or server level to adjust that limitation?

If the request times out, an error message will already show (though that's just the HTTP request timing out, not a PHP-level timeout, which I don't think we can capture).

jeffpaul · 2025-06-09T15:05:32Z

Not that I can find (as far as an API that tells us that). We could try and make an image generation request and use that response but not sure I love the idea of automatically doing that (as it costs them money).

I agree.

Sidsector9

Code looks good, the PR tested well for me 👍

dkotter added 5 commits April 23, 2025 14:55

Start work on bringing the new image model to our OpenAI integration.…

b422b5b

… Add new helper methods and filters allowing more control over each option. Start removing the DALLE name

Update the settings to match the new model

6fef805

More renaming of DALLE to Images

a9a0aae

Ensure we only send supported params when using the new gpt-image-1 m…

b035c54

…odel

Update spacing

148ad38

dkotter self-assigned this Apr 23, 2025

github-actions bot added this to the 3.4.0 milestone Apr 23, 2025

dkotter mentioned this pull request Apr 23, 2025

Add ability to modify DALL·E 3 generated images #723

Open

1 task

dkotter and others added 5 commits April 23, 2025 16:01

Fix eslint error

47b1932

Fix E2E tests

9b96957

More E2E fixes

40f4fe6

Merge branch 'develop' of github.com:10up/classifai into develop

ce87045

Merge branch 'develop' into feature/new-openai-image-model

aca3c3d

dkotter commented Jun 5, 2025

View reviewed changes

dkotter added 3 commits June 5, 2025 10:57

Update readmes to reference new image generation

31d120b

Update caption we set on generated images

fe3aa04

Trim down the filename we use to avoid long filenames breaking things

1da56fe

dkotter marked this pull request as ready for review June 5, 2025 17:21

dkotter requested review from jeffpaul and a team as code owners June 5, 2025 17:21

github-actions bot added the needs:code-review This requires code review. label Jun 5, 2025

Ensure all error messages are returned

7a61fc5

jeffpaul requested review from Sidsector9 and removed request for jeffpaul June 9, 2025 15:05

github-actions bot added the needs:refresh This requires a refreshed PR to resolve. label Jun 9, 2025

Merge branch 'develop' into feature/new-openai-image-model

f7045f0

github-actions bot removed the needs:refresh This requires a refreshed PR to resolve. label Jun 9, 2025

Sidsector9 added 2 commits June 11, 2025 17:51

Merge branch 'develop' of github.com:10up/classifai into develop

ca0549e

Merge branch 'develop' into feature/new-openai-image-model

352c1c6

Sidsector9 approved these changes Jun 11, 2025

View reviewed changes

dkotter merged commit 4eb0d6b into develop Jun 11, 2025
19 checks passed

dkotter deleted the feature/new-openai-image-model branch June 11, 2025 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch OpenAI Image Generation to use new `gpt-image-1` model #897

Switch OpenAI Image Generation to use new `gpt-image-1` model #897

Uh oh!

dkotter commented Apr 23, 2025 •

edited

Loading

Uh oh!

dkotter Jun 5, 2025

Uh oh!

dkotter commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

dkotter commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 9, 2025

Uh oh!

Sidsector9 left a comment

Uh oh!

Uh oh!

Uh oh!


		class DallE extends Provider {
		class Images extends Provider {

Switch OpenAI Image Generation to use new gpt-image-1 model #897

Switch OpenAI Image Generation to use new gpt-image-1 model #897

Uh oh!

Conversation

dkotter commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the Change

How to test the Change

Changelog Entry

Credits

Checklist:

Uh oh!

dkotter Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

dkotter commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 5, 2025

Uh oh!

dkotter commented Jun 5, 2025

Uh oh!

jeffpaul commented Jun 9, 2025

Uh oh!

Sidsector9 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Switch OpenAI Image Generation to use new `gpt-image-1` model #897

Switch OpenAI Image Generation to use new `gpt-image-1` model #897

dkotter commented Apr 23, 2025 •

edited

Loading