Stable Diffusion 1024x1024 Pixel Image Generator (CLI)

Fig.1 Generated Images (1024×1024) using this code: (a) Futuristic Forest Details, (b) Cosmic Constellation Map, (c) Paint Strokes of a Virtual Reality Art (d) Augmented Human Eye, (e) Architectural Marvels in a Smart City, (f) Macro Shot of a Nano-Insect, (g) Interactive Globe Details, (h) Clockwork of a Time Machine

▶ Generated image files (1024×1024) and a pre-upscaled image file (512×512) are located in this directory. Fig.2 Comparison of Resolution Before and After Processing: (left) 512×512, (right) 1024×1024

Overview

This project aims to generate high-resolution 1024x1024 pixel images using the Stable Diffusion model 1.4/1.5 and an advanced upsampler prototype. It is specifically designed to meet the requirements of strategically important clients who demand higher image resolutions than what is currently available in the standard Stable Diffusion packages.

Currently, a researcher at Stability AI has released it on Colab. However, it only works in a notebook environment. Therefore, I have prepared a program that can generate 1024x1024 pixel images in a Linux environment by providing prompts through command line arguments

▼ Related Tweet

https://x.com/StabilityAI/status/1590531946026717186?s=20

Features

Generate 1024x1024 pixel images from given prompts.
Utilizes Stable Diffusion 1.4/1.5 model.
Runs locally on Linux.

Requirements

Python 3.8 or higher
Version 0.0.15 of the k-diffusion package
Version 4.31.0 of the transformers package
Docker (for DevContainer support)

Installation

Using DevContainer in Visual Studio Code

Clone this repository.

git clone https://github.com/hogaku/StableDiffusion-Upscaler-CLI.git

Open the project folder in Visual Studio Code.
```
cd StableDiffusion-Upscaler-CLI
code .
```
When Visual Studio Code prompts you to reopen the folder in a DevContainer, click "Reopen in Container". Alternatively, you can press F1 and select the "Remote-Containers: Reopen Folder in Container" command. This will build the DevContainer defined in the .devcontainer folder, installing all the required dependencies automatically.

Manual Installation

Clone this repository.

git clone https://github.com/hogaku/StableDiffusion-Upscaler-CLI.git

Open the project folder in Visual Studio Code.
```
cd StableDiffusion-Upscaler-CLI
code .
```
Install required packages.
```
pip install -r requirements.txt
```
Run the setup script.
```
./setup.sh
```

Usage

First, Rename the .env.example`` file to .envand configure the Stable Diffusion Engine and API key as appropriate. Example.env` configuration:

SD_ENGINE_ID="stable-diffusion-v1-5"
SD_API_SECRET_KEY="sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

Run the following command on your Linux terminal to generate a 1024x1024 pixel image.

python main.py --seed=<YOUR_SEED> --prompt=<YOUR_PROMPT>

or

python main.py -s=<YOUR_SEED> -p=<YOUR_PROMPT>

Replace <YOUR_SEED> and <YOUR_PROMPT> with the desired seed and prompt for image generation.

Example:

python main.py --seed=12345 --prompt="A beautiful sunset over the mountains."

This will produce a 1024x1024 pixel image based on the prompt and seed provided.

(Others) Standalone Execution for 512x512 Pixel Image Generation

The code for generating 512×512 pixel images using stable-diffusion-v1-5 can operate independently.

Pattern1: CLI Execution

python generate.py <ENGINE_ID> <YOUR_PROMPT>

Pattern2: Class Invocation(No Return Value)

from generate import SDImageGenerator
engine_id = os.getenv('SD_ENGINE_ID')
generator_key = os.getenv('SD_API_SECRET_KEY')
engine_id = "stable-diffusion-v1-5"
generator = SDImageGenerator(engine_id, generator_key)
# No return value
generator.generate_image(prompt, <OUTPUT_SAVE_DIR>)

Pattern3: Class Invocation (Return Value: PIL Image)

from generate import SDImageGenerator
engine_id = os.getenv('SD_ENGINE_ID')
generator_key = os.getenv('SD_API_SECRET_KEY')
engine_id = "stable-diffusion-v1-5"
generator = SDImageGenerator(engine_id, generator_key)
# Return value: PIL Image
output_image = generator.generate_image(prompt, <OUTPUT_SAVE_DIR>, return_image_data=True)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.devcontainer		.devcontainer
clip_utils		clip_utils
img		img
samples		samples
upscaler_utils		upscaler_utils
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Stable_Diffusion_Upscaler_Demo.ipynb		Stable_Diffusion_Upscaler_Demo.ipynb
check_can_GPU.py		check_can_GPU.py
generate.py		generate.py
main.py		main.py
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion 1024x1024 Pixel Image Generator (CLI)

Overview

Features

Requirements

Installation

Using DevContainer in Visual Studio Code

Manual Installation

Usage

(Others) Standalone Execution for 512x512 Pixel Image Generation

About

Releases

Packages

Languages

License

hogaku/StableDiffusion-Upscaler-CLI

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion 1024x1024 Pixel Image Generator (CLI)

Overview

Features

Requirements

Installation

Using DevContainer in Visual Studio Code

Manual Installation

Usage

(Others) Standalone Execution for 512x512 Pixel Image Generation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages