DreamPixelForge

A modern GUI application for running multiple AI image generation models locally on your machine, transforming your text prompts into stunning images.

Features

Clean and intuitive user interface
Multi-model support:
- Stable Diffusion 1.5
- Stable Diffusion 2.1
- Stable Diffusion XL
- Dreamlike Diffusion
- Kandinsky 2.2
- Pony Diffusion V6 XL
Text-to-image generation with various models
Support for negative prompts (model-specific defaults for optimal quality)
Adjustable generation parameters (steps and guidance scale)
Batch image generation (generate up to 10 images at once)
Model-specific resolution presets
Automatic image saving to outputs folder
Save generated images in various formats
Prompt enhancement using local LLMs via Ollama
Seed control for reproducible results
Multiple sampler algorithms for different generation styles
Cross-platform support (Windows, macOS, Linux)
GPU acceleration support where available (CUDA on Windows/Linux, Metal on macOS)
Real-time progress tracking
Clear feedback during model downloads
Support for local models from Civitai and other sources
App Icon Generation presets for professional icon creation
Icon Post-Processing for rounded corners and platform-specific sizing

Requirements

Python 3.8 or higher
For Windows/Linux: CUDA-capable GPU (recommended, 8+ GB VRAM for SDXL)
For macOS: Apple Silicon Mac (M1/M2/M3) for Metal acceleration (Intel Macs will use CPU)
At least 8GB of RAM (16GB recommended)
4-7GB free disk space per model (~20GB for all models)

Installation

Windows/Linux

Clone this repository:

git clone https://github.com/yourusername/dream-pixel-forge.git
cd dream-pixel-forge

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```

macOS

Clone this repository:

git clone https://github.com/yourusername/dream-pixel-forge.git
cd dream-pixel-forge

Use the provided installation script:

chmod +x platform_specific/macos/install_macos.sh
./platform_specific/macos/install_macos.sh

Or follow these manual steps:

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate

Install the required packages:
```
pip install -r requirements.txt
```
Notes for macOS:
- Apple Silicon Macs (M1/M2/M3) will automatically use Metal Performance Shaders (MPS) for acceleration
- Intel Macs will run in CPU mode (significantly slower)
- Image generation will be slower than on equivalent NVIDIA GPUs

Usage

Run the application:
```
python dream_pixel_forge.py
```
Select the model you want to use:
- Choose from the "Hugging Face Models" tab for pre-configured models
- Or select from the "Local Models" tab for your own custom models
Enter your prompt in the text field
(Optional) Enter a negative prompt to specify what you don't want in the image
Adjust the generation parameters if needed:
- Number of Steps: Higher values (30-50) generally produce better quality but take longer
- Guidance Scale: Higher values (7.5-15) make the image more closely match the prompt
- Batch Size: Number of images to generate at once (1-10)
- Seed: Control the randomness of the generation
  - Use -1 for a random seed each time
  - Set a specific number for reproducible results
  - The random dice button generates a new random seed
- Sampler: Select the algorithm used for the diffusion process
  - Different samplers have different characteristics and speeds
  - "Euler a" is a good default for most images
Click "Generate Images" and wait for the result
All generated images are automatically saved to the "outputs" folder, with filenames that include:
- The first few words of your prompt
- A unique generation counter
- The seed used for the image
For batch generation:
- Use the "Previous" and "Next" buttons to navigate between generated images
Use the "Save Image to Custom Location" button if you want to save your image to a specific location

Models

Stable Diffusion 1.5

The original Stable Diffusion model - fast and versatile.

Stable Diffusion 2.1

Improved version with better quality and consistency.

Stable Diffusion XL

Larger model with higher quality outputs (requires more VRAM).

Dreamlike Diffusion

Artistic model that creates dreamlike, surreal images.

Kandinsky 2.2

Russian alternative to SD with unique artistic style.

Pony Diffusion V6 XL

Specialized model for creating stylized art with high quality outputs. This model uses the SDXL architecture and automatically applies quality-enhancing tags to prompts.

Special features:

Optimal for generating stylized art
Automatically applies quality boosting tags for better results
Uses CLIP skip feature for improved output quality
Based on SDXL, so needs 8+ GB VRAM for optimal performance

Quality Score Tags:

All prompts for Pony Diffusion (both official and local models) are automatically enhanced with score_9, score_8_up, score_7_up quality tags
These tags tell the model to generate high-quality images according to its internal aesthetic scoring system
You don't need to add these tags manually - they're added automatically
This applies to both the official Hugging Face model and any local Pony models you add

Using Local Models

DreamPixelForge supports loading custom models from Civitai and other sources. These models should be in .safetensors or .ckpt format.

Adding Local Models

There are two ways to add local models:
- Direct import: Place .safetensors or .ckpt files in the models folder in the application directory and use "Import from Models Folder"
- Manual selection: Use the "Add Model" button to select a model file from anywhere on your system
When adding a model, you'll need to provide:
- Model Name: A name to identify the model in the UI
- Model Type: The base model architecture (SD 1.5, SD 2.1, SDXL)
- Model File: The path to the .safetensors or .ckpt file
- Description: (Optional) A description of the model
After adding a model, it will appear in the "Local Models" tab

Using Local Models

Switch to the "Local Models" tab
Select your model from the dropdown list
The appropriate resolution presets will be loaded based on the model type you specified
Generate images as you would with built-in models
Model-specific negative prompts will be automatically applied based on the model type

Managing Local Models

Use the "Manage Models" button to:

Add new models
Import models from the models folder
Remove models from the registry (this doesn't delete the model files)
Auto-detection of Pony models with appropriate configuration

App Icon Generation

DreamPixelForge includes specialized features for creating professional app icons:

App Icon Generation Preset

Access the preset from the Presets menu → App Icon Generator
This applies optimal settings for app icon generation:
- Square resolution (512x512 or 1024x1024 depending on model)
- Optimal steps (25) and guidance scale (7.0)
- Batch size of 4 to provide multiple options
- Specialized negative prompt to avoid text and common artifacts
- Sampler optimized for detailed icons
The preset also enhances your prompt by adding app icon specific terms if needed
You can then customize the prompt further to match your app's purpose

App Icon Post-Processing

After generating your app icons, use the post-processing tool to prepare them for different platforms:

Generate an icon using the App Icon Preset
Go to Post Processing menu → App Icon Processing
In the dialog, set:
- Corner Radius - Apply rounded corners from 0% (square) to 50% (fully rounded)
- Target Platform - iOS, Android, Windows, macOS, or All Platforms
- Output Directory - Where to save the processed icons
The tool automatically:
- Applies the specified corner radius with proper transparency
- Generates all required sizes for the selected platform
- Names files according to platform conventions
- Preserves transparency for platforms that support it

This workflow makes it easy to go from a text prompt to a complete set of properly sized and formatted icons for your application.

Model-Specific Negative Prompts

DreamPixelForge now features optimized negative prompts for each supported model type:

Stable Diffusion 1.5: Basic negative prompt to avoid common artifacts and issues
Stable Diffusion 2.1: Extended negative prompt tailored to SD 2.1's characteristics
Stable Diffusion XL: Comprehensive negative prompt optimized for SDXL models
Dreamlike Diffusion: Special negative prompt for artistic models
Kandinsky 2.2: Negative prompt adapted to Kandinsky's unique architecture
Pony Diffusion V6 XL: Specialized negative prompt for stylized art generation

These model-specific negative prompts will be automatically applied when you switch between models, improving image quality without manual adjustment. You can still customize the negative prompt as needed for specific results.

Automatic Model Detection

The application now automatically detects and properly configures certain model types:

Pony Models: Automatically detected and configured as SDXL models with appropriate settings
Local Models: Properly categorized based on their architecture (SD 1.5, SD 2.1, SDXL)

This ensures you get the best quality output with minimal manual configuration.

Ollama Prompt Enhancement

DreamPixelForge supports prompt enhancement using local large language models via the Ollama project. This feature helps you:

Convert descriptive sentences into a concise set of 5-10 optimized image generation tags
Enhance existing tags with 3-5 additional very closely related keywords that improve your results while maintaining the original style and concept

Requirements for Ollama Integration

Ollama installed and running on your machine
At least one language model installed through Ollama

Setting up Ollama

Download and install Ollama from ollama.ai
Run Ollama according to the instructions for your operating system
Pull a language model using the Ollama command:
```
ollama pull llama2
```

Using Prompt Enhancement

Choose an Ollama model from the dropdown
Select the input type:
- Description to Tags: Enter a full description of what you want to see
- Enhance Tags: Enter existing tags/keywords to expand them
Type your prompt in the enhancement field
Click "Enhance Prompt" to process it through the selected LLM
The enhanced prompt will be placed in the main prompt field, ready for image generation

Note: You need to start Ollama separately before using this feature. If Ollama is not detected, a message will be shown with an option to check for availability.

Model Downloads and First Use

When you first use a model, it will be downloaded automatically from Hugging Face. The application will show:

A first-time use notice with download size information
Real-time download status in the progress area
Elapsed time for longer downloads

Download sizes for each model:

Stable Diffusion 1.5: ~4GB
Stable Diffusion 2.1: ~4.2GB
Dreamlike Diffusion: ~4GB
Kandinsky 2.2: ~4.5GB
Stable Diffusion XL: ~6.5GB
Pony Diffusion V6 XL: ~7GB

Note: Downloads happen only once per model. After downloading, the model will be loaded directly from your local cache.

Model Storage and Cache Management

Models are downloaded automatically by the Hugging Face Diffusers library when first used and stored in a cache directory:

Windows: C:\Users\<YOUR_USERNAME>\.cache\huggingface\hub
macOS: /Users/<YOUR_USERNAME>/.cache/huggingface/hub
Linux: /home/<YOUR_USERNAME>/.cache/huggingface/hub

Disk Space Requirements

Each model requires significant disk space:

Stable Diffusion 1.5/2.1: ~4GB each
Dreamlike Diffusion: ~4GB
Kandinsky 2.2: ~4-5GB
Stable Diffusion XL: ~6.5GB
Pony Diffusion V6 XL: ~7GB

Note: You only need disk space for the models you actually use. The ~27GB total is only if you plan to use all models. Most users will only need 4-7GB for their preferred model.

Managing the Cache

You can manage the model cache in several ways:

Clear the cache - You can safely delete the cache directory if you need to free up space. Models will be re-downloaded when needed.

Custom cache location - Set a custom cache directory by setting the HF_HOME environment variable before running the application:

# Windows (PowerShell)
$env:HF_HOME = "D:\custom_model_cache"
python dream_pixel_forge.py

# Linux/macOS
export HF_HOME="/path/to/custom_model_cache"
python dream_pixel_forge.py

One-time downloads - Models are only downloaded once, so subsequent runs will be faster.

macOS Performance Expectations

On macOS, performance will vary depending on your hardware:

Apple Silicon (M1/M2/M3):
- Basic models (SD 1.5, SD 2.1) should run reasonably well
- Higher-end models (M2/M3 Pro, Max, Ultra) can handle SDXL with decent performance
- Expect ~2-4x slower generation than equivalent NVIDIA GPUs
- Use 30-40 steps rather than 50 for faster generation
Intel Macs:
- Will run in CPU-only mode
- Very slow performance (minutes per image)
- Best to use smaller models (SD 1.5) and lower resolutions
- Consider reducing steps to 20-30 for faster results

macOS Troubleshooting

If you encounter issues on macOS:

Memory errors:
- Try reducing resolution (512x512 instead of 768x768)
- Use smaller batch sizes (1-2 images at once)
- Close other memory-intensive applications
Crashing on model loading:
- Some models may not be compatible with MPS
- Try updating to the latest PyTorch version
- Restart the application between model changes
Very slow loading or generation:
- First generation after starting the app is always slower
- Consider using smaller models (SD 1.5 instead of SDXL)
- Reduce the image resolution and number of steps

Development

This project uses Git for version control. After making changes:

# View changed files
git status

# Add files to staging area
git add .

# Commit changes with a descriptive message
git commit -m "Description of changes"

# Push changes to remote repository (if set up)
git push

Notes

The first run will download the selected model (SD models ~4GB, SDXL ~6.5GB)
Generation time depends on your hardware (GPU recommended)
If you don't have a GPU, the application will run on CPU but will be significantly slower
Different models have different VRAM requirements:
- SD 1.5 and 2.1: ~4GB VRAM
- Dreamlike/specialized models: ~4-6GB VRAM
- SDXL: 8+GB VRAM recommended

Troubleshooting

If you encounter any issues:

Make sure all dependencies are installed correctly
Check if you have enough disk space and VRAM for the selected model
If using GPU, ensure you have the latest CUDA drivers installed
Try reducing the number of steps or image size if you run into memory issues
For SDXL, you need a GPU with at least 8GB VRAM, or consider using CPU mode

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
docs/images		docs/images
models		models
platform_specific/macos		platform_specific/macos
tests		tests
.coverage		.coverage
.gitignore		.gitignore
README.md		README.md
dream_pixel_forge.py		dream_pixel_forge.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_tests.py		run_tests.py
test_coverage_improvement.md		test_coverage_improvement.md
test_fixes_summary.md		test_fixes_summary.md
test_summary.md		test_summary.md

Legorobotdude/dream-pixel-forge

Folders and files

Latest commit

History

Repository files navigation

DreamPixelForge

Features

Requirements

Installation

Windows/Linux

macOS

Usage

Models

Stable Diffusion 1.5

Stable Diffusion 2.1

Stable Diffusion XL

Dreamlike Diffusion

Kandinsky 2.2

Pony Diffusion V6 XL

Using Local Models

Adding Local Models

Using Local Models

Managing Local Models

App Icon Generation

App Icon Generation Preset

App Icon Post-Processing

Model-Specific Negative Prompts

Automatic Model Detection

Ollama Prompt Enhancement

Requirements for Ollama Integration

Setting up Ollama

Using Prompt Enhancement

Model Downloads and First Use

Model Storage and Cache Management

Disk Space Requirements

Managing the Cache

macOS Performance Expectations

macOS Troubleshooting

Development

Notes

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages