Add Unified Gradio Web UI for Image Understanding, Generation, and Editing #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

mi804 merged 2 commits into modelscope:main from Enderfga:main

May 7, 2025

Contributor

Enderfga commented May 6, 2025 •

edited

Loading

This PR introduces a Gradio-powered Web UI (app.py) that unifies the three main capabilities of Nexus-Gen:

Multimodal Q&A: Ask questions about uploaded images or in text-only mode
Image Generation: Generate images from detailed prompts, with optional prompt polishing using LLM
Image Editing: Edit uploaded images with natural language instructions

The UI is modular, user-friendly, and runnable locally with:

python app.py

Example inputs and outputs are pre-loaded via gr.Examples for quick testing.

Enderfga added 2 commits

May 6, 2025 14:50


          support web demo

76f0d17


          fix some bug

46420a4

Collaborator

mi804 commented May 7, 2025

Thanks a lot for you contribution!!!

mi804 merged commit 2b6a64b into modelscope:main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet