Skip to content

Add Unified Gradio Web UI for Image Understanding, Generation, and Editing #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 7, 2025

Conversation

Enderfga
Copy link
Contributor

@Enderfga Enderfga commented May 6, 2025

This PR introduces a Gradio-powered Web UI (app.py) that unifies the three main capabilities of Nexus-Gen:

  • Multimodal Q&A: Ask questions about uploaded images or in text-only mode
  • Image Generation: Generate images from detailed prompts, with optional prompt polishing using LLM
  • Image Editing: Edit uploaded images with natural language instructions

The UI is modular, user-friendly, and runnable locally with:

python app.py

Example inputs and outputs are pre-loaded via gr.Examples for quick testing.

@mi804
Copy link
Collaborator

mi804 commented May 7, 2025

Thanks a lot for you contribution!!!

@mi804 mi804 merged commit 2b6a64b into modelscope:main May 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants