I'm QI QIN(秦θ), a Researcher at Shanghai AI Lab and incoming PhD student at the University of Sydney.
- Image Generation:
- Lumina-Image 2.0 (
) - A Unified and Efficient Image Generative Framework.
- Lumina-mGPT (
) - Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining.
- Lumina-Image 2.0 (
- Video Generation:
- Lumina-Video (
) - Efficient and Flexible Video Generation with Multi-scale Next-DiT.
- Lumina-Video (
- AutoRegressive:
- Lumina-mGPT-2.0 (
) - Stand-Alone AutoRegressive Image Modeling.
- Lumina-mGPT-2.0 (
- Diffusion Large Language Model:
- Lumina-DiMOO (
) - An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding.
- Lumina-DiMOO (
- OmniCaptioner (
) - One Captioner to Rule Them All.
π§ Contact: Feel free to drop me an email ([email protected]) if you're interested.