An innovative framework for generating manga with dynamic multi-character control, integrating diffusion-based image generation with multimodal large language models.
CVPR 2025The first comprehensive empirical benchmark of GPT-4o's image generation performance across more than 20 distinct tasks.
arXiv 2025