Efficient Text-to-Image Generation: An Adaptive Step Schedule Controller for Diffusion Models

Page view(s)

Checked on

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/22322

Title:

Efficient Text-to-Image Generation: An Adaptive Step Schedule Controller for Diffusion Models

Journal Title:

2025 IEEE International Conference on Image Processing (ICIP)

DOI:

10.1109/ICIP55913.2025.11084691

Publication URL:

https://doi.org/10.1109/icip55913.2025.11084691

Authors:

Kuluhan Binici, Cihan Acar, Shivam Aggarwal, Siying Liu, Tulika Mitra

Keywords:

Diffusion models , efficient AI

Publication Date:

18 August 2025

Citation:

Binici, K., Acar, C., Aggarwal, S., Liu, S., & Mitra, T. (2025). Efficient Text-to-Image Generation: An Adaptive Step Schedule Controller for Diffusion Models. 2025 IEEE International Conference on Image Processing (ICIP), 355–360. https://doi.org/10.1109/icip55913.2025.11084691

Abstract:

Text-to-image diffusion models often use a fixed number of denoising steps, balancing time costs and image qual- ity. However, the optimal number of steps depends on the complexity of the input text prompt. We propose an adap- tive diffusion controller that dynamically adjusts the number of steps to generate high-quality images efficiently, without additional model training. By leveraging a mixture of step schedules with varying step sizes and evaluating the error term discrepancy at each timestep, our method transitions between schedules to optimize performance. Experiments on COCO and DiffusionDB show that our approach reduces inference time while maintaining visual fidelity, offering a more efficient alternative for text-to-image diffusion models.

License type:

Publisher Copyright

Funding Info:

Student was funded by A*STAR SINGA PhD scholarship

Description:

© 2025 IEEE.  Personal use of this material is permitted.  Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

URI:

https://oar.a-star.edu.sg/communities-collections/articles/22322

ISSN:

2381-8549

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
icip2025-final-version.pdf	9.18 MB	PDF	Request a copy