Preview

Proceedings of the Southwest State University. Series: IT Management, Computer Science, Computer Engineering. Medical Equipment Engineering

Advanced search

Semantically invariant conditioning of diffusion models: a unified framework for cross-model positive prompting

https://doi.org/10.21869/2223-1536-2025-15-4-35-49

Abstract

The purpose of research is development of a universal methodology of positive industrial engineering for image generation by diffusion models based on a deep linguistic and semantic analysis of Human-AI interaction and identification of cross-model invariants.

Methods. Within the framework of this study, an interdisciplinary scientific approach was applied, combining methods of cognitive analysis and empirical verification.

Results. The results of the study confirmed the high efficiency of the proposed universal methodology of positive industrial engineering, which significantly improved the quality of image generation by diffusion models. Experimental data have shown that promptas formed according to the developed structure and lexical optimization strategies provide better compliance with the specified characteristics and more stable results across different models, while statistically significantly exceeding the quality of unstructured promptas (p < 0,01). The use of a multi-level system of components and implicit control methods has made it possible to reduce the variability of unwanted artifacts, increase the accuracy of visual characteristics, and simplify the process of creating designs, making it more predictable, reproducible, and universal for various platforms. In general, the implementation of this methodology improves human interaction with AI, increases the stability and quality of visual results, and facilitates the adaptation of products to different models and tasks.

Conclusion. The conducted research has confirmed the effectiveness of the proposed universal methodology of positive industrial engineering for image generation by diffusion models. The introduction of a structured approach and lexical optimization strategies can significantly improve the quality, stability and predictability of results, as well as reduce the number of unwanted artifacts. This approach promotes more manageable and universal human-AI interaction, making it easier to create high-quality images in various models and conditions. In the future, the use of the developed methodology can become the basis for improving the efficiency of automated visual content generation systems and expanding their practical capabilities.

About the Authors

A. A. Zotkina
Penza State Technological University
Россия

Alena A. Zotkina, Candidate of Sciences (Engineering), Associate Professor at the Department of Programming

1a/11 Baidukova Pass. / Gagarina Str., Penza 440039



A. I. Martyshkin
Penza State Technological University
Россия

Alexey I. Martyshkin, Candidate of Sciences (Engineering), Associate Professor, Head of the Department of Programming

Researcher ID: S-7452-2016

1a/11 Baidukova Pass. / Gagarina Str., Penza 440039



A. A. Pavlov
Penza State Technological University
Россия

Akim A. Pavlov, Student at the Department of Programming

1a/11 Baidukova Pass. / Gagarina Str., Penza 440039



A. V. Tkachenko
Penza State Technological University
Россия

Alexandra V. Tkachenko, Student at the Department of Programming

1a/11 Baidukova Pass. / Gagarina Str., Penza 440039



References

1. Cao C., Tan C., Gao J., et al. Opportunities and challenges of diffusion models for generative AI. National Science Review. 2024;11(12):Nwae348. https://doi.org/10.1093/nsr/nwae348

2. Yan Liu, Tao Jiang, Rui Li, et al. A state-of-the-art review of diffusion model applications for microscopic image and micro-alike image analysis. Frontiers in Medicine. 2025;12:1551894. https://doi.org/10.3389/fmed.2025.1551894

3. What is Prompt Engineering? Google Cloud. Available at: https://cloud.google.com/discover/what-is-prompt-engineering (accessed 15.09.2025).

4. Prompt engineering concepts. Amazon Bedrock. Available at: https://docs.aws.amazon.com/bedrock/latest/userguide/prompt-engineering-guidelines.html (accessed 20.09.2025).

5. Industrial engineering: creating effective queries for GigaChat. Sber Developers. Available at: https://developers.sber.ru/docs/ru/gigachat/prompts-hub/prompt-engineering (accessed 22.09.2025).

6. Stable Diffusion 3.5 Prompt Guide. Stability AI. Available at: https://stability.ai/learninghub/stable-diffusion-3-5-prompt-guide (дата обращения: 25.09.2025).

7. Chain-of-Thought Prompting. Anthropic. Available at: Available at: https://docs.anthropic.com/ru/docs/build-with-claude/prompt-engineering/chain-prompts (accessed 27.04.2025).

8. Imagen prompt guide | Gemini API. Google AI for Developers. Available at: https://ai.google.dev/gemini-api/docs/imagen-prompt-guide (accessed 30.09.2025).

9. Collection of Dall-E 3 prompting tips, issues and bugs, Simplified. OpenAI. Available at: https://community.openai.com/t/collection-of-dall-e-3-prompting-tips-issues-and-bugs-simplified/994822 (accessed 03.09.2025).

10. How to Design Prompts for Flux 1.1 Pro. Available at: https://fluxproweb.com/blog/detail/How-to-Design-Prompts-for-Flux-1-1-Pro-0a16b61b16e6/ (accessed 05.09.2025).

11. Version. Midjourney Docs. Available at: https://docs.midjourney.com/hc/en-us/articles/32199405667853-Version (accessed 07.09.2025).

12. Ideogram 3.0. Ideogram. Available at: https://about.ideogram.ai/3.0 (accessed 10.09.2025).

13. Kandinsky 3.0. AI Forever. Available at: https://ai-forever.github.io/Kandinsky-3/ (accessed 12.09.2025).

14. How to Craft Prompts for Accurate, AI-Generated Images. Recraft. Available at: https://www.recraft.ai/blog/how-to-craft-prompts-for-accurate-ai-generated-images (accessed 15.09.2025).

15. Navigating the Alignment Challenges of Diffusion Models: Insights and Innovations. Preprints.org. 2025;(1502). Available at: https://www.preprints.org/manuscript/202501.1502/v1 (accessed 18.09.2025).

16. Recraft introduces a revolutionary AI model that thinks in design language. Recraft. Available at: https://www.recraft.ai/blog/recraft-introduces-a-revolutionary-ai-model-that-thinksin-design-language (accessed 20.09.2025).

17. Ramesh A., Dhariwal P., Nichol A., et al. Hierarchical Text-Conditional Image Generation with CLIP Latents. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. Available at: https://arxiv.org/abs/2204.06125 (accessed 18.09.2025).

18. Ho J., Jain A., Abbeel P. Denoising Diffusion Probabilistic Models. Advances in Neural Information Processing Systems (NeurIPS). 2020;33. Available at: https://proceedings.neurips.cc/paper/2020/hash/4c5bcfec8584af0d967f1ab10179ca4b-Abstract.html (accessed 11.09.2025).

19. Rombach R., Blattmann A., Lorenz D., et al. High-Resolution Image Synthesis with Latent Diffusion Models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. Available at: https://openaccess.thecvf.com/content/CVPR2022/papers/Rombach_High-Resolution_Image_Synthesis_With_Latent_Diffusion_Models_CVPR_2022_paper.pdf (accessed 11.09.2025).


Review

For citations:


Zotkina A.A., Martyshkin A.I., Pavlov A.A., Tkachenko A.V. Semantically invariant conditioning of diffusion models: a unified framework for cross-model positive prompting. Proceedings of the Southwest State University. Series: IT Management, Computer Science, Computer Engineering. Medical Equipment Engineering. 2025;15(4):35-49. (In Russ.) https://doi.org/10.21869/2223-1536-2025-15-4-35-49

Views: 17

JATS XML


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2223-1536 (Print)