Name: Text-Driven Image Synthesis: Optimizing Prompts with ChatGPT in DF-GAN Framework
Start: 2024-08-09T12:15:00+0530
End: 2024-08-09T14:15:00+0530

Friday August 9, 2024 12:15pm - 2:15pm IST

Virtual Room C

Authors - Karan Chopra, Vatsal Mehta, Jayu Jain, Gayatri Joshi, Shanthi Therese
Abstract - The goal of this project is to increase the quality of the images generated using textual description by combining DF-GAN with ChatGPT. Initially, we experimented with GAN on MNIST dataset, and then we tried stackedGAN on Oxford-102 datasets. Unfortunately, these approaches had issues such as sub-par image quality and long training times. So, we moved onto DF-GAN with CUB & COCO dataset and saw the impact of better user prompts on improving image generation. A key development in this project is the integration of ChatGPT into the backend to improve prompt quality. By using ChatGPT, we can create more nuanced & contextually relevant prompts that significantly improve the expressiveness & accuracy of the images generated. The evaluation process includes metrics like sharpness & noise to provide an evaluation of the image quality that adds some value. In addition, the user-friendly interface using Streamlit improves accessibility, allowing a wider range of users to interact with our image generating model. This project develops as a systematic analysis of different GAN architectures and dataset combinations. It provides an extensive approach for advancing text to image generation, with an emphasis on practical usability.

Paper Presenter

Jayu Jain

India

Friday August 9, 2024 12:15pm - 2:15pm IST
Virtual Room C Goa, India

Virtual Room 8C, Virtual Room C

Host Organization Global Knowledge Research Foundation

9th International Conference on ICT for Sustainable Development

Jayu Jain

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!