Exploring Multimodal Learning: Text Conditioned Image Generation
Authors: Priyank a, Shailesh D Kamble and Amita Dev
Publishing Date: 26-05-2024
ISBN: 978-81-955020-7-3
Abstract
Over the years, with the advancement of technologies, Artificial Intelligence has played a huge role. Text to image-based conversion has taken up the market when the user looks to make their tasks simpler and easier. With plain text commands, one may obtain an image without wasting time in searching for that image. With the use GAN (generative adversarial network) and through the intersection of Natural language processing in decoding the texts through tokens, deep learning, and Artificial intelligence and with the help of image datasets, we would be able to generate images by preprocessing the text and understanding it.
Keywords
Artificial Intelligence, Natural Language processing, Machine learning, Generative adversarial network.
Cite as
Priyank a, Shailesh D Kamble and Amita Dev, "Exploring Multimodal Learning: Text Conditioned Image Generation", In: Ashish Kumar Tripathi and Vivek Shrivastava (eds), Advancements in Communication and Systems, SCRS, India, 2024, pp. 387-396. https://doi.org/10.56155/978-81-955020-7-3-34