Beyond Text-to-Image: Multimodal Prompts to Explore Generative AI
Informations
Type:
inproceedings
Auteurs:
Liu, Vivian
Pertinence:
Faible
Référence:
Doi:
10.1145/3544549.3577043
Mots-clés:
Large Language Models, creativity support tools, multimodal, text-to-image generation, co-creative A...
Url:
https://doi.org/10.1145/3544549.3577043
Date de publication:
02/2023
Résumé:
pas trouvé le pdf
Abstract:
Text-to-image AI systems have proven to have extraordinary generative capacities that have facilitated widespread adoption. However, these systems are primarily text-based, which is a fundamental inversion of what many artists are traditionally used to: having full control over the composition of their work. Prior work has shown that there is great utility in using text prompts and that AI augmented workflows can increase momentum on creative tasks for end users. However, multimodal interactions beyond text need to be further defined, so end users can have rich points of interaction that allow them to truly co-pilot AI-generated content creation. To this end, the goal of my research is to equip creators with workflows that 1) translate abstract design goals into prompts of visual language, 2) structure exploration of design outcomes, and 3) integrate creator contributions into generations.
Références
1 articles
Titre Type Pertinence Auteurs Date Publication Références Citations Actions
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompt... inproceedings Haute Wu, Tongshuang and Terry, Michael and Cai, Carrie Jun 04/2022 0 3
Citations
0 articles
Titre Type Pertinence Auteurs Date Publication Références Citations Actions
Pas encore d'article
Mots-clés
7 mots-clés
Nom Nombre d'articles Actions
Large Language Models 6
multimodal 2
creativity support tools 1
co-creative AI 1
text-to-image generation 1
prompt engineering 1
prompting 1
Auteurs
2 auteurs
Nom Nombre d'articles Actions
Vivian 1
Liu 1