Beyond Text-to-Image: Multimodal Prompts to Explore Generative AI
Informations
- Type:
- inproceedings
- Auteurs:
- Liu, Vivian
- Pertinence:
-
Faible
- Référence:
- Doi:
- 10.1145/3544549.3577043
- Mots-clés:
- Large Language Models, creativity support tools, multimodal, text-to-image generation, co-creative A...
- Url:
- https://doi.org/10.1145/3544549.3577043
- Date de publication:
- 02/2023
- Résumé:
- pas trouvé le pdf
- Abstract:
- Text-to-image AI systems have proven to have extraordinary generative capacities that have facilitated widespread adoption. However, these systems are primarily text-based, which is a fundamental inversion of what many artists are traditionally used to: having full control over the composition of their work. Prior work has shown that there is great utility in using text prompts and that AI augmented workflows can increase momentum on creative tasks for end users. However, multimodal interactions beyond text need to be further defined, so end users can have rich points of interaction that allow them to truly co-pilot AI-generated content creation. To this end, the goal of my research is to equip creators with workflows that 1) translate abstract design goals into prompts of visual language, 2) structure exploration of design outcomes, and 3) integrate creator contributions into generations.
Références
1 articles
Titre | Type | Pertinence | Auteurs | Date Publication | Références | Citations | Actions |
---|---|---|---|---|---|---|---|
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompt... | inproceedings | Haute | Wu, Tongshuang and Terry, Michael and Cai, Carrie Jun | 04/2022 | 0 | 3 |
Citations
0 articles
Titre | Type | Pertinence | Auteurs | Date Publication | Références | Citations | Actions |
---|---|---|---|---|---|---|---|
Pas encore d'article |
Mots-clés