[UIST 2023] GenAssist: Making Image Generation Accessible

This is the repository for the paper:

GenAssist: Making Image Generation Accessible.

Mina Huh, Yi-Hao Peng, Amy Pavel

🏆 Best Paper Award (top 1%)

✨ Check out our website for examples and presentation!

Abstract

Blind or low vision (BLV) creators use images to communicate with sighted audience members. However, creating or retrieving images remains challenging as it is difficult for BLV creators to use existing graphical authoring tools or assess image search results. Thus, creators describe their desired images to sighted people who create or retrieve the images or limit the types of images they create. While text-to-image generation models could let creators generate multiple high-fidelity images based on a text description (i.e. prompt), it is difficult to assess the content and quality of the generated image. We propose GenAssist, a system to make text-to-image generation accessible. Using our interface, creators can verify whether generated image candidates followed their prompt, access additional details added to the image but not specified in the prompt, and skim a summary of similarities and differences between image candidates. To power the interface, GenAssist uses a large language model to generate visual questions, vision-language models to extract answers, and a large language model to summarize the results. Our study with 12 BLV creators demonstrated that GenAssist enables and simplifies the process of image selection and generation, making visual authoring more user-friendly and accessible to all.

Citation and contact

If you find our work helpful, please cite us as

@inproceedings{huh2023genassist,
  title={GenAssist: Making image generation accessible},
  author={Huh, Mina and Peng, Yi-Hao and Pavel, Amy},
  booktitle={Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology},
  pages={1--17},
  year={2023}
}

📧 Please contact Mina Huh at minahuh[at]cs.utexas.edu if you have any questions or suggestions.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
static		static
.DS_Store		.DS_Store
.nojekyll		.nojekyll
README.md		README.md
demo.html		demo.html
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[UIST 2023] GenAssist: Making Image Generation Accessible

Abstract

Citation and contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[UIST 2023] GenAssist: Making Image Generation Accessible

Abstract

Citation and contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages