Toward a Structured Theoretical Framework for the Evaluation of Generative AI-based Visualizations

dc.contributor.authorPodo, Lucaen_US
dc.contributor.authorIshmal, Muhammaden_US
dc.contributor.authorAngelini, Marcoen_US
dc.contributor.editorEl-Assady, Mennatallahen_US
dc.contributor.editorSchulz, Hans-Jörgen_US
dc.date.accessioned2024-05-21T08:30:27Z
dc.date.available2024-05-21T08:30:27Z
dc.date.issued2024
dc.description.abstractThe automatic generation of visualizations is an old task that, through the years, has shown more and more interest from the research and practitioner communities. Recently, large language models (LLM) have become an interesting option for supporting generative tasks related to visualization, demonstrating initial promising results. At the same time, several pitfalls, like the multiple ways of instructing an LLM to generate the desired result, the different perspectives leading the generation (code-based, image-based, grammar-based), and the presence of hallucinations even for the visualization generation task, make their usage less affordable than expected. Following similar initiatives for benchmarking LLMs, this paper explores the problem of modeling the evaluation of a generated visualization through an LLM. We propose a theoretical evaluation stack, EvaLLM, that decomposes the evaluation effort in its atomic components, characterizes their nature, and provides an overview of how to implement them. One use case on the Llama2-70-b model shows the benefits of EvaLLM and illustrates interesting results on the current state-of-the-art LLM-generated visualizations. The materials are available at this GitHub repository: https://github.com/lucapodo/evallm_llama2_70b.giten_US
dc.description.sectionheadersVisual Analytics Applications and Systems
dc.description.seriesinformationEuroVis Workshop on Visual Analytics (EuroVA)
dc.identifier.doi10.2312/eurova.20241118
dc.identifier.isbn978-3-03868-253-0
dc.identifier.pages6 pages
dc.identifier.urihttps://doi.org/10.2312/eurova.20241118
dc.identifier.urihttps://diglib.eg.org/handle/10.2312/eurova20241118
dc.publisherThe Eurographics Associationen_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Human-centered computing→Visualization design and evaluation methods
dc.subjectHuman centered computing→Visualization design and evaluation methods
dc.titleToward a Structured Theoretical Framework for the Evaluation of Generative AI-based Visualizationsen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
12_eurova20241118.pdf
Size:
2.81 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
2004-i7.mp4
Size:
5.26 MB
Format:
Video MP4
Collections