Toward a Structured Theoretical Framework for the Evaluation of Generative AI-based Visualizations

Podo, Luca; Ishmal, Muhammad; Angelini, Marco

Toward a Structured Theoretical Framework for the Evaluation of Generative AI-based Visualizations

dc.contributor.author	Podo, Luca	en_US
dc.contributor.author	Ishmal, Muhammad	en_US
dc.contributor.author	Angelini, Marco	en_US
dc.contributor.editor	El-Assady, Mennatallah	en_US
dc.contributor.editor	Schulz, Hans-Jörg	en_US
dc.date.accessioned	2024-05-21T08:30:27Z
dc.date.available	2024-05-21T08:30:27Z
dc.date.issued	2024
dc.description.abstract	The automatic generation of visualizations is an old task that, through the years, has shown more and more interest from the research and practitioner communities. Recently, large language models (LLM) have become an interesting option for supporting generative tasks related to visualization, demonstrating initial promising results. At the same time, several pitfalls, like the multiple ways of instructing an LLM to generate the desired result, the different perspectives leading the generation (code-based, image-based, grammar-based), and the presence of hallucinations even for the visualization generation task, make their usage less affordable than expected. Following similar initiatives for benchmarking LLMs, this paper explores the problem of modeling the evaluation of a generated visualization through an LLM. We propose a theoretical evaluation stack, EvaLLM, that decomposes the evaluation effort in its atomic components, characterizes their nature, and provides an overview of how to implement them. One use case on the Llama2-70-b model shows the benefits of EvaLLM and illustrates interesting results on the current state-of-the-art LLM-generated visualizations. The materials are available at this GitHub repository: https://github.com/lucapodo/evallm_llama2_70b.git	en_US
dc.description.sectionheaders	Visual Analytics Applications and Systems
dc.description.seriesinformation	EuroVis Workshop on Visual Analytics (EuroVA)
dc.identifier.doi	10.2312/eurova.20241118
dc.identifier.isbn	978-3-03868-253-0
dc.identifier.pages	6 pages
dc.identifier.uri	https://doi.org/10.2312/eurova.20241118
dc.identifier.uri	https://diglib.eg.org/handle/10.2312/eurova20241118
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Human-centered computing→Visualization design and evaluation methods
dc.subject	Human centered computing→Visualization design and evaluation methods
dc.title	Toward a Structured Theoretical Framework for the Evaluation of Generative AI-based Visualizations	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 12_eurova20241118.pdf
Size:: 2.81 MB
Format:: Adobe Portable Document Format

Download

Name:: 2004-i7.mp4
Size:: 5.26 MB
Format:: Video MP4

Download

Collections

EuroVA2024