WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

Xiao, Shishi; Hou, Yihan; Jin, Cheng; Zeng, Wei

WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

dc.contributor.author	Xiao, Shishi	en_US
dc.contributor.author	Hou, Yihan	en_US
dc.contributor.author	Jin, Cheng	en_US
dc.contributor.author	Zeng, Wei	en_US
dc.contributor.editor	Bujack, Roxana	en_US
dc.contributor.editor	Archambault, Daniel	en_US
dc.contributor.editor	Schreck, Tobias	en_US
dc.date.accessioned	2023-06-10T06:17:02Z
dc.date.available	2023-06-10T06:17:02Z
dc.date.issued	2023
dc.description.abstract	Retrieving charts from a large corpus is a fundamental task that can benefit numerous applications such as visualization recommendations. The retrieved results are expected to conform to both explicit visual attributes (e.g., chart type, colormap) and implicit user intents (e.g., design style, context information) that vary upon application scenarios. However, existing examplebased chart retrieval methods are built upon non-decoupled and low-level visual features that are hard to interpret, while definition-based ones are constrained to pre-defined attributes that are hard to extend. In this work, we propose a new framework, namely WYTIWYR (What-You-Think-Is-What-You-Retrieve), that integrates user intents into the chart retrieval process. The framework consists of two stages: first, the Annotation stage disentangles the visual attributes within the query chart; and second, the Retrieval stage embeds the user's intent with customized text prompt as well as bitmap query chart, to recall targeted retrieval result. We develop a prototype WYTIWYR system leveraging a contrastive language-image pre-training (CLIP) model to achieve zero-shot classification as well as multi-modal input encoding, and test the prototype on a large corpus with charts crawled from the Internet. Quantitative experiments, case studies, and qualitative interviews are conducted. The results demonstrate the usability and effectiveness of our proposed framework.	en_US
dc.description.number	3
dc.description.sectionheaders	Interaction and Accessibility
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	42
dc.identifier.doi	10.1111/cgf.14832
dc.identifier.issn	1467-8659
dc.identifier.pages	311-322
dc.identifier.pages	12 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14832
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14832
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Human-centered computing -> Visualization; Information systems -> Query intent; Computing methodologies -> Artificial intelligence
dc.subject	Human centered computing
dc.subject	Visualization
dc.subject	Information systems
dc.subject	Query intent
dc.subject	Computing methodologies
dc.subject	Artificial intelligence
dc.title	WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: v42i3pp311-322_cgf14832.pdf
Size:: 2.8 MB
Format:: Adobe Portable Document Format

Download

Name:: 1068-file-i7.pdf
Size:: 1.79 MB
Format:: Adobe Portable Document Format

Download

Collections

42-Issue 3
EuroVis23: Eurographics Conference on Visualization