Ferret: Reviewing Tabular Datasets for Manipulation
dc.contributor.author | Lange, Devin | en_US |
dc.contributor.author | Sahai, Shaurya | en_US |
dc.contributor.author | Phillips, Jeff M. | en_US |
dc.contributor.author | Lex, Alexander | en_US |
dc.contributor.editor | Bujack, Roxana | en_US |
dc.contributor.editor | Archambault, Daniel | en_US |
dc.contributor.editor | Schreck, Tobias | en_US |
dc.date.accessioned | 2023-06-10T06:16:44Z | |
dc.date.available | 2023-06-10T06:16:44Z | |
dc.date.issued | 2023 | |
dc.description.abstract | How do we ensure the veracity of science? The act of manipulating or fabricating scientifc data has led to many high-profle fraud cases and retractions. Detecting manipulated data, however, is a challenging and time-consuming endeavor. Automated detection methods are limited due to the diversity of data types and manipulation techniques. Furthermore, patterns automatically fagged as suspicious can have reasonable explanations. Instead, we propose a nuanced approach where experts analyze tabular datasets, e.g., as part of the peer-review process, using a guided, interactive visualization approach. In this paper, we present an analysis of how manipulated datasets are created and the artifacts these techniques generate. Based on these fndings, we propose a suite of visualization methods to surface potential irregularities. We have implemented these methods in Ferret, a visualization tool for data forensics work. Ferret makes potential data issues salient and provides guidance on spotting signs of tampering and differentiating them from truthful data. | en_US |
dc.description.number | 3 | |
dc.description.sectionheaders | Visual Analysis and Processes | |
dc.description.seriesinformation | Computer Graphics Forum | |
dc.description.volume | 42 | |
dc.identifier.doi | 10.1111/cgf.14822 | |
dc.identifier.issn | 1467-8659 | |
dc.identifier.pages | 187-198 | |
dc.identifier.pages | 12 pages | |
dc.identifier.uri | https://doi.org/10.1111/cgf.14822 | |
dc.identifier.uri | https://diglib.eg.org:443/handle/10.1111/cgf14822 | |
dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | en_US |
dc.subject | CCS Concepts: Human-centered computing -> Information visualization; Human computer interaction (HCI) | |
dc.subject | Human centered computing | |
dc.subject | Information visualization | |
dc.subject | Human computer interaction (HCI) | |
dc.title | Ferret: Reviewing Tabular Datasets for Manipulation | en_US |