Comparative Exploration of Document Collections: a Visual Analytics Approach
dc.contributor.author | Oelke, Daniela | en_US |
dc.contributor.author | Strobelt, Hendrik | en_US |
dc.contributor.author | Rohrdantz, Christian | en_US |
dc.contributor.author | Gurevych, Iryna | en_US |
dc.contributor.author | Deussen, Oliver | en_US |
dc.contributor.editor | H. Carr, P. Rheingans, and H. Schumann | en_US |
dc.date.accessioned | 2015-03-03T12:34:46Z | |
dc.date.available | 2015-03-03T12:34:46Z | |
dc.date.issued | 2014 | en_US |
dc.description.abstract | We present an analysis and visualization method for computing what distinguishes a given document collection from others. We determine topics that discriminate a subset of collections from the remaining ones by applying probabilistic topic modeling and subsequently approximating the two relevant criteria distinctiveness and characteristicness algorithmically through a set of heuristics. Furthermore, we suggest a novel visualization method called DiTop-View, in which topics are represented by glyphs (topic coins) that are arranged on a 2D plane. Topic coins are designed to encode all information necessary for performing comparative analyses such as the class membership of a topic, its most probable terms and the discriminative relations. We evaluate our topic analysis using statistical measures and a small user experiment and present an expert case study with researchers from political sciences analyzing two real-world datasets. | en_US |
dc.description.seriesinformation | Computer Graphics Forum | en_US |
dc.identifier.doi | 10.1111/cgf.12376 | en_US |
dc.identifier.issn | 1467-8659 | en_US |
dc.identifier.uri | https://doi.org/10.1111/cgf.12376 | en_US |
dc.publisher | The Eurographics Association and John Wiley and Sons Ltd. | en_US |
dc.title | Comparative Exploration of Document Collections: a Visual Analytics Approach | en_US |