Visible to the public VarifocalReader #x2014; In-Depth Visual Analysis of Large Text Documents

TitleVarifocalReader #x2014; In-Depth Visual Analysis of Large Text Documents
Publication TypeJournal Article
Year of Publication2014
AuthorsKoch, S., John, M., Worner, M., Muller, A., Ertl, T.
JournalVisualization and Computer Graphics, IEEE Transactions on
Volume20
Pagination1723-1732
Date PublishedDec
ISSN1077-2626
Keywordsdata mining, data visualisation, Data visualization, distant reading, document analysis, document handling, focus-context techniques, in-depth visual analysis, interactive systems, intermediate text levels, learning (artificial intelligence), literary analysis, machine learning, machine learning techniques, natural language processing, Navigation, Tag clouds, text analysis, text documents, text mining, varifocalreader, visual abstraction, visual analytics
Abstract

Interactive visualization provides valuable support for exploring, analyzing, and understanding textual documents. Certain tasks, however, require that insights derived from visual abstractions are verified by a human expert perusing the source text. So far, this problem is typically solved by offering overview-detail techniques, which present different views with different levels of abstractions. This often leads to problems with visual continuity. Focus-context techniques, on the other hand, succeed in accentuating interesting subsections of large text documents but are normally not suited for integrating visual abstractions. With VarifocalReader we present a technique that helps to solve some of these approaches' problems by combining characteristics from both. In particular, our method simplifies working with large and potentially complex text documents by simultaneously offering abstract representations of varying detail, based on the inherent structure of the document, and access to the text itself. In addition, VarifocalReader supports intra-document exploration through advanced navigation concepts and facilitates visual analysis tasks. The approach enables users to apply machine learning techniques and search mechanisms as well as to assess and adapt these techniques. This helps to extract entities, concepts and other artifacts from texts. In combination with the automatic generation of intermediate text levels through topic segmentation for thematic orientation, users can test hypotheses or develop interesting new research questions. To illustrate the advantages of our approach, we provide usage examples from literature studies.

DOI10.1109/TVCG.2014.2346677
Citation Key6875959