home > publications >
  publications











Listing only

UpLib and the future of digital personal libraries
We examine one possible future for the library of a professional knowledge worker by examining the UpLib personal digital library system. The system consists of a full-text indexed repository accessed through an active agent via a Web interface. It is suitable for personal collections comprising tens of thousands of documents (including papers, books, photos, receipts, email, etc.), and provides for ease of document entry and access as well as high levels of security and privacy. Unlike many other systems of the sort, user access to the document collection is assured even if the UpLib system is unavailable. It is 'universal' in the sense that documents are canonically represented as projections into the text and image domains, and uses a predominantly visual user interface based on page images. UpLib can thus handle any document format which can be rendered as pages. Provision is made for alternative representations existing alongside the text-domain and image-domain representation, either stored or generated on demand. The system is highly extensible through user scripting, and is intended to be used as a platform for further work in document engineering.
Janssen, W. (2006).
Electronic Imaging 2006.
Collaborative Intelligence Analysis with CACHE and its Effects on Information Gathering and Cognitive Bias
CACHE is a system supporting collaborative analysis tasks. We describe CACHE and a study of the effects of its use on reducing confirmation bias in analysis—the tendency to seek information that confirms rather than refutes an initial belief. We compare collaborative vs individual (Nominal) use of CACHE, as well as groups made of members with diverse (Heterogeneous) beliefs vs Homogeneous beliefs. We found reduction in confirmation bias for Heterogeneous collaborative groups using CACHE and working alone, but Homogeneous groups accentuated initial biases, rather than debiasing. Benefits of CACHE, future extensions, and our perspective on design are discussed.
Billman, D., Convertino, G., Shrager, J., Massar, J. P. and Pirolli, P. (2006).
Human Computer Interaction Consortium Workshop, Snow Mountain, CO. [PDF]
Entity workspace: an evidence file that aids memory, inference, and reading
An intelligence analyst often needs to keep track of more facts than can be held in human memory. As a result, analysts use a notebook or evidence file to record facts learned so far. In practice, the evidence file is often an electronic document into which text snippets and hand-typed notes are placed. While this kind of evidence file is easy to read and edit, it provides little help for making sense of the captured information. We describe Entity Workspace, a tool designed to be used in place of a traditional evidence file. Entity Workspace combines user interface and entity extraction technologies to build up an explicit model of important entities (people, places, organizations, phone numbers, etc.) and their relationships. Using this model, it helps the analyst find and re-find facts rapidly, notice connections between entities, and identify good documents and entities to explore next.
Bier, E., Ishak, E. and Chi, E. H. (2006).
Intelligence and Security Informatics 2006. [PDF]
Social information foraging and collaborative search
In this short paper, we summarized recent work in the understanding of social information foraging, and models of how users socially foraging with diverse hints. In communities of practice that depend on foraging in overly rich information environments, there appears to be pressure to self-organize into a balance of some division of labor, plus some degree of cooperation. One way to bridge between different communities of users is to diversify their information sources. We have been examining the possibility of using a collaborative search engine to achieve this effect.
Chi, E. H. and Pirolli, P. (2006).
HCIC Workshop. [PDF]
Generality of Transfer of Navigation Knowledge from Degree-of-Interest Trees
An experiment was conducted to assess the generality of navigation knowledge learned from interaction with the Degree-of-Interest (DOI) Tree. The degree of transfer of learning from the DOI Tree was compared to transfer of learning from a standard Internet Explorer browser. Twenty participants worked with either the DOI or Explorer browser in a Training Phase in which they performed information-finding or comparison tasks requiring navigation through a widely used Web directory. Degree of transfer was measured by effects on task completion time in a Transfer Phase in situations with varying degrees of similarity to the original training conditions. An analysis based on percent transfer scores indicated that participants trained on navigation of an information structure using the DOI showed superior transfer across a range of conditions as compared to participants trained using the Explorer.
Pirolli, P. and Gumbrecht, M. (2006).
CHI 2006. [PDF]
Turning Pages of 3D Electronic Books
Taking the form of physical books, virtual 3D books can be used as basic components of e-book systems, information workspaces, and digital libraries. This paper describes the page turning design of 3Book, a 3D book system that we recently developed. Our design aims to find a sensible balance among important factors such as visual realism, readability, interactivity, and scalability. To convey the impression of reading or viewing an actual physical book, we model all the faces of the book and synchronize the movements of various portions of the book during page turning. Our design delivers a seamless transition between two states of the book (i.e., when it is lying still and when it is turning pages). In addition, we deform the turning pages around an imaginary cone of changing sizes to produce realistically-looking curved pages.
Hong, L., Card, S. K. and Chen, J. (2006).
IEEE Symposium on 3D user interfaces. [DOC]
Entity quick click-rapid text copying based on automatic entity extractions
Retyping text phrases can be time consuming. As a result, techniques for copying text from one software application to another, such as copy-and-paste and drag-and-drop are now commonplace. However, even these techniques can be too slow in situations where many phrases need to be copied. In the special case where the phrases to be copied represent syntactically identifiable entities, such as person names, company names, telephone numbers, or street addresses, much faster phrase copying is possible. We describe entity quick click, an approach that reduces both the amount of cursor travel and the number of button presses needed to copy a phrase.
Bier, E., Ishak, E. and Chi, E. H. (2006).
CHI 2006. [PDF]
Navigation in Degree of Interest Trees
We present an experiment that compares how people perform search tasks in a degree-of-interest browser and in a Windows-Explorer-like browser. Our results show that, whereas users do attend to more information in the DOI browser, they do not complete the task faster than in an Explorer-like browser. However, in both types of browser, users are faster to complete high information scent search tasks than low information scent tasks. We present an ACT-R computational model of the search task in the DOI browser. The model describes how a visual search strategy may combine with semantic aspects of processing, as captured by information scent. We also describe a way of automatically estimating information scent in an ontological hierarchy by querying a large corpus (in our case, Google's corpus).
Budiu, R., Pirolli, P. and Fleetwood, M. (2006).
AVU 2006. [PDF]
Modeling Informattion Scent: A Comparison of LSA, PMI-IR, and GLSA Similarity Measures on Common Test and Corpora
In this paper we describe a comparison among three systems that estimate semantic similarity between words: Latent Semantic Analysis [6], Pointwise Mutual Information [17], and Generalized Latent Semantic Analysis [8]. We compare all these techniques on a unique corpus (TASA) and, for PMI and GLSA, we also report performance on a different web-based corpus. The evaluation is carried out through two kinds of tests: (1) synonymy tests, and (2) comparison with human word similarity judgments.
Budiu, R., Royer, C. and Pirolli, P. (2006).
CHI 2006. [PDF]
Document representation with Generalized Latent Semantic Analys
Methods for dimensionality reduction, notably LSI, have been successfully applied to the information retrieval task and document classification on small document collections. Since they involve a computation of the eigenvalue or singular value decomposition of a document-term matrix, their use for large real world applications is somewhat limited. In addition to it, the information about the term similarity that these methods can use is inferred only from the current document collection. We present an algorithm that computes a low dimensional vector space representation of documents using point-wise mutual information as a term similarity measure. Point-wise term similarity can be computed using any additional resources, such as the Web. Our method uses the term by term matrix and can therefore be applied to large document collections. Experimental results show a considerable performance improvement on the information retrieval tasks.
Matveeva, I., Farahat, A. and Royer, C. (2005).
Conference 0n Research and Development in Information Retrieval (SIGIR 2005). [PDF]
Document representation with Generalized Latent Semantic Analysis
Methods for dimensionality reduction, notably LSA, have been successfully applied to the information retrieval task and document classification. Recently, corpus-based association measures such as point-wise mutual information have been found to outperform LSA on a variety of tasks. We have developed an algorithmic framework that computes a low-dimensional vector space representation of documents combining different measures of association with different dimensionality reduction techniques. Experimental results show a competitive performance on the synonymy and text classification tasks.
Matveeva, I., Farahat, A. and Royer, C. (2005).
16th European Conference on Machine Learning. [PDF]
Terms and document representation with generalized latent semantic analysis
Document indexing and representation of term-document relations are very important issues for document clustering and retrieval. In this paper, we present Generalized Latent Semantic Analysis as a framework for computing semantically motivated term and document vectors. Our focus on term vectors is motivated by recent success of co-occurrence based measures of semantic similarity obtained from very large corpora. Our experiments demonstrate that GLSA term vectors efficiently capture semantic relations between terms and outperform related approaches on the synonymy test. We also show that termbased document representation improves performance on the document classification test.
Royer, C., Matveeva, I. and Farahat, A. (2005).
EMNLP. [PDF]
Term representation with generalized latent semantic analysis
Document indexing and representation of term document relations are very important issues for document clustering and retrieval. In this paper, we present Generalized Latent Semantic Analysis as a framework for computing semantically motivated term and document vectors. Our focus on term vectors is motivated by recent success of co-occurrence based measures of semantic similarity obtained from very large corpora. Our experiments demonstrate that GLSA term vectors efficiently capture semantic relations between terms and outperform related approaches on the synonymy test.
Farahat, A., Matveeva, I. and Royer, C. (2005).
RANLP (Recent Advances in Natural Language Processing). [PDF]
Making UpLib useful: personal document engineering
Any new system must provide significant advantages to users for them to adopt it over their existing practices. In this paper, we discuss changes made over the last two years of use of the UpLib personal digital library system, to provide those advantages in the realm of document management. These changes are concentrated in the document acquisition phase, where document analysis is performed and databases of document information are prepared. However, some changes have been made in the areas of document management and document usage, primarily to allow user better interaction with the improved document projections.
Janssen, W., Breidenbach, J., Good, L. and Popat, A. (2005).
ACM Symposium on Document Engineering, 2005. [PDF]
ReadUp: A widget for reading
User interfaces for digital library systems must support a wide range of user activities. They include search, browsing, and curation, but perhaps the most important is actual reading of the items in the library. Support for reading, however, is usually relegated to applications which are only loosely integrated with the digital library system. One reason for this is the absence of toolkit widget support for the activity of reading. Most user interface toolkits instead provide support for either text editing or text presentation. This makes it difficult to write applications which support reading well. In this paper we describe the origins, design, and implementation of a new Java Swing toolkit widget called ReadUp, which provides support for reading page images in a digital library application, and discuss briefly how it is being used.
Janssen, W. (2005).
2005 European Conference on Digital Libraries (ECDL 2005). [PDF]
Animated dynamic highlighting for reading
A digital library system must support a sweep of document activities for its users, including acquisition, organization, retrieval, and actual reading. In this paper, we discuss how these activities are supported at the base levels of the UpLib personal digital library system. We report on two new features of UpLib which augment previous capabilities. The UpLib Portal is a tool for document acquisition and subsequent retrieval. The ReadUp document reader is a component specifically designed to support reading with annotation and search.
Gurevich, O., Janssen, W. and Karttunen, L. (2005).
ACM/IEEE 2005 Joint Conference on Digital Libraries. [PDF]
Chapter 2. The science of analytical reasoning
Literature review and research roadmap for Department of Homeland Security National Visual Analytics program.
Card, S. K. (2005).
Illuminating the Path: the Research and Development Agenda for Visual Analytics. [PDF]
UC: A fluid interface for personal digital libraries
An advanced visual interface system is presented for fluid interaction in a personal digital library system. The system employs a zoomable planar representation of a collection using hybrid continuous/quantum treemap visualizations to facilitate navigation while minimizing cognitive load. By providing both fluidity and a means of reading documents within the same visualization, the system obliterates the traditional boundary separating the acquisition of materials from their use. In addition, the system provides a means of streamlining and largely automating the addition of new documents into a collection. The system is particularly well suited to user tasks which, in the physical world, are normally carried out by laying out a set of related documents on a physical desk — namely, those tasks that require frequent and rapid transfer of attention from one document in the collection to another. Discussed are the design and implementation of the system as well as its relationship to previous work.
Good, L., Popat, A., Janssen, W. and Bier, E. (2005).
Joint Conference on Digital Libraries. [PDF]
Visual foraging of highlighted text: an eye-tracking study
The wide availability of digital reading material online is causing a major shift in everyday reading activities. Readers are increasingly skimming instead of reading in depth. Highlights are increasingly used in digital interfaces to direct attention toward relevant passages within texts. In this paper, we study the eye-tracking behavior of subjects using both keyword highlighting and a new highlighting technique called ScentHighlights, introduced recently [7]. In this first eye-tracking study of highlighting interfaces, we show that there is direct evidence of the von Restorff isolation effect [21] in the eye-tracking data, in that subjects performed better when a fact is isolated (highlighted) against a homogeneous background. Users with the ScentHighlights condition paid more attention to highlighted areas and are more accurate than with other interfaces. In addition to confirming the von Restorff effect, we found that there is great variation in subject differences in reading strategies among subjects, even in the presence of strong cues such as highlights. Some readers scan for highly profitable regions first, while others read sequentially despite the presence of strong highlight cues. The results point to future design possibilities in highlighting interfaces.
Chi, E. H., Gumbrecht, M. and Hong, L. (2005).
CHI2006. [PDF]
SNIF-ACT: A Model of User Navigation on the World Wide Web
SNIF-ACT (Scent-based Navigation and Information Foraging in the ACT Cognitive architecture) is developed to explain navigation behavior on the World Wide Web when users are engaged in unfamiliar information-seeking tasks. SNIF-ACT makes navigation choices based on assessments of activation that spreads to the users' information goal from proximal cues (information scent) that occur in Web browsers. SNIF-ACT 1.0 was matched to the detailed protocol analyses from the N = 8 participants studied in Card, S., Pirolli, P., Van Der Wege, M., Morrison, J., Reeder, R., Schraedley, P., & Boshart, J. (2001). Information scent as a driver of Web Behavior Graphs: Results of a protocol analysis method for web usability. CHI 2001, ACM Conference on Human Factors in Computing Systems, CHI Letters, 3(1), 498-505. SNIF-ACT 1.0 was found to provide good fits to participants' choice of links to follow, and to their choice of when to leave a current patch of information. SNIF-ACT 2.0 incorporates several refinements based on a rational analysis of link selection on a web page. These refinements include a satisficing assessment of link selection from a Web page and an adaptive stopping rule that decides when to leave a Web page based on past and current experiences with a Web site. SNIF-ACT 2.0 was validated on a data set obtained from N = 244 participants studied by Chi, E. H., Rosien, A., Suppattanasiri, G., Williams, A., Royer, C., Chow, C., Robles, E., Dalal, B., Chen, J., & Cousins, S. (2003). The Bloodhound Project: Automating discovery of Web usability issues using the InfoScent simulator. CHI 2003, ACM Conference on Human Factors in Computing Systems, CHI Letters, 5(1), 505-512. Monte Carlo simulations of the model provided good fits to the Web navigation data. Theoretical and practical implications of the model are discussed.
Fu, W.-T. and Pirolli, P. (2005).
Journal Cognitive Science. [DOC]
ScentHighlights: highlighting conceptually-related sentences during reading
No Abstract Available
Chi, E. H., Hong, L., Gumbrecht, M. and Card, S. K. (2005).
Proceedings of the 10th International Conference on Intelligent User Interfaces.
Introducing Wearable Force Sensors in Martial Arts
The development of sports throughout history has been essential to our health and keeping us strong physically, mentally, and spiritually. Sport, as a complex human endeavor, promotes unity and brings the world together through the Olympic Games. As an intrinsic part of our social fabric, the various sports in the Olympics are fraught with rules and regulations, and, of course, controversy. The acceptance of technology in sports in the recent past is littered with controversy. Technological developments in sports equipment must therefore take into account of social acceptance issues, such as player perception of fairness and rule changes. In this article, we examine this issue from our experience of introducing a novel wearable sensor system for martial arts.
Chi, E. H. (2005).
IEEE Pervasive Computing no 3.
Pervasive Computing in Sports Technologies: Guest Editor's Introduction
No Abstract Available
Chi, E. H., Borriello, B., Hunt, G. and Davies, N. (2005).
IEEE Pervasive Computing No. 4.
Annotating 3D Electronic Books
No Abstract Available
Hong, L., Chi, E. H. and Card, S. K. (2005).
Proceedings of the Human Factors in Computing Systems Conference (CHI2005) Conference Companion. [PDF]
Information Scent and Web Navigation: Theory, Models, and Automated Usability Evaluation
Within a more Information Foraging Theory, we have developed a rational analysis of Web use, which has shaped a cognitive model of Web navigation called SNIF-ACT. An automated and practical method for initializing the model with requisite knowledge of information scent was developed based on Pointwise Mutual Information (PMI) computations from a local document corpus with a Web back-off. An automated Web usability tool called Bloodhound was developed that implements an algorithm that approximates the operation of the cognitive model. We report on succesful empirical tests of the SNIF-ACT cognitive mode, the PMI method, and Bloodhound.
Pirolli, P., Chi, E. H. and Farahat, A. (2005).
HCI International. [DOC]
The Theory of Information Scent.
No Abstract Available
Pirolli, P. (2005).
Next Wave: The Research Associate Directorate technical trends publications.
Cognitive Models of Human-Information Interaction
No Abstract Available
Pirolli, P. (2005).
Handbook of Applied Cognition.
Icon Abacus and Ghost Icons
We present two techniques that make document collection visualizations more informative. Icon abacus uses the horizontal position of icon groups to communicate document attributes. Ghost icons show linked documents by adding temporary icons and by highlighting or dimming existing ones.
Bier, E. and Perer, A. (2005).
JCDL 2005:Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries. [PDF]
Icon Abacus: Positional Display of Document Attributes
This paper presents icon abacus, a space-efficient technique for displaying document attributes by automatic positioning of document icons. It displays the value of an attribute by using position on a single axis, allowing the other axis to display different metadata simultaneously The layout is stable enough to support navigation using spatial memory.
Bier, E. and Perer, A. (2005).
JCDL 2005:Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries. [PDF]
Negation in non-literal sentences
We investigate how people process negation in semantically distorted and metaphoric sentences. We present three experiments in which participants judged the truth of a_rmative and negative sentences that were either literal or contained semantic illusions (Erickson & Mattson, 1981) or metaphors. In all experiments, negation increases processing times; although for semantic illusions, negation preserves the ordering of judgment times for literal and nonliteral sentences, for metaphors this ordering is reversed, with nonliteral negatives taking less time than literal negatives. This result presents evidence against the traditional Clark and Chase (1972) model of negation. We propose the negation-as distortion model and discuss how this model can explain the data.
Budiu, R. and Anderson, J. (2005).
Proceedings of the 27th Annual Meeting of the Cognitive Science Society. [PDF]
GLSA Server @ PARC
No Abstract Available
Royer, C., Farahat, A., Pirolli, P. and Budiu, R. (2005).
Twelfth Annual ACT-R Workshop. [PPT]
Navigation in DOI Trees
No Abstract Available
Budiu, R. and Pirolli, P. (2005).
Twelfth Annual ACT-R Workshop. [PDF]
Fluid Interfaces for Personal Digital Libraries
An advanced visual interface system is presented for fluid interaction in a personal digital library system. The system employs a zoomable planar representation of a collection using hybrid continuous/quantum treemap visualizations to facilitate navigation while minimizing cognitive load. By providing both fluidity and a means of reading documents within the same visualization, the system obliterates the traditional boundary separating the acquisition of materials from their use. In addition, the system provides a means of streamlining and largely automating the addition of new documents into a collection. The system is particularly well suited to user tasks which, in the physical world, are normally carried out by laying out a set of related documents on a physical desk - namely, those tasks that require frequent and rapid transfer of attention from one document in the collection to another. Discussed are the design and implementation of the system as well as its relationship to previous work.
Good, L., Popat, A., Janssen, W. and Bier, E. (2005).
9th European Conference on Research and Advanced Technology for Digital Libraries. [PDF]
Issues in Construction of Document Thumbnails for Page-Image Digital Libraries
Digital libraries are increasingly based on digital page images of documents, but techniques for constructing usable versions of these page images are still largely folklore. This paper documents some issues encountered in creating document icons, page thumbnails, and page images for the UpLib digital library system, and suggests answers for each of them, based on both problem analysis and user feedback. In particular, several algorithms for determining useful sizes of both document icons and large page images are discussed.
Janssen, W. (2004).
ACM European Conference on Digital Libraries. [PDF]
Collaboration Mechanisms for the UpLib System
The UpLib personal digital library system is specifically designed for secure use by a single individual, instead of collaborative use by a number of people. However, collaborative operation of multiple UpLib repositories is still possible. This paper describes two mechanisms that have been added to UpLib to facilitate community building around individual document collections.
Janssen, W. (2004).
ACM/IEEE Joint Conference on Digital Libraries.
The new sensemakers: what's the next thing beyond search? sensemaking
Everyone engages in sense-making whether searching the Web, wading through boxes of documents or analyzing volumes of message traffic. Computers have a role to play, not just in processing ever-increasing amounts of information faster, but in augmenting our abilities to make sense of it all. Cognitive task analysis is the key to understanding sense-making activities and finding the technological leverage points.
Stefik, M. (2004).
Innovation Pipeline. [DOC]
From information visualization to sensemaking: connecting the mind's eye to the mind's muscle
Looking back on the tenth anniversary of the first Symposium on Information Visualization, much has been accomplished. The computer’s power has been exploited to give quick visual form to abstract data, to interact, and to warp detail to follow the user’s changing interest. Moreover, the design space of visualizations has been systematized with reference models, taxonomies, and monographs relating visualization to perceptual and graphical constraints. Looking ahead, however, I will argue that the era of pure information visualization is over. The path ahead depends on giving much more attention to the purposes of visualization and its use. Leaving aside communication, the purpose of information visualization is insight, or more particularly, a larger process that might be called sensemaking. I will sketch out the nature of sensemaking, exemplify it empirically in a practical, urgent setting, and suggest how theories of sensemaking could be developed. I will then describe systems that subsume information visualization as part of an emerging class of sensemaking systems combining visualization (the mind’s eye) with semantic content analysis and sensemaking operations (the mind’s muscle). Not surprisingly, a focus on sensemaking is a good generator of new visualizations. But these developments also suggest that it may be time for the information visualization field to alter its boundaries to go beyond the merely visual.
Card, S. K. (2004).
Information Visualization 2004. [PPT]
DOITrees revisited: scalable, space-constrained visualization of hierarchical data
This paper extends previous work on focus+context visualizations of tree-structured data, introducing an efficient, space-constrained, multi-focal tree layout algorithm (“TreeBlock”) and techniques at both the system and interactive levels for dealing with scale. These contributions are realized in a new version of the Degree-Of-Interest Tree browser, supporting real-time interactive visualization and exploration of data sets containing on the order of a million nodes
Heer, J. and Card, S. K. (2004).
Proceedings of the Working Conference on Advanced Visual Interfaces 2004. [PDF]
The InfoCLASS model: Conceptual richness and inter-person conceptual coherence about information collections
Previous work on information foraging theory has addressed how people navigate through information systems to find information. This paper presents a new information foraging model called InfoCLASS that models the conceptual categories that people learn while interacting with information systems. InfoCLASS is based on previous work on the rational analysis of human category formation. InfoCLASS simulations can be used to make qualitative predictions about the richness of conceptual categories that will be learned from different experiential histories, and from different kinds of user interfaces. InfoCLASS simulations can also be used to assess the conceptual coherence among a group of users of information systems. It is argued that the degree of conceptual coherence among a group of users is an important determinant of the efficiency and effectiveness of a social organization engaged in making discoveries, such as scientific communities or intelligence agencies.
Pirolli, P. (2004).
Journal of the Japanese Cognitive Science Society. [DOC]
Information foraging on the Web: Rational analyses of human-information interaction
Presents rational analyses of information foraging on the Web
Pirolli, P. (2004).
Cognitive Science. [DOC]
The use of proximal information scent to forage for distal content on the World Wide Web
Summarizes the theory of information scent and related it to Egon Brunswik's probablistic functionalism
Pirolli, P. (2004).
Working with technology in mind: Brunswikian resources for cognitive science and engineering. [DOC]
The Web as a back off corpus
A number of real world applications require the evaluation and ranking of word pair association. The three most effective techniques for measuring word association are based on statistical techniques that use corpus co-occurrence counts, latent semantic analysis techniques, or a lexical ontology. A major limitation of latent semantic analysis and lexical ontology techniques is their inability to deal with out of corpus words. This limitation has motivated researchers to use the web as a corpus for computing co-occuranece counts. However, a significant number of real world applications require a large number of co-occuranece evaluations that cannot be readily computed from the Web due to search engine limitations on the number of queries. In this paper, we use a hybrid approach for computing world pair association. This approach uses statistical similarity measures computed from co-occurence counts in a local corpus in case of common terms or from the web in the case of infrequent terms. We compare the four models of semantic similarity. The models range in their access to syntactic and semantic informativeness as well as search engine access. We evaluate the models using human ratings of semantic similarity. The outcome shows that the best correlations with human similarity judgment comes from a statistical approach that uses a local corpus and backs off to the web. Experimental results on a number of evaluation data sets show a significant reduction in the number of search engine queries while maintaining the same performance levels. The storage and computational requirements of our hybrid approach can easily fit on a modern workstation.
Farahat, A. and Pirolli, P. (2004).
Association of Computational Lnguistics. [PDF]
The prepared mind versus the beginner's mind: synthesizing opposite advice for creative design and invention
Creativity often emerges out of the tension between two seemingly irreconcilable properties that a particular design or product should have. By creating a higher order synthesis that honors the essential elements, a creative solution arises. There is also a tension of opposites in methods for creative work: cultivating a “prepared mind” versus cultivating a “beginner’s mind.” Advice for the prepared mind says “Develop and use your experience” and advice for the beginner’s mind advice says “Discard your previous experience.” How can we effectively use these seemingly opposite pieces of advice?
Stefik, M. and Stefik, B. (2004).
.
Comparing Zoomable User Interfaces to Folders for a Shape Organization Task
This work compares user performance on a shape grouping task with a folder-based interface and a zooming interface. There was a significant difference in task completion times between the two interfaces, with subjects performing 30% faster with the zooming interface than with folders.
Good, L., Stefik, M. and Bederson, B. (2004).
Information Visualization Conference. [DOC]
Zoomable user interface for in-depth reading
The Instant Bookplex (IB) system includes a zoomable user interface (ZUI) for navigating through a spatial representation of a document collection. This ZUI supports extended reading in the collection using semantic zooming, graphical presentation of metadata, animated transitions, and an integrated reading user interface. IB helps users find and re-find documents, choose good documents to read next, and navigate between documents.
Bier, E., Good, L., Newberger, A. and Popat, A. (2004).
Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries (JCDL).
A document corpus browser for in-depth reading
Software tools, including Web browsers, e-books, electronic document formats, search engines, and digital libraries are changing the way that people read, making it easier for them to find and view documents of interest. However, we observe that while these tools provide significant help with relatively short-term reading projects involving small numbers of documents, they fall short of supporting readers who are engaged in longer-term reading projects, in which a topic is to be understood in-depth by reading many documents. These readers need to find and manage many documents and citations, remember what they have read, and prioritize what to read next. In this paper, we describe a software tool designed to facilitate in-depth reading. This tool is a document collection browser that supports reading tasks that take place over days, weeks, or longer. The browser includes a zoomable user interface that helps the reader see the details of a document corpus in context. We describe the browser’s design and the design principles that motivated it.
Bier, E., Good, L., Popat, A. and Newberger, A. (2004).
Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries (JCDL 2004). [PDF]
Page Turning Design for 3D Electronic Books
Taking the form of physical books, virtual 3D books can be used as basic components of e-book systems, information workspaces, and digital libraries. This paper describes the page turning design of 3Book, a 3D book that we recently developed based on our previous experiences with WebBook. Our design achieves interactive page turning by employing page textures of multiple resolutions at different stages of page turning, solving the scalability problem. By modeling all the faces of the book and synchronizing the movements of various portions of the book during page turning, our design helps to convey the impression of reading or viewing an actual physical book.
Hong, L., Card, S. K. and Mackinlay, J. D. (2004).
UIST 2004 Conference. [PDF]
Deforming Pages of 3D Electronic Books
We describe a novel technique for deforming the pages of virtual 3D books to produce a realistic page turning effect.
Hong, L., Card, S. K. and Chen, J. (2004).
Siggraph'04 Sketches. [PDF]
"Killer app" of wearable computing: wireless force sensing body protectors for marial arts
Ubiquitous and Wearable Computing both have the goal of pushing the computer into the background, supporting all kinds of human activities. Application areas include areas such as everyday environments (e.g. clothing, home, office), promoting new forms of creative learning via physical/virtual objects, and new tools for interactive design. In this paper, we thrust ubiquitous computing into the extremely hostile environment of the sparring ring of a martial art competition. Our system uses piezoelectric force sensors that transmit signals wirelessly to enable the detection of when a significant impact has been delivered to a competitor’s body. The objective is to support the judges in scoring the sparring matches accurately, while preserving the goal of merging and blending into the background of the activity. The system therefore must take into account of the rules of the game, be responsive in real-time asynchronously, and often cope with untrained operators of the system. We present a user study of the finished prototype and detail our experience.
Chi, E. H., Song, J. and Corbin, G. (2004).
Proceedings of the 17th Annual ACM Conference on User Interface Software and Technology. [PDF]
The Use of Proximal Information Scent to Forage for Distal Content on the World Wide Web.
The particular focus of this chapter will be on a psychological theory of information scent (Pirolli, 1997, 2003; Pirolli & Card, 1999) that is embedded in a broader model (Pirolli & Fu, 2003) of information foraging on the Web. The notion of information scent also has been used in developing models of people seeking information in document-clustering browsers (Pirolli, 1997) and highly interactive information visualizations (Pirolli, Card, & Van Der Wege, 2003). Information scent refers to the cues used by information foragers to make judgments related to the selection of information sources to pursue and consume. These cues include items such as Web links or bibliographic citations that provide users with concise information about content that is not immediately available. The information scent cues play an important role in guiding users to the information they seek, and they also play a role in providing users with an overall sense of the contents of collections. The purpose of this paper is to present a theoretical account of information scent that supports the development of models of navigation choice.
Pirolli, P. (2004).
Working with Technology in Mind: Brunswikian Resources for Cognitive Science and Engineering. Oxford University Press. [PDF]
Log-based Longitudinal Study Finds Window Thrashing
Although large displays are becoming more cost effective, most user interfaces are optimized for a single monitor of modest size even though many traditional workspaces such as desks and workbenches are much larger and some studies have found benefits from large displays. This paper explores whether a single monitor is sufficient for information work using standard software. A log-based longitudinal field study finds that most of the time a single monitor allows skilled information analysts to have a reasonable pattern of window activity. However, a novel visualization of the data shows that windows typically fill the monitor and the pattern is occasionally interrupted by window thrashing, the rapid manipulation of windows caused by limited display resource. Given these findings, we identify some common tasks that justify the development and the expense of wideband visual interfaces that are optimized for larger displays.
Mackinlay, J. D. and Royer, C. (2004).
. [PDF]
Transient User Profiling
Our work in the past five years on modeling user actions on the Web has shown that a great deal of information about user actions can be recovered from the informational cues processed by the user during navigation. We call these informational cues by the name of "Information Scent." We have shown in various papers that Information Scent can be used as a methodology for clustering a group of user profiles [Chi02], simulating a collection of users navigating thru the Web with an information need [Chi03], and provid-ing navigational cues to users with transient information goals [Olston03]. We argue in this position paper for a CHI2004 workshop that more research in user profiling should be done for user goals that are transient in nature.
Chi, E. H. (2004).
Proceedings of the Workshop on User Profiling (CHI2004), Vienna, Austria. [PDF]
Wideband Displays: Mitigating Multiple Monitor Seams
Wideband displays fill our field of view, creating new opportunities to develop effective visual interfaces. Although multiple monitors are becoming an affordable way to create wideband displays, the resulting seams create gaps in words and divide diagonal lines into nonaligned segments. We present several novel user interface techniques for creating seam-aware applications, proving that vendors need not wait for affordable seamless displays to exploit the potential of wideband displays.
Mackinlay, J. D. and Heer, J. (2004).
Proceedings of the Human Factors in Computing Systems Conference (CHI2004), Vienna, Austria. [PDF]
3Book: a 3D Electronic Smart Book
This paper describes the 3Book, a 3D interactive visualization of a codex book as a component for various digital library and sensemaking systems. The book is designed to hold large books and to support sensemaking operations by readers. The book includes methods in which the automatic semantic analysis of the book’s content is used to dynamically tailor access.
Card, S. K., Hong, L., Mackinlay, J. D. and Chi, E. H. (2004).
Advanced Visual Interfaces (AVI) 2004. [PDF]
3Book: A Scalable 3D Virtual Book
This paper describes the 3Book, a 3D interactive visualization of a codex book as a component for digital library and information-intensive applications. The 3Book is able to represent books of almost unlimited length, allows users to read large format books, and has features to enhance reading and sensemaking.
Card, S. K., Hong, L., Mackinlay, J. D. and Chi, E. H. (2004).
Proceedings of the Human Factors in Computing Systems Conference (CHI2004) Conference Companion, Vienna, Austria. [PDF]
eBooks with Indexes that Reorganize Conceptually
Subject indexes were an important step forward for books because they enabled the comparison and correlations of information without extensive reading, re-reading and memorization. In this short paper, we focus on the user interaction and usage scenario of a new system called ScentIndex that enhances the subject index of an eBook by conceptually reorganizing it to suit particular information needs. Users first enter information needs via keywords describing the concepts they are trying to retrieve and comprehend. ScentIndex then computes what index entries are conceptually related, and reorganizes and displays these index entries on a single page.
Chi, E. H., Hong, L., Heiser, J. and Card, S. K. (2004).
Proceedings of the Human Factors in Computing Systems Conference (CHI2004) Conference Companion, Vienna, Austria. [PDF]
UpLib: a universal personal digital library system
We describe the design and use of a personal digital library system, UpLib. The system consists of a full-text indexed repository accessed through an active agent via a Web interface. It is suitable for personal collections comprising tens of thousands of documents (including papers, books, photos, receipts, email, etc.), and provides for ease of document entry and access as well as high levels of security and privacy. Unlike many other systems of the sort, user access to the document collection is assured even if the UpLib system is unavailable. It is ``universal'' in the sense that documents are canonically represented as projections into the text and image domains, and uses a predominantly visual user interface based on page images. UpLib can thus handle any document format which can be rendered as pages. Provision is made for alternative representations existing alongside the text-domain and image-domain representation, either stored or generated on demand. The system is highly extensible through user scripting, and is intended to be used as a platform for further work in document engineering. UpLib is assembled largely from open-source components (the current exception being the OCR engine, which is proprietary).
Janssen, W. and Popat, A. (2003).
Proceedings of the 2003 ACM Symposium on Document Engineering. [PDF]
Design and Implementation of a Universal Personal Digital Library System
We trace through the initial design and early evolution of a personal digital library system pieced together largely from open-source components (the current exception being the OCR engine, which is proprietary). The system consists of a repository, a search engine, and a Web interface. It is suitable for personal collections comprising tens of thousands of documents (including papers, books, photos, receipts, etc.), and provides for ease of document entry and access as well as basic security and privacy. Representation of documents in an image-domain format is taken as canonical, exploiting the fact that documents are ultimately presented as visual objects to a user to achieve some degree of ``universality.'' Provision is made for alternative representations existing alongside the image-domain representation, either stored or generated on demand. At the conference we will demo the document-access portion of the system implemented self-contained on an ultra-portable laptop holding about 2,000 documents including many scanned books.
Janssen, W. and Popat, A. (2003).
European Conference on Digital Libraries. [PDF]
Validating InfoScent Models with Large Scale User Studies
In the last 5 years, one major focus of the User Interface Research Group has been understanding how Information Foraging Theory could be used to explain user optimization behavior on the Web. My primary research focus has been on developing the Information Scent Model to describe local behaviors of a group of users around a Web locality. In order to validate this model, we have performed two major validation studies---one on the server end, and the other on the client end: -- We were interested in validating how Information Scent could be used to explain how users will move from page to page by following information cues that are present for each link, and evaluating the probabilities of finding the desired information by clicking through that link. We needed a large amount of user session data that is paired with user goals, and then simulating those goals to see if the user session data match. We performed this study _in situ_ with over 400 subjects at the client end using a remote user logging program, obtaining several thousand usable session data. -- We were also interested in how Information Scent could be used to perform more accurate Web usage data mining. We wanted to construct more accurate user profiles by applying Information Scent principles to Web usage log mining. We have obtained several large scale server logs, of which some sessions are associated with known user goals. We then built user profiles out of these logs. We measured the accuracy of our clustering algorithm by trying to match the user sessions with the known category of the user goals.
Chi, E. H. (2003).
IBM NPUC Workshop (New Paradigms in User Computing. [PDF]
A theory of information scent
Information scent is a psychological theory of how people use perceptual cues, such as World Wide Web (WWW) links in order to make information-seeking decisions and to gain an overall sense of the contents of information collections. A spreading activation theory of information scent is presented, as well as a theory of learning categories (e.g., genres) of available information. The SNIF-ACT model incorporates information scent to model World Wide Web browsing. The InfoCLASS model simulates the learning of information categories from browsing.
Pirolli, P. (2003).
10th International Conference on Human-Computer Interation.
Use of Critical Analysis Method to Conduct a Cognitive Task Analysis of Intelligence Analysts
Intelligence analysts (IAs) engage in information seeking, evaluation, prediction, and reporting behavior in an information-intensive work environment. A Cognitive Task Analysis (CTA) was conducted to capture data that will provide input to support development of a model of the analyst's processes, biases, and analytic strategies. A hybrid method was used to conduct the CTA, including a modified version of the critical decision method - the critical analysis method. The essential distinction was that participants were asked to describe an example of a strategic analysis problem. Procedures used to conduct the critical analysis method are described in this paper. Several factors contribute to making the IA's task challenging: (i) time pressure to produce reports in a shorter timeframe, (ii) a high cognitive workload, and (iii) difficult human judgments that are required regarding uncertain validity and reliability of the data. Human judgments are involved in considering the plausibility of information, deciding what information to trust, and determining how much weight to place on specific pieces of data. Intelligence analysis involves a complex process of assessing the reliability of information from a wide variety of sources and combining seemingly unrelated events. This problem is challenging because it involves aspects of data mining, data correlation and human judgment.
Card, S. K., Hutchins, S. and Pirolli, P. (2003).
6th International Conference on Naturalistic Decision Making.
Efficient User Interest Estimation in Fisheye Views
We present a new technique for efficiently computing Degree-of-Interest distributions to inform the visualization of graph-structured data. The technique is independent of the interest distribution used, and enables fluid interaction with very large data sets (over 100,000 nodes).
Heer, J. and Card, S. K. (2003).
Extended Abstracts of CHI 2003, Conference on Human Factors in Computing Systems, Fort Lauderdale, FL. [PDF]
Wideband Visual Interfaces: Sensemaking on Multiple Monitors
Although vendors have made multiple-monitor systems for many years, our interfaces have been stuck in a 30-year old windows paradigm focused on displays much smaller than the desktops we use when working with paper. Advances in flat panel displays and graphics cards now enable affordable personal computers with 6-8 monitors and may someday eliminate seams. This paper argues that vendors should be developing wideband visual interfaces that are designed for displays that fill the human visual field. We describe a longitudinal field study of window activity that found that windows almost always filled a typical single monitor display and that subjects occasionally struggled with window thrashing when they needed to work with two or more windows at the same time. Vendors need not wait for affordable seamless wideband displays before addressing these findings. We have implemented several novel user interface techniques for creating seam-aware applications that target wideband displays based on multiple monitors.
Mackinlay, J. D., Heer, J. and Royer, C. (2003).
Technical Report. [PDF]
AVID: Supporting the creation of scalable, responsive visualizations
In this paper we describe a visualization architecture (AVID) that employs a dynamic model of user interest to support the design and creation of highly responsive, scalable visualizations of hierarchical data. We present evidence of the architecture's efficacy, showcasing dynamic visualizations with near-immediate (<100ms) update times, even on structures of over 100,000 nodes. We discuss how the key concepts used generalize to arbitrary graph structures. Additionally, we present the results of a user study comparing a prototypical visualization built using AVID to a more traditional file-browser interface, showcasing up to 20% improvement in information access times.
Heer, J., Card, S. K., Heiser, J. and Pirolli, P. (2003).
Working Paper.
ScentTrails: Integrating Browsing and Searching on the Web
The two predominant paradigms for finding information on the Web are browsing and keyword searching. While they exhibit complementary advantages, neither paradigm alone is adequate for complex information goals that lend themselves partially to browsing and partially to searching. To integrate browsing and searching smoothly into a single interface, we introduce a novel approach called ScentTrails. Based on the concept of information scent developed in the context of information foraging theory, ScentTrails highlights hyperlinks to indicate paths to search results. This interface enables users to interpolate smoothly between searching and browsing to locate content matching complex information goals effectively. In a preliminary user study, ScentTrails enabled subjects to find information more quickly than by either searching or browsing alone.
Olston, C. and Chi, E. H. (2003).
ACM Transaction on Computer-Human Interaction. [PDF]
SNIF-ACT: A model of information foraging on the world wide web.
No Abstract Available
Pirolli, P. and Fu, W.-T. (2003).
Ninth International Conference on User Modeling, Johnstown, PA. [PDF]
Innovation and the Next Generation of Human-Computer Interacftion
No Abstract Available
Card, S. K. (2003).
Ann Arbor, MI: School of Business, University of Michigan. [PPT]
The Bloodhound Project: Automating Discovery of Web Usability Issues using the InfoScent™ Simulator
According to usability experts, the top user issue for Web sites is difficult navigation. We have been developing auto-mated usability tools for several years, and here we describe a prototype service called InfoScent™ Bloodhound Simula-tor, a push-button navigation analysis system, which auto-matically analyzes the information cues on a Web site to produce a usability report. We further build upon previous algorithms to create a method called Information Scent Absorption Rate, which measures the navigability of a site by computing the probability of users reaching the desired destinations on the site. Lastly, we present a user study involving 244 subjects over 1385 user sessions that show how Bloodhound correlates with real users surfing for in-formation on four Web sites. The hope is that, by using a simulation of user surfing behavior, we can reduce the need for human labor during usability testing, thus dramatically lower testing costs, and ultimately improving user experience. The Bloodhound Project is unique in that we apply a concrete HCI theory directly to a real-world prob-lem. The lack of empirically validated HCI theoretical model has plagued the development of our field, and this is a step toward that direction.
Chi, E. H., Rosien, A., Supattanasiri, G., Williams, A., Royer, C., Chow, C., Robles, E., Dalal, B., Chen, J. and Cousins, S. (2003).
CHI 2003, Fort Lauderdale, FL. [PDF]
Information Scent and the Adaptive Conceptual Lens on Information
Theoretical integration of cognitive models of memory, concept formation, Random Utility Model, and navigation
Pirolli, P. (2002).
Cognitive Science. [DOC]
Stochastic Models of Information Foraging by Information Scent
Theory for predicting user flow and inferring user goals on the WWW and similar systems
Pirolli, P. (2002).
ACM Transactions on Computer Human Interaction. [DOC]
A theory of information scent
Briefly summarize the new/novel technical contribution: A thoery of how users process proximal cues to make navigation decision and develop conceptual representations of information sources.
Pirolli, P. (2002).
10th International Conference on Human Computer Interaction 2003. [DOC]
Adaptive Information Interaction
No Abstract Available
Pirolli, P. (2002).
ONR Workshop. [PPT]
Breakthrough: stories and strategies of radical innovation
A breakthrough creates something new or satisfies a previously undiscovered need. Radical breakthroughs often have uses and effects far beyond what their inventors had in mind and can launch new industries or transform existing ones. Our capacity for breakthroughs depends on a combination of science, imagination and business; the next waves of innovation will come from organizations that get this combination right. The stories in this book, told by repeat inventors and managers of technology, uncover the best practices for inventing the future.
Stefik, M. and Stefik, B. (2002).
Book.
Degree-of-Interest Trees: A Component of an Attention-Reactive User Interface
This paper proposes Degree-of-Interest trees. These trees use degree-of-interest calculations and focus+context visualization methods, together with bounding constraints, to fit within pre-established bounds. The method is an instance of an emerging "attention-reactive" user interface whose components are designed to snap together in bounded spaces.
Card, S. K. (2002).
. [PDF]
Browse Hierarchical Data with the Degree of Interest Tree
This demonstration shows a method and implementation to interactively display large hierarchies (up to 10,000 nodes) within a web browser. This software computes a degree of interest (DOI) for each node in the hierarchy and displays an overview of the complete hierarchy while showing more detail for nodes with a higher DOI value.
Nation, D., Roberts, D. and Card, S. K. (2002).
ACM CHI 2002 Conference on Human Factors in Computing Systems. [PDF]
Degree-of-Interest Trees: A Component of an Attention-Reactive User Interface
This paper proposes Degree-of-Interest trees. These trees use degree-of-interest calculations and focus+context visualization methods, together with bounding constraints, to fit within pre-established bounds. The method is an instance of an emerging “attention-reactive” user interface whose components are de-signed to snap together in bounded spaces.
Card, S. K. and Nation, D. (2002).
Advanced Visual Interfaces, Trento, Italy. [PDF]
LumberJack: Intelligent Discovery and Analysis of Web User Traffic Composition
Web Usage Mining enables new understanding of user goals on the Web. This understanding has broad applications, and traditional mining techniques such as association rules have been used in business applications. We have developed an automated method to directly infer the major groupings of user traffic on a Web site [Heer01]. We do this by utilizing multiple data features in a clustering analysis. We have performed an extensive, systematic evaluation of the proposed approach, and have discovered that certain clustering schemes can achieve categorization accuracies as high as 99% [Heer02b]. In this paper, we describe the further development of this work into a prototype service called LumberJack, a push-button analysis system that is both more automated and accurate than past systems.
Chi, E. H., Rosien, A. and Heer, J. (2002).
ACM-SIGKDD Workshop on Web Mining for Usage Patterns and User Profiles, Edmonton, Canada. [PDF]
Expressiveness of the Data Flow and Data State Models in Visualization Systems
Visualization can be viewed as a process that transforms raw data (value) into views. There has been two major category of data process models that have been proposed to model the visualization transformation process. This paper seeks to compare the Data Flow Models and the Data State Models. Specifically, it proves that, in terms of expressiveness, anything that can represented using the Data Flow Model can also be represented using the Data State Model, and vice versa.
Chi, E. H. (2002).
Advanced Visual Interfaces Conference, Trento, Italy 375-378. [PDF]
Scent of the Web
No Abstract Available
Chi, E. H. (2002).
Human Factors and Web Development. Erlbaum, Hilsdale, New Jersey: pp. 265-285.
A User-Tracing Architecture for Modeling Interaction with the World Wide Web
No Abstract Available
Pirolli, P., Fu, W.-T., Reeder, R. and Card, S. K. (2002).
Advanced Visual Interfaces, Trento, Italy. [PDF]
Automatic Centerline Extraction for Virtual Colonoscopy
In this paper, we introduce a concise and concrete definition of an accurate colon centerline and provide an efficient automatic means to extract the centerline and its associated branches (caused by a forceful touching of colon and small bowel or a deep fold in twisted colon lumen). We further discuss its applications on fly-through path planning and endoscopic simulation, as well as its potential to solve the challenging touching and colon collapse problems in virtual colonoscopy. Experimental results demonstrated its centeredness, robustness, and efficiency.
Wan, M., Liang, Z., Ke, Q., Hong, L., Bitter, I. and Kaufman, A. (2002).
IEEE Transactions on Medical Imaging 21(12): 1450-1460. [PDF]
Improving Web Usability Through Visualization
Predictive Web usage visualizations can help analysts uncover traffic patterns and usability problems.
Chi, E. H. (2002).
IEEE Internet Computing: 64-71. [PDF]
A Framework for Visualizing Information
Information visualization is the design and creation of interactive graphic depictions of information by combining principles in the disciplines of graphic design, cognitive science, and interactive computer graphics. This book describes a framework to make information visualization systems easier to develop through the creation of a reference model. It develops and discusses the general utility of this Data State Model, and validates it by applying it to various visualization techniques and showing several systems that illustrate issues such as how to model operators and interactions in visualization systems. The book also applies this reference model to make information visualization more accessible to potential users by creating a `Visualization Spreadsheet', where each cell can contain an entire set of data represented using interactive graphics.
Chi, E. H. (2002).
Human-Computer Interaction Kluwer Academic Publishers, Netherlands: 176.
What Did They Do? Understanding Clickstreams with the WebQuilt Visualization System
This paper describes the visual analysis tool WebQuilt, a web usability logging and visualization system that helps web design teams record and analyze usability tests. The logging portion of WebQuilt unobtrusively gathers clickstream data as users complete specified tasks. This data is then aggregated and presented as an interactive graph, where nodes of the graph are images of the web pages visited, and arrows are the transitions between pages. To aid analysis of the gathered usability test data, the WebQuilt visualization provides filtering capabilities and semantic zooming, allowing the designer to understand the test results at the gestalt view of the entire graph, and then drill down to sub-paths and single pages. The visualization highlights important usability issues, such as pages where users spent a lot of time, pages where users get off track during the task, navigation patterns, and exit pages, all within the context of a specific task. WebQuilt is designed to conduct remote usability testing on a variety of Internet-enabled devices and provide a way to identify potential usability problems when the tester cannot be present to observe and record user actions.
Waterson, S., Hong, J. I., Sohn, T., Heer, J., Matthews, T. and Landay, J. A. (2002).
Advanced Visual Interfaces, Trento, Italy. [PDF]
Mining the Structure of User Activity using Cluster Stability
Recent research has explored web user session clustering as a means of understanding user activity and interests on the World Wide Web. Though the proposed techniques have proven to be useful and effective, they require that one either specify the number of clusters in advance or browse a large hierarchy of clusters to find the optimal depth at which to describe user activity. In this paper, we examine the utility of a stability-based technique for automatically determining the optimal number of clusters in the context of web user session clustering. We present two case studies evaluating the technique’s effectiveness.
Heer, J. and Chi, E. H. (2002).
SIAM International Conference on Data Mining, Workshop on Web Analytics, Arlington, VA. [PDF]
Fluid Annotations Through Open Hypermedia: Using and Extending Emerging Web Standards
The Fluid Documents project has developed various research prototypes that show that powerful annotation techniques based on animated typographical changes can help readers utilize annotations more effectively. Our recently-developed Fluid Open Hypermedia prototype supports the authoring and browsing of fluid annotations on third-party Web pages. This prototype is an extension of the Arakne Environment, an open hypermedia application that can augment Web pages with externally stored hypermedia structures. This paper describes how various Web standards, including DOM, CSS, XLink, XPointer, and RDF, can be used and extended to support fluid annotations.
Bouvin, N. O., Zellweger, P. T., Grønbæk, K. and Mackinlay, J. D. (2002).
WWW2002, Hawaii. [PDF]
CHI@20: Fighting Our Way from Marginality to Power
The Special Interest Group on Computer Human Interaction (SIGCHI) has had a successful history of 20 years of growth in its numbers and influence. To help guide the continued evolution of the academic discipline and professional community, we invite several senior members to offer their visions for what the field of CHI actually accomplished over the past several decades, and what do we still need to accomplish? What do we need to do differently/better/smarter? What haven't we tried because the technology, the money or the will wasn't there in the past, but perhaps is now? The CHI field is more than just technology. We understand that our work can have a profound effect on individuals, families, neighborhoods, corporations, and countries. We know that we can influence education, commerce, healthcare, and government. How can we contribute to bridging the digital divides in developed and developing countries? What agendas can we offer for the academic, research, industrial, and civic spheres for the next 20 years? How can we be more ambitious? How can we truly serve human needs?
Shneiderman, B., Card, S. K., Norman, D. A., Tremaine, M. and Waldrop, M. M. (2001).
ACM CHI 2002 Conference on Human Factors in Computing Systems. [PDF]
Using Information Scent to Model User Information Needs and Actions on the Web
On the Web, users typically forage for information by navigating from page to page along Web links. Their surfing patterns or actions are guided by their information needs. Researchers need tools to explore the complex interactions between user needs, user actions, and the structures and contents of the Web. In this paper, we describe two computational methods for understanding the relationship between user needs and user actions. First, for a particular pattern of surfing, we seek to infer the associated information need. Second, given an information need, and some pages as starting points, we attempt to predict the expected surfing patterns. The algorithms use a concept called “information scent”, which is the subjective sense of value and cost of accessing a page based on perceptual cues. We present an empirical evaluation of these two algorithms, and show their effectiveness.
Chi, E. H., Pirolli, P., Chen, K. and Pitkow, J. (2001).
ACM CHI 2001 Conference on Human Factors in Computing Systems, Seattle, WA 490--497. [PDF]
Reliable Path for Virtual Endoscopy: Ensuring Complete Examination of Human Organs
Virtual endoscopy is a computerized, noninvasive procedure for detecting anomalies inside human organs. Several preliminary studies have demonstrated the benefits and effectiveness of this modality. Unfortunately, previous work cannot guarantee that an existing anomaly will be detected, especially for complex organs with multiple branches. In this paper, we introduce the concept of reliable navigation, which ensures the interior organ surface is fully examined by the physician performing the virtual endoscopy procedure. To achieve this, we propose computing a reliable fly-through path that ensures no blind area during the navigation. Theoretically, we discuss the criteria of evaluating a reliable path and prove that the problem of generating an optimal reliable path for virtual endoscopy is NP-complete. In practice, we develop an efficient method for the calculation of an effective reliable path. First, a small set of center observation points are automatically located inside the hollow organ. For each observation point, there exists at least one patch of interior surface visible to it, but that cannot be seen from any of the other observation points. These chosen points are then linked with a path that stays in the center of the organ. Finally, new points inside the organ are recursively selected and connected into the path until the entire organ surface is visible from the path. We present encouraging results from experiments on several data sets. For a medium size volumetric model with several hundred thousand inner voxels, an effective reliable path can be generated in several minutes.
He, T., Hong, L., Chen, D. and Liang, Z. (2001).
IEEE Transactions on Visualization and Computer Graphics 7(4): 333-342. [PDF]
Separating the Swarm: Categorization Methods for User Access Sessions on the Web
Understanding user behaviors on Web sites enables site owners to make sites more usable, ultimately helping users to achieve their goals more quickly. Accordingly, researchers have devised methods for categorizing user sessions in hopes of revealing user interests. These techniques build user profiles by combining users' navigation paths with other data features, such as page viewing time, hyperlink structure, and page content. Previously, we have presented complex techniques of combining many of these data features to cluster user profiles. In this paper, we introduce a user study and a systematic evaluation of these different data features and their associated weighting schemes. We present the results of our study, including accuracy measures for a number of clustering approaches, and offer recommendations for Web analysts. While further investigation over more sites is needed to definitively settle on a robust scheme, we have characterized this analytic space.
Heer, J. and Chi, E. H. (2002).
Proc. of the Human Factor in Computing Systems Conference (CHI 2002), Minneapolis, MN. [PDF]
WebQuilt: A Proxy-based Approach to Remote Web Usability Testing
WebQuilt is a web logging and visualization system that helps web design teams run usability tests (both local and remote) and analyze the collected data. Logging is done through a proxy, overcoming many of the problems with server-side and client-side logging. Captured usage traces can be aggregated and visualized in a zooming interface that shows the web pages people viewed. The visualization also shows the most common paths taken through the web site for a given task, as well as the optimal path for that task, as designated by the designer. This paper discusses the architecture of WebQuilt and also describes how it can be extended for new kinds of analyses and visualizations.
Hong, J. I., Heer, J., Waterson, S. and Landay, J. A. (2001).
ACM Transactions on Information Systems. [PDF]
Large-Scale Cognition: The Psychology of Informavores
No Abstract Available
Card, S. K. (2001).
Stanford, CA: Dept. of Psychology, Stanford University. [PPT]
Identification of Web User Traffic Composition using Multi-Modal Clustering and Information Scent
On the Web, users typically forage for information by navigating from page to page along Web links. Their surfing patterns or actions are guided by their information needs. Researchers need tools to explore the complex interactions between user needs, user actions, and the structures and contents of the Web. In this paper, we describe two computational methods for understanding the relationship between user needs and user actions. First, for a particular pattern of surfing, we seek to infer the associated information need. Second, given an information need, and some pages as starting points, we attempt to predict the expected surfing patterns. The algorithms use a concept called “information scent”, which is the subjective sense of value and cost of accessing a page based on perceptual cues. We present an empirical evaluation of these two algorithms, and show their effectiveness.
Heer, J. and Chi, E. H. (2001).
Proceedings of the Workshop on Web Mining, SIAM Conference on Data Mining, Chicago, IL 51-58. [PDF]
WebEyeMapper and WebLogger: Tools for Analyzing Eye Tracking Data Collected in Web-use Studies
Eye trackers output a stream of points at which the eye was looking. To give these points meaning, researchers analyzing eye tracking data need to map the points onto the meaningful objects at which the eye was looking. Performing this mapping has proven to be a tedious, time-consuming task. We present a software system that automates this task for Web usability studies that incorporate eye tracking.
Reeder, R. W., Pirolli, P. and Card, S. K. (2001).
CHI 2001, Seattle. [PDF]
Using Thumbnails to Search the Web
We introduce a technique for creating novel, textually-enhanced thumbnails of Web pages. These thumbnails combine the advantages of image thumbnails and text summaries to provide consistent performance on a variety of tasks. We conducted a study in which participants used three different types of summaries (enhanced thumbnails, plain thumbnails, and text summaries) to search Web pages to find several different types of information. Participants took an average of 67, 86, and 95 seconds to find the answer with enhanced thumbnails, plain thumbnails, and text summaries, respectively. We found a strong effect of question category. For some questions, text outperformed plain thumbnails, while for other questions, plain thumbnails outperformed text. Enhanced thumbnails (which combine the features of text summaries and plain thumbnails) were more consistent than either text summaries or plain thumbnails, having for all categories the best performance or performance that was statistically indistinguishable from the best.
Woodruff, A., Faulring, A., Rosenholtz, R., Morrison, J. and Pirolli, P. (2001).
CHI 2001, Seattle. [PDF]
A Taxonomic Analysis of What World Wide Web Activities Significantly Impact People's Decisions and Actions
In this paper, we present three taxonomic classification schemes based on Web users' responses to what Web activities significantly impacted their decisions and actions. The taxonomic classifications focus on three variables: the Purpose of people's search on the Web, the Method people use to find information, and the Content of the information for which they are searching. These taxonomies are useful for understanding people's activity on the Web and for developing ecologically-valid tasks to be used when studying web behavior.
Morrison, J. B., Pirolli, P. and Card, S. K. (2001).
CHI 2001, Seattle. [PDF]
The Guidebook, the Friend, and the Room: Visitor Experience in a Historic House
In this paper, we describe an electronic guidebook prototype and report on a study of its use in a historic house. Supported by mechanisms in the guidebook, visitors constructed experiences that had a high degree of interaction with three entities: the guidebook, their companions, and the house and its contents. For example, we found that most visitors played audio descriptions through speakers (rather than using headphones or reading textual descriptions) to facilitate communication with their companions.
Woodruff, A., Aoki, P. M., Hurst, A. and Szymanski, M. H. (2001).
CHI 2001, Seattle. [PDF]
Tap Tips: Lightweight Discovery of Touchscreen Targets
We describe tap tips, a technique for providing touch-screen target location hints. Tap tips are lightweight in that they are non-modal, appear only when needed, require a minimal number of user gestures, and do not add to the standard touchscreen gesture vocabulary. We discuss our implementation of tap tips in an electronic guidebook system and some usability test results.
Aoki, P. M., Hurst, A. and Woodruff, A. (2001).
CHI 2001, Seattle. [PDF]
Visual Information Foraging in a Focus+Context Visualization
Eye tracking studies of the Hyperbolic Tree browser suggest that visual search in focus+context displays is highly affected by information scent (i.e., local cues, such as text summaries, used to assess and navigate towards distal information sources). When users detected a strong information scent, they were able to reach their goal faster with the Hyperbolic Tree browser than with a conventional browser. When users detected a weak scent or no scent, users exhibited less efficient search of areas with a high density of visual items. In order to interpret these results we present an integration of the CODE Theory of Visual Attention (CTVA) with information foraging theory. Development of the CTVA-foraging theory could lead to deeper analysis of interaction with visual displays of content, such as the World Wide Web or information visualizations.
Pirolli, P., Card, S. K. and Van Der Wege, M. (2001).
CHI 2001, Seattle. [PDF]
Information Scent as a Driver of Web Behavior Graphs: Results of a Protocol Analysis Method for Web Usability
The purpose of this paper is to introduce a replicable WWW protocol analysis methodology illustrated by application to data collected in the laboratory. The methodology uses instrumentation to obtain detailed recordings of user actions with a browser, caches Web pages encountered, and videotapes talk-aloud protocols. We apply the current form of the method to the analysis of eight Web protocols, visualizing the structure of the interaction and showing the strong effect of information scent in determining the path followed.
Card, S. K., Pirolli, P., Van Der Wege, M., Morrison, J., Reeder, R. W., Schraedley, P. and Boshart, J. (2001).
CHI 2001, Seattle. [PDF]
The Future of Software: Visualization+Computation Tools
The complexity of software has continuously risen since the invention of the computer, and while Moore's law predicts the growth in processor speed, it fails to take into account our ability in managing complex software processes. Our utilization of the increase in processor speed is very much dependent on our ability to manage this complexity.
Chi, E. H. (2000).
Future of Software Special Issue. [PDF]
Visualization Method for Biological Sequence Similarity Reports
Previously, we presented a system called AlignmentViewer that uses information visualization techniques to visualize similarities between a single DNA sequence and a large database of other sequences. In this paper, we extend, summarize, and describe the system using several interesting case studies. We present our comb glyph technique for visualizing alignments between sequences. In this paper, we also extend the original system by incorporating computational steering, and the visualization of differences between data sets. The case studies and the new extended system present our novel approach of extracting significant relationships in the biological data set.
Chi, E. H., Riedl, J. T., Shoop, E. and Barry, P. (2000).
Journal of Electronic Imaging: Special Issue on Visualization and Data Analysis. [PDF]
Getting Portals to Behave
Data visualization environments help users understand and analyze their data by permitting interactive browsing of graphical representations of the data. To further facilitate understanding and analysis, many visualization environments have special features known as portals, which are sub-windows of a data canvas. Portals provide a way to display multiple graphical representations simultaneously, in a nested fashion. This makes portals an extremely powerful and flexible paradigm for data visualization. Unfortunately, with this flexibility comes complexity. There are over a hundred possible ways each portal can be configured to exhibit different behaviors. Many of these behaviors are confusing and certain behaviors can be inappropriate for a particular setting. It is desirable to eliminate confusing and inappropriate behaviors. In this paper, we construct a taxonomy of portal behaviors and give recommendations to help designers of visualization systems decide which behaviors are intuitive and appropriate for a particular setting. We apply these recommendations to an example setting that is fully visually programmable and analyze the resulting reduced set of behaviors. Finally, we consider a real visualization environment and demonstrate some problems associated with behaviors that do not follow our recommendations.
Olston, C. and Woodruff, A. (2000).
InfoVis 2000, Salt Lake City 15-25. [PDF]
A Taxonomy of Visualization Techniques Using the Data State Reference Model
In previous work, researchers have attempted to construct taxonomies of information visualization techniques by examining the data domains that are compatible with these techniques. This is useful because implementers can quickly identify various techniques that can be applied to their domain of interest. However, these taxonomies do not help the implementers understand how to apply and implement these techniques. In this paper, we will extend and then propose a new way to taxonomize information visualization techniques by using the Data State Model. In fact, as the taxonomic analysis in this paper will show, many of the techniques share similar operating steps that can easily be reused. The paper shows that the Data State Model not only helps researchers understand the space of design, but also helps implementers understand how information visualization techniques can be applied more broadly.
Chi, E. H. (2000).
InfoVis 2000, Salt Lake City 69-75. [PDF]
Improving Electronic Guidebook Interfaces Using a Task-Oriented Design Approach
Item selection is a key problem in electronic guidebook design. Many systems do not apply so-called "context-awareness" technologies to infer user interest, placing the entire burden of selection on the user. Conversely, to make selection easier, many systems automatically eliminate information that they infer is not of interest to the user. However, such systems often eliminate too much information, preventing the user from finding what they want. To realize the full potential of electronic guidebooks, designers must strike the right balance between automatic context-based inference and manual selection. In this paper, we introduce a task-oriented model of item selection for electronic guidebooks to help designers explore this continuum. We argue that item selection contains three sub-tasks and that these sub-tasks should be considered explicitly in system design. We apply our model to existing systems, demonstrating pitfalls of combining sub-tasks, and discuss how our model has improved the design of our own guidebook prototype.
Aoki, P. M. and Woodruff, A. (2000).
DIS 2000, New York 319-325. [PDF]
Opportunities for Information Visualization
No Abstract Available
Mackinlay, J. D. (2000).
IEEE Computer Graphics and Applications 20(1).
WebLogger: A Data Collection Tool for Web-use Studies
Considering the amount of interest in studying Web-browsing behavior, there is a relative lack of tools for data collection in this area. Those tools that do exist have significant limitations on the data they are able to collect or on their suitability for efficient analysis. We present WebLogger, a tool which instruments Microsoft's Internet Explorer Web browser. We have found that WebLogger alleviates some of the problems associated with other approaches to browser-based data collection methods.
Reeder, R. W., Pirolli, P. and Card, S. K. (2000).
Xerox PARC, Palo Alto, CA. [PDF]
Guidelines for Using Multiple Views in Information Visualization
A multiple view system uses two or more distinct views to support the investigation of a single conceptual entity. Many such systems exist, ranging from computer-aided design (CAD) systems for chip design that display both the logical structure and the actual geometry of the integrated circuit to overview-plus-detail systems that show both an overview for context and a zoomed-in-view for detail. Designers of these systems must make a variety of design decisions, ranging from determining layout to constructing sophisticated coordination mechanisms. Surprisingly, little work has been done to characterize these systems or to express guidelines for their design. Based on a workshop discussion of multiple views, and based on our own design and implementation experience with these systems, we present eight guidelines for the design of multiple view systems.
Baldonado, M. Q. W., Woodruff, A. and Kuchinsky, A. (2000).
AVI 2000, Palermo, Italy. [PDF]
The Effect of Information Scent on Searching Information Visualizations of Large Tree Structures
Focus + context information visualizations have sought to amplify human cognition by increasing the amount of information immediately available to the user. We study how the focus + context distortion of the Hyperbolic Tree browser affects information foraging behavior in a task similar to the CHI '97 Browse Off. In comparison to a more conventional browser, Hyperbolic users searched more nodes, searched at a faster rate, and showed more learning. However, the performance of the Hyperbolic was found to be highly affected by "information scent", proximal cues to the value of distal information. Strong information scent made hyperbolic search faster than with a conventional browser. Conversely, weak scent put the hyperbolic tree at a disadvantage. There appears to be two countervailing processes affecting visual attention in these displays: strong information scent expands the spotlight of attention whereas crowding of targets in the compressed region of the Hyperbolic narrows it. The results suggest design improvements.
Pirolli, P., Card, S. K. and Van Der Wege, M. (2000).
AVI 2000, Palermo, Italy. [PDF]
Case Study: Resource Steering in a Visualization System
Visual computational steering environments extend traditional visualization environments by enabling the user to interactively steer the computations applied to the data. In this paper, we develop a new type of computational steering. "Resource steering" extends current visual steering techniques by providing machine resource estimation and control to the user. With resource steering, the user controls the execution of the computation on a parallel or distributed computer based on experimentally or theoretically derived estimates of the parallel performance of the computation. We demonstrate this extended steering model by applying it to an information visualization system that analyzes genetic sequence similarity reports. We show how our extended steering model enhances the user's ability to control visualization computations.
Chi, E. H. and Riedl, J. T. (2000).
Proceedings of the Joint Eurographics IEEE TCVG Symposium on Visualization (VisSym '00), Amsterdam, The Netherlands. [PDF]
Enhancing a Digital Book with a Reading Recommender
Digital books can significantly enhance the reading experience, providing many functions not available in printed books. In this paper we study a particular augmentation of digital books that provides readers with customized recommendations. We systematically explore the application of spreading activation over text and citation data to generate useful recommendations. Our findings reveal that for the tasks performed in our corpus, spreading activation over text is more useful than citation data. Further, fusing text and citation data via spreading activation results in the most useful recommendations. The fused spreading activation techniques outperform traditional text-based retrieval methods. Finally, we introduce a preliminary user interface for the display of recommendations from these algorithms.
Woodruff, A., Gossweiler, R., Pitkow, J., Chi, E. H. and Card, S. K. (2000).
CHI 2000, The Hague, The Netherlands 153-160. [PDF]
The Scent of a Site: A System for Analyzing and Predicting Information Scent, Usage, and Usability of a Web Site
Designers and researchers of users' interactions with the World Wide Web need tools that permit the rapid exploration of hypotheses about complex interactions of user goals, user behaviors, and Web site designs. We present an architecture and system for the analysis and prediction of user behavior and Web site usability. The system integrates research on human information foraging theory, a reference model of information visualization and Web data-mining techniques. The system also incorporates new methods of Web site visualization (Dome Tree, Usage Based Layouts), a new predictive modeling technique for Web site use (Web User Flow by Information Scent, WUFIS), and new Web usability metrics.
Chi, E. H., Pirolli, P. and Pitkow, J. (2000).
CHI 2000, The Hague, The Netherlands 161-168. [PDF]
Sensemaking of Evolving Web Sites using Visualization Spreadsheets
In the process of knowledge discovery, workers examine available information in order to make sense of it. By sensemaking, we mean interacting with and operating on the information with a variety of information processing mechanisms [3, 18]. Previously, we introduced a concept that uses the spreadsheet metaphor with cells containing visualizations of complex data. In this paper, we extend and apply a cognitive model called "visual sensemaking" to the Visualization Spreadsheet. We use the task of making sense of a large Web site as a concrete example throughout the paper for demonstration. Using a variety of visualization techniques, such as the Disk Tree and Cone Tree, we show that the interactions of the Visualization Spreadsheet help users draw conclusions from the overall relationships of the entire information set.
Chi, E. H. and Card, S. K. (1999).
Symposium on Information Visualization (InfoVis '99), San Francisco. [PDF]
Web Analysis Visualization Spreadsheet
In this paper, we present methods in information visualization that apply to the discovery of patterns in World-Wide Web sites. We hope to use techniques of information visualization to help in the organization and categorization of Web sites. We present a detailed case study of using the spreadsheet to analyze the content, usage, and structure of a large Web site. We demonstrate how the visualization spreadsheet principles apply in this specific data domain.
Chi, E. H. (1999).
ACM Digital Library Workshop on Organizing Web Space (WOWS '99), Berkeley, CA 24-31. [PDF]
Mining longest repeated subsequences to predict World Wide Web surfing
Modeling and predicting user surfing paths involves tradeoffs between model complexity and predictive accuracy. In this paper we explore predictive modeling techniques that appempt ot reduce model complexity while retaining predictive accuracy. We show that compared to various Markov models, longest repeating subsequence models are able to significantly reduce model size while retaining the ability to make accurate predictions. Sharp increases in the overall predictive capabilities of these models are achievable by models increases to the size of the number of predictions made.
Pitkow, J. E. and Pirolli, P. (1999).
Second USENIX Symposium on Internet Technologies and Systems. [PDF]
The Internet Edge: A Book of Changes
No Abstract Available
Stefik, M. J. (1999).
MIT Press.
Information Foraging
Information Foraging Theory is an approach to understanding how strategies and technologies for information seeking, gathering and consumption are adapted to the flux of information in the environment. The theory assumes that people, when possible, will modify their strategies or the structure of the environment to maximize their rate of gaining valuable information. Field studies inform the theory by illustrating that people do freely structure their environments and their strategies to yield high gains in information foraging. The theory is developed by (a) adaptation (rational) analysis of information foraging problems and (b) a detailed process model (ACT-IF). The adaptation analysis develops (a) information patch models, which deal with time allocation and information filtering and enrichment activities in environments in which information is encountered in clusters (e.g. bibliographic collections), (b) information scent models which address the identification of information value from proximal cues, and (c) information diet models which address decisions about the selection and pursuit of information items. ACT-IF is developed to instantiate these rational models and to fit the moment-by-moment behavior of people interacting with complex information technology. ACT-IF is a production system in which the information scent of bibliographic stimuli is calculated by spreading activation mechanisms. Time allocation and item selection heuristics make use of information sent to select production rules in ways that maximize information foraging activities.
Pirolli, P. and Card, S. K. (1999).
Psychological Review 106(4): 643-675. [PDF]
Cognitive Architectures and Cognitive Engineering Models in Human-Computer Interaction
(First paragraph) We engage our physical and social environments through highly evolved technologies, in interactions that often require sophisticated knowledge and virtuoso performance. Finding the order underlying the complexity of these interactions may be one of the most daunting challenges facing science. Over the past few decades, the study of human-computer interaction (HCI) has become an increasingly important arena for pursuing this challenge. HCI is a discipline concerned with the study and design of interactive computing systems used by people towards satisfying their goals. HCI has become an arena in which new computer applications can benefit from new cognitive engineering models that synthesize results from sound cognitive science. It has also become a useful testbed for cognitive architectures, which are integrated theories of psychological mechanisms that aim to predict complex learning, cognition, and performance. This chapter provides an overview of cognitive engineering models and cognitive architectures in the context of HCI.
Pirolli, P. (1999).
The handbook of applied cognition. John T. Wiley, Sussex, England. [PDF]
VIDA (Visual Information Density Adjuster)
Multiple studies have shown that clutter or sparsity in visual representations can have negative effects ranging from decreased user performance to diminished visual appeal. We have developed a system that assists users in the construction and navigation of visualizations with appropriate visual information density. This system, VIDA (Visual Information Density Adjuster), applies a cartographic principle to minimize clutter and sparsity in visual displays of information.
Woodruff, A., Landay, J. and Stonebraker, M. (1999).
CHI '99, Pittsburgh, PA. [PDF]
Distributions of Surfers' Paths Through the World Wide Web: Empirical Characterization
Surfing the World Wide Web (WWW) involves traversing hyperlink connections among documents. The ability to predict surfing patterns could solve many problems facing producers and consumers of WWW content. We analyzed WWW server logs for a WWW site, collected over ten days, to compare different path reconstruction methods and to investigate how past surfing behavior predicts future surfing choices. Since log files do not explicitly contain user paths, various methods have evolved to reconstruct user paths. Session times, number of clicks per visit, and Levenshtein Distance analyses were performed to show the impact of various reconstruction methods. Different methods for measuring surfing patterns were also compared. Markov model approximations were used to model the probability of users choosing links conditional on past surfing paths. Information-theoretic (entropy) measurements suggest that information is gained by using longer paths to estimate the conditional probability of link choice given surf path. The improvements diminish, however, as one increases the length of path beyond one. Information-theoretic (Total Divergence to the Average entropy) measurements suggest that the conditional probabilities of link choice given surf path are more stable over time for shorter paths than longer paths. Direct examination of the accuracy of the conditional probability models in predicting test data also suggests that shorter paths yield more stable models and can be estimated reliably with less data than longer paths.
Pirolli, P. and Pitkow, J. E. (1999).
World Wide Web 2(1-2): 29-45. [PDF]
A Framework for Information Visualization Spreadsheets
Information has become interactive. Information visualization is the design and creation of interactive graphic depictions of information by combining principles in the disciplines of graphic design, cognitive science and interactive computer graphics. As the volume and complexity of the data increases, users require more powerful visualization tools that allow them to more effectively explore large abstract datasets. This thesis seeks to make information visualization more accessible to potential users by creating a "Visualization Spreadsheet", where each cell can contain an entire set of data represented using interactive graphics. Just as a numeric spreadsheet enables exploration of numbers, a visualization spreadsheet enables exploration of visual forms of information. Unlike numerical spreadsheets, which store only simple data elements and formulas in each cell, a cell in the Visualization Spreadsheet can hold an entire abstract data set, selection criteria, viewing specifications, and other information needed for a full-fledged information visualization. Similarly, intra-cell and inter-cell operations are far more complex, stretching beyond simple arithmetic and string operations to encompass a range of domain-specific operators. The complexity of operations and interactions requires a visualization framework that is easily understandable to both end-users and visualization designers. This thesis develops and discusses the general utility of a novel visualization framework, and validates the framework by applying it to various visualization techniques and showing several systems that illustrate some of these research issues. We show that the spreadsheet approach facilitates certain visual user tasks that are more difficult using other approaches. The underlying approach in our work allows domain experts to define new data types and data operations and enable visualization experts to incorporate new visualizations, viewing parameters, and view operations.
Chi, E. H. (1999).
University of Minnesota.
Summary of WWW Characterizations
To date there have been a number of efforts that attempt to characterize various aspects of the World Wide Web. This paper presents a summary of these efforts, highlighting regularities and insights that have been discovered across the variety of access points available for instrumentation. Characterizations that are derived from client, proxy, and server instrumentation are reviewed as well as effort to characterize the entire structure of the WWW. Given the dynamic nature of the Web, it may be surprising for some readers to find that many properties of the Web follow regular and predictable patterns that have not changed in form over the Web's lifetime. Understanding theses aspects as well as those that vary is critical to designing a better Web, and as a direct consequence, creating amore enjoyable user experience.
Pitkow, J. E. (1998).
Web Journal 2(1-2): 3-13. [PDF]
Perception and Its Application in Computer Graphics
No Abstract Available
Ferwerda, J., Gossweiler, R., Healey, C., Interrante, V. and Reingans, P. (1998).
.
Information Visualization: Using Vision to Think
No Abstract Available
Card, S. K., Mackinlay, J. D. and Shneiderman, B. (1998).
Morgan-Kaufmann, San Francisco, California.
Fluid Links for Informed and Incremental Link Transitions
We have developed a novel user interface technique for hypertext, called fluid links, that has several advantages over current methods. Fluid links provide additional information at a link source to support readers in choosing among links and understanding the structure of hypertext. Fluid links present this information in a convenient location that does not obscure the content or layout of the source material. The technique uses perceptually-based animation to provide a natural and lightweight feeling to readers. In their richer forms, fluid links can provide a novel hypertext navigation paradigm that blurs the boundaries of hypertext nodes and can allow readers to fluidly control the focus on the material to support their current reading goals.
Zellweger, P. T., Chang, B.-W. and Mackinlay, J. D. (1998).
Proceedings of Hypertext'98 50-57. [PDF]
Goal-Directed Zoom
We introduce a novel zoom method, goal-directed zoom. In a goal directed zoom system, users specify which representation of an object they wish to see. The system automatically zooms to the elevation at which that representation appears at appropriate detail. We have extended a database visualization environment to support end-user construction of visualization that have goal directed zoom. We present a sample visualization we have constructed using this environment.
Woodruff, A., Landay, J. and Stonebraker, M. (1998).
CHI '98, Los Angeles 305-6. [PDF]
Constant Information Density in Zoomable Interfaces
We introduce a system that helps users construct interactive visualizations with constant information density. This work is an extension of the DataSplash database visualization environment. DataSplash is a direct manipulation system in which users can construct and navigate visualizations. Objects' appearances change as users zoom closer to or further away from the visualization. Users specify graphically the point at which these changes occur. Our experience with DataSplash indicates that users find it difficult to construct visualizations that display an appropriate amount of detail. In this paper, we introduce an extension to DataSplash based on the Principle of Constant Information Density. Thins extension gives users feedback about the density of visualizations as they create them. We also introduce an extension that suggests improvements to existing visualizations. We have performed an informal study of user navigation in applications with and without constant information density. We suggest that designers take density into account when designing applications to avoid biasing user navigation in unexpected ways.
Woodruff, A., Landay, J. and Stonebraker, M. (1998).
Advanced Visual Interfaces '98, L'Aquila, Italy 57-65. [PDF]
Constant Density Visualizations of Non-Uniform Distributions of Data
The cartographic Principle of Constant Information Density suggests that the amount of information in an interactive visualization should remain constant as the user pans and zooms. In previous work, we presented a system, VIDA (Visual Information Density Adjuster), which helps users manually construct applications in which overall display density remains constant. In the context of semantic zoom systems, this approach ensures uniformity in the z dimension, but does not extend naturally to ensuring uniformity in the x and y dimensions. In this paper, we present a new approach that automatically creates displays that are uniform in the x, y, and z dimensions. In the new system, users express constraints about visual representations that should appear in the display. The system applies these constraints to subdivisions of the display such that each subdivision meets a target density value. We have implemented our technique in the DataSplash/VIDA database visualization environment. We describe our algorithm, implementation, and the advantages and disadvantages of our approach.
Woodruff, A., Landay, J. and Stonebraker, M. (1998).
UIST '98, San Francisco 19-28. [PDF]
Summary of WWW Characterizations
To date there have been a number of efforts that attempt to characterize various aspects of the World Wide Web. This paper represents a summary of these efforts, highlighting regularities and invariants that have been discovered.
Pitkow, J. E. (1998).
The Seventh International World Wide Web Conference, Brisbane, Australia. [PDF]
A Theory of the Measurement of Knowledge Content, Access, and Learning
We develop an approach to the measurement of knowledge content, knowledge access and knowledge learning. This approach has two elements: First we describe a theoretical view of cognition, called the Newell-Dennett framework, which we see as being particularly favorable to the development of a measurement approach. Then, we describe a class of measurement models, based on Rasch modeling, which we see as being particularly favorable to the development of cognitive theories. Knowledge content and access are viewed as determining the observable actions selected by an agent in order to achieve desired goals in observable situations. To the degree that models within the theory fit the data at hand, one considers measures of observed behavior to be manifestations of intelligent agents having specific classes of knowledge content and varying degrees of access to that knowledge. Although agents, environment, and knowledge are constitutively defined (in terms of one another), successful application of our theory affords separation of parameters associated with the person from those associated with the environment. We present and discuss two examples of measurement models developed within our approach that address the evolution of cognitive skill, strategy choice and application, and developmental changes in mixtures of strategy use.
Pirolli, P. and Wilson, M. (1998).
Psychological Review 105: 58-82. [PDF]
Information Foraging Models of Browsers for Very Large Document Spaces
Information Foraging (IF) Theory addresses user strategies and technology for seeking, gathering, and using on-line information. We present IF-based models and evaluations of two interfaces: the Scatter/Gather browser for large document collections, and the Butterfly interface for surfing the citation link structure of scientific literatures. A computational cognitive model, ACT-IF, models observed users by assuming that they have heuristics that optimize their information foraging behavior in accordance with IF theory.
Pirolli, P. and Card, S. K. (1998).
AVI '98, L'Aquilla, Italy 83-93. [PDF]
Exploring Browser Design Trade-Offs Using a Dynamical Model of Optimal Information Foraging
Designers and researchers of human-computer interaction need tools that permit the rapid exploration and management of hypotheses about complex interactions of designs, task conditions, and user strategies. Dynamic programming is introduced as a such a tool for the analysis of information foraging technologies. The technique is illustrated in the context of the Scatter/Gather text clustering browser. Hypothetical improvements in browser speed and text clustering are examined in the context of variations in task deadlines and the quality of the document repository. A complex and non-intuitive set of tradeoffs emerge from even this simple space of factors, illustrating the general utility of the approach.
Pirolli, P. (1998).
Conference on Human Factors in Computing Systems, CHI '98, Los Angeles 33-40. [PDF]
Report of the 7-8 May 1998 ONR/CNMOC Interactive METOC Working Group
(First paragraph) The purpose of this workshop was to develop an appreciation and understanding of the METOC function for Naval Operations. Then, using a scenario-based group discussion method, develop recommendations to facilitate the METOC function.
ONR/CNMOC Working Group (1998).
Office of Naval Research, San Diego, California. [PDF]
Fluid Visualization of Spreadsheet Structures
Spreadsheets augment a visible tabular layout with invisible formulas. Direct manipulations of the tabular layout may or may not result in the desired changes to the formulas. The user is forced to explore the individual cells to find, verify, and modify the formulas, which causes heavy cognitive overhead. We present a set of techniques that make these formulas and their resulting dataflow structure easily accessible while maintaining the natural appearance of the spreadsheet. Transient local views visualize dataflow structures associated with individual cells, while static global views and animated global explanations visually present the entire dataflow structure at once. Semantic navigation enables the user to navigate through the dataflow structure interactively, and visual editing techniques make it possible to construct formulas using graphical editing techniques. Central to these techniques is the use of animation and lightweight interaction for rapid and non-intrusive visualization. Our prototype implementation suggests that these techniques can greatly improve the expressive power of current spreadsheets as well as other applications that have rich underlying structures.
Igarashi, T., Zellweger, P. T., Chang, B.-W. and Mackinlay, J. D. (1998).
Proceedings of Visual Languages'98. [PDF]
Strong Regularities in World Wide Web Surfing
One of the most common modes of accessing information in the World Wide Web is surfing from one document to another along hyperlinks. Several large empirical studies have revealed common patterns of surfing behavior. A model that assumes that users make a sequence of decisions to proceed to another page, continuing as long as the value of the current page exceeds some threshold, yields the probability distribution for the number of pages that a user visits within a given Web site. This model was verified by comparing its predictions with detailed measurements of surfing patterns. The model also explains the observed Zipf-like distributions in page hits observed at Web sites.
Huberman, B. A., Pirolli, P., Pitkow, J. and Lukose, R. J. (1998).
Science 280: 95-97. [PDF]
Information Visualization
No Abstract Available
Gershon, N., Eick, S. and Card, S. K. (1998).
Interactions March-April.
An Operator Interaction Framework for Visualization Systems
Information visualization encounters a wide variety of different date domains. The visualization community has developed representation methods and interactive techniques. As a community, we have realized that the requirements in each domain are often dramatically different. In order to easily apply existing methods, researchers have developed a semiology of graphic representation. We have extended this research into a framework that includes operators and interactions in visualization systems, such as a visualization spreadsheet. We discuss properties of this framework and use it to characterize operations spanning a variety of different visualization techniques. The framework developed in this paper enables a new way of exploring and evaluating the design space of visualization operators, and helps end-users in their analysis tasks.
Chi, E. H. and Riedl, J. T. (1998).
Symposium on Information Visualization (InfoVis '98), Research Triangle Park, North Carolina 63-70. [PDF]
Principles for Information Visualization Spreadsheets
Spreadsheets have proven highly successful for interacting with numerical data, such as applying algebraic operations, refining data propagation relationships, manipulating rows or columns, and exploring "what-if" scenarios. Spreadsheet techniques have recently been extended from numeric domains to other domains. Here we present a spreadsheet approach to displaying and exploring information visualizations, with large, abstract, multidimensional data sets that are visually represented in multiple ways. We illustrate how spreadsheet techniques provide a structured, intuitive, and powerful interface for investigating information visualizations. An earlier version of this article appeared in the proceedings of the 1997 Information Visualization Symposium. Here we refocus the discussion to illustrate principles that make the spreadsheet approach powerful. These principles show how we can perform many user tasks easily in the visualization spreadsheet that prove much more difficult using other approaches.
Chi, E. H., Riedl, J., Barry, P. and Konstan, J. (1998).
Computer Graphics and Applications: 30-38. [PDF]
Visualizing the Evolution of Web Ecologies
Several visualizations have emerged which attempt to visualize all or part of the World Wide Web. Those visualizations, however, fail to present the dynamically changing ecology of users and documents on the Web. We present new techniques for Web Ecology and Evolution Visualization (WEEV). Disk Trees represent a discrete time slice of the Web ecology. A collection of Disk Tress forms a Time Tube, representing the evolution of the Web over longer periods of time. These visualizations are intended to aid authors and webmasters with the production and organization of content, assist Web surfers making sense of information, and help researchers understand the Web.
Chi, E. H., Pitkow, J., Mackinlay, J., Pirolli, P., Gossweiler, R. and Card, S. K. (1998).
ACM Conference on Human Factors in Software (CHI '98), Los Angeles 400-407, 644-645. [PDF]
A Negotiation Architecture for Fluid Documents
The information presented in a document often consists of primary content as well as supporting material such as explanatory notes, detailed derivations, illustrations, and the like. We introduce a class of user interface techniques for fluid documents that supports the reader's shift to supporting material while maintaining the context of the primary material. Our approach initially minimizes the intrusion of supporting material by presenting it as a small visual cue near the annotated primary material. When the user expresses interest in the annotation, it expands smoothly to a readable size. At the same time, the primary material makes space for the expanded annotation. The expanded supporting material must be given space to occupy, and it must be made salient with respect to the surrounding primary material. These two aspects, space and salience, are subject to a negotiation between the primary and supporting material. This paper presents the components of our fluid document techniques and describes the negotiation architecture for ensuring that the presentations of both primary and supporting material are honored.
Chang, B.-W., Mackinlay, J. D., Zellweger, P. T. and Igarashi, T. (1998).
Proceedings of UIST'98, ACM Symposium on User Interface Software and Technology 123-132. [PDF]
A Spreadsheet Approach to Information Visualization
In information visualization, as the volume and complexity of the date increases, researchers require more powerful visualization tools that enable them to more effectively explore multidimensional datasets. In this paper, we discuss the general utility of a novel visualization spreadsheet framework. Just as a numerical spreadsheet enables exploration of numbers, a visualization spreadsheet enables exploration of visual forms of information. We show that the spreadsheet approach facilitates certain information visualization tasks that are more difficult using other approaches. Unlike traditional spreadsheets, which store only simple data elements and formulas in each cell, a visualization spreadsheet cell can hold an entire complex data set, selection criteria, viewing specifications, and other information needed for a full-fledged information visualization. Similarly, inter-cell operations are far more complex, stretching beyond simple arithmetic and string operations to encompass a range of domain-specific operators. We have built two prototype systems that illustrate some of these research issues. The underlying approach in our work allows domain experts to define new data types and data operations, and enables visualization experts to incorporate new visualizations, viewing parameters, and view operations.
Chi, E. H., Barry, P., Riedl, J. and Konstan, J. (1997).
Symposium on Information Visualization (InfoVis '97), Phoenix, AZ 17-24. [PDF]
Supporting Fine-Grained Data Lineage in a Database Visualization Environment
The lineage of a datum records its processing history. Because such information can be used to trace the source of anomalies and errors in processed data sets, it is valuable to users for a variety of applications including investigation of anomalies and debugging. Traditional data lineage approaches rely on metadata. However, metadata does not scale well to fine-grained lineage, especially in large data sets. For example, it is not feasible to store all of the information necessary to trace from a specific floating point value in a processed data set to a particular satellite image pixel in a source data set. In this paper, we propose a novel method to support fine-grained data lineage. Rather than relying on metadata, our approach lazily computes lineage using a limited amount of information about the processing operators and the base data. We introduce the notions of weak inversion and verification. While our system does not perfectly invert the data, it uses weak inversion and verification to provide a number of guarantees about the lineage it generates. We propose a design for the implementation of weak inversion and verification in an object-relational database management system.
Woodruff, A. and Stonebraker, M. (1997).
13th Int'l Conf. on Data Engineering, Birmingham, England 91-102. [PDF]
In Search of Reliable Usage Data on the WWW
The WWW is currently the hottest test-bed for future interactive digital systems. While much is understood technically about how the WWW functions, substantially less is known about how this technology is used collectively and on an individual basis. This disparity of knowledge exists largely as a direct consequence of the decentralized nature of Web. Since each user of the Web is not uniquely identifiable across the system and the system employs various levels of caching, measurement of actual usage is problematic. This paper establishes terminology to frame the problem of reliably determining usage of WWW resources while reviewing current practice and their shortcomings. A review of the various metrics and analyses that can be performed to determine usage is then presented. This is followed by a discussion of the strengths and weaknesses of the hit-metering proposal [Mogul and Leach 1997] currently in consideration by the HTTP working group. Lastly, new proposals, based upon server-side sampling are introduced and assessed against the other proposal. It is argued that server-side sampling provides more reliable and useful usage data while requiring no change to the current HTTP protocol and enhancing user privacy.
Pitkow, J. E. (1997).
The Sixth International World Wide Web Conference, Santa Clara, California. [PDF]
A Spreadsheet Approach to Information Visualization
In information visualization, as the volume and complexity of the data increases, researchers require more powerful visualization tools that allow them to more effectively explore multi-dimensional datasets. In this paper, we show a novel new visualization framework built upon the spreadsheet metaphor, where each cell can contain an entire dataset. Just as a numerical spreadsheet enables exploration of numbers, a visualization spreadsheet enables exploration of visualizations of data. Our prototype spreadsheet enabled users to compare visualizations in cells using the tabular layout. Users can use the spreadsheet to display, manipulate, and explore multiple visual representation techniques for their data. By applying different operations to the cells, we showed how visualization spreadsheets afford the construction of 'what-if' scenarios. The possible set of operations that users can apply consists of animation, filtering, and algebraic operators.
Chi, E. H., Konstan, J., Barry, P. and Riedl, J. (1997).
ACM Symposium on User Interface Software and Technology (UIST '97) 79-80. [PDF]
Characterizing World Wide Web Ecologies
One of the fastest growing sources of information today is the World Wide Web (WWW), having grown from only fifty sources of information in January of 1993 to over a half million four years later. The exponential growth of information within the Web has created an overabundance of information and a poverty of human attention, with users citing the inability to navigate and find relevant information on the Web as one of the biggest problems facing the Web today. The primary goal of the research presented here is to put forth new techniques and models that can be used to help efficiently manage people's attentional processes when dealing with large, unstructured, heterogeneous information environments. The primary model is based upon the desirability of items on the Web. This research searches for lawful patterns of structure, content, and use. Methods are developed to exploit these patterns to organize and optimize users' information foraging and sensemaking activities. The enhancements rely on predicting, categorization and allocation of attention. Several methods are explored for inducing categorical structures for the WWW. Some of these enhancements involve clustering in a high-dimensional space of content, use, and structural features. Others derive from cocitation analysis methods used in the study of scientific communities. A user would also be aided by retrieval mechanisms that predicted and returned the most likely needed WWW pages, given that the user is attending to some given page(s). The approach of this research uses a spreading activation mechanism to predict the needed, relevant information, computed using past usage patterns, degree of shared content, and WWW hyperlink structure.
Pitkow, J. E. (1997).
Xerox PARC, Palo Alto, CA. [PDF]
The Evolutionary Ecology of Information Foraging
We present Information Foraging Theory as an approach to understanding how strategies and technologies for information seeking, gathering, and consumption are adapted to the flux of information in the cultural environment. The theory is developed within an evolutionary-ecological framework that includes analysis of adaptation, knowledge, and cognition. The theory is applied to field studies, controlled experiments, and technology design. We present the Information Diet Model and Information Patch Residence Time Model as optimization models of information foraging under some strong constraints. These are used to develop a specific production system model called ACT-IF that predicts the fine-grained information seeking and gathering behavior of participants using a sophisticated document browsing system. We also present the Overlapped Patch Foraging with Queueing Model to address situations in which information search and information handling may occur in parallel, the Extreme Variance Rule which deals with information foraging under deadlines and uncertainty, a general class of Dynamic Information Foraging Models, and the Hogg-Huberman Model of the phase space of cost functions for heuristic information search.
Pirolli, P. and Card, S. (1997).
Xerox PARC, Palo Alto, CA. [PDF]
Analyzing Differences Between Internet Information System Software Architectures
No Abstract Available
Abowd, G., Pitkow, J. and Kazman, R. (1996).
International Communications Conference 1996 (ICC 96), Dallas, Texas.
The Human, the Computer, the Task, and their Interaction-Analytic Models and Use-Centered Design
No Abstract Available
Card, S. K. (1996).
Mind Matters: A Tribute to Allen Newell. Erlbaum, Hillsdale, New Jersey.
An Investigation of Documents on the World World Web
We report on our examination of pages from the World Wide Web. We have analyzed data collected by the Inktomi Web crawler (this data currently comprises over 2.6 million HTML documents). We have examined many characteristics of these documents, including: document size; number and types of tags, attributes, file extensions, protocols, and ports; the number of in-links; and the ratio of document size to the number of tags and attributes. For a more limited set of documents, we have examined the following: the number and types of syntax errors and readability scores. These data have been aggregated to create a number of ranked lists, e.g., the ten most-used tags, the ten most common HTML errors.
Woodruff, A., Aoki, P. M., Brewer, E., Gauthier, P. and Rowe, L. A. (1996).
5th Int'l Conf. on the World Wide Web, Paris 963-980. [PDF]
Predicting Document Access in Large Multimedia Repositories
No Abstract Available
Recker, M. M. and Pitkow, J. E. (1996).
Transactions on Computer-Human Interaction 3(4): 352-375.
Emerging Trends in the WWW User Population
Vast amounts of attention and resources have recently been devoted toward the World Wide Web (WWW) [1], but relatively little research has been conducted examining Web usage and societal implications. With the goals of understanding the Web user population and promoting the Web as a viable surveying medium, the WWW User Surveys were initially conducted by Georgia Institute of Technology's Graphics, Visualization, and Usability center during January of 1994. Subsequent surveys have been administered approximately every six months thereafter. Each survey is conducted for one month using the limited interactivity of the Web, where users point and click on responses within their Web browsers and submit results to a centralized server for processing. The first survey [6] was administered during January 1994 and received over 1,500 responses, which was a considerable amount at that time. This response rate, along with tremendous positive feedback from the Web community, justified continuing the surveys. The second survey (October 1994) [7] employed an extended and refined question base, which included a set of questions developed by the University of Michigan's Hermes Team regarding consumer attitudes toward electronic commerce. The response rate continued to grow significantly, recording over 4,500 unique users. This tremendous growth has continued through the third and fourth surveys (April and October 1995) [4,5], with 13,000 and 23,300 users responding, respectively. Based upon current estimates, the last two surveys were completed by nearly one out of every thousand web users [2,3]. We expect this trend to continue for the fifth survey, the results of which will be available in mid-June 1996.
Pitkow, J. E. and Kehoe, C. M. (1996).
Communications of the ACM 39(6). [PDF]
Supporting the Web: A Distributed Hyperlink Database System
In its current implementation, the World-Wide Web lacks much of the explicit structure and strong typing found in many closed hypertext systems. While this property has directly fueled the explosive acceptance of the Web, it further complicates the already difficult problem of identifying usable structures and aggregates in large hypertext collections. These reduced structures, or localities, form the basis to simplifying visualizations of and navigation through complex hypertext systems. Much of the previous research into identifying aggregates utilize graph theoretic algorithms based upon structural topology, i.e., the linkages between items. Other research has focused on content analysis to form document collections. This paper presents our exploration into techniques that harness both the topology and textual similarity between items as well as integrate new analyses based upon actual usage of the Xerox's WWW space. Linear equations and spreading activation models are employed to arrange Web pages based upon functional categories, node types, and relevancy.
Pitkow, J. E. and Jones, R. K. (1996).
The Fifth International World Wide Web Conference, Paris, France. [PDF]
Silk from a Sow's Ear: Extracting Usable Structures from the Web
In its current implementation, the World Wide Web lacks much of the explicit structure and strong typing found in many closed hypertext systems. While this property probably relates to the explosive acceptance of the Web, it further complicates the already difficult problem of identifying usable structures and aggregates in large hypertext collections. These reduced structures, or localities, form the basis for simplifying visualizations of and navigation through complex hypertext systems. Much of the previous research into identifying aggregates utilize graph theoretic algorithms based upon structural typology, i.e. the linkage between items. Other research has focused on content analysis to form document collections. This paper presents our exploration into techniques that utilize both the topology and textual similarity between items as well as usage data collected by servers and page meta-information like title and size. Linear equations and spreading activation models are employed to arrange Web pages based upon functional categories, node types and relevancy.
Pirolli, P., Pitkow, J. and Rao, R. (1996).
Conference on Human Factors in Computing Systems, CHI '96, Vancouver, Canada. [PDF]
Surveying the Territory: GVU's Five WWW User Surveys
Five years is not very long on most historical scales, but for the World Wide Web (WWW) it constitutes a lifetime. A question almost as old as the web itself is, "Who is using it, and for what?" One way to answer this question is to use paper surveys, telephone survey, or diaries which are some of the same methods used to measure the audiences of other one-way media such as television and radio. However, something interesting happened in early 1994: the implementation of HTML Forms turned the web into a two-way medium which made it possible to contact the audience directly. To test the viability of the web as a survey medium and collect preliminary data on the web population, the first GVU WWW User Survey was conducted in January 1994. Subsequent surveys have been conducted approximately every six months. The collection of responses from over 55,000 Web users over five surveys has given us a unique perspective on the advances in surveying technology and methodology and changes in the web population itself. In the following sections we discuss what we have learned in each of these areas.
Kehoe, C. M. and Pitkow, J. E. (1996).
The World Wide Web Journal 1(3). [PDF]
Flexible Information Visualization of Multivariate Data from Biological Sequence Similarity Searches
Information visualization faces challenges presented by the need to represent abstract data and the relationships within the data. Previously, we presented a system for visualizing similarities between a single DNA sequence and a large database of other DNA sequences [6]. Similarity algorithms generate similarity information in textual reports that can be hundreds or thousands of pages long. Our original system visualized the most important variables from these reports. However, the biologists we work with found this system so useful they requested visual representations of other variables. We present an enhanced system for interactive exploration of this multivariate data. We identify a larger set of useful variables in the information space. The new system involves more variables, so it focuses on exploring subsets of the data. We present an interactive system allowing mapping of different variables to different axes, incorporating animation using a time-axis, and providing tools for viewing subsets of the data. Detail-on-demand is preserved by hyperlinks to the analysis reports. We present three case studies illustrating the use of these techniques. The combined technique of applying a time axis with a 3D scatter plot and query filters to visualization of biological sequenced similarity data is both powerful and novel.
Chi, E. H., Riedl, J., Shoop, E., Carlis, J., Retzel, E. and Barry, P. (1996).
IEEE Visualization '96 133-140, 477. [PDF]
The WebBook and the Web Forager: An Information Workspace for the World-Wide Web
The World-Wide Web has achieved global connectivity stimulating the transition of computers from knowledge processors to knowledge sources. But the Web and its client software are seriously deficient for supporting users' interactive use of this information. This paper presents two related designs with which to evolve the Web and its clients. The first is the WebBook, a 3D interactive book of HTML pages. The WebBook allows rapid interaction with objects at a higher level of aggregation than pages. The second is the Web Forager, an application that embeds the WebBook and other objects in a hierarchical 3D workspace. Both designs are intended as exercises to play off against analytical studies of information workspaces.
Card, S. K., Robertson, G. G. and York, W. (1996).
ACM Conference on Human Factors in Software (CHI '96) 111-117. [PDF]
Tioga-2: A Direct Manipulation Database Visualization Environment
This paper reports on user experience with Tioga, a DBMS-centric visualization tool developed at Berkeley. Based on this experience, we have designed Tioga-2 as a direct manipulation system that is more powerful and much easier to program. A detailed design of the revised system is presented, together with an extensive example of its application.
Aiken, A., Chen, J., Stonebraker, M. and Woodruff, A. (1996).
12th Int'l Conf. on Data Engineering, New Orleans, LA 208-17. [PDF]
Shifting the Possible
No Abstract Available
Stefik, M. (1996).
Xerox PARC, Palo Alto, CA.
Letting Loose the Light: Igniting Commerce in Electronic Publication
In "The Digital Library Project: The World of Knowbots" in Part 1, Robert Kahn and Vinton Cerf ask, "If a thousand books are combined on a single CD-ROM and the acquirer of the CD-ROM only intends to read one of them, what sort of royalty arrangement is appropriate to compensate the copyright owners? How would compensation be extended for cases in which electronic copies are provided to users?" Their questions show how, in 1988, issues about copyright protection and payment for using information arose in the context of early CD-ROM distribution. By 1994 copyright issues had not only not been settled, they were coming to a boil. Laura Fillmore's effort to build a successful publishing business on the Internet reveals the limitations of what was practical in May of 1994. Although digital works were being sold on the Internet, provisions for commerce were primitive. Furthermore, the ease of copying digital works had led many people to believe that digital information should be free. Fast access to the network had made trading programs or other data as easy as mixing songs on audio tape. In short, it had become much simpler for network users to infringe copyright than to uphold it. This is the context for the oft-quoted statement by John Perry Barlow of the Electronic Freedom Foundation, "Copyright is dead." Advocates of free information argue that because you don't lose the original when you make a copy of a digital work, there should be no charge for copying information. The conventional wisdom among publishers in late-1994, when this article was written, was that digital containers for software were inherently leaky vessels and that no viable solution would ever be found.
Stefik, M. (1996).
Internet dreams: Archetypes, myths, and metaphors. MIT Press, Cambridge, MA. [PDF]
The Spreadcube: Visualizing Multidimensional Data
No Abstract Available
Tenev, T. and Rao, R. (1996).
Xerox PARC, Palo Alto, CA.
Scatter/Gather Browsing Communicates the Topic Structure of a Very Large Text Collection
Scatter/Gather is a cluster-based browsing technique for large text collections. Users are presented with automatically computed summaries of the contents of clusters of similar documents and provided with a method for navigating through these summaries at different levels of granularity. The aim of the technique is to communicate information about the topic structure of very large collections. We tested the effectiveness of Scatter/Gather as a simple pure document retrieval tool, and studied its effects on the incidental learning of topic structure. When compared to interactions involving simple keyword-based search, the results suggest that Scatter/Gather induces a more coherent conceptual image of a text collection, a richer vocabulary for constructing search queries, and communicates the distribution of relevant documents over clusters of documents in the collection.
Pirolli, P., Schank, P., Hearst, M. and Diehl, C. (1996).
Conference on Human Factors in Computing Systems (CHI '96), Vancouver, BC. [PDF]
Measuring Learning in LISP: An Application of the Random Coefficients Multinomial Logic Model
No Abstract Available
Draney, K. L., Wilson, M. and Pirolli, P. (1996).
Objective measurement: Theory in practice (Vol. 3): 195-218.
Table Lens as a Tool for Making Sense of Data
The Table Lens is a visualization for searching for patterns and outliers in multivariate datasets. It supports a lightweight form of exploratory data analysis (EDA) by integrating a familiar organization, the table, with graphical representations and a small set of direct manipulation operators. We examine the EDA process as a special case of a generic process, which we call sensemaking. Using a GOMS methodology, we characterize a few central EDA tasks and compare performance of the Table Lens and one of the best of the more traditional graphical tools for EDA i.e. Splus. This analysis reveals that Table Lens is more or less on par with the power of Splus, while requiring the use of fewer specialized graphical representations. It essentially combines the graphical power of Splus with the direct manipulation and generic properties of spreadsheets and relational database front ends. We also propose a number of design refinements that are suggested by our task characterizations and analyses.
Pirolli, P. and Rao, R. (1996).
Advanced Visual Interfaces Workshop, AVI '96, Gubbio, Italy. [PDF]
The Knowledge Crystallization Task: Information Foraging in the Wild
No Abstract Available
Pirolli, P., Card, S. K. and Gigch, J. P. v. (1996).
Xerox PARC, Palo Alto, CA.
Life, Death, and Lawfulness on the Electronic Frontier
To facilitate users' ability to make sense of large collections of hypertext we present two new techniques for inducing clusters of related documents on the World Wide Web. Users' ability to find relevant information might also be enhanced by finding lawful properties of document behavior and use. We present models and analyses of document use and change for the World Wide Web.
Pitkow, J. and Pirolli, P. (1997).
Conference on Human Factors in Computing Systems, CHI '97, Atlanta, GA 383-390. [PDF]
Computational Models of Information Scent-Following in a very Large Browsable Text Collection
An ecological-cognitive framework of analysis and a model-tracing architecture are presented and used in the analysis of data recorded from users browsing a large document collection. The users interacted with the Scatter/Gather browser, which clusters documents into groups of similar content and presents users with summaries of cluster content. Predictions made by a computational model of navigation and information foraging are matched against the observed activity.
Pirolli, P. (1997).
Conference on Human Factors in Computing Systems, CHI '97, Atlanta, GA 3-10. [PDF]
The Structure of the Information Visualization Design Space
Research on information visualization has reached the place where a number of successful point designs have been proposed and a number of techniques have been discovered. It is now appropriate to begin to describe and analyze portions of the design space so as to understand the differences among designs and to suggest new possibilities. This paper proposes an organization of the information visualization literature and illustrates it with a series of examples. The result is a framework for designing new visualizations and augmenting existing designs.
Card, S. K. and Mackinlay, J. (1997).
IEEE Symposium on Information Visualization, Phoenix, AZ 92-99. [PDF]
Visualizing Retrieved Information: A Survey
Clearly, the presentation method for information retrieved from the global information infrastructure (GII) makes a big difference to users. The NCSA Mosaic interface, for example, with its point-and-click multimedia page presentation, swelled popular interest in the World Wide Web. This experience suggests the possibility of increased usefulness if we apply visualization techniques to information retrieved from the GII. Note that, although the term "information visualization" is coming into use, the goal is really "information perceptualization." The latter implies a richer use of many senses, including sound and touch, to increase the rate at which people can assimilate and understand information. In discussing the visualization of retrieved information, it helps to consider four functional levels: (1) the infosphere, (2) the workspace, (3) sensemaking tools, and (4) the document. This simple classification lets us separate the functions served by the visualization from the techniques themselves, which can be applied across functional levels.
Card, S. K. (1996).
IEEE Computer Graphics and Applications 16(2, March): 63-67. [PDF]
Exploring Large Tables with the Table Lens
No Abstract Available
Rao, R. and Card, S. K. (1995).
ACM SIGCHI '95, Denver.
A Measurement Model for a Complex Cognitive Skill
No Abstract Available
Draney, K., Pirolli, P. and Wilson, M. (1995).
Cognitively diagnostic assessment. Lawrence Erlbaum Associates, Hillsdale, NJ: 103-125.
Pioneers and Settlers: Methods Used in Successful User Interface Design
(First paragraph): User interfaces, it has been said (Card, this volume), are an ineluctable part of interactive software systems. They are typically more than half the code, often far more. They typically cause more than half the problems, too - often far more. In short, they represent the sort of troublesome engineering problem that organizations would like to do something about.
Card, S. K. (1995).
Human-computer interface design: Success stories, emerging methods, real-world context. Morgan Kaufmann Publishers, Inc., San Francisco, California: 122-169. [PDF]
Navigation and Coordination Primitives for Multidimensional Browsers
This paper describes extensions to the Tioga flight-simulator browsing protocol presented by Stonebraker et al. (1993a). These extensions allow users to navigate a multidimensional data space using sophisticated zooming capabilities. This design also allows users to move easily between different multidimensional spaces. Tunneling between different data spaces is shown to be a substantial generalization of hyperlinks in a hypermedia system. Finally, our design provides for the coordination of multiple browsers. This preserves context and allows users to explore multiple paths simultaneously. In concert, these extensions incorporate the functionality of many information management paradigms as well as introducing new constructs. These powerful mechanisms for relating data provide users with great flexibility. For example, users can create magnifying glasses which show an enhanced view of the underlying data.
Woodruff, A., Su, A., Stonebraker, M., Paxson, C., Chen, J., Aiken, A., Wisnovsky, P. and Taylor, C. (1995).
3rd Conf. on Vis. Database Sys., Lausanne, Switzerland 360-371. [PDF]
Buffering of Intermediate Results in Dataflow Diagrams
Buffering of intermediate results in dataflow diagrams can significantly reduce latency when a user browses these results or re-executes a diagram with slightly different inputs. We define the optimal buffer allocation problem of determining the buffer contents which minimize the average response time to such user requests. We show that this problem has several characteristics which render traditional latency reduction techniques ineffective. Since optimal buffer allocation is NP-hard, we propose heuristic methods for buffer management of intermediate results. We present a simulation of the behavior of these heuristics under a variety of conditions, varying graph structure and access pattern. We argue that history mechanisms which track user access patterns can be used to improve performance. We further show that graph structure and access pattern determine the factor of improvement which is possible. The performance enhancements we describe can be applied to minimize query response time in visual dataflow languages.
Woodruff, A. and Stonebraker, M. (1995).
11th IEEE Symp. on Visual Languages, Darmstadt, Germany 187-194. [PDF]
The Relationship between the Rasch Model and Conjoint Measurement Structures
No Abstract Available
Wilson, M. and Pirolli, P. (1995).
University of California, Berkeley, CA.
Rich Interaction in the Digital Library
Effective information access involves rich interactions between users and information residing in diverse locations. Users seek and retrieve information from the sources-for example, file servers, databases, and digital libraries and use various tools to browse manipulate, reuse, and generally process the information. We have developed a number of techniques that support various aspects of the process of user/information interaction. These techniques can be considered attempts to increase the bandwidth and quality of the interactions between users and information in an information workspace - an environment designed to support information work (see Figure 1).
Rao, R., Pedersen, J. O., Hearst, M. A., Mackinlay, J. D., Card, S. K., Masinter, L., Halvorsen, P. K. and Robertson, G. G. (1995).
Communications of the ACM 38(4): 29-39. [PDF]
Using the Web as a Survey Tool: Results from the Second WWW User
No Abstract Available
Pitkow, J. E. and Recker, M. M. (1995).
Computer Networks and ISDN Systems 27(6).
Results from the Third World Wide Web User Survey
No Abstract Available
Pitkow, J. E. and Kehoe, C. M. (1995).
The World Wide Web Journal 1(1).
Towards an Intelligent Publishing Environment
No Abstract Available
Pitkow, J. E. and Jones, R. K. (1995).
Computer Networks and ISDN Systems 27(6).
Information Foraging and Sensemaking in Strategic Management: An Analysis of an in-situ Study
No Abstract Available
Pirolli, P. and van Gigch, J. P. (1995).
Xerox PARC, Palo Alto, CA.
Information Foraging in Information Access Environments
Information foraging theory is an approach to the analysis of human activities involving information access technologies. The theory derives from optimal foraging theory in biology and anthropology, which analyzes the adaptive value of food-foraging strategies. Information foraging theory analyzes trade-offs in the value of information gained against the costs of performing activity in human-computer interaction tasks. The theory is illustrated by application to information-seeking tasks involving a Scatter/Gather interface, which present users with a navigable, automatically computer overview of the contents of a document collection arranged as a cluster hierarchy.
Pirolli, P. and Card, S. K. (1995).
ACM Conference on Human Factors in Software (CHI '95), Denver, Colorado 51–58. [PDF]
An Organic User Interface for Searching Citation Links
This paper describes Butterfly, an Information Visualizer application for accessing DIALOG's Science Citation databases across the Internet. Network information often involves slow access that conflicts with the use of highly-interactive information visualization. Butterfly addresses this problem, integrating search, browsing, and access management via four techniques: 1) visualization supports the assimilation of retrieved information and integrates search and browsing activity, 2) automatically-created "link-generating" queries assemble bibliographic records that contain reference information into citation graphs, 3) asynchronous query processes explore the resulting graphs for the user, and 4) process controllers allow the user to manage these processes. We use our positive experience with the Butterfly implementation to propose a general information access approach, called Organic User Interfaces For Information Access, in which a virtual landscape grows under user control as information is accessed automatically.
Mackinlay, J., Rao, R. and Card, S. K. (1995).
ACM Conference on Human Factors in Software (CHI '95), Denver, Colorado 67-73. [PDF]
The Sequoia 2000 Electronic Repository
A major effort in the Sequoia 2000 project was to build a very large database of earth science information. Without providing the means for scientists to efficiently and effectively locate required information and to browse its contents, however, this vast database would rapidly become unmanageable and eventually unusable. The Sequoia 2000 Electronic Repository addresses theses problems though indexing and retrieval software that is incorporated into the POSTGRES database management system. The Electronic Repository effort involved the design of probabilistic indexing and retrieval for text documents in POSTGRES, and the development of algorithms for automatic georeferencing of text documents and segmentation of full texts into topically coherent segments for improved retrieval. Various graphical interfaces support these retrieval features.
Larson, R. R., Plaunt, C., Woodruff, A. G. and Hearst, M. A. (1995).
Digital Technical Journal 7(3): 50-65. [PDF]
A Focus + Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies
We present a new focus+context (fisheye) technique for visualizing and manipulating large hierarchies. Our technique assigns more display space to a portion of the hierarchy while still embedding it in the context of the entire hierarchy. The essence of this scheme is to lay out the hierarchy in a uniform way on a hyperbolic plane and map this plane onto a circular display region. This supports a smooth blending between focus and context, as well as continuous redirection of the focus. We have developed effective procedures for manipulating the focus using pointer clicks as well as interactive dragging, and for smoothly animating transitions across such manipulation. A laboratory experiment comparing the hyperbolic browser with a conventional hierarchy browser was conducted.
Lamping, J., Rao, R. and Pirolli, P. (1995).
ACM Conference on Human Factors in Computing Systems (CHI '95), Denver, Colorado. [PDF]
Visualization of Biological Sequence Similarity Search Results
Biological sequence similarity analysis presents visualization challenges, primarily because of the massive amounts of discrete, multi-dimensional data. Genomic data generated by molecular biologists is analyzed by algorithms that search for similarity to known sequences in large genomic databases. The output from these algorithms can be several thousand pages of text, and is difficult to analyze because of its length and complexity. We developed and implemented a novel graphical representation for sequence similarity search results, which visually reveals features that are difficult to find in textual reports. The method opens new possibilities in the interpretation of this discrete, multi-dimensional data by enabling interactive investigation of the graphical representation.
Chi, E. H., Barry, P., Shoop, E., Carlis, J., Retzel, E. and Riedl, J. (1995).
IEEE Visualization '95 44-51. [PDF]
Characterizing Browsing Strategies in the World-Wide Web
No Abstract Available
Catledge, L. D. and Pitkow, J. E. (1995).
Computer Networks and ISDN Systems 26(6): 1065-1073.
Training in Self-Explanation and Self-Regulation Strategies: Investigating the Effects of Knowledge Acquisition Activities on Problem Solving
No Abstract Available
Bielaczyc, K., Pirolli, P. and Brown, A. L. (1995).
Cognition and Instruction 13: 221-252.
A Simple Yet Robust Caching Algorithm Based Upon Dynamic Access Patterns
No Abstract Available
Pitkow, J. E. and Recker, M. M. (1994).
Proceedings of the Second International World Wide Web Conference, Chicago, IL.
Integrating Bottom-Up and Top-Down Analysis for Intelligent Hypertext
No Abstract Available
Pitkow, J. E. and Recker, M. M. (1994).
Third International Conference on Intelligent Knowledge Management, Maryland, Maryland.
Living in Augmented Reality: Ubiquitous Media and Reactive Environments
No Abstract Available
Buxton, B. (1995).
Imagina, Monte Carlo.
Human Skills in Interface Design
No Abstract Available
Buxton, W. A. S. (1994).
Interacting with virtual environments. Wiley, New York: 1-12.
Zooming and Tunneling in Tioga: Supporting Navigation in Multidimensional Space
In [7] we proposed a visual programming system called Tioga. The Tioga system applies a boxes and arrows programming notation to allow nonexpert users to graphically construct database applications. Users connect database procedures using a dataflow model. Browsers are used to visualize the resulting data. This paper describes extensions to the Tioga Browser protocol. These extensions allow sophisticated, flight-simulator navigation through a multidimensional data space. This design also incorporates wormholes to allow tunneling between different multidimensional spaces. Wormholes are shown to be substantial generalizations of hyperlinks in a hypertext system. These powerful mechanisms for relating data provide users with great flexibility. For example, users can create magnifying glasses that provide an enhanced view of the underlying data.
Woodruff, A., Wisnovsky, P., Taylor, C., Su, A., Stonebraker, M., Paxson, C., Chen, J. and A.Aiken (1994).
10th IEEE Symp. on Visual Languages, St. Louis 191-193. [PDF]
GIPSY: Automated Geographic Indexing of Text Documents
No Abstract Available
Woodruff, A. and Plaunt, C. (1994).
Journal of the American Society for Information Science (JASIS) 45(9): 645-655.
Modelling Individual Differences in Students' Learning Strategies
No Abstract Available
Recker, M. and Pirolli, P. (1994).
Journal of the Learning Sciences 4: 1-38.
Protofoil: Storing and Finding the Information Worker's Paper Documents in an Electronic File Cabinet
Although the document imaging industry has taken off in the last few years, document image filing for the individual information worker is still not widespread or effective. In this paper, we focus on building an electronic filing system for paper documents that supports the ad hoc, multifarious work of information workers. Motivated by interviews with researchers and a survey of descriptive studies of paper document filing, we have focussed on minimizing or delaying costs of document filing and supporting a rich variety of methods for accessing and using stored documents. We have implemented a prototype system called Protofoil for storing, retrieving, and manipulating paper documents as electronic images that integrates many user interface - paper and workstation - and information retrieval technologies. Protfoil has been tested through use in our laboratory, and has been deployed in a field study at a lawyer's office.
Rao, R., Card, S. K., Johnson, W., Klotz, L. and Trigg, R. (1994).
ACM Conference on Human Factors in Software (CHI '94), Boston, Massachusetts 180-185. [PDF]
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information
We present a new visualization, called the Table Lens, for visualizing and making sense of large tables. The visualization uses a focus+context (fisheye) technique that works effectively on tabular information because it allows display of crucial label information and multiple distal focal areas. In addition, a graphical mapping scheme for depicting table contents has been developed for the most widespread kind of tables, the cases-by -variables table. The Table Lens fuses symbolic and graphical representation into a single coherent view that can be fluidly adjusted by the user. This fusion and interactivity enables an extremely rich and natural style of direct manipulation exploratory data analysis.
Rao, R. and Card, S. (1994).
ACM Conference on Human Factors in Software (CHI '94), Boston, Massachusetts. [PDF]
Results From the First World Wide Web User Survey
No Abstract Available
Pitkow, J. E. and Recker, M. M. (1994).
Computer Networks and ISDN Systems 27(2).
WebViz: A Tool for WWW Access Log Visualization
No Abstract Available
Pitkow, J. E. and Bharat, K. A. (1994).
The First International World Wide Web Conference, Geneva, Switzerland.
Learning Strategies and Transfer in the Domain of Programming
No Abstract Available
Pirolli, P. and Recker, M. (1994).
Cognition and Instruction 12: 235-275. [PDF]
Developing Calendar Visualizers for the Information Visualizer
Increasing masses of information confronting a business or an individual have created a demand for information management applications. Time-based information, in particular, is an important part of many information access tasks. This paper explores how to use 3D graphics and interactive animation to design visualizers that improve access to large masses of time-based information. Two new visualizers have been developed for the Information Visualizer: 1) the Spiral Calendar was designed for rapid access to an individual's daily schedule, and 2) the Time Lattice was designed for analyzing the time relationships among the schedules of groups of people. The Spiral calendar embodies a new 3D graphics technique for integrating detail and context by placing objects in a 3D spiral. It demonstrates that advanced graphic techniques can enhance routine office information tasks. The Spiral Calendar development process involved three major phases: 1) progressive design based on our experience using advanced graphics for user interfaces, 2) an implementation/evaluation cycle based on a new method for characterizing information access from dynamic displays, and 3) reuse of the spiral technique for a decision support visualization where the spiral helps the user keep track of decisions while designing the layout of aircraft cockpit controls. The Time Lattice, on the other hand has only been informally evaluated. However, it illustrates how different tasks require different advanced graphics technology. In particular, the Time Lattice uses of 2D transparent shadows to provide interactive access to a complex 3D object. Our experience developing theses visualizations should prove useful to others developing user interface that use advanced graphics.
Mackinlay, J. D., Robertson, G. G. and DeLine, R. (1994).
Proceedings of UIST'94, ACM Symposium on User Interface Software and Technology 109-118. [PDF]
The Cost-of-Knowledge Characteristic Function: Display Evaluation for Direct Walk Information Visualizations
In this paper we present a method, the Cost-of-Knowledge Characteristic Function, for characterizing information access from dynamic displays. The paper works out this method for a simple, but important, class of dynamic displays called direct-walk interactive information visualizations, in which information is accessed through a sequence of mouse selections and key selections. The method is used to characterize a simple calendar task for an application of the Information Visualizer, to compute the changes in characterization as the result of possible program variants, and to conduct empirical comparison between different systems with the same function.
Card, S. K., Pirolli, P. and Mackinlay, J. (1994).
CHI '94 Conference on Human Factors in Computing Systems, Boston, MA 238-244.
Collaborative Explanations and Metacognition: Identifying Successful Learning Activities in the Acquisition of Cognitive Skills
No Abstract Available
Bielaczyc, K., Pirolli, P. and Brown, A. (1994).
Sixteenth Annual Conference of the Cognitive Science Society, Atlanta, GA 39-44.
Forward
No Abstract Available
Card, S. K. (1993).
Human-Computer Interaction. Prentice-Hall, New York, New York.
The Cost Structure of Sensemaking
Making sense of a body of data is a common activity in any kind of analysis. Sensemaking is the process of searching for a representation and encoding data in that representation to answer task-specific questions. Different operations during sensemaking require different cognitive and external resources. Representation are chosen and changed to reduce the cost of operations in an information processing task. The power of theses representational shifts is generally under-appreciated as is the relation between sensemaking and information retrieval. We analyze sensemaking tasks and develop a model of the cost structure of sensemaking. We discuss implications for the integrated design of user interfaces, representational tools, and information retrieval systems.
Russell, D. M., Stefik, M. J., Pirolli, P. and Card, S. K. (1993).
INTERCHI '93 Conference on Human Factors in Computing Systems, Amsterdam 269-276. [PDF]
The Document Lens
This paper describes a general visualization technique based on a common strategy for understanding paper documents when their structure is not known, which is to lay the pages of a document in a rectangular array on a large table where the overall structure and distinguishing features can be seen. Given such a presentation, the user wants to quickly view parts of the presentation in detail while remaining in context. A fisheye view or a magnifying lens might be used for this, but they fail to adequately show the global context. The Document Lens is a 3D visualization for large rectangular presentations that allows the user to quickly focus on a part of a presentation while continuously remaining in context. The user grabs a rectangular lens and pulls it around to focus on the desired area at the desired magnification. The presentation outside the lens is stretched to provide a continuous display of the global context. This stretching is efficiently implemented with affine transformations, allowing text documents to be viewed as a whole with an interactive visualization.
Robertson, G. G. and Mackinlay, J. D. (1993).
Proceedings of UIST '93, Atlanta, Georgia 101-108. [PDF]
Non-Immersive Virtual Reality
The goal of virtual reality systems is to place the user in a three-dimensional environment that can be directly manipulated. Ideally, users cease to think of themselves as interacting with a computer and interact instead with the 2D environment. The usual definition of VR involves full immersion. That is, users wear head-mounted stereo displays to provide full visual immersion and special gloves that allow six-degree-of-freedom input for directly manipulating the environment. An alternative form of VR is being explored in a number of research labs. Nonimmersive VR also places the user in a 3D environment that can be directly manipulated, but it does so with a conventional graphics workstation using a monitor, a keyboard, and a mouse. The scene is displayed with the same 3D depth cues used in immersive VR: perspective view, hidden-surface elimination, color texture, lighting, shading, and shadows. As in immersive VR, animation and simulation are interactively controlled in response to the user's direct manipulation. Much of the technology used to support immersive and nonimmersive VR is the same. They use the same 3D modeling and rendering and many of the same interaction techniques.
Robertson, G. G., Card, S. K. and Mackinlay, J. D. (1993).
IEEE Computer 26(2 February): 81-83. [PDF]
Information Visualization Using 3D Interactive Animation
UI Innovations are often driven by a combination of technology advances and application demands. On the technology side, advances in interactive computer graphics hardware, coupled with low-cost mass storage, have created new possibilities for information retrieval systems in which UIs could play a more central role. On the application side, increasing masses of information confronting a business or an individual have created a demand for information management applications. In the 1980s, text-editing forced the shaping of the desktop metaphor and the now standard GUI paradigm. In the 1990s, it is likely that information access will be a primary force in shaping the successor to the desktop metaphor. This article presents an experimental system, the Information Visualizer (See Figure 1), which explores a UI paradigm that goes beyond the desktop metaphor to exploit the emerging generation of graphical personal computers and to support the emerging application demand to retrieve, store, manipulate, and understand large amounts of information. The basic problem is how to utilize advancing graphics technology to lower the cost of finding information and accessing it once found (the information's "cost structure").
Robertson, G. G., Card, S. K. and Mackinlay, J. D. (1993).
Communications of the ACM 36(4 April): 57-71. [PDF]
System Components for Embedded Information Retrieval from Multiple Disparate Information Sources
Current information retrieval interfaces only address a small part of the reality of rich interactions amongst user, task and information sources. We view information gathering as an interactive, iterative activity involving multiple disparate information sources and embedded in the context of broader processes of information use. We have developed two key system components that enable information workspaces that adhere to this reformulation of information retrieval. The first is a design for a user/system interaction model for retrieval from multiple, disparate information sources. The second is a repository modeling system, called Repo, that represents meta-information about different information repositories in a manner that supports system operation as well as provides a direct information resource to the user. To test these ideas, we have utilized Repo and embodied the interaction model in the user interface of a system called Labrador.
Rao, R., Russell, D. M. and Mackinlay, J. D. (1993).
Proceedings of UIST'93, Atlanta, Georgia, November 23-33. [PDF]
Knowledge and the Simultaneous Conjoint Measurement of Activity, Agents, and Situations
No Abstract Available
Pirolli, P. and Wilson, M. (1993).
Annual Meeting of the Cognitive Science Society, Boulder, CO.
Computer Supported Cooperative Work: Research Issues for the 90s
No Abstract Available
Olson, J., Card, S., Landauer, T., Malone, T. and Leggett, J. (1993).
Behavior and Information Technology 12(2): 115-129.
Bridging the Paper and Electronic Worlds: The Paper User Interface
Since its invention millennia ago, paper has served as one of our primary communications media. Its inherent physical properties make it easy to use, transport, and store, and cheap to manufacture. Despite these advantages, paper remains a second class citizen in the electronic world. In this paper, we present a new technology for bridging the paper and electronic worlds. In the new technology, the user interface moves beyond the workstation and onto paper itself. We describe paper user interface technology and its implementation in a particular system called XAX.
Johnson, W., Jellinek, H. D., Klotz, L., Jr., Rao, R. and Card, S. K. (1993).
ACM/IFIPS InterCHI '93 Conference on Human Factors in Software, Amsterdam 507-512. [PDF]
ACM SIGCHI Curricula for Human-Computer Interaction
(First paragraph of introduction): Over the last twenty years, computer science has developed rapidly as a discipline. As the content of computer science has continued to evolve and gain new substance, the curriculum of computer science has continually been revised to reflect the new content and deeper understandings of the nature of computers and information. This active reorganization is as would be expected for a new, vigorously developing discipline.
ACM (1992).
ACM, New York, New York. [PDF]
Student Strategies for Learning Programming from a Computational Environment
No Abstract Available
Recker, M. and Pirolli, P. (1992).
Intelligent Tutoring Systems. Springer-Verlag, Berlin: 382-394.
Explanation, Analogy and Transfer in an Intelligent Tutoring System Paradigm
No Abstract Available
Pirolli, P. (1992).
. Taylor and Francis, New York: 65-84.
Measuring Learning Strategies and Understanding: A Research Framework
No Abstract Available
Pirolli, P. and Wilson, M. (1992).
Intelligent Tutoring Systems. Springer-Verlag, Berlin: 539-558.
Generative Knowledge and the Development of Situated Know-How in Intelligent Tutoring Stems: Implications for Student Modelling
No Abstract Available
Pirolli, P. (1992).
Cognitive modelling and interactive environments in language learning. Springer-Verlag, Berlin: 43-51.
The Information Visualizer: A 3D User Interface for Information Retrieval
Advances in computer technology have created new possibilities for information retrieval systems in which user interfaces could play a more central role. Our analysis of the problem suggests that what is needed from the user's point of view is not so much information retrieval itself, but rather, the amplification of information-based work processes. User interfaces enabled by this technology may be able to amplify work by modifying the cost structure of information used in work. As a consequence, we have attempted to go beyond the usual notion of an information retrieval systems to develop an "Information Workspace" that encompasses the cost structure of information from secondary storage to immediate use. As an implementation of the concept, we describe an experimental system, called the Information Visualizer, and its rationale. The system is based on the use of (1) 3D/Rooms for increasing the capacity of immediate storage available to the user, (2) an animated scheduler-based user interface interaction architecture, called the Cognitive Coprocessor, for coupling the user to information agents, and (3) information visualization for interacting with the information structure. The system and its rationale are described.
Mackinlay, J. D., Robertson, G. G. and Card, S. K. (1992).
Advanced Visual Interfaces. World Scientific Publishing, Singapore: 173-179. [PDF]
The Design Space of Input Devices
No Abstract Available
Card, S. K., Mackinlay, J. D. and Robertson, G. G. (1992).
Multimedia Interface Design. ACM, New York, New York: 217-232.
The Information Grid: A Framework for Information Retrieval and Retrieval-Centered Applications
The Information Grid (InfoGrid) is a framework for building information access applications that provides a user interface design and an interaction model. It focuses on retrieval of application objects as its top level mechanism for accessing user information, document, or services. We have embodied the InfoGrid design in an object-oriented application framework that supports rapid construction applications. This application framework has been used to build a number of applications, some that are classically characterized as information retrieval applications, others that are more typically viewed as personal work tools.
Rao, R., Card, S. K., Jellinek, H. D. and Mackinlay, J. D. (1992).
Proceedings of UIST'92, Monterey, California, November 23-32. [PDF]
Towards a Unified Model of Learning to Program
No Abstract Available
Pirolli, P. (1992).
NATO Advanced Research Workshop: Cognitive models and Intelligent Environments for Learning Programming, Santa Margherita, Italy 13-25.
The Structure of Design Problem Spaces
No Abstract Available
Goel, V. and Pirolli, P. (1992).
Cognitive Science 16: 395-429.
Reframing the Problem of Intelligent Behavior: Commentary on Newell's Unified Theories of Cognition
No Abstract Available
Card, S. K. (1992).
Behavior and Brain Sciences 14(3): 438-439.
Computer-Aided Instructional Design
No Abstract Available
Pirolli, P. (1991).
Intelligent tutoring systems: Evolutions in design. Lawrence Erlbaum: 105-125.
Search Architectures for the Automatic Design of Graphical Presentations
One of the responsibilities of an intelligent interface is to design presentations of application information. An effective approach to this problem is to automatically search a space of possible designs. Since these design spaces are often large and complex, an important issue is how to control the search. This paper describes the search architecture used by the APT (A Presentation Tool) system for the automatic design of graphical presentations of relational information and discusses how it might be modified to handle more difficult design spaces.
Mackinlay, J. D. (1991).
Intelligent User Interfaces. ACM: 281-292. [PDF]
Cone Trees: Animated 3D Visualizations of Hierarchical Information
The task of managing and accessing large information spaces is a problem in large scale cognition. Emerging technologies for 3D visualization and interactive animation offer potential solutions to this problem, especially when the structure of the information can be visualized. We describe one of these Information Visualization techniques, called the Cone Tree, which is used for visualizing hierarchical information structures. The hierarchy is presented in 3D to maximize effective use of available screen space and enable visualization of the whole structure. Interactive animation is used to shift some of the user's cognitive load to the human perceptual system.
Robertson, G. G., Mackinlay, J. D. and Card, S. K. (1991).
ACM Conference on Human Factors in Computing Systems (CHI '91) 189-194. [PDF]
Mind Bugs: The Origins of Procedural Misconceptions (book review)
No Abstract Available
Pirolli, P. (1991).
Artificial Intelligence 52: 329-340.
Effects of Examples and their Explanations in a Lesson on Recursion: A Production System Analysis
No Abstract Available
Pirolli, P. (1991).
Cognition and Instruction 8: 207-259.
The Perspective Wall: Detail and Context Smoothly Integrated
Tasks that involve large information spaces overwhelm workspaces that do not support efficient use of space and time. For example, case studies indicate that information often contains linear components, which can result in 2D layouts with wide, inefficient aspect rations. This paper describes a technique called Perspective Wall for visualizing linear information by smoothly integrating detailed and contextual views. It uses hardware support for 3D interactive animation to fold wide 2D layouts into intuitive 3D visualizations that have a center panel for detail and two perspective panels for context. The resulting visualization supports efficient use of space and time.
Mackinlay, J. D., Robertson, G. G. and Card, S. K. (1991).
ACM Conference on Human Factors in Computing Systems (CHI '91) 173-179. [PDF]
A Morphological Analysis of the Design Space of Input Devices
The market now contains a bewildering variety of input devices for communication from humans to computers. This paper discusses a means to systematize these devices through morphological design space analysis, in which different input device designs are taken as points in a parametrically described design space. The design space is characterized by finding methods to generate and test design points. In a previous paper, we discussed a method for generating the space of input device designs using primitive and compositional movement operators. This allowed us to propose a taxonomy of input devices. In this paper, we summarize the generation method and explore the use of device footprint and Fitts's law as a test. We then use calculations to reason about the design space. Calculations are used to show why the mouse is a more effective device than the headmouse and where in the design space there is likely to be a more effective device than the mouse.
Card, S. K., Mackinlay, J. D. and Robertson, G. G. (1991).
ACM Transactions on Information Systems 9(2 April): 99-122. [PDF]
The Information Visualizer: An Information Workspace
This paper proposes a concept for the user interface of information retrieval system called an information workspace. The concept goes beyond the usual notion of an information retrieval system to encompass the cost structure of information from secondary storage to immediate use. As an implementation of the concept, the paper describes an experimental system, called the Information Visualizer, and its rationale. The system is based on (1) the use of 3D/Rooms for increasing the capacity of immediate storage available to the user, (2) the Cognitive Co-Processor scheduler-based user interface interaction architecture for coupling the user to information agent, and (3) the use of information visualization for interacting with information structure.
Card, S. K., Mackinlay, J. D. and Robertson, G. G. (1991).
ACM Conference on Human Factors in Computing Systems (CHI '91) 181-188. [PDF]
Human Performance Models for Computer-Aided Engineering
No Abstract Available
Elkind, J., Jerome, I., Card, S. K., Hochberg and Julian, H. B. (1990).
Academic Press, Boston, Massachusetts.
Special Issue on Foundations of Human-Computer Interaction.
No Abstract Available
Card, S. K. and Polson, P. G. (1990).
.
Cognitive Architectures
No Abstract Available
Card, S. K. and Newell, A. (1990).
Human Performance Models for Computer-Aided Engineering. Academic Press, Boston, Massachusetts: 173-179.
Models of Working Memory
No Abstract Available
Card, S. K. (1990).
Human Performance Models for Computer-Aided Engineering. Academic Press, Boston, Massachusetts: 203-214.
Modeling Scenarios for Action
No Abstract Available
Card, S. K. (1990).
Human Performance Models for Computer-Aided Engineering. Academic Press, Boston, Massachusetts: 233-247.
Use and Integration of Models
No Abstract Available
Card, S. K. and Wickens, C. (1990).
Human Performance Models for Computer-Aided Engineering. Academic Press, Boston, Massachusetts: 23-54.
The Instructional Design Environment: Technology to Support Design Problem Solving
No Abstract Available
Pirolli, P. and Russell, D. (1990).
Instructional Science Kluwer Academic Publishers, Dordrecht Printed in the Netherlands 19: 121-144.
You Can't Get There from Here: Comments on R. W. Sperry's Resolution of Science and Ethics
No Abstract Available
Pirolli, P. and Goel, V. (1990).
American Psychologist 45: 71-73.
Beyond WIMP Interfaces: Exploiting Human Abilities and Experience
No Abstract Available
Mackinlay, J. D., Card, S. K. and Robertson, G. G. (1990).
Proceedings of Siggraph'90 Workshop on Software Architectures and Metaphors for Non-WIMP User Interfaces.
A Semantic Analysis of the Design Space of Input Devices
A bewildering variety of devices for communication from humans to computers now exists on the market. In this article, we propose a descriptive framework for analyzing the design space of these input devices. We begin with Buxton's (1983) idea that input devices are transducers of physical properties in one, two, or three dimensions. Following Mackinlay's semantic analysis of the design space for graphical presentations, we extend this idea to more comprehensive descriptions of physical properties, space, and transducer mappings. In our reformulation, input devices are transducers of any combination of linear and rotary, absolute and relative, position and force, in any of the six spatial degrees of freedom. Simple input devices are described in terms of semantic mappings from the transducers of physical properties into the parameters of the applications. One of these mappings, the resolution function, allows us to describe the range of possibilities from continuous devices to discrete devices, including possibilities in between. Complex input controls are described in terms of hierarchical families of generic devices and in terms of composition operators on simpler devices. The description that emerges is used to produce a new taxonomy of input devices. The taxonomy is compared with previous taxonomies of Foley, Wallace, and Chan (1984) and of Buxton (1983) by reclassifying the devices previously analyzed by these authors. The descriptive techniques are further applied to the design of complex mouse-based virtual input controls for simulated three-dimensional (3D) egocentric motion. One result is the design of a new virtual egocentric motion control.
Mackinlay, J. D., Card, S. K. and Robertson, G. G. (1990).
Human-Computer Interaction 5(2-3): 145-190. [PDF]
Rapid, Controlled Movement Through a Virtual 3D Workspace
Computer graphics hardware supporting real-time interactive 3D animation has the potential to support effective user interfaces by enabling virtual 3D workspaces. However, this potential requires development of viewpoint movement techniques that support rapid and controlled movement through workspaces. Rapid movement through large distances avoids wasted work time; controlled movement near target objects allows the user to examine and interact with objects in the workspace. Current techniques for viewpoint movement typically use high velocities to cover distances rapidly, but high velocities are hard to control near objects. This paper describes a new technique for targeted viewpoint movement that solves this problem. The key idea is to have the user indicate a point of interest (target) on a 3D object and use the distance to this target to move the viewpoint logarithmically, by moving the same relative percentage of distance to the target on every animation cycle. The result is rapid motion over distances that slows as the viewpoint approaches the target object. The technique can be used with 2D and multidimensional input devices. We also extend the technologies to move objects in the workspace.
Mackinlay, J. D., Card, S. K. and Robertson, G. G. (1990).
Computer Graphics 24(4, August): 1971-1976. [PDF]
Representations for Knowledge-Based Graphical User Interfaces
No Abstract Available
Mackinlay, J. D. (1990).
Proceedings of AAAI Symposium on Knowledge-based Human Computer Communication, Stanford, California.
Introduction to this Special Issue on Foundations of Human-Computer Interaction
No Abstract Available
Card, S. K. and Polson, P. P. (1990).
Human-Computer Interaction 5(2 & 3): 119-123.
The Design Space of Input Devices
A bewildering variety of devices for communication from humans to computers now exists on the market. In order to make sense of this variety, and to aid in the design of new input devices, we propose a framework for describing and analyzing input devices. Following Mackinlay's semantic analysis of the design space for graphical presentations, our goal is to provide tools for the generation and test of input device designs. The descriptive tools we have created allow us to describe the semantics of a device and measure its expressiveness. Using these tools, we have built a taxonomy of input devices that goes beyond earlier taxonomies of Buxton & Baecker and Foley, Wallace, & Chan. In this paper, we build on these descriptive tools, and proceed to the use of human performance theories and data for evaluation of the effectiveness of points in this design space. We focus on two figures of merit, footprint and bandwidth, to illustrate this evaluation. The result is the systematic integration of methods for both generating and testing the design space of input devices.
Card, S. K., Mackinlay, J. D. and Robertson, G. G. (1990).
ACM Conference on Human Factors in Computing Systems (CHI '90), Seattle, Washington 117-124. [PDF]
Human Factors and the Intelligent Interface
No Abstract Available
Card, S. K. (1989).
Intelligent Interfaces: Theory, Research, and Design. Elsevier Science Publishers, B. V. (North-Holland), New York, New York: 270-280.
Learning from a Visualized Garbage Collector
No Abstract Available
Weiser, M., Hayes, B. and Mackinlay, J. D. (1989).
Proceedings of 5th USENIX Computer Graphics Workshop, Monterey, California, November.
The Cognitive Co-Processor for Interactive User Interfaces
No Abstract Available
Robertson, G. G., Card, S. K. and Mackinlay, J. D. (1989).
Proceedings of the ACM Conference on User Interface Software and Technology 10-18.
The Explanation of Programming Examples
No Abstract Available
Pirolli, P. and Recker, M. (1989).
Workshop on Models of Complex Human Learning, Cornell University, Ithaca, NY.
Empirical Analyses of Self-Explanation and Transfer in Learning to Program
No Abstract Available
Pirolli, P. and Bielaczyc, K. (1989).
Proceedings of the Annual Conference of the Cognitive Science Society, Hillsdale, NJ 450-457.
Strategies and Mechanisms for the Construction and Refinement of Programming Knowledge: A Unified Computational Model of Learning
No Abstract Available
Pirolli, P. (1989).
Army Research Institute.
On the Art of Building: Putting a New Instructional Design into Practice
No Abstract Available
Pirolli, P. (1989).
Proceedings of the 2nd Intelligent Tutoring Systems Research Forum 129-141.
Motivating the Notion of Generic Design Within Information-Processing Theory: The Design Problem Space
No Abstract Available
Goel, V. and Pirolli, P. (1989).
AI Magazine 10: 19-36.
The Problem Space of Instructional Design
No Abstract Available
Pirolli, P. L. and Greeno, J. G. (1988).
Intelligent Tutoring Systems: Lessons Learned. Lawrence Erlbaum, Hillsdale, NJ: 181-201.
Applying a Theory of Graphical Presentation to the Graphic Design of User Interfaces
The increasing availability of computers with high-quality graphics and fonts has created an opportunity and an obligation for user interface designers. The opportunity is that designers can use graphical techniques to design more effective user interfaces. The obligation is that they must become experts at the design of graphical user interfaces. Current user interface toolkits provide very little design assistance. This paper describes a theory that supports automatic design of graphical presentations of relational information and shows how to extend it to support theory-driven design of graphical users.
Mackinlay, J. D. (1988).
Proceedings of the ACM SIGGRAPH Symposium on User Interface Software (UIST '88) 179-189. [PDF]
Learning to Program Recursive Functions
No Abstract Available
Anderson, J. R., Pirolli, P. L. and Farrell, R. (1988).
The Nature of Expertise. Lawrence Erlbaum, Hillsdale, NJ: 153-183.
Towards Theory and Technology for the Design of Intelligent Tutoring Systems
No Abstract Available
Pirolli, P. and Russell, D. (1988).
Proceedings of the International Conference on Intelligent Tutoring Systems 350-356.
Search Architectures for the Automatic Design of Graphical Presentations
No Abstract Available
Mackinlay, J. D. (1988).
Workshop notes for Architectures for Intelligent Interfaces: Elements and Prototypes, Monterey, California, March 1988.
A Model of Purpose-Driven Analogy and Skill Acquisition in Programming
No Abstract Available
Pirolli, P. (1987).
Proceedings of the Tenth Annual Conference of the Cognitive Science Society 609-621.
Intelligent Tutoring Systems Technology: An Interface Between Instructional Theory and Practice
No Abstract Available
Pirolli, P. (1987).
Technology and Education 1: 2-4.
A Multiple, Virtual-Workspace Interface to Support User Task Switching
An Interface is presented that is designed to help users switch among tasks on which they are concurrently working. Nine desirable properties for such an interface are derived. It is argued that a key constraint to building interfaces that support task switching is that low user-overhead switching among tasks required a large amount of display space, whereas actual display space is limited. A virtual workspace design is presented that greatly speeds the inevitable task-switching induced window faulting. The resulting interface is presented as a study in theory-based human-interface design. It is shown how in this case theory is important in inspiring a design, but design entailments outside the theory raise new issues that must be faced to make the design viable. These design experiences, in turn, help inspire new theory.
Card, S. K. and Henderson, D. A., Jr. (1987).
Proceedings of HCI+GI, Toronto, Canada 53-59. [PDF]
Catalogues: A Metaphor for Computer Application Delivery
This paper presents the mail-order catalogue as a metaphor for the delivery of application software in an integrated work environment. It also describes, Catalogue, an adjunct to the Rooms multiple virtual workspace environment, which employs this metaphor. This mechanism can be used (1) to give users "instant starts" by letting the user's select a standard setup, (2) to allow users to assemble their own environment from standard components, (3) to parameterize a standard component, and (4) to load applications ready to run.
Card, S. K. and Henderson, D. A., Jr. (1987).
Proceedings of the Interact '87 Conference on Human-Computer Interaction (Bullinger, H-J. & Shackel, B. (Eds.)), Stuttgart 969-964. [PDF]
The Model Human Processor: An Engineering Model of Human Performance
It can be argued that one of the chief impediments to progress in human engineering is the lack of a model of the human information-processor posed in such a way as to enable approximate engineering calculations to be made of human performance. The Model Human Processor is an attempt at such a model. The use of the model is illustrated through sample computations.
Card, S. K., Moran, T. P. and Newell, A. (1986).
Handbook of Perception and Human Performance. John Wiley and Sons, New York, New York: Chapter 45, 1- 35. [PDF]
A Cognitive Model and Computer Tutor for Programming Recursion
No Abstract Available
Pirolli, P. (1986).
Human-Computer Interaction 2: 319-355.
Straightening Out Softening Up: Response to Carroll and Campbell
No Abstract Available
Newell, A. and Card, S. K. (1986).
Human-Computer Interaction 2(3): 251-267.
Automating the Design of Graphical Presentations of Relational Information
The goal of the research described in this paper is to develop an application-independent presentation tool that automatically designs effective graphical presentations (such as bar charts, scatter plots, and connected graphs) of relational information. Two problems are raised by this goal: the codification of graphic design criteria in a form that can be used by the presentation tool, and the generation of a wide variation of designs so that the presentation tool can accommodate a wide variety of information. The approach described in this paper is based on the view that graphical presentations are sentences of graphical languages. The graphic design issues are codified as expressiveness and effectiveness criteria for graphical languages. Expressiveness criteria determine whether a graphical language can express the desired information. Effectiveness criteria determine whether a graphical language exploits the capabilities of the output medium and the human visual system. A wide variation of designs are systematically generated by using a "composition algebra" that composes a small set of primitive graphical languages. Artificial intelligence techniques are used to implement a prototype presentation tool called APT (A Presentation Tool), which is based on the composition algebra and the graphic design criteria.
Mackinlay, J. D. (1986).
ACM Transactions on Graphics 5(2, April): 110-141. [PDF]
Rooms: The Use of Multiple Virtual Workspaces to Reduce Space Contention in a Window-Based Graphical User Interface
A key constraint on the effectiveness of window-based human-computer interfaces is that the display screen is too small for many applications. This results in "window thrashing," in which the user must expend considerable effort to keep desired windows visible. Rooms is a window manager that overcomes small screen size by exploiting the statistics of window access, dividing the user's workspace into a suite of virtual workspaces with transitions among them. Mechanisms are described for solving the problems of navigation and simultaneous access to separated information that arise from multiple workspaces.
Henderson, D. A., Jr. and Card, S., K. (1986).
ACM Transactions on Graphics 5(3, July): 211-241. [PDF]
The Role of Practice in Fact Retrieval
No Abstract Available
Pirolli, P. L. and Anderson, J. R. (1985).
Journal of Experimental Psychology: Learning, Memory, and Cognition 11: 136-153.
The Role of Learning from Examples in the Acquisition of Recursive Programming Skills
No Abstract Available
Pirolli, P. L. and Anderson, J. R. (1985).
Canadian Journal of Psychology 39: 240-272.
The Prospects for Psychological Science in Human-Computer Interaction
No Abstract Available
Newell, A. and Card, S. K. (1985).
Human-Computer Interaction 1(3): 209-242.
Expressiveness and Language Choice
Specialized languages are often more appropriate than general languages for expressing certain information. However, specialized languages must be chosen carefully because they do not allow all sets of facts to be stated. This paper considers the problems associated with choosing among specialized languages. Methods are presented for determining that a set of facts is expressible in a language, for identifying when additional facts are stated accidentally, and for choosing among languages that can express a set of facts. This research is being used to build a system that automatically chooses an appropriate graphical language to present a given set of facts.
Mackinlay, J. D. and Genesereth, M. R. (1985).
Data and Knowledge Engineering 1(1, June): 17-29. [PDF]
Visual Search of Computer Command Menus
Two experiments explored how users perform visual search to locate a target in a computer command menu. In Experiment 1, users searched for menu items while their eye movements were monitored. Visual search was well characterized by an unsystematic search model in which the user may search the same place more than once. This model predicted the distribution of search times, the lack of an item position effect, and the frequency of saccade directions. In Experiment 2, the unsystematic search model was used to describe the effects on search time of three menu-item arrangements and of practice. Initially, the arrangement of the menu, whether alphabetic, random, or categorical, influenced the time required by the search. But with practice, the user eventually learned the location of each item in the menu, rendering the arrangement of the menu unimportant.
Card, S. K. (1984).
Attention and Performance X, Control of Language Processes. Lawrence Erlbaum Associates, Hillsdale, New Jersey. [PDF]
Human Limits and the CRT Computer Interface
No Abstract Available
Card, S. K. (1984).
Visual Display Terminals: Suitability Issues and Health Concerns. Prentice-Hall, Englewood Cliffs, New Jersey.
The Effect of Leading Questions on Prior Memory: Evidence for the Coexistence of Inconsistent Memory Traces
No Abstract Available
Pirolli, P. L. and Mitterer, J. O. (1984).
Canadian Journal of Psychology 38: 135-141.
Learning to Program Recursion
No Abstract Available
Pirolli, P. L., Anderson, J. R. and Farrell, R. (1984).
Proceedings of the Sixth Annual Conference of the Cognitive Science Society, Hillsdale, NJ 277-280.
Expressiveness of Languages
No Abstract Available
Mackinlay, J. D. and Genesereth, M. R. (1984).
Proceedings AAAI'84, Austin, Texas 226-232.
On-Line Composition of Text
The use of text editors for writing original text has been little studied despite the importance of the task. A study conducted by Gould found that the composition rate of writers using a text editor was more than 50% slower than the same writers writing by hand. We show that the source of the slownenss is the design of the text editor and that using a display-oriented editor writers can write as fast and good typists faster than by hand. Like Gould, there was no quality difference based on the source of the letters and users of the text editor made many more modifications. Fewer than half of the modifications users made actually improved the text.
Card, S. K., Robert, J. M. and Keenan, L. N. (1984).
Interact '84 (Shackel, B. (ed.), Human-Computer Interaction), Amsterdam 51-56. [PDF]
Window-Based Computer Dialogues
No Abstract Available
Card, S. K., Pavel, M. and Farrell, J. E. (1984).
Interact '84 (Shackel, B. (ed.), Human-Computer Interaction), Amsterdam 239-243.
Spread of Activation
No Abstract Available
Anderson, J. R. and Pirolli, P. L. (1984).
Journal of Experimental Psychology: Learning, Memory, and Cognition 10: 791-798.
The Psychology of Human-Computer Interaction
No Abstract Available
Card, S. K., Moran, T. P. and Newell, A. (1983).
Erlbaum Associates, Hillsdale, New Jersey.
SIMPLE: a Simple Language for Research in Programmer Psychology
No Abstract Available
Shrager, J. and Pirolli, P. L. (1983).
Carnegie-Mellon University, Department of Psychology, Pittsburgh, PA.
Research Needs for Human Factors
No Abstract Available
National Research, C. (1983).
National Academy Press, Washington, D. C.
The Psychological Study of Computing (Review of Information Technology and Psychology by R. A. Rasschau, R. Lachau, and K. R. Laughery (Eds.))
No Abstract Available
Card, S. K. (1983).
Contemporary Psychology 28: 841-842.
Automation in Combat Aircraft
No Abstract Available
Council, N. R. (1982).
National Academy Press, Washington, D. C.
User Perceptual Mechanisms in the Search of Computer Command Menus
No Abstract Available
Card, S. K. (1982).
Proceedings of the Conference on Human Factors in Computer Systems, Gathersburg, Maryland 190-196.
The Keystroke-Level Model for User Performance Time with Interactive Systems
No Abstract Available
Card, S. K., Moran, T. P. and Newell, A. (1980).
Communications of the ACM 23: 396-410.
A Method for Calculating Performance Times for Users of Interactive Computing Systems
No Abstract Available
Card, S. K. (1979).
, Denver, Colorado.
Evaluation of Mouse, Rate-Controlled Isometric Joystick, Step Keys, and Text Keys for the Text Selection on a CRT
No Abstract Available
Card, S. K., English, W. K. and Burr, B. J. (1978).
Ergonomics 21: 601-613.
The Role of Semantics in Remembering Comparative Sentences
No Abstract Available
Clark, H. H. and Card, S. K. (1969).
Journal of Experimental Psychology 82: 545-553.