Date of Award

Spring 1-1-2025

Document Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Psychology

First Advisor

Scholl, Brian

Abstract

What is the purpose of visual perception? The most common explanations suggest that â€œseeingâ€ is for answering the question â€˜Whatâ€™s out there?â€™ â€” giving us information about the features and objects in the current local environment. This dissertation, in contrast, suggests a different approach: seeing is also for answering the question â€˜Whatâ€™s happening?â€™ â€” and the related questions â€˜What just Happened?â€™ and â€˜Whatâ€™s about to happen?â€™. The critical difference is that while the more common answer seems implicitly static, these newer answers are intrinsically dynamic. This dissertation proposes that visual processing represents even certain static images in terms of rich, dynamic representations. I make this point via three empirical case studies, involving (1) intuitive physics, (2) causal history, and (3) navigational affordances. The first example explores how intuitive physics is taken into account even in visual processing: when viewing static images of objects covered by soft materials (e.g. cloths), observers spontaneously form dynamic representations of underlying physical interactions (between gravity, the cloth, and the rigid object beneath the cloth). These representations then have powerful influences on visual attention and memory: observers are better at detecting changes to the deep underlying scene structure (the object beneath the cloth), compared to changes involving only the superficial folds of the cloth â€” even when the latter were objectively more extreme along several dimensions. A second example explores how perceiving collections of shapes also involves representing their causal history. When viewing static images of blocks stacked on top of each other, observers spontaneously form dynamic representations of the past, which then dramatically influence current percepts in accord with intuitive physics (where such structures must be built â€˜from the ground-upâ€™): when blocks appear sequentially from top to bottom, observers mistakenly perceive them as appearing simultaneously â€” and when blocks appear simultaneously, observers mistakenly perceive them as appearing from bottom to top. In a final example, I explore novel connections between two prominent themes in our field: affordances and visual routines. When viewing static images of maze-like stimuli (or scenes filled with obstacles), observers spontaneously engage in â€˜mental path tracingâ€™: when comparing two probes in such scenes, response times depend on the their â€˜pathwiseâ€™ distance from each other, and not simply their Euclidean separation â€” even when the paths are completely task-irrelevant. Collectively, this work demonstrates that perception forms rich dynamic representations even of static scenes: we see what matters â€” visual representations of a sceneâ€™s deep underlying structure, its inferred past, and its likely future.

Recommended Citation

Wong, Kimberly, "The Richness of Perception: Dynamic Visual Representations from Static Images" (2025). Yale Graduate School of Arts and Sciences Dissertations. 1571.
https://elischolar.library.yale.edu/gsas_dissertations/1571

Download

COinS

Yale Graduate School of Arts and Sciences Dissertations

The Richness of Perception: Dynamic Visual Representations from Static Images

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

Recommended Citation

Search

Browse

Contribute

Researcher Profiles

Copyright, Publishing and Open Access

Links

Yale Graduate School of Arts and Sciences Dissertations

The Richness of Perception: Dynamic Visual Representations from Static Images

Author

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

Recommended Citation

Share

Search

Browse

Contribute

Researcher Profiles

Copyright, Publishing and Open Access

Links