Stanford Encyclopedia of Philosophy

Notes to Mental Imagery

1. One might ask, however, why closing one's eyes or turning one's head away should not count as choosing not to see the elephant. Also, our voluntary control over our imagery should not be overstated. Notoriously, if someone is commanded not to imagine an elephant, almost inevitably they will imagine one; and some images, very often particularly distressing ones, can be very hard to voluntarily dismiss.

2. It is also surely in conflict with the arguments that have been made for the ineliminable epistemological role of imagination and imagery in scientific thought experiments (e.g., Brown, 1991; Gendler, 1998; Georgiou, 2007).

3. On this point I have relied upon the expertise of Professor Edward Slingerland of the Department of Asian Studies at the University of British Columbia, who supplied the reference. Further corroboration comes from the story of the 16th century Jesuit missionary Matteo Ricci, who attempted to introduce European imagery based mnemonic techniques (see Supplement: Ancient Imagery Mnemonics) into China (Spence, 1985). Although the Chinese scholars were not especially impressed with the practical utility of the elaborate system Ricci presented (they were prepared to believe that it would work, but thought it unlikely to repay the effort needed to learn it) there is no hint that they found the concept of imagery itself in the least unfamiliar or puzzling.

4. Modern scholarly discussions of Aristotle's concept of phantasia include those of Brentano (1886/1977), Beare (1906), Ross (1923), Benardete (1975), Engmann (1976), Frede (1992), Lycos (1964), Modrak (1986, 1987), Nussbaum (1978), Philippe (1971), Rees (1971), Roth (1963), Schofield (1978), Todd (1981), Watson (1982, 1988), Wedin (1988), White (1985), Turnbull (1994), Caston (1996), Niarchos (1997-8), and Birondo (2001). General histories of the concept of imagination, that cover Aristotle's seminal contribution, include those of Brann (1991), Bundy (1927), Cocking (1991), Kearney (1988), White (1990), and Thomas (1999a).

5. Although the Treatise of Man was written relatively early in Descartes’ philosophical career, before the Meditations, and was not published during his lifetime, there is ample evidence in later material such as the Passions of the Soul (1649) and the Conversation with Burman (1648), that he did not repudiate these views, and that they continued to shape his thinking to the end. Furthermore, when the treatise was posthumously published it seems to have had a considerable influence, and, for a while, may well have been more widely read than more purely philosophical works such as the Meditations, where the "clear and distinct ideas" of the intellect are treated.

6. Animal spirit may be identified with what is now called cerebro-spinal fluid, a clear liquid that bathes the brain and spinal cord. The name animal spiritand the notion that it is centrally involved in brain function, and, indeed, in the processes of thought, goes back at least to the Roman physician Galen, and was widely accepted through the middle ages and Renaissance (Woolam, 1957; Finger, 1994). Thus, although Descartes' neural hydraulics may have been fairly original, the idea of animal spirit was quite conventional. Today, cerebro-spinal fluid is held to have an essentially nutritive function for the brain, and not to play any direct role in any cognitive or otherwise mental processes.

7. The retinal image had, at this time, only relatively recently been discovered. Johannes Kepler first described it in 1604 (Lindberg, 1976). Descartes had the Treatise of Man ready for publication in 1633.

8. Descartes held that the pineal gland was full of animal spirit, under pressure, and that the optic nerve (and other sensory nerves) acted upon it via small threads running down the center of each nerve fibre. The pull of a thread would cause a pore to open in the surface of the gland, so that animal spirit was allowed to flow out of the gland, and through the hollow nerve fibre, into the brain. It is the pattern of such open pores in the gland surface that supposedly comprises the image there.

9. In fact, he says it is the seat of "imagination and common sense". "Common sense," however, is not used here in its modern colloquial sense, but refers instead to the Aristotelian notion of a faculty or power of the soul (explained mainly in De Anima III, 2) whereby the deliverance of the various sense organs are brought together into a coherent, unified perceptual experience of the world (Beare, 1906; Ross, 1923; Kahn, 1966; Kosman, 1975; Nussbaum, 1978; Modrak,1981, 1987; Everson, 1997). Likewise, Descartes held that not only visual experience, but other sensory experiences too, were mediated by movements of the pineal gland that were in turn caused by the flow of animal spirit through its surface, under control of the sensory nerves. Aristotle is often, and plausibly, interpreted as understanding this common sense to be very closely bound up with, if not identical to, the phantasia or imagination (De Insomniis 459a 15-20), which is said to be responsible for imagery. When it functions in perception it is appropriately called the common sense, and when it functions in remembering or imagining something, producing imagery, it is called the imagination. Thus, although Descartes may be highly original in the way he mechanizes and anatomically locates these functions, he is not being at all original or controversial in linking them.

10. Spelling, capitalization and emphasis are all as in the original.

11. Very much the same is said in Hobbes' De Corpore (Chap. XXV, 7, in Calkins, 1963).

12. See also De Corpore (Chap. XXV, 7, in Calkins, 1963). Perhaps Hobbes thought of these motions as vibrations that could die away gradually: at any rate, he seems to have believed light to be a form of vibration (Human Nature, chap. 2, in Calkins, 1963), so it would make sense for it to induce vibratory motion in the brain.

13. Scientists would continue to be called "natural philosophers" for some time. The Oxford English Dictionary does not record any use of the word "scientist" before 1834, and its coinage and popularization is often attributed to Whewell (1840). However, the distinction between what we now respectively call science and philosophy was recognized in practice long before that (Gjertsen, 1989). Indeed, it was marked (and perhaps in part initiated) by Locke himself, when he promised not to "meddle with the physical consideration of the mind," (Essay I.i.2), and described himself as a mere "under-labourer" to Newton and the other great scientists of his age (Essay: Epistle to the Reader).

14. The word "imagery" occurs just once in the Essay (II.x.5), and its direct reference is to carvings or other designs on tombstones. However, the point of the passage is to draw an analogy between the "mouldering away" of this imagery with time and the gradual fading of ideas from our memory.

15. It is true that Locke holds that ideas originate not only from "sense" but also from "reflection," i.e., introspection upon the workings of our mind itself. However, in all relevant respects he understands this reflection as just another perceptual sense, an "internal sense" (Essay II.1.4) that just happens to be directed inwards toward the workings of our minds rather than outwards toward the world around us (Ayers, 1991 p. 49).

16. Berkeley did think that we could use our inherently particular ideas to represent generals, i.e., classes of things, as a geometrician might draw a diagram containing some particular straight line of some particular length to illustrate a theorem intended to apply to all straight lines of any length (Principles, Introduction XII). Thus, I might use a picture of Winston Churchill (or a pictorial idea of him, in my mind) to represent all British prime ministers, or all politicians, or all men, or whatever else I might chose. However, it is not clear that this move solves the problem of how we are able to think about classes of things. In order to be able to decide to use my mental picture of Churchill to represent all prime ministers, surely I must already have a general concept of prime-ministerhood, or of the class of all prime ministers. If I do, it is not clear what I would gain by using an imaginal idea to represent this, and Berkeley's story does nothing to illuminate the nature and origin of my (presumably non-imaginal) general concept of prime-ministerhood (or of straight line of arbitrary length, or whatever).

17. Hume seems to use 'fancy' (derived from the Greek phantasia) as nothing more than an alternative name for the mental faculty more commonly known by the Latinate term 'imagination.' Although some medieval and renaissance authors may have suggested distinct cognitive roles for imaginatio and phantasia the distinctions they were attempting to draw never really seem to have caught on (Clarke & Dewhurst, 1972; Harvey, 1975). After Hume's time, Coleridge (1817) used the two terms to draw a distinction between the truly creative imagination and the merely associative fancy, but, despite its fame, this distinction never caught on very widely either, and 'fancy,' in the relevant sense, is now largely obsolete.

18. Another passage in the Enquiry is, in a way, even more explicit:

Our mental vision or conception of ideas is nothing but a revelation made to us by our Maker. When we voluntarily turn our thoughts to any object, and raise up its image in the fancy, it is not the will which creates that idea: It is the universal Creator, who discovers it to the mind, and renders it present to us. (Enquiry VII.1)

Unfortunately exegesis of this is complicated by the fact that Hume is here describing a theory (probably Malebranche's) that he wants to reject. Hume certainly does not think that God is directly responsible for rendering our ideas present to us. However, there is no hint that he is in anything but full agreement with the metaphor of "mental vision" and the notion that to think of an object is to "raise up its image in the fancy."

19. Some of Reid's arguments may require only that ideas be understood as representational entities. However, he frequently states that his predecessors did think of their ideas as images (e.g., 1764 2.III), and chapters 7 and 9 of the second Essay on the Intellectual Powers of Man (1785), go to some pains to pin this doctrine on Locke and others.

20. Bloor (1983) goes so far as to suggest (though without citing any evidence) that the later work of Wittgenstein largely grew out of the reaction to the imageless thought affair. Certainly one of the leading Würzburg school researchers, Karl Bühler, became a prominent psychology Professor in Vienna during the inter-war years, and Wittgenstein is known to have met him there, and seems to have been very struck (probably unfavorably) by his views (Toulmin, 1969; Bartley, 1973). Bühler also taught, and deeply influenced, the young Karl Popper (Popper, 1976), and undoubtedly his views would also have been quite familiar to the Vienna Circle positivists. (In the German speaking world at this time, the fields of philosophy and psychology were still closely institutionally as well as intellectually entwined.)

21. Ever since Galton (1880, 1883) published his pioneering work on individual differences in imagery experience, it has been “well known” that some small minority of otherwise normal people report that they do not experience visual imagery at all. However, this phenomenon remains very poorly understood. Galton's evidence is essentially anecdotal, and his interpretation of his findings seems to be distinctly suspect (Brewer & Schommer-Aikins, 2006; Burbridge, 1994). Since his time, no systematic, quantitative research seems to have been done on “non-imagers” (although see Sommer (1978) and Thomas (1989) for case studies of individuals, and Faw (1997, 2009) for personal testimony and discussion). Even their incidence in the general population remains unknown (figures ranging from 10-12% (Abelson, 1979) to “less than 2%” have been suggested, but there is no sign of an objective basis for any of these estimates). Despite this, both Danto (1958) and Lawrie (1970) insinuate that Ryle's views on imagery are only explicable if he was a “non-imager” of the sort described by Galton. Hannay (1971) (although he soon retreats) floats the same notion with regard not only to Ryle, but also to Shorter and Sartre. Similar ad hominem dismissals of anti-pictorialist arguments can also be found amongst contemporary psychologists. Kosslyn, for example, speculates that the leading anti-pictorialist psychologist, Pylyshyn, may be “one of the few (…) people who does not experience imagery” (see “What Shape Are a German Shepherd's Ears?” in Other Internet Resources, which is also the source for the “less than 2%” estimate for the proportion of non-imagers in the general population). In fact, however, throughout his writings on imagery Pylyshyn is frequently at considerable pains to make it clear that he in no way intends to cast doubt on the reality of imagery experience. There is some evidence that when people know relatively little about the theoretical issues and empirical evidence concerning imagery there is a correlation between their inclination to believe in inner pictures and the (introspectively reported) vividness of their imagery (i.e., people with more vivid imagery are more likely to believe in inner pictures). However, this correlation disappears once they know more about the issues and evidence (Reisberg, Pearson, & Kosslyn, 2003).

22. Sartre, indeed, is said to have experimented with the hallucinogenic drug mescaline as a young man, and to have not only experienced vivid hallucinations during his actual “trip,” but also distressing hallucinatory “afterflashes” for a considerable time following it (de Beauvoir, 1960; Riedlinger, 1982). As for Ryle, it is quite clear, particularly from chapter 3 of his On Thinking (1979), that although he did not regard all thought, or even all imaginative thought, as necessarily involving quasi-visual experience, he certainly accepted that it sometimes does.

23. Bugelski himself was a significant participant in this research activity (e.g. Bugelski, 1970, 1971; Bugelski, Kidd, & Segmen, 1968).

24. Imagery values were determined by averaging the results of numerical ratings produced by many subjects filling out a specially designed questionnaire (Paivio, Yuille, & Madigan, 1968).

25. The difference in memorability of the two sets of words is not in dispute. What has been much disputed, however, is whether the difference in imagery value (or imagability) really causes the differences in memory performance, or whether some other semantic feature of the words (something highly correlated with, but not identical to, imagery value) is the true cause. For example, Richardson (1975) once held concreteness to be the main causally effective variable, but later changed his mind in the face of further evidence (Richardson, 1980 p. 93). Another suggestion is that the predicability of the nouns in question (i.e., “the ease with which what they refer to can be described by simple factual statements”) might be the true causally effective parameter (Jones, 1986). Measures of predicability correlate quite well with imagery value, and it is quite plausible that more predicable words might spontaneously give rise to more extensive networks of verbal associations that might account for their greater memorability. In fact, upwards of thirty such alternatives to imagery value have been proposed over the past few decades, but the consensus seems to be that none of them has proven able to account for the full range of the empirical evidence as accurately and comprehensively as imagery value itself (Marks, 1997; see also Paivio, 1983a, 1991, 2007). Nevertheless, it is clear, if only from the fact that so many alternative explanations have been proposed, that doubts about Paivio's imagery based explanation of his experimental data remain persistent and deep rooted. These doubts may be partly motivated by lingering doubts about the very reality of imagery, but the more important motivation is probably a commitment (itself motivated by broader theoretical or metatheoretical considerations) to some version of a common coding view of mental representation (see Supplement: Dual Coding and Common Coding Theories of Memory).

26. In the article that kicked off the analog/propositional debate (Pylyshyn, 1973) the expression "propositional representation" was restricted to computational representational schemes based on predicate calculus notation. However, the expression came to be applied, by Pylyshyn and others, much more widely in later discussions. The advantage of the term 'proposition,' in this context, is that it emphasizes the point that what is being contemplated are not natural language sentences (i.e., sentences of English, Russian, Chinese, Latin, or whatever), not the sort of natural language verbal mental representations advocated by theorists such as Paivio (1971, 1986), but, rather, unconscious computational representations of unknown lexicon and syntax (perhaps identifiable with the deep structure representations of sentences of Chomskian linguistic theory). Unfortunately, as noted, this usage is importantly different from what 'proposition' has long meant to philosophers (Gale, 1967).

27. The word 'analog' was originally mainly associated with the views of Shepard, who played an important role in the earlier stages of the debate, and who may have been envisaging truly analog processes, as in an analog computer (Shepard, 1975, 1978b, 1981, 1984; Shepard & Cooper et al., 1982). However, although his experimental findings remain important, Shepard's theoretical ideas have long since been overshadowed, in the development of the debate, by those of Kosslyn. Since the later 1980s Kosslyn has been more concerned with the neuroscience of mental imagery than with the computational model that was central to his earlier work, but there is no reason to think that this has made his theory any more analog, and, in fact, he rarely uses the term.

28. In the same year, some very similar arguments were independently made by Anderson & Bower (1973).

29. Most defenders of the analog/pictorial theory of imagery, such as Kosslyn, Thompson & Ganis (2002), do not dispute that imagery is cognitively penetrable. Their quarrel is with other aspects of Pylyshyn's argument.

30. Of course, there had previously been other suggestions for non-pictorial accounts of imagery, most notably the motor theories of Dunlap (1914) and Washburn (1916), the behaviorist theory of Taylor (1973), and the enactive (or perceptual activity) theories of Hebb (1968), Hochberg (1968), and Sarbin & Juhasz (1970; Sarbin, 1972). However, none of these ideas had had much impact, and even today few are aware of them. Enactive theories will be discussed in section 4.5.1.

31. Pylyshyn (1973) also makes this argument, although he does not cite either Shorter or Dennett. They all argue that mental images can be indeterminate or inexplicit in what they represent, in a way that pictures cannot be. Perhaps the clearest and best known example of the argument is that Dennett (1969 pp. 135-7; cf. Chisholm, 1942), who claims that a mental image of a tiger may be quite inexplicit as to the number of stripes it shows, whereas any picture of a tiger would have to show a particular, countable number of stripes. Of course, a description of a tiger could perfectly easily be inexplicit in the relevant way, perhaps by just saying something like, "It has several stripes." Thus, it is argued, mental images are in some important way, more like descriptions than like pictures.
For some reason, this argument continues to get trotted out, even though numerous critics have pointed out several serious flaws in it (Price, 1953; Hannay, 1971; Fodor, 1975; Shepard, 1978b; Block, 1981a, 1983b; Lyons, 1984; Tye, 1991; Cohen, 1996), and few if any attempts have been made to rebut these critiques (Simpson (1985) is an exception, but of limited scope). For one thing, is by no means clear that mental images really are inexplicit or indeterminate in the relevant sort of way. Just because one may not, in practice, be able to count the stripes on one's tiger image, it does not follow that it does not have a determinate number of stripes. Several things well short of actual indeterminacy, such as the tendency of images to fade from consciousness quite quickly, might prevent one from counting. Furthermore, it is not clear that it is really the case that pictures cannot be inexplicit in the relevant way (consider, for example, a rough, impressionistic sketch of a tiger, or a blurred photograph). Frankly, given the complicated and irregular pattern of tiger markings, it is not at all clear to me that even an actual tiger has a determinate number of stripes (certainly I do not want the job of counting them). Fortunately for Pylyshyn, he is able to marshal several other arguments against picture theory, so neither his case against picture theories of imagery, nor his case for the description theory, depends upon this argument from image indeterminacy.

32. Pylyshyn (1973) also discusses alternative suggestions from other sources about how visuo-spatial descriptions might be represented computationally, and does not attempt to advocate one such scheme over another. However, to the best of my knowledge, only Simon, Baylor, and Moran were, at this time, explicitly and publicly claiming that such representations were models of mental imagery. Pylyshyn makes it clear that it was Baylor's work, especially, that inspired him.

33. In fact Fodor (1975) did not address Pylyshyn's work at all. However, he argues against Dennett's (1969) version of the "inexplicitness" (striped tiger) objection to mental pictures at some length (see note 31, above), and briefly criticizes the homunculus objection.

34. Emphasis in original.

35. Although Finke's work in this area is perhaps the most striking, other experimenters have also found that certain specifically visual effects, such as illusions and aftereffects, normally induced by specific visual stimuli, can be induced by appropriate mental imagery (Erickson & Erickson, 1938; Barber, 1959; Pressey & Wilson, 1974; Kunen & May, 1980; Wallace, 1980, 1984; Berbaum & Chung, 1981; Kosslyn, Sukel, & Bly, 1999). However, some of these findings (including some of Finke's) have not replicated reliably (Broerse & Crassini, 1980; Reisberg & Morris, 1985), and many if not most of the results are very vulnerable to the criticism that they may reflect the demand characteristics (see supplement) of the experiment rather than genuine cognitive mechanisms (Harris, 1982; Broerse & Crassini, 1984; Predebon & Wenderoth, 1985), or else they are open to alternative explanations that are more consistent with description theory (Pylyshyn, 2002a, 2003b), a point that Finke himself now seems to accept (1989 ch. 2). Despite this, however, the threat these results pose to description theory may not have been entirely defused (Thomas, 1987, 1999b), especially as much neurophysiological evidence now points in the same direction.

36. See supplement: European Responses: Jaensch, Freud, and Gestalt Psychology.

37. Neisser (1967) had been an influential pioneer of the application of the computer metaphor to psychology, but by or before 1976 he had come to have serious reservations about it.

38. At this time virtually all computer monitors were CRT (Cathode Ray Tube) displays, and Kosslyn sometimes describes the theory as based upon a "CRT display metaphor."

39. Since the mid 1980s Kosslyn has turned more and more to neuroscience rather than computational modeling, both as a theoretical idiom and as a research strategy (see Supplement: The Quasi-Pictorial Theory of Imagery). However, there is little sign that he has thereby abandoned the functionalist commitments of his earlier work. Kosslyn & Hatfield (1984), reaffirm a commitment to functionalism, and even to computationalism broadly conceived, even while critiquing the symbolic computer programming idiom in which Kosslyn's earlier theoretical work had been couched. Kosslyn's more recent work on imagery can be seen as an exploration of just how and where the various component functions of his original computational theory (revised in certain details) might actually be implemented in the brain.

40. The underlying similarity of these diversely expressed theories is recognized by Morris & Hampson (1983), Thomas (1987, 1999b), and Hochberg (2001). Unfortunately, however, it was little recognized at the time they were being proposed, so the terms suggested here to describe them are all more or less anachronistic: "enactive" comes from Noë (2004) (although he does not apply it explicitly to imagery); "sensorimotor" from Newton (1993, 1996) and O'Regan & Noë (2001); and "perceptual activity" from Thomas (1999b) and Blain (2006). Morris & Hampson (1983), who were perhaps the first to recognize the common theme in some of these theoretical proposals, refer to them as "role playing theories," but this is not a very happy choice of terminology. Although Sarbin (1972; Sarbin & Juhasz, 1970) explicitly makes theoretical use of the concept of role playing, this is not true of any of the other theorists in question (including Neisser, whose work is actually the main focus of Morris & Hampson's discussion).

41. Others, such as Janssen (1976b) and Morgan (1979) show signs that they were thinking along similar lines, but they never developed the relevant ideas very far. Shepard (1978b), with his remarks about imagery as perceptual anticipation, and Paivio, when he says that imagery is akin to "active perception" (1977), may also be leaning in this direction. Certainly both Shepard and Paivio repudiate computational models.

42. Perhaps relevant answers may also be sought in the information carried by in internal formations, such as the sensorially generated activation pattern in the retina, or the cochlea of the ear, or in the brain's retinotopically mapped visual cortex, (Thomas, 1999b, 2009). Such activation patterns are not (pace Kosslyn) mental representations (Meijsing, 2006): there is no reason to believe (and good reason to disbelieve) that they bear intentionality or that they are suitable for manipulation as representational tokens in thought processes or computations, and, as recent neuroscience shows (Lamme et al., 2000; Rees, Krieman,& Koch, 2002; Koch, 2003 ch. 6) they are not inherently conscious states. Nevertheless, they bear information about the layout of the environment, much as a shadow does, or an optical image in a camera obscura, and thus may be interrogated by internal active perceptual processes in a way analogous to that in which our external sense organs interrogate the information bearing formations (such as the optic array) in the external environment itself (Thomas, 1999b, 2009). There is growing evidence that this sort of internalized active perception really does occur (e.g.: Li et al., 2004; Reynolds, J.H. & Chelazzi, 2004; Kveraga et al., 2007; Bressler et al., 2008; Kelly et al., 2008; Munneke et al., 2008), and good reason to believe that it plays a key role in generating conscious perceptual experience (Marcel, 1983; Lamme & Roelfsema, 2000; Pascual-Leone & Walsh, 2001; Gaillard et al., 2009).

43. Although perceptual information processing presumably entails much activity in a perceiver's brain, this is conceived to be mainly driven "bottom-up" by the sensory input. It is thus not action, not something done by the organism, but rather something that happens to it. Thus it would not lend itself to being intentionally re-enacted in the sense envisaged by enactive imagery theories. (In practice most versions of information processing theory make some appeal to a top-down as well as a bottom-up flow of information, but the latter is always seen as primary, with the top-down component merely modulating the bottom-up flow. The notion that the process is driven by the information flowing inward, from periphery to center, remains the defining characteristic of theories of this sort.)

44. It is no accident that Neisser's enactive theory of imagery draws heavily on Gibson's (1966) theory of perception, which firmly rejected the information processing perspective (and, indeed, cognitivism in general). However, although Gibson had some enthusiastic followers, they were, and have remained, a small and embattled minority amongst perceptual psychologists (and few of them, apart from Neisser, have taken much interest in imagery).

45. It has been shown that "active perception" behavioral routines of the sort discussed here can (under conditions of simulated natural selection) spontaneously evolve in simple robotic systems, enabling the robot to use a quite rudimentary endowment of sensory transducers to make surprisingly subtle and complex perceptual distinctions (Nolfi & Floreano, 2002). The sense of perceptual presence that such a system is claimed to support has been called the "immanence illusion" (Minsky, 1986), the "refrigerator light illusion" (Thomas, 1999b; Block, 2001, 2007), and even the "grand illusion" (Noë & O'Regan, 2000; Noë et al., 2000). However, Noë, while fully endorsing the underlying concept, in fact argues that it is misleading to think of this as an illusion (2002, 2004). This is what perceptual presence is.

46. A number of related findings point in the same direction:
(i) Similar, though disputed, claims have been made regarding the eye movements of REM sleep and the concurrent dream imagery (Dement & Kleitman, 1957; Roffwarg et al., 1962; Moskowitz & Berger, 1969; Jacobs, Feldman & Bender, 1972; Koulack, 1972; Firth & Oswald, 1975; Hong et al., 1997);
(ii) Bensafi et al. (2003) found that when people are imagining particular odors they spontaneously sniff in a way that re-enacts the distinctive sniffing pattern observed when they actually smell an odor of that type;
(iii) Some researchers have found that, under the right conditions, the tracking (as opposed to saccadic) eye movements made when visually following a moving stimulus are also re-enacted when imagining such a motion (Antrobus, Antrobus, & Singer, 1964; Deckert, 1964; Zikmund, 1972) (however, others (e.g., Brown, 1968) dispute these claims);
(iv) Ruggieri & Alfieri (1992) found that the crystalline lens of the eye behaves in imagery as in perception, focusing further away when objects are imagined as far away, and closer when objects are imagined as close.

47. Mast & Kosslyn (2002b) point out that the fovea of the eye, the part of the retina with the greatest visual acuity, where the light sensitive cells are packed most closely together, and where most of the color sensitive cone cells are found, only receives light from about 2° of visual angle. Indeed, although the fovea occupies only about 1% of the entire retinal surface, about half of the retinotopic map in primary visual cortex (V1) is devoted to it (Petersen & Diamond, 2001), and our ability to recognize objects and colors in peripheral vision (i.e. using just parts of the retina outside the fovea) is quite limited. We only experience our visual world as rich in detail and color because we normally move our eyes to new fixation points, to focus new information onto the fovea, several times every second (Richardson & Spivey, 2004). Thus, if quasi-pictorial mental images were retinal (or retinotopic) snapshots (as Kosslyn usually seems to hold), only a very small part of an image would appear detailed and colored, which does not seem to match most people's imagery experience. On the face of it, this raises yet another problem for quasi-pictorial theory, but Mast & Kosslyn argue that the recent results on imagery and eye movements contain the seeds of its solution:

One possibility is that eye movements are stored in memory along with the images [i.e., quasi-pictures] that were encoded at each fixation. If so, then eye movements might play an important role in allowing one to visualize a montage, a composite created on the basis of memories of multiple fixations. That is, eye movements could trigger sequences of memories and could also help us to position correctly each image of a part of the field relative to other parts. (Mast & Kosslyn, 2002b p. 271).

Apparently they are suggesting that our ordinarily experienced visual mental images (which according to Kosslyn (1978a) have a visual angle of about 30°), are not single quasi-pictures, but composite mosaics made up of numerous small (foveal) quasi-pictures. This proposal appears to be a significant departure from earlier versions of quasi-pictorialism, but Kosslyn does not mention it in his more recent restatement of his theory (2005). (Hochberg (1968) and Hebb (1968) have sometimes been read as suggesting something quite similar to the Mast & Kosslyn proposal (Farley, 1974, 1976; Laeng & Teodorescu, 2002). However, other versions of enactive theory will not bear this interpretation.)

48. The points are not unconnected. Thomas (2006) argues that the concepts of consciousness and imagination are much more closely connected with one another than is commonly realized; that, indeed, before the 17th century, imagination occupied a very large part of the conceptual space since taken over by consciousness.

49. Aristotle, who may well be said to have invented the concept of imagination (Juhasz, 1971; Schofield, 1978; and see Rees, 1971 n.7), defined it [phantasia] as "(apart from any metaphorical sense of the word) the process by which we say that an image is presented to us" (De Anima 428a). During later antiquity, the Aristotelian concept of phantasia developed into the richer Latin concept of imaginatio (which became our imagination) (Watson, 1988; Cocking, 1991). Virtually every philosopher, from ancient times until well into the 20th century, implicitly accepted Aristotle's definition (White, 1990 part 1; Brann, 1991; Thomas, 1999a), and most modern dictionaries still say something similar. The currency of the idea that imagination is the source of artistic creativity (and, latterly, creativity in other spheres, such as science (e.g., Tyndall, 1872; Van't Hoff, 1878; Brown, 1991; Holton, 1996; Gendler, 1998; Georgiou, 2007)) is largely due to the 18th and 19th century Romantic movement; although the idea has been traced back as far as Flavius Philostratus' Life of Apollonius of Tyana (6 ยง16, in Conybeare, 1912), written in the 3rd century A.D. (Saintsbury, 1900; Osbourne, 1968; Cocking, 1991; see also Kearney, 1988). The connection here with imagery remains strong. Philostratus (or Apollonius, whose words are supposedly being reported) argues that, as opposed to simply producing a likeness of a living model, when a sculptor produces a statue of a god whom he has never actually seen, he must be relying upon his imagination (i.e., he must form a mental image to stand in place of the living model). It is clear that both Philostratus and the Romantic theorists of imagination continued to believe that 'imagination' primarily means the power to produce or experience imagery (Robson, 1989; Brann, 1991). For Coleridge (1817) the power of artistic creativity is, quite explicitly, "secondary imagination."

50. In The Concept of Mind (1949), Gilbert Ryle declared that "There is no special Faculty of Imagination, occupying itself single-mindedly in fancied viewings and hearings", and in the latter half of the 20th century such deflationary views of the imagination (perhaps in reaction to its earlier inflation by Romanticism) became commonplace amongst analytic philosophers. Not only was the faculty as such called into question, but the very notion of imagining. Many late 20th century philosophers held, in effect, that 'imagine' is a polysemous verb, used to mean different, sometimes quite unrelated, things on different occasions: visualize, think creatively, believe falsely, suppose, pretend, and so on (e.g.: Flew, 1953; Strawson, 1971; Gross, 1973; Dix, 1985; Newton, 1989; Sparshott, 1990; Walton, 1990). By contrast, White (1990 part 2) argued that 'imagine' does have a true meaning – it means something like "think of as possibly being so" – but that it has no conceptual connection whatsoever with imagery. Similar views now seem to be quite widespread, though they are rarely defended in any detail (if at all) (e.g.: Rorty, 1988; Nozick, 1993; Johnson, 1993; Nichols & Stich, 2000, Byrne, 2005). Of course, other contemporary philosophers of imagination do not share these deflationary views (e.g.: Warnock, 1976, 1994; Thomas, 1999a; McGinn, 2004), and the view that imagination is a coherent concept, and that it is conceptually related to imagery, has been explicitly defended by Thomas (1997a, 1999b) and Kind (2001).

51. This view also finds some support in the arguments of Rowlands (2006a, 2006b) and Lycan (2006). They argue that (despite what Noë (2004) and other enactive theorists, such as Gibson (1966, 1979), have said) it makes sense to think of enactive perception as giving rise to intentional representations. These need not, however, be the sort of internal symbolic tokens to which enactivists object. Sensorimotor activity itself, it is suggested, can function representationally (Rowlands, 2006a, 2006b; see also Bickhard & Richie, 1983; Heil, 1998 ch. 6; Meijsing, 2006).

52. But Paivio (1971, 1986), amongst others, holds that imagery can be at work in our cognitive processes without our being consciously aware of it. The question is whether such unconscious perceptual representations deserve to be called mental images, and the answer will depend, at least in part, on whether one has a representational or an experiential concept of imagery (see section 1).

53. Abell & Currie (1999), however, argue, as simulationists, against quasi-pictorial theory, and there is little sign that they are any more sympathetic to “propositional” description theory (see also Abell, 2005). Furthermore, Johansson et al. (2006) interpret Barsalou's remarks about simulation as committing him to some form of enactive theory.

54. Fodor, of course, is the leading proponent of the view that all mental representation is, or depends upon, mentalese, the amodal language of thought. However, according to Cummins (1989) Locke (who, of course, shares Prinz's empiricist/perceptual view of concepts) also held a covariational theory of representational content.


Notes to Supplement: Other Quasi-Perceptual Phenomena

1. Random dot stereograms were introduced by Julesz (1971) as a tool for studying stereoscopic vision. An illustrative example (somewhat imperfect, owing to the limitations of web graphics) is shown in figure 1. The two sides of the stereogram are identical except that the dots in a square region in the center of the left-hand image have been shifted a little to the right as compared with those in the right-hand image. This difference is too slight to be visible to even quite careful inspection. However, when the stereogram is viewed through a stereoscope so that the left-hand pattern is presented only to the left eye, and the right-hand only to the right, they will fuse into a single square percept with the central, shifted, square region appearing raised in depth. Of course, the shifted area does not have to be square, so other raised (or depressed) shapes can also be obtained.

Julesz random dot stereogram
Figure 1 (notes)
Example of a Random Dot Stereogram (Julesz, 1971).

According to Stromeyer (1970; Stromeyer & Psotka, 1970), the subject Elizabeth viewed one side of a million-dot stereogram, and then was able to use her eidetic image of it to see the three dimensional shape when presented with the other half of the stereogram some four hours later. This would require, at a minumum, a pixel perfect memory for the exact relative positions of several tens of thousands of dots. She was also shown four different versions of the right-hand part of a 10,000 dot stereogram (each with a different shaped region shifted relative to the left-hand image), and then the next day was shown the corresponding left-hand pattern. It is reported that she was then able to use her eidetic images of each of the four versions of the right-hand pattern, in turn, to experience four differently shaped regions standing out in depth. Given what is now known about the (relatively low) resolving power of the various components of the human visual system (retina, primary visual cortex, etc.) (Van Hooser & Nelson, 2005; Petersen & Diamond, 2002), and given also that other accounts of eidetic imagery describe the images as fading, and becoming irretrievable, a minute or so after the original stimulus has been removed from view (Haber, 1979), it is hard to understand how such feats could truly be possible. Although there are well attested reports in the psychology literature of other people with remarkable mnemonic abilities (with what are popularly, though misleadingly, called "photographic memories"), none of their genuinely impressive accomplishments remotely resemble this feat ascribed to Elizabeth (Neisser, 1982 Part 7).


Notes to Supplement: From the Hellenistic to the Early Modern Era

1. Not to be confused with his uncle, also a philosopher, Giovanni Pico della Mirandola.


Notes to Supplement: Edward B. Titchener: The Complete Iconophile

1. Although many philosophers now associate this view mainly with Berkeley's immediate target, Locke (1700), in fact it goes back at least to Aristotle (De Intrepretatione 16a; De Anima 420b), and, indeed, went virtually unquestioned by language theorists throughout the intervening centuries (Wollock, 1997).


Notes to Supplement: The Perky Experiment

1. But see Schwitzgebel (2004) for a qualified defense (and description) of Titchener's introspective methodology. The reputation of Wundt, who did not use introspection of this now deprecated kind (Danziger, 1979, 1980), has seen a considerable revival in recent decades (e.g. Blumenthal, 1975; Bringman & Tweney, 1980; Fancher, 1996).

2. It is not apparent that Perky took any deliberate steps to induce relaxation in her subjects, but Segal found it essential. Segal attributes this difference in their findings to the fact that her subjects, unlike Perky's, were drawn from among “the suspicious, pragmatic students who populated our campuses in the late 1950s and early 1960s” (Segal, 1971b). Thanks to Orne (1962) and others, psychologists today are very much more alert than they once were to the need to take account of the social psychology and social context of the experimental situation when working with human subjects (see supplement: The Problem of Demand Characteristics in Imagery Experiments).


Notes to Supplement: European Responses: Jaensch, Freud, and Gestalt Psychology

1. Furthermore, according to Brewer & Schommer-Aikins (2006; see also Burbridge, 1994), Galton's claims in this regard are not truly supported by the actual data he gathered (as well as being contradicted by the results of their own attempt to replicate his study). They conclude that there is no valid reason to believe that scientists tend to be deficient in imagery.

2. More recently, reflective scientists and engineers seem more likely to claim that imagery plays a vital role in their professional thought processes (Ferguson, 1977, 1992; Shepard, 1978a,b; Deutsch, 1981; Miller, 1984; Barsalou, 1999; Damasio, 2003).


Notes to Supplement: The American Response: Behaviorist Iconophobia and Motor Theories of Imagery

1. Watson also mentions Fernald's (1912) work on imagery types in this connection. Many psychologists in this period believed that people people could usefully be classified into psychological types according to the preferred or predominant sensory mode of their imagery. Thus some people ("visiles") might think mostly in visual imagery, others ("audiles") in auditory imagery, others in motor or kinaesthetic imagery ("motiles"), and so on, and these types might be expected to be found to correlate with individual differences in cognitive abilities or personality (see Angell, 1906 chap. 8, p. 165). Fernald's findings indicate that the classification of people into imagery types was likely to less straightforward than had previously been realized, as the modes of imagery people report are sensitive to situational and contextual variables. I know of no evidence, however, that, when Watson was writing, anyone but he saw Fernald's work as throwing the theory of imagery types (let alone the very concept of imagery, or psychology itself) into crisis. Neither Fernald herself, nor Titchener (1914) seem to have thought it did, and certainly her work aroused nothing like the widespread controversy of the imageless thought debate. Fernald's work did perhaps pave the way for quite devastating critiques of imagery type theory by the likes of Thorndike (1914 ch. 16) and Griffits (1927). The theory has never recovered from those critiques, but they came later.

2. Skinner, it should be said, resisted the reification of experiences in general quite as much as he resisted that of images in particular.


Notes to Supplement: Dual Coding and Common Coding Theories of Memory

1. That is not to say that Dual Coding theory provides the only viable account of the selective interference phenomenon. In fact, selective interference is also often discussed in the light of the Working Memory theory of Baddeley (1976, 1994; Baddeley & Hitch, 1974; Logie & Baddeley, 1990). According to this theory, the working memory system (roughly equivalent to what other psychologists call short term memory) consists of a central executive that controls a number of semi-independent “slave” subsystems, the most important of which are the articulatory (and/or phonological) loop (that holds and manipulates verbal material in consciousness) and the visuo-spatial scratchpad (that does the same for imagery). Two tasks that simultaneously call upon the same subsystem may be expected to interfere with each other. Baddeley, unlike Paivio, does not seem to be committed to the storage of imaginal and verbal memories in separate long term memory stores. However, it is arguable that for many purposes, including the explanation of selective interference and many mnemonic effects, Working Memory theory and Dual Coding theory function not so much as rival theories, but as alternative formulations of essentially the same theoretical model (Marschark & Hunt, 1989).


Notes to Supplement: Conceptual Issues in Dual Coding Theory

1. Codes are systems of representations, and in the context of the discussion of Dual Coding theory, we are talking about types of mental representation of which we can, at least potentially, be directly conscious, as we often are conscious of both imagery and of “inner speech”. None of the arguments made in this section are intended to rule out the possibility that cognition might also involve other sorts of non-conscious representational codes. These might inlcude not only mentalese (conceived as the underlying code of our thought processes), but also, perhaps, specialized representational systems involved in the processing that (on some views) goes on within informationally encapsulated mental modules, including, perhaps, sensory processing modules (Fodor, 1983; Pylyshyn, 2003 b).

2. A fully consistent defense of the unity and cohesiveness of the imagery code may not be entirely compatible with this sort of Empiricism. But although Paivio (1986) explicitly claims to be an Empiricist, this seems to be mainly a way of indicating his commitment to the primacy of the experimental method in science, as opposed to “rationalistic” methods like computer simulation. In fact, he explicitly rejects the wax impression metaphor that is central to the traditional Empiricist understanding of perception and imagery (and from which, of course, the very notion of a “sense impression” derives) (Paivio, 1977).

3. The rejection of the Empiricist picture of perception, amongst both philosophers and cognitive scientists, takes many and various forms, but many recent theorists now view perception as an essentially active process of environmental exploration, and firmly reject the view of the senses as passive input channels (Gibson, 1966, 1979; Neisser, 1976; Bajcsy, 1988; Ballard, 1991; O'Regan, 1992; Blake & Yuille, 1992; Aloimonos, 1993; Churchland et al., 1994; Akins, 1996; Landy et al., 1996; Thomas, 1999b; O'Regan & Noë, 2001; Noë, 2002a, 2004, 2009). As applied to imagery (Thomas, 1999b), this appears to be consistent with Paivio's characterization of it as “a dynamic process more like active perception than a passive recorder of experience” (Paivio, 1977).

4. McGinn (2004 pp. 27-8) claims that it is impossible to experience both a visual and an auditory image simultaneously, because imagery depends on attention and we cannot give the right sort of focal attention to two different things at once. However, it seems to me that I can and do have imagery experiences that are simultaneously visual and auditory. For example, if I imagine a metal pan falling to the ground, I can both "see" the fall and "hear" the crash. McGinn may have made a case for thinking it is impossible (or very difficult) to have two separate, unrelated images at the same time (regardless of mode), but it does not necessarily follow that we cannot have unitary imagery experiences with more than one sensory aspect.

5. Aristotle (on a plausible and widely accepted interpretation of his texts) associated the imagination [phantasia], “the process by which we say that an image [phantasma] is presented to us” (De Anima 428a 1-4), not with the five special senses (vision, hearing, smell, taste, touch) but with the so called “common sense” [koine aisthesis], the faculty of the soul through which the various deliverances of the special senses are integrated into a coherent and meaningful whole (Beare, 1906; Ando, 1965; Nussbaum, 1978; Modrak, 1987). (Note that the Aristotelian concept of koine aisthesis, although it translates literally as “common sense,” is unrelated to the modern colloquial meaning of that expression.) The most direct textual evidence that Aristotle considered the imagination and the common sense to be fundamentally the same faculty is to be found in De Insomniis (459a) and De Memoria (450a). Kant's view of the relationship between imagination and perception seems, in the relevant respects, to have been broadly similar to this (as, of course, are the views of a myriad other thinkers, past and present, directly or indirectly influenced by Aristotle). The Kantian imagination, the einbildungskraft, is primarily responsible for the synthesis of the manifold of experience (i.e., the deliverances of the individual senses) into a form that the understanding can grasp (Kant, 1781/1787 A120, B180-181, A141-142; see Strawson, 1971; Warnock, 1976). It should be noted that Paivio explicitly denies that there is an Aristotelian common sense (Paivio, 1986 p. 580). However, this is apparently because he interprets Aristotle as proposing something like the “amodal” mentalese representations of the sort championed by Pylyshyn (1978; 1984). There are alternative (and, arguably, less anachronistic) interpretations of Aristotle's theory that Paivio might well find less objectionable (see, e.g., Kosman, 1975; Modrak, 1987).

6. 1975 was the year in which Fodor published his book The Language of Thought, which introduced the concept of mentalese. It is sometimes suggested (though not, I think, by Fodor) that the familiar feeling of having something, such as the answer to a question, “on the tip of one's tongue” amounts to a direct experience of a mentalese representation, equivalent to our experience of imagery. This does not stand up to examination, however. It is true that the hypothesis of the mentalese code enables a simple and appealing explanation of an otherwise somewhat puzzling phenomenon. If information is stored in and retrieved from memory in the form of mentalese, it will clearly need to be translated into a natural language (e.g., English) before it can be spoken or written down. On this basis, the tip of the tongue phenomenon may readily be explained as arising from a successful episode of retrieval from storage, followed by a failure of the stage of translation into natural language. However, even if this is right, mentalese is still functioning as an explanans here, and not an explanandum. It may be arguable that, because the explanation is principled and fairly simple, the tip of the tongue phenomenon provides relatively direct experiential evidence for the reality of mentalese (as, say, Brownian motion provides relatively direct, easily observed evidence for the reality of molecules), but that is not at all the same as having direct experience of mentalese per se (any more than seeing Brownian motion through the microscope is seeing molecules). It is a far cry from the sort of direct, quasi-sensuous phenomenal experience that we enjoy of imagery and inner speech. The tip of the tongue feeling may perhaps be best understood as one of those vague and contentless fringe consciousness phenomena discussed by Mangan (2001). When we experience imagery (or inner speech) we are normally (perhaps even ipso facto) aware of its intentional content, of what is being represented. By contrast, it is of the essence of the tip of the tongue phenomenon that we are not aware of what (if anything) is getting represented.


Notes to Supplement: Mental Rotation

1. Quite aside from Behaviorist scruples, there are still good grounds for skepticism toward peoples' subjective, introspective descriptions of their own cognitive processes (Nisbett & Wilson, 1977; Schwitzgebel, 2002a,b).


Notes to Supplement: The Problem of Demand Characteristics in Imagery Experiments

1. Whether the findings on the "visual angle of the mind's eye" stand up in the same way is much more questionable, although Finke (1989 ch. 2) is still prepared to defend them.


Notes to Supplement: The Quasi-Pictorial Theory of Imagery

1. Kosslyn does not claim that the specific, LISP-like format of deep representation shown in figure 4.4.2_1 is psychologically real. It is just his illustration. Even today, little is known about how such information might actually be represented (if it actually is).

2. Kosslyn explicitly draws the analogy with the 2½D sketch (as described by Marr & Nishihara (1978)) "except that [the 2½D sketch] is far too primitive and unorganized; a parsed 2½D sketch would more closely approximate our surface representation" (Kosslyn, 1980 p. 464; cf Kosslyn, 1994 p. 86).

3. It is perhaps worth noting again that not only is the functional architecture of Descartes' theory of visual perception and imagination (as presented in the Treatise of Man and the Optics (Descartes, 1664, 1637)) virtually identical to that of Kosslyn's theory, but also that the Cartesian mental image functions as a merely quasi picture. Descartes insists that it is of no consequence whether the inner images of his theory (formed on the surface of the pineal gland, the Cartesian visual buffer) actually resemble the things they represent. All that really matters is that they are such as to stimulate the soul (functionally equivalent, in this respect, to Kosslyn's mind's eye function) in such a way as to cause it to have the appropriate experiences (Optics: Discourse 4; see Hyman, 1986). Of course, it turns out that, despite these caveats, the Cartesian inner image is in fact a picture that resembles what it represents, being (during vision) a point-by-point projection of the retinal image upon the pineal surface. However, it also turns out that Kosslyn holds that when his quasi-picture is actually instantiated in the brain, it is rather more literally picture-like than the functional level theory demands (see next note). Pylyshyn (1981, 2002a, 2003b) argues that much of the superficial plausibility of theories of this type depends upon this equivocation. They slide between making appeal to pictures in a fairly literal sense, and appealing to some notion of a quasi-picture (something that is not really a picture at all). The literal picture in the head theory is intuitively appealing, but clearly false, whereas the quasi-picture (not-really-a-picture) theory, while not so obviously false, has no more intrinsic intuitive appeal than explicitly non-pictorial views like Pylyshyn's own.

4. Retinotopic mapping refers to the way in which several visual processing areas of the brain (most notably, but not only, V1, the primary visual cortex), are structured as (rather distorted and low resolution) maps of the light sensitive retina upon which light is focused at the back of the eye (see Petersen & Diamond, 2001). Adjacent regions of the visual cortex correspond to adjacent areas on the retina such that, during vision, the two-dimensional spatial pattern of excitation of the cortical neurons corresponds topologically to the pattern of illumination in the optical image formed on the retina. If, as Kosslyn now holds, these retinotopic maps are the embodiment of his theory's visual buffer, this means that loci in the brain that represent the presence of the same sorts of features (e.g., the color, or the state of motion) of adjacent areas of an imagined visual scene will, after all, be (for the most part) physically adjacent in the brain.

5. Almost half of the retinotopic map in primary visual cortex (V1) is devoted to mapping the fovea, the small central part of the retina where the light sensitive cells are packed most densely and where most of the color-sensitive cone cells are found. However, the fovea is only about 1% of the entire light sensitive retinal surface (Petersen & Diamond, 2001).

6. The mnemonic effects of imagery have also been demonstrated in the congenitally blind (Craig, 1973; Jonides et al., 1975; Cornoldi et al., 1979; Zimler & Keenan, 1983). However, the congenitally blind cannot do absolutely everything that the sighted can do with their imagery (Arditi, Holtzman, & Kosslyn, 1988). Also, much of what they can do often takes more effort and is done more slowly (De Beni & Cornoldi, 1998; Vanlierde & Wanet-Defalque, 2004). Some argue that the congenitally blind may even experience visual imagery, especially in dreams (Keilkopf, 1968; Kaski, 2002; Aleman et al., 2001; Lopes da Silva, 2003; Bértolo, 2005). The more generally accepted view, however, is that although people blinded later in life often continue to experience visual imagery for many years, those blind from birth or early childhood do not. When they use visual language in describing their dreams and other imaginings, they are speaking metaphorically about their spatial understanding, or understanding in general (Schlaegel, 1953; Berger et al., 1962; Kirtley, 1975; Kerr, 1993, 2000; Hurovitz et al., 1999; Kerr & Domhoff, 2004). There is evidence that visual cortex can become active when the congenitally blind experience imagery (Lambert et al., 2004; De Volder et al., 2001), but this may be because so called visual cortex takes on non-visual responsibilities in the blind (Kujala et al., 1995; Röder et al., 1995; Raz et al., 2005). What is clear is that the blind are quite capable of perceiving and understanding spatial structure and spatial relations through the use of their haptic and other senses (Brodey, 1969; Morgan, 1977; Casey, 1978; Landau, Gleitman, & Spelke, 1981; Acredolo, 1983). Absent a prior commitment to picture theory, this seems quite sufficient to account for their spatial imagination abilities, and even their dreams (Kerr, Foulkes, & Schmidt, 1982; Thomas, 1987 §II.B.6; Kerr & Domhoff, 2004).