The role of spatial frequency channels in letter identification

被引:194
作者
Majaj, NJ [1 ]
Pelli, DG [1 ]
Kurshan, P [1 ]
Palomares, M [1 ]
机构
[1] NYU, New York, NY 10003 USA
关键词
letters; spatial frequency; channels; masking; noise additivity; identification; object recognition; spatial vision; scale invariance; scale dependence; contrast sensitivity function; low-frequency cut; squarewaves; sinewaves; most sensitive channel;
D O I
10.1016/S0042-6989(02)00045-7
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
How we see is today explained by physical optics and retinal transduction, followed by feature detection, in the cortex, by a bank of parallel independent spatial-frequency-selective channels. It is assumed that the observer uses whichever channels are best for the task at hand. Our current results demand a revision of this framework: Observers are not free to choose which channels they use. We used critical-band masking to characterize the channels mediating identification of broadband signals: letters in a wide range of fonts (Sloan, Bookman, Kunstler, Yung), alphabets (Roman and Chinese), and sizes (0.1-55degrees). We also tested sinewave and squarewave gratings. Masking always revealed a single channel, 1.6 +/- 0.7 octaves wide, with a center frequency that depends on letter size and alphabet. We define an alphabet's stroke frequency as the average number of lines crossed by a slice through a letter, divided by the letter width. For sharp-edged (i.e. broadband) signals, we find that stroke frequency completely determines channel frequency, independent of alphabet, font, and size. Moreover, even though observers have multiple channels, they always use the same channel for the same signals, even after hundreds of trials, regardless of whether the noise is low-pass, high-pass, or all-pass. This shows that observers identify letters through a single channel that is selected bottom-up, by the signal, not top-down by the observer. We thought shape would be processed similarly at all sizes. Bandlimited signals conform more to this expectation than do broadband signals. Here, we characterize processing by channel frequency. For sinewave gratings, as expected, channel frequency equals sinewave frequency f(channel) = f. For bandpass-filtered letters, channel frequency is proportional to center frequency f(channel) proportional to f(center) (log-log slope 1) when size is varied and the band (c/letter) is fixed, but channel frequency is less than proportional to center frequency f(channel) proportional to f(center)(2/3) (log-log slope 2/3) when the band is varied and size is fixed. Finally, our main result, for sharp-edged (i.e. broadband) letters and squarewaves, channel frequency depends solely on stroke frequency, f(channel)/c/deg = (f(stroke)/10 c/deg)(2/3) , with a log-log slope of 2/3. Thus, large letters (and coarse squarewaves) are identified by their edges small letters (and fine squarewaves) are identified by their gross strokes. (C) 2002 Published by Elsevier Science Ltd.
引用
收藏
页码:1165 / 1184
页数:20
相关论文
共 57 条
[1]  
Ahumada AJ, 1997, J OPT SOC AM A, V14, P2471, DOI 10.1364/JOSAA.14.002471
[2]   SPATIAL-FREQUENCY CHARACTERISTICS OF LETTER IDENTIFICATION [J].
ALEXANDER, KR ;
XIE, W ;
DERLACKI, DJ .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1994, 11 (09) :2375-2382
[3]  
[Anonymous], 1991, J MARK CHANNELS, DOI [DOI 10.1300/J049V01N02_02, DOI 10.1300/J049v01n02_02]
[4]   THE PHYSICAL LIMITS OF GRATING VISIBILITY [J].
BANKS, MS ;
GEISLER, WS ;
BENNETT, PJ .
VISION RESEARCH, 1987, 27 (11) :1915-1924
[5]   SIZE INVARIANCE IN VISUAL OBJECT PRIMING [J].
BIEDERMAN, I ;
COOPER, EE .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1992, 18 (01) :121-133
[6]  
BLAKEMORE C, 1969, J PHYSIOL-LONDON, V200, pP11
[7]   The psychophysics toolbox [J].
Brainard, DH .
SPATIAL VISION, 1997, 10 (04) :433-436
[8]  
BREITMEYER B, 1984, VISUAL MAKING INTEGR
[9]   Visual signal detectability with two noise components: Anomalous masking effects [J].
Burgess, AE ;
Li, X ;
Abbey, CK .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1997, 14 (09) :2420-2442
[10]   IMPLICATIONS OF THE CRAIK-OBRIEN ILLUSION FOR BRIGHTNESS PERCEPTION [J].
BURR, DC .
VISION RESEARCH, 1987, 27 (11) :1903-&