|
|
||||||||
1 From the Department of Optometry and Vision Sciences, Cardiff University, Cardiff, Wales, United Kingdom; and the 2 Department of Biosciences, University of Helsinki, Helsinki, Finland.
| Abstract |
|---|
|
|
|---|
METHODS. Contrast energy thresholds (Eth) were measured for flicker at 1.25 to 20 Hz in strong, purely temporal (spatially uniform), additive, external noise. The masking power of white external noise, characterized by its spectral density at zero frequency N0, increases with the duration of the noise frame.
RESULTS. For short noise frame durations, Eth increased in direct proportion to N0, keeping the nominal signal-to-noise ratio [SNR = (Eth/N0)0.5] constant at threshold. The masking effect thus increased with the duration of the noise frame and the noise mimicked white noise. When noise frame duration and N0 increased further, the nominal SNR at threshold started to decrease, indicating that noise no longer mimicked white noise. The minimum number of noise frames per flicker cycle needed to mimic white noise decreased with increasing flicker frequency from 8.3 at 1.25 Hz to 1.6 at 20 Hz.
CONCLUSIONS. The critical high-frequency cutoff of detection-limiting temporal noise in terms of noise frames per signal cycle depends on the temporal frequency of the signal. This is opposite to the situation in the spatial domain and must be taken into consideration when temporal signals are masked with temporal noise.
| Introduction |
|---|
|
|
|---|
The masking power of external noise is described by its spectral density. The spectral density of white spatiotemporal noise is the product of the spatial dimensions of the noise checks, duration of each noise frame, and root mean square (RMS) contrast of noise squared.6 8 Spatial parameters are held constant for purely temporal noise, and its spectral density can be expressed simply as the product of the duration of the noise frame and RMS contrast squared.1 2 9 The masking effect of white temporal noise is thus dependent on its contrast and the duration of each noise frame.
The frame rate of the display limits the temporal bandwidth of noise. Hence, so-called white noise always has a specific high-frequency cutoff, beyond which it is no longer white. Previous research indicates, however, that noise reduces detectability only at the frequencies of the signal and its immediate neighborhood.8 10 11 Regardless of whether the detection mechanism is a filter matched to the signal1 or consists of channels tuned to different temporal frequencies,3 4 the variation of noise is thus relevant only at the frequencies of the matched filter or within the channel detecting the signal. This means that as long as the noise power spectrum is "flat" over the frequency range of the filter or channel detecting the signal, noise can be regarded as white. This seems to be the case both in spatial10 11 12 and temporal2 vision.
When investigating pathologic vision with reduced flicker sensitivity or detection of low or high temporal frequencies at which flicker sensitivity is low, high signal contrasts have to be used together with high spectral density of external noise. Because of the limited contrast range, the noise-masking effect is then increased by increasing the duration of the noise frame. This, however, lowers the high-frequency cutoff (see Kukkonen et al.10 ), and a limit will be reached at which the white noise bandwidth no longer fully covers the temporal frequency spectrum used for signal detection.
In the present study, we determined the lowest high-frequency cutoffthat is, the minimum bandwidth of temporal noise that still acts as white in masking sinusoidal flicker (1.2520 Hz) by determining the longest noise frame duration that still mimics white noise.
| Methods |
|---|
|
|
|---|
The average luminance of the display was 50 cd/m2. To increase the number of gray levels available, the red, green, and blue outputs of the graphics board were combined by means of a video summation device.13 The luminance response of the monitor was linearized by gamma correction. For further details, see Rovamo et al.2
Stimuli
The sinusoidally flickering signal was a sharp-edged circular field 8 cm in diameter. A black cardboard mask limited the equiluminous surround to a circular area 20 cm in diameter. Viewing distance was constant at 115 cm, producing a foveal stimulus 4° in diameter within a surround of 10° in diameter.
Flicker was produced by changing the color look-up table of the graphics board during each vertical retrace period of the display. The total duration of the signal was 2.4, 2, 1, 0.5, and 0.25 seconds at 1.25, 2.5, 5, 10, and 20 Hz, respectively. The number of cycles displayed was thus five cycles for 2.5- to 20-Hz signals. To keep the exposure time and experiment duration reasonable, only three cycles were displayed at 1.25 Hz.
Spatially uniform purely temporal noise with the widest temporal bandwidth was produced by adding a random luminance to the signal at each frame of the display lasting 7.5 ms. The duration of the noise frame was increased by allowing this random increment or decrement to last for several frames. The longest noise frame duration was 545 ms (72 frames) at 1.25 Hz, 455 ms (60 frames) at 2.5 Hz, 227 ms (30 frames) at 5 Hz, and 167 ms (22 frames) at 10 and 20 Hz. At all noise frame durations, the signal luminance varied in every display frame according to the sinusoidal waveform.
The random luminances producing the noise were drawn independently from a Gaussian luminance distribution with zero mean and truncation at ±2.5 SD units. The RMS contrast of temporal noise was 0.15, except at 1.25 Hz (0.25, subject AR; 0.20, subject MT). These contrasts were chosen so that external noise always limited detection. At all flicker frequencies the contrast energy threshold measured with the lowest spectral density of external white noise was always more than 10 times higher than the threshold measured without added noisethat is, when internal neural noise limited detection.
The contrast energy of the flickering signal was calculated by numerical integration across time6
as
![]() | (1) |
t is the nominal duration of each temporal frame of the displaythat is, 1/132 seconds.
The power spectrum of temporal noise is given by6
![]() | (2) |
tn is the duration of each noise frame. At low temporal frequencies, at which noise is white and the spectral density of temporal noise is the same as at zero frequency N0, equation 2
reduces to10
![]() | (3) |
Equation 3
shows that increasing the duration of the noise frame increases the spectral density of temporal noise at zero frequency and within the white bandwidth of noise. The temporal frequency at which the spectral density of noise starts to decreasethat is, noise ceases to be whiteis consequently reduced. Above this corner frequency (2
tn)-1 the average power spectrum begins ringing, according to the Fourier spectrum of a sinc function of equation 2
. For a graphical presentation of the relationship between N0, noise frame duration, and cutoff frequency, see the spatial equivalent.10
![]() | (4) |
![]() | (5) |
|
Thresholds were determined by a two-alternative forced-choice algorithm at the probability level of 84% correct (see Mustonen et al.14 ). Each trial consisted of two exposures of equal duration indicated by a sound signal. The two exposures contained different samples of noise, but only one exposure contained the signal. The subjects task was to indicate which exposure contained the signal by pressing one of two chosen keys on a computer keyboard. Auditory feedback was provided.
The threshold contrast was calculated as the arithmetic mean of the last eight reversal contrasts. Every data point shown is the median of three to five thresholds measured. For three to five threshold measurements, the median is more robust than the mean in estimating the true threshold, because the median is less affected by occasional outlying values.
Subjects
Two experienced subjects (aged 51 and 43 years) with normal vision participated in the experiments. AR was an uncorrected hypermetrope and MT was a corrected astigmatic myope (-0.50/-0.50 ax 80). Both had visual acuity of 6/5. Psychophysical data for subjects with normal vision tend not to vary from one subject to another more than the normal variation within each subjects data (as in the current experiments). It is, therefore, an accepted practice to use just two to three subjects to represent the normal performance of a healthy visual system.
The tenets of the Declaration of Helsinki were observed, and informed consent was obtained from both subjects before the experiments were conducted.
| Results |
|---|
|
|
|---|
With further increase in duration of the noise frame, the nominal SNR started to decrease at all temporal frequencies. The decreasing nominal SNR means that the spectral density of external noise at some frequencies affecting signal detection started to decrease, increasing the true SNR within the detection filter, despite the continuing increase in N0. The slopes of decrease of the nominal SNR ranged from -1.7 to -2.2. They are similar to slopes found previously in spatial experiments,10 probably reflecting the decrease in effective noise within the frequency range used for detection. Ultimately, for long enough noise frame durations, internal neural noise would become dominant, and Eth would settle at a constant level. However, this situation (SNR slope of -0.5) was not reached within the range of frame durations used.
The continuous line is the least-square fit of equation 5 to the data in each panel (Fig. 1) . The critical duration (nc) refers to the point at which SNR started to decrease, thus marking the longest noise frame duration that still mimicked white noise. The nc values determined from the least-square fit (±95% confidence intervals) were 0.12 ± 0.02, 0.22 ± 0.03, 0.30 ± 0.06, 0.37 ± 0.03, and 0.62 ± 0.09 cycles, corresponding to 96, 88, 60, 37, and 31 ms for flicker frequencies 1.25, 2.5, 5, 10, and 20 Hz, respectively.
In Figure 2A ,(nc) is plotted as a function of flicker frequency (f). Because log nc increased more or less linearly with log f, nc was described by nc = afb, where a and b are constants. The best fit of this equation to the data were achieved with a = 0.10 and b = 0.58.
|
The absolute cutoff frequency (Fc) in hertz is calculated as the product of relative cutoff frequency fc and the signal frequency f. Fc was found to be 5.25 ± 0.9, 5.75 ± 0.8, 8.5 ± 1.5, 14 ± 1, and 16 ± 2 Hz at 1.25, 2.5, 5, 10, and 20 Hz, respectively. Fc, plotted in Figure 2B as a function of flicker frequency, was described by Fc = mfp, where m and p are constants. The best fit of this equation to the data was obtained with m = 4.88 and p = 0.42.
| Discussion |
|---|
|
|
|---|
At the cutoff frequency corresponding to the duration of the critical noise frame, the average power spectrum of noise was 41% of its maximum (see equation 2 ), and at higher frequencies it decreased further in an undulating fashion. When the cutoff frequency of the noise fell below the temporal frequency corresponding to the critical noise frame duration, the spectral density of noise averaged over the frequency range of the detector decreased below N0. Hence, the zero-frequency spectral density given by equation 3 is no longer a valid measure of the effective noise. Noise has become nonwhite within the bandwidth of the detection filter.
The increase of the masking effect of external temporal noise with noise frame duration broadens the use of noise in experimental and clinical research. Both diseases of the visual system and the choice of signal parameters can produce poor sensitivity to temporal contrast, which implies that the contrast of external noise is also attenuated. Because high signal contrast is needed for threshold measurement, the remaining contrast range is not sufficient to produce external noise that is stronger than internal noise. If the spectral density of noise is increased by increasing the duration of the noise frame, however, noise contrast can be kept low, and more of the dynamic range of the display is left for the signal. The critical noise frame duration for the signal used must not be exceeded, however.
The critical duration expressed in terms of a fraction of a cycle increased with flicker frequency. The higher the frequency, the fewer noise frames were needed per cycle to mimic white noise. This differs from spatial vision, for which the critical noise check size expressed as a fraction of spatial cycle is constant, independent of the spatial frequency of the signal with constant bandwidth.10 11
In the chain of processes1 2 determining flicker sensitivity, the source of the frequency dependence described herein may be the early physiological transformations shaping neural signals or/and the central signal detector. It would be easy to accommodate the results by ad hoc assumptions about the bandwidths of the matched filter or multiple channels. This is not, however, in agreement with the finding that temporal integration is independent of temporal frequency.15 16 It is, therefore, more likely that the frequency dependence of critical duration originates from known physiological properties of peripheral processing that modify the neural signals before detection.
In a matched filter model, the detector is assumed to be a replica of the signal. If the bandwidth and center frequency of the neural signal at the level of detection were equal to the external signal, the critical number of noise frames per signal cycle should be constant for our signals with constant bandwidth. The results, however, show that detection of flicker at 1.25 Hz was affected by noise up to approximately 4.2 times the nominal signal frequency. The corresponding factors were approximately 2.3, 1.7, and 1.4 at 2.5, 5, and 10 Hz. This means that the lower the nominal center frequency, the more its bandwidth will spread toward higher temporal frequencies.
Considering the known retinal mechanisms, it seems plausible that the power spectrum of the neural signal could spread to frequencies significantly higher than the nominal signal frequency, especially at low flicker frequencies. There are good grounds for thinking that human achromatic flicker detection is based on signals in M-type retinal ganglion cells.2 9 17 The nonlinear MY cells, such as cat Y cells, give both fundamental and frequency-doubled responses to temporal modulation. The relative amplitudes of these components depend on the parameters of stimulation,18 19 but the frequency-doubled component is relatively favored by large-field stimuli that extend far into the receptive field surround, as in the present experiments. Moreover, linear responses at the fundamental frequency (whether in MY cells or in the larger population20 of Mx cells) would be comparatively weak for large-field flicker at low frequencies.21 22 Additional neural modulation above the nominal frequency could come from the convergence of inputs from ON-center and OFF-center cells. We do not know how the brain integrates inputs from different cells and cell types, but a simple assumption would be that detection occurs whenever any type of neural response exceeds a certain SNR. The relative weight of different response types will automatically vary with flicker frequency, and as signal frequencies approach the temporal high-frequency roll-off of the ganglion cell, higher harmonics will be particularly strongly attenuated, and the fundamental will become dominant. With respect to the nature of the neural signal, it is worth pointing out that the threshold percept of flicker without external noise is quite indeterminate and, specifically, does not correspond to the objective signal frequency.23
Retinal low-pass filtering can in principle shift the spectral peak of the neural response somewhat below the nominal frequency at high temporal frequencies. This would occur if the flicker frequency is on the falling high-frequency limb of the modulation transfer function (of an M-cell, for example) resulting in such severe filtering at and higher than 20 Hz that the amplitude of the signal (and noise) is relevant mainly at frequencies below the nominal frequency of the signal. In agreement with this, at 20 Hz the critical cutoff frequency of the noise appears to be slightly below the expected minimum value of two noise frames per flicker cycle (i.e., the cutoff frequency of noise equal to the flicker frequency).
Flicker detection has also been modeled3 4 using a multiple channel approach in which detection is mediated by one broadly tuned low-pass channel and one broadly tuned band-pass channel that includes quite low frequencies but rolls off sharply at higher than 20 Hz. If any subset of our signals were detected using just one of these channels, the critical cutoff frequencies of these signals would correspond to the cutoff of the channel. This could be the case for our signals at 1.25 and 2.5 Hz, because their critical noise cutoff frequencies were almost the same (5.3 and 5.8 Hz) and roughly correspond to the cutoff of the broadly tuned low-pass channel.4 The increase of critical noise cutoff frequency to 14 Hz when our signal frequency increased to 10 Hz could reflect a combination of the responses from the two channels.4 Our 20-Hz signal could be detected using only the band-pass channel, because its peak response was at approximately 20 Hz, and at this frequency the response of the low-pass channel had dropped to one tenth of its maximum. The total amount of noise that affects detection should therefore start decreasing as soon as the noise cutoff frequency decreases below the cutoff frequency of the band-pass channel. The critical cutoff frequency of noise therefore should be above the nominal signal frequency, in disagreement with our experimental finding of 16 Hz.
We conclude that the masking effect of external, purely temporal noise can be increased by increasing the noise frame duration up to a critical value. However, the critical duration, in terms of a fraction of flicker cycle, increases with flicker frequency, in contrast to the situation in analogous experiments on spatial masking.10 11
| Footnotes |
|---|
Submitted for publication September 7, 2001; revised March 18, 2002; accepted April 3, 2002.
Commercial relationships policy: N.
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Jyrki M. Rovamo, Department of Biosciences, Division of Animal Physiology, PO Box 65, University of Helsinki, FIN-00014 Helsinki, Finland; rovamo{at}cardiff.ac.uk.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. Rovamo, H. Kukkonen, A. Raninen, and K. Donner Efficiency of Temporal Integration of Sinusoidal Flicker Invest. Ophthalmol. Vis. Sci., November 1, 2003; 44(11): 5049 - 5055. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |