|
|
||||||||
1From the State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China; the 2Institute of Neuroscience, Chinese Academy of Sciences, Shanghai, China; the 3EENT Hospital, Fudan University, Shanghai, China; and the 4School of Optometry, University of Alabama at Birmingham, Birmingham, Alabama.
| Abstract |
|---|
|
|
|---|
METHODS. To illustrate legibility variations in Chinese characters, visual acuities for six groups of Chinese characters from low to high spatial complexities and one group of Sloan letters were determined in six normal-sighted Chinese observers. The relationship between legibility and optical defocus were then determined for the Landolt C, the Snellen E, and three groups of Chinese characters representing low, medium, and high spatial complexities in 26 normal-sighted Chinese readers.
RESULTS. The acuity size of Chinese characters increased steadily with stimulus complexity, though at a slower rate than would be expected if visual acuity were based on the finest details of the stimuli. The acuity size versus optical defocus functions of three Chinese character groups and the Snellen E had similar slopes and differed only by a vertical shift, depending on the optotype spatial complexity. The function of the Landolt C was significantly steeper.
CONCLUSIONS. The findings indicate that visual acuity assessment in Chinese readers is complicated by the spatial complexity of Chinese characters, but the fact that the Snellen E, which is the current national standard of acuity measurement in China, and Chinese characters showed similar dependence on optical defocus may indicate a potentially valid way to infer functional vision in Chinese readers with Snellen E acuity.
However, there are situations in which the optical conjugation between image plane and the retina is not the main concern.4 Even when refractive errors are optimally corrected, visual performance can be impaired by diseases and trauma of the eye or of the visual neural pathway or by demanding operating conditions, such as low luminance, low contrast, and disabling glare. Therefore, for purposes such as diagnosis of ocular diseases, visual rehabilitation, job qualification, and disability benefit, visual acuity is used as a measurement of functional visionthat is, how well a person can perform vision-related activities under certain conditions. For a visual acuity measurement to be functionally relevant, the stimuli should be closely related to the observers visual tasks. For literate observers, the most important visual task is undoubtedly reading text of the observers native language.
Because of the importance of reading text in functional vision, letter charts have become the dominant way of clinical evaluation of functional vision around the world. However, letters are complex spatial patterns, and what constitutes the "finest detail" in letters cannot be precisely defined. Therefore, uniting visual acuity for refraction and for functional vision has always been a challenge.1 Both international and U.S. standards stipulate that alternative optotypes, such as letters, should be equivalent to the Landolt C in test results. ISO 8597 (International Organization for Standardization) 2 stipulates that a set of optotypes is equivalent to the Landolt C if they differ less than 0.05 log unit. In the U.S. standard, the allowed difference is within 5%. If an alternative set of optotypes is not equivalent to the Landolt C, a size conversion factor should be determined to scale the optotypes. In phonics-based languages, at least in those using Roman alphabets, an agreement between refraction oriented Landolt C acuity and functional vision oriented letter acuity is relatively easy to achieve. Sloan et al.5 tested acuity in 214 eyes with various refractive errors by using the Landolt C and the uppercase English letters CDHKNORSVZ and found that the two measurements correlated highly (Pearson r = 0.90). When such agreement between primary and alternative optotypes can be established, the same unit for visual resolution, MAR (minimum angle of resolution) or logMAR, can be used for functional vision measurement, even though for letter stimuli, what is to be resolved may differ from letter to letter.
Because many phonics-based languages employ only a small number of letters that often have simple and relatively uniform spatial complexities, a subset of 8 to 10 letters can adequately represent the spatial complexity of the entire alphabet. The letters to be used as acuity optotypes can be determined by an exhaustive testing of the legibility of the entire alphabet, to select letters with approximately identical legibility (The identical legibility requirement is essential to modern visual acuity chart design, in which multiple optotypes of nominally identical legibility are arranged on the same line to increase testing accuracy.) Once selected, the set of optotypes can be tested against the standard, such as the Landolt C, to determine acuity equivalence and a conversion factor, if necessary. This method has been used in creating visual acuity charts in English, German, Hindi and Gujarati, Thai, and Arabic.5 6 7 8 9
This optotype selection method, however, may not be applicable to nonalphabetic languages like Chinese. An enormous number of characters are used in written Chinese. Primary school graduates are required to learn more than 2500 frequently used characters during their 6 years of schooling. The criterion for literacy is the ability to recognize 2000 or more characters. It is thus impractical to test the legibility equivalence of even the frequently used characters. In addition, Chinese characters have a wide range of spatial complexities. While all written Chinese characters of the same font type and font size occupy the same square area, a character may consist of 1 to 52 strokes. The wide range of spatial complexity makes it difficult to use a small number of Chinese optotypes to quantify the visual demand for reading Chinese text. Efforts have been made to create visual acuity charts in Chinese. Woo and Lo10 selected a small set of Chinese characters to construct a chart for distance acuity measurement. Because the authors intended to draw the characters according to the Snellen principle (stroke width = one-fifth character height), only simple characters with a few strokes could be used. Even among these simple characters, a large variation of character irradiation was found. For example, the ink area of
![]()
![]()
In this article, we present a thorough investigation of the legibility of written Chinese characters to lay the groundwork for developing a more reliable way to evaluate functional vision in Chinese readers. We first measured the legibility of six groups of representative Chinese optotypes that covered the full range of spatial complexities of frequently used Chinese characters, by applying rigorous psychophysical procedures. The results showed that acuity sizes of Chinese characters increased linearly with their spatial complexity. We then studied the effects of optical defocus on acuities of the Landolt C, the Snellen E, and three Chinese character groups representing low, medium, and high spatial complexities, using a clinically accepted, forced-choice letter recognition method. We found that acuity-versus-defocus functions for the Snellen E and all three Chinese character groups, but not for the Landolt C, were parallel to each other. These results suggest that Snellen E acuity could be used to infer functional vision in Chinese readers.
| Methods |
|---|
|
|
|---|
A unique method was developed to select Chinese optotypes (see the Appendix). Six groups, 10 characters in each, were selected according to the number of strokes per letter (24, 56, 89, 1112, 1315, and 1618 strokes/letter). Characters in each group had sufficient physical similarity, judged by the Euclidian distances among them, and thus might have similar legibility.15 16 17 These Chinese optotypes were drawn in standard bold Heiti (black) font face, which has a relatively uniform stroke width and no flaring or tapering at the beginning or end. The resultant Chinese optotypes, CC1 through CC6, are shown in Figure 1a . For further quantification of the spatial complexity of these Chinese optotypes further, their stroke frequencies were calculated (see the Appendix and Fig. 1b ).
|
Apparatus
The stimuli were generated by a MatLab-based (The MathWorks, Natick, MA) WinVis program (Neurometrics Institute, Oakland, CA) and were presented on a 21-in. color monitor (2048 x 1536 pixels, 0.189 x 0.189 mm/pixel, 75-Hz frame rate). The maximum and minimum luminance of the monitor was 89 and 0.02 cd/m2. All optotypes were minimal-luminance, black figures on a maximum-luminance white background. Observers viewed the displays binocularly in a dimly lit room. An instrument table with a chinheadrest combo was used to maintain correct viewing distance.
Observers and Procedures
In the two parts of the study, determination of the legibility of Chinese characters and the measurement of the effects of optical defocus on Chinese character legibility, different sets of observers and different procedures were used.
Chinese Character Legibility
Six young (mean age, 22.8 years) native Chinese speakers with normal or corrected-to-normal vision served as observers. All observers had a college education and at least 7 years of training in reading and writing English. Except ZJ, a coauthor, all were new to psychophysical observations and were unaware of the purpose of the study. Each observer first underwent refractive testing by a trained technician using a Snellen E light box at the designated viewing distance of 5 m. Most observers were slightly myopic and wore corrections. The average best corrected acuity was 0.114 ± 0.049 logMAR.
Sloan letters and the Chinese optotypes CC1 through CC6 were used. A method of constant stimuli was used to measure the acuity size of a stimulus group. A single letter stimulus was presented at the center of the screen with unrestricted duration. The observers task was to report the stimulus letter from a 10-letter list with a key-press (09). Auditory feedback was given on incorrect responses. Six levels of stimulus angular sizes were tested for each stimulus group. The stimulus angular size was changed by changing viewing distance without changing the physical stimuli on the screen (40 x 40 pixels for Sloan letter and 50 x 50 pixels for Chinese characters). Letter recognition for a specific stimulus group at a specific angular size formed one experimental session (10 letters x 5 trials). One round of experiments consisted of 42 sessions (seven stimulus groups x six stimulus sizes), which were run according to a randomly permuted table for each observer and were typically completed in several days. Four observers performed 10 rounds of experiments, and the other two performed six rounds.
The percentage of correct data for each stimulus group was fitted with a Weibull function: P=1(1
) e(x/th)ß, where P was the percentage correct,
was the guessing rate (0.1 in a 10-AFC trial), x was the stimulus size in arc minutes, ß was the slope of the psychometric function, and th was the threshold size at a 66.9% correct level. In the rest of this article we use "acuity size" to refer to the full height of the threshold optotype in arc minutes, and use "MAR" to refer to one fifth of the acuity size, even though for complex Chinese characters, MAR was not a well-defined stimulus feature.
The Effects of Optical Defocus on Chinese Character Legibility
Twenty-six observers (mean age, 25.9 years) with normal or corrected-to-normal vision served as observers. Landolt Cs and Snellen Es at four orientations and three groups of Chinese optotypes representing low, medium, and high spatial complexities were used. Each stimulus group contained five optotypes (see Fig. 4a ). The Chinese optotypes were extracted from the original CC1, CC3, and CC6 groups shown in Figure 1 . They were shown to have intermediate empiric legibility within their groups (Table 1) . The largest optotypes were 125 x 125 pixels, and the smallest were 25 x 25 pixels. This range was used to ensure good pixel image quality and a close match of logarithmic steps.
|
|
|
|
All observers were first refracted with a Snellen E light box at the designated viewing distance of 5 m. The mean best corrected visual acuity was 0.126 ± 0.054 logMAR. An observer used his or her dominant eye to view the stimulus display from a distance of 6 m, with his or her best correction in a trial frame. Acuity sizes were measured under four conditions: best corrected vision with an additional 0-, +0.5-, +1.0-, and +2.0-D spherical lens in a random testing order. The order of the five optotype groups under each condition was also randomized. If an observer failed to recognize the largest optotype at 6 m due to the addition of a +1.0- or +2.0-D lens, the viewing distance was halved. The experiment was conducted in a dimly lit room. A front surface mirror was used to increase optical distance.
The observers task was to report the five optotypes from left to right. The stimulus was shown continuously until all five optotypes were reported. The observers knew the tested letters well, and they had a list of the tested optotypes in large print at hand. The test began with a large optotype size (usually 0.2 or 0.3 log unit larger than the threshold size estimated in pilot studies). If the observer reported four or five optotypes of a possible five correctly, the optotype size was reduced by approximately 0.05 log unit. If the observer reported three or fewer optotypes correctly, the same optotype size was repeated with a new line of optotypes. If the observer was then able to report four or five optotypes correctly, the optotypes size was reduced, and the test continued. Otherwise, the mean number of correctly read optotypes of this size was recorded, and their contribution to acuity was considered according to the letter-by-letter scoring principle. Acuity was recorded in terms of one fifth of the threshold letter size.
Written informed consent was obtained from all observers. The research adhered to the tenets of the Declaration of Helsinki.
| Results |
|---|
|
|
|---|
Figure 2b shows that Sloan letters had significantly smaller acuity sizes than CC1 stimuli (average 3.41 vs. 4.68 arcmin; F1,5 = 96.29, P < 0.0005), even though both groups had similar stroke frequencies (2.02 vs. 2.22 strokes/letter). This difference may be accounted for by thicker strokes of Sloan letters (Fig. 1) . To test this hypothesis, eight Chinese characters were selected that had structures similar to that of eight of the Sloan letters, with the exception of maybe a rotation and some minor differences, such as the straightness of strokes or stroke endings. These Chinese characters were rendered in two ways, with normal bold Heiti strokes and with graphically thickened strokes similar to those of Sloan letters (CCthin and CCthick in Fig. 3a ). CCthin and CCthick stimuli had stroke frequencies similar to those of the corresponding Sloan letters.
Acuity sizes of these groups were obtained from three young observers in the same way as in the main experiment, and the average acuity sizes of individual optotypes are shown in Figure 3b . The acuity size difference between Sloan letters and regular Chinese bold Heiti characters was reproduced (mean = 3.88 and 4.66 arcmin for Sloan and CCthin, respectively; F1,2 =30.09, P = 0.032). Thickening the strokes reduced the threshold size (mean acuity size from 4.66 arcmin for CCthin to 4.31 arcmin for CCthick; F1,2 = 17.87, P = 0.052). Moreover, the mean acuity size of CCthick was not significantly different from that of Sloan (3.88 arcmin; F1,2 = 9.35, P = 0.092). These results indicate that the smaller acuity sizes of Sloan letters shown in Figure 2b were at least partially due to their thicker stroke width. Other factors that may have contributed to the acuity size difference will be analyzed in the Discussion section.
Effects of Optical Defocus on Chinese Character Legibility
The variation of legibility of Chinese characters shown in Figure 2b was wide. However, we argue (see the Discussion section) that the task of deriving a simple visual function measure would become much more complicated if the pattern of variation changes from one viewing condition to another. To estimate changes of legibility of Chinese characters quantitatively, we tested legibility under a set of most common clinical conditionsnamely, different refractive errors. We added plus lenses to simulate various degrees of refractive errors and measured changes of acuity sizes of the Landolt C, the Snellen E, and the three groups of Chinese characters representing simple, medium, and complex spatial complexities (Fig. 4a) .
When no plus lenses were added (Fig. 4b , at 0.0 D), there was a strong effect of stimulus type (F4.22 = 56.17, P < 0.0005). The Landolt C and the Snellen E had similar logMARs (0.159 and 0.167; F1,25 = 0.334, P = 0.568). The three CC groups all had significantly larger logMARs (0.044, 0.037, and 0.137, for CC1, CC3 and CC6, respectively) than the Landolt C and the Snellen E (F1,25 = 206.40, P < 0.0005). The differences among CC groups were also significant (CC1 vs. CC3: F1,25 = 27.78, P < 0.0005; and CC3 vs. CC6: F1,25 = 43.19, P < 0.0005). CC1, CC3, and CC6 acuity sizes were 0.115, 0.195, and 0.296 logMAR larger than the Landolt C acuity size, respectively. According to ISO 8597,2 these Chinese optotypes were not equivalent to the Landolt C.
When optical defocus was introduced, the logMARs of all five stimulus groups increased linearly with optical defocus (F3,23 = 44.34, P < 0.0005; Fig. 4b ). A linear function, logMAR = a + bD, where D is the added optical defocus in diopters, b is the slope, and a is the logMAR at 0-D optical defocus, provided excellent fits (coefficient of determination, r2 > 0.995) for all five sets of data. Notice that the fitting lines for the Snellen E, CC1, CC3, and CC6 in Figure 4b were almost parallel, as demonstrated by the best-fitting slopes in Figure 4c . The main difference between the Snellen E and the three groups of Chinese characters groups was an upward shift of the line with increasing optotype complexity. Specifically, calculated by the y-intercepts of the best fitting lines, acuities of CC1, CC3, and CC6 were 0.117, 0.210, and 0.291 log units higher than that of the Snellen E, respectively. In other words, acuities of CC1, CC3, and CC6 were roughly 1, 2, and 3 lines of a Bailey-Lovie chart higher than that of the Snellen E, regardless of the amount of optical defocus (up to +2.0 D). On the one hand, the common slopes of these optotypes indicated that a common mechanism might underlie their recognition. The line for the Landolt C optotype, on the other hand, had a steeper slope than others (Figs. 4b 4c) , indicated by a significant interaction between the stimulus groups and optical defocus (F12,14 = 20.92, P < 0.0005).
| Discussion |
|---|
|
|
|---|
Relative Legibility
Relative legibility of optotypes within each stimulus group was calculated and summarized in Table 1 . (Relative legibility indicates the ranking of legibility among a group of letters involved in an acuity test. We used the proportion of correct recognition as the measure of relative legibility. Note that relative legibility is experiment specific. The rankings shown in Table 1 were derived from experiments in which10 stimulus letters were used, and thus may not be generalized to other situations. For example, S was the third least legible among the 10 Sloan letters, but when all 26 uppercase letters were tested together, S was almost always the least legible.18 ) The
2 tests showed that the legibility of optotypes within each stimulus group was not homogeneous (P < 0.0005), indicating that optotypes within each group were not equally legible. In contrast, the variances of relative legibility in our data (Table 1) and that of Ferris et al.19 were not significantly different (F9,9 = 1.228, P = 0.382), indicating the same level of test accuracy. The variances of relative legibility in our CC1 through CC5 data were not significantly different from that of our Sloan data (F9,9 = 2.746, P = 0.427). CC6 had a significantly smaller variance than did the Sloan letters (F9,9 = 7.996, P = 0.037). Therefore, although each Chinese character group contained more legible and less legible characters, the scattering of relative legibility within these stimulus groups was no greater than that of the Sloan letters observed in field tests.19 Therefore, these CC optotypes are acceptable stimuli for assessing legibility at different spatial complexities. These results also suggest that computer evaluation of Euclidean distances among bitmaps is an acceptable way to preselect optotypes of similar legibility.
This study is the first of a series of studies on the visual psychophysics of Chinese character recognition. In later experiments including the optical defocus experiment (Fig. 4) , five optotypes with the intermediate relative legibility from each group were used as representative optotypes of that group. These five optotypes can be treated as equally legible and interchangeable optotypes.
Further Analysis of Acuity Differences between Sloan Letters and Chinese Characters
In Figure 3 , some acuity differences between Chinese characters and Sloan letters remained even when stroke width was equalized. Such differences may be explained by the structural similarity among the eight Chinese characters in the CCthick group. To demonstrate, raw data from the Sloan and CCthick experimental sessions that yielded correct rates between 45% and 85% were organized into letter confusion matrices (Fig. 5) , in which prominent confusions are highlighted by boxes. In a typical letter acuity experiment, the correct rate of a letter stimulus depends on how confusable it is with its fellow members in the stimulus group.20 Among the eight Sloan letters used in our experiment, the three most legible letters were V, H, and Z (91%, 88%, and 76% correct, respectively). The chalice, the two vertical bars, and the two horizontal bars were unique within the group. These letters were not confused consistently with other letters (Fig. 5a) . In CCthick, however, the chalice of V was replaced by a blade of
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
|
A Chinese character expresses a meaning and is thus functionally equivalent to an English word. Chinese can be read twice as fast as English.23 Recognition of Chinese characters is four times as efficient as recognition of five-letter English words (five letters is the average length of English words).22 However, this advantage of Chinese character over English words does not hold at acuity. Our result showed that CC1 was 37.2% less legible than Sloan letters. Meanwhile, Sheedy et al.24 compared acuities of single Sloan letters and lowercase English words (five to six letters) in four font faces and found that lowercase words were 4.5% to 7% less legible than Sloan letters. These data together indicate that CC1 is approximately 33% less legible than lowercase English words. We speculate that in an acuity test, the global properties of the physical structure of the stimulus, whether it is a letter, a character or a word, determine the acuity size. Familiarity with the stimulus or the meaning of the stimulus may have little effect.
Deriving a Visual Function Measurement from Snellen E Acuity
The difference between the easiest among our 60 Chinese characters (
![]()
![]()
It is worth noting that the Chinese characters used in this study were "simplified" characters, which are standard in mainland China and Singapore. Many simplified characters have fewer strokes than the corresponding traditional characters. For instance, the simplified character
![]()
![]()
Finally, it is also worth pointing out that the parallel relationship between the Snellen E and the Chinese characters shown in Figure 4b was obtained by introducing optical defocus. While this method (dioptric blur) has been widely used to simulate refractive errors in the eye, subtle differences may exist. We are currently working on a clinical population to see whether the same relationship holds in naturally occurring refractive errors.
| Conclusions |
|---|
|
|
|---|
| Appendix 1 |
|---|
|
|
|---|
![]() |
Calculation of Optotype Stroke Frequency
Although the number of strokes has been used in many studies of letter recognition to index the complexity of stimuli, it is not a good measure of spatial complexity because the total number of strokes ignores the spatial arrangement of strokes, and thus may not provide a good measurement of stroke density. For example, although the character
![]()
![]()
![]()
![]()
Another measurement of spatial complexity of patterns is perimetric complexity.22 28 Perimetric complexity is a size-invariant measure of dispersion and is defined as the square of inside-and-outside perimeter of a pattern, divided by the ink area. We calculated perimetric complexities of all the optotypes used in our study and found they correlated highly with stroke frequencies (r = 0.956). Because stroke frequency is the more intuitive of the two measurements, we chose to use stroke frequency as our measurement of optotype complexity.
| Acknowledgements |
|---|
| Footnotes |
|---|
Submitted for publication October 5, 2006; revised December 31, 2006 and January 17, 2007; accepted March 16, 2007.
Disclosure: J.-Y. Zhang, None; T. Zhang, None; F. Xue, None; L. Liu, None; C. Yu, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
* Each of the following is a corresponding author: Cong Yu, State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China; yucong{at}bnu.edu.cn. Lei Liu, School of Optometry, University of Alabama at Birmingham, Birmingham, AL; liul7788{at}uab.edu. ![]()
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |