英文摘要 |
Radicals, as components of Chinese characters, and configurations are integral parts of Chinese orthography. Current studies have proven the psychological entity as well as the pedagogical meaning of radicals; however, little research has been done on the properties of radicals. The present study aims to develop a data-driven and exhaustive searching knowledge base –Chinese Orthography Database-which consists of a radical set and a traditional Chinese character set. Four hundred and thirty-nine radicals are used, with 11 symbols of configurations, to take apart 6097 frequent characters, which is the union of two sets of frequent characters defined by the Big-5 encoding method and the Chinese Knowledge and Information Processing group. These freguent characters are computed by the parameters of Chinese character constituent and exhaustively analyzed, while several orthographic indices are created: (a) radical frequency by type/token, (b) configuration frequency, (c) position-based radical frequency and, (d) neighborhood sizes of radicals. To assist researchers in constructing experimental materials and educators in teaching Chinese, several the applications of the Chinese Orthography Database are discussed. |