英文摘要 |
The paper is aimed at addressing two issues: first, despite of the importance of semantic information in HTML pages, it is often ignored by search engines due to various technology difficulties; second, the ambiguity problem sometimes makes results returned by search engines much less useful. OOSM, a schema model as well as a set of information processing tools, is proposed in the paper. OOSM develops the concept of coarse mapping, that is, users are allowed (but not restricted) to associate a grammar node to a sub section instead of a single node on a HTML page. AS a result, minor modifications of the annotated HTML page can be tolerated. We believe that OOSM is a right solution for the issues presented in this paper. |