英文摘要 |
In this paper, I would like to examine the differences in the extraction of features from AI text mining results in relation to the genre of the text, in order to find new perspectives and features for reading and understanding of the work, rather than the conventional data mining direction based on quantitative methods to extract general trends. In the past, I have mainly focused on the genres of literature and editorials, but this time I will expand the scope to include the humanities and social sciences, including philosophy, thought, religion, history, psychology, society, and education, as well as nature discourse. As a result, for humanities and social sciences papers in written form, which are sentences expressing the speaker's thoughts, the multidimensional scaling method can extract the important points of the content quite accurately, but it is difficult to find anything other than keywords in the co-occurrence network. Compared with editorials such as newspaper editorials and novels, editorials and articles are similar in the multidimensional scaling method, but articles are similar to novels in the co-occurrence network. In addition, compared with the case of discourse materials, the two methods of text mining can extract the important keywords of discourse contents respectively, but they cannot guess the details of the contents. However, there is a possibility that the richness of the conversation can be inferred from the difference in the distribution of the elements. In the case of discourse, there is a different tendency from that of written text, and it can be said that there is a possibility that the qualitative difference between written and spoken language can be examined in the future using text mining as a standard. |