英文摘要 |
"Given the profusion of research information contained in prosecution texts, if applied properly, they can not only help crime prevention and forensic research answer key research questions but also serve as a means for the government to promote evidencebased criminal justice policies. However, in the past, legal texts were often manually coded for analysis, which often consumed considerable labor and time costs. Therefore, this study considers the development of an artificial intelligence algorithm model for the automatic interpretation, identification, and coding of prosecutorial texts as an important tool to improve the effectiveness of prosecutorial practice and crime research. This pilot study is based on relatively stable text structure and low complexity of vocabulary information in drug offense indictments. Besides, this study explores the feasibility of automatic text interpretation and coding of indictments on drug charges through natural language analysis and attempts to construct word segmentation and tagging models.This study accomplished three specific results: 1. establishing a Chinese word segmentation model for indictments on illicit drug use; 2. developing an automatic tagging tool for word features of indictments for drug use; 3. designing a manual tagging and viewing interface tool for indictments for drug use offenses. In addition, the research results also indicate that the possibility of training artificial intelligence to automatically deconstruct and decode the contents of indictments is very high. If the standard threshold of the inconsistency rate is less than 10%, the compliance rate of machine tagging is 85.2%, while the rate of manual tagging is 70.4%, which shows that the performance of machine tagging is better than that of manual tagging. In terms of the cost of tagging time, after taking into account the hours of supervised machine training and testing, the time spent on machine tagging can be reduced by 50% compared to manual tagging. Therefore, it is expected that if we continue to accumulate and expand the data, the development of automatic tagging will be able to significantly reduce the tagging time and improve the consistency rate of tagging.This study proposes to continue to devote resources to the development of research on the automatic interpretation and tagging of criminal indictments through artificial intelligence so that in the future, prosecutors can save large costs from manual reading and coding for indictments with the help of AI model. In addition, the use of natural language processing can also facilitate the accumulation of crime data by research units, so as to improve the accuracy and accessibility of crime data as well as the effectiveness of predicting crime patterns and criminal policy recommendations." |