英文摘要 |
At present, disease classification mainly relies on humans to read a large amount of text data as a basis for classification. A professional disease classifier requires long-term professional training to perform the complex tasks of ICD-10 (The International Statistical Classification of Diseases and Related Health Problems 10th Revision, ICD-10) disease classification. It takes a lot of time for the classification to make a correct ICD-10 coding for each patient. This study contains 140,000 labeled data and different types of text data, such as discharge note or history. We hope to train an automatic ICD-10 coding system from these text data and have the ability to read and understand the information written by doctors. In this study, the data included discharge note from National Taiwan University Hospital from 2016 to July 2017. We have 0.85 F1-score in ICD-10-CM 21-categories classification and 0.65 F1-score in all code classifications. The results of this study prove that deep learning is worthy of further research in medical system. |