| 英文摘要 |
To align with international standards and accurately reflect contemporary medical technologies and procedures, the National Health Insurance Administration (NHIA) in Taiwan adopted the 2014 edition of the International Classification of Diseases, 10th Revision, Clinical Modification and Procedure Coding System (ICD-10-CM/PCS) on January 1, 2016. This coding system is now the basis for claims related to outpatient, emergency, and inpatient diagnoses and procedures across contracted healthcare institutions, and underpins the Taiwan Diagnosis-Related Groups payment system (Tw-DRGs). Compared to ICD-9-CM, ICD-10-CM/PCS is more complex, incorporating greater specificity such as laterality, anatomical details, and care circumstances for diagnoses, as well as surgical approaches, device usage, and materials for procedures. This complexity increases the workload for coders, who must spend significantly more time identifying appropriate codes. This study utilizes ICD-10-CM codes assigned by professional coders at a public medical center, along with structured data extracted from electronic discharge summaries, as input features. By leveraging Bio-BERT, a domain-specific natural language processing model, we developed machine learning models to learn and replicate coders’coding patterns. We also evaluated the predictive performance of different models to identify optimal approaches for automated coding. To improve the adoption and accuracy of AI-assisted coding, we implemented an AI-based automated coding web service integrated into the hospital’s disease classification system. This service serves as a decision-support tool for coders, aiming to enhance both the quality and efficiency of ICD-10-CM coding. |