This study aims to utilize machine learning in the detection of accounting fraud in textual information, focusing on the analysis performance and interpretability of the model. This study considers a manager’s motivation to conceal the fraudulent financial statement for content functions by applying the obfuscation hypothesis, for text functions by applying the information manipulation theory and for interpersonal functions by applying interpersonal deception theory. The analysis shows that the rates of katakana and alphabet characters are higher in fraudulent firms than in nonfraudulent firms, supporting the obfuscation hypothesis. In addition, the rates of numbers and proper nouns are lower in fraudulent firms than in nonfraudulent firms, supported by the information manipulation theory. Furthermore, when the performance of fraud detection models (decision tree, random forest, XGBoost, LightGBM, CatBoost) is compared, it was found that CatBoost had the highest performance. Data Availability: Data are available from sources identified in the paper. JEL Classifications: M41; M42; C45; C55; K42.