Minimalist DCT-based Depthwise Separable Convolutional Neural Network Approach for Tangut Script

Agi Prasetiadi; Julian Saputra; Imada Ramadhanti; Asti Dwi Sripamuji; Risa Riski Amalia

doi:10.20895/dinda.v3i2.1106

Agi Prasetiadi
Julian Saputra
Imada Ramadhanti
Asti Dwi Sripamuji
Risa Riski Amalia

DOI: https://doi.org/10.20895/dinda.v3i2.1106

Keywords: Tangut script, Optical character recognition, Depthwise separable convolutional neural network, Discrete cosine transform

Abstract

The Tangut script, a lesser-explored dead script comprising numerous characters, has received limited attention in deep learning research, particularly in the field of optical character recognition (OCR). Existing OCR studies primarily focus on widely-used characters like Chinese characters and employ deep convolutional neural networks (CNNs) or combinations with recurrent neural networks (RNNs) to enhance accuracy in character recognition. In contrast, this study takes a counterintuitive approach to develop an OCR model specifically for the Tangut script. We utilize shorter layers with slimmer filters using a depthwise separable convolutional neural network (DSCNN) architecture. Furthermore, we preprocess the dataset using a frequency-based transformation, namely the Discrete Cosine Transform (DCT). The results demonstrate successful training of the model, showcasing faster convergence and higher accuracy compared to traditional deep neural networks commonly used in OCR applications.