Enhancing Film Genre Classification Using FastText Embeddings, Bidirectional GRU (BiGRU), and Attention Mechanisms
DOI:
https://doi.org/10.58776/ijitcsa.v2i3.169Keywords:
Film Genre Classification, FastText Embeddings, Bidirectional GRU (BiGRU), Attention Mechanisms, Natural Language Processing (NLP)Abstract
This research aims to enhance the classification of film genres using advanced natural language processing techniques. By integrating FastText embeddings with Bidirectional Gated Recurrent Units (Bi-GRU) and attention mechanisms, the proposed model addresses the limitations of existing methods that struggle with capturing both local and global dependencies within textual data. The model's performance is evaluated on a dataset from IMDb, demonstrating its capability to predict film genres from textual descriptions accurately. Key contributions include the development of a robust model architecture that effectively handles out-of-vocabulary words and contextual nuances, implementing regularization techniques such as DropConnect to improve generalization, and using advanced embeddings to enhance semantic representation. The results indicate significant improvements in genre classification accuracy, particularly for frequent genres, showcasing the model's potential for practical applications in media content analysis. Future work will address data imbalance and explore more sophisticated architectures to enhance performanc.
References
L. Luo, “Network text sentiment analysis method combining LDA text representation and GRU-CNN,” Pers. Ubiquitous Comput., vol. 23, no. 3–4, pp. 405–412, Jul. 2019, doi: 10.1007/s00779-018-1183-9.
M. Zulqarnain, R. Ghazali, M. G. Ghouse, and M. F. Mushtaq, “Efficient processing of GRU based on word embedding for text classification,” JOIV Int. J. Informatics Vis., vol. 3, no. 4, pp. 377–383, Nov. 2019, doi: 10.30630/joiv.3.4.289.
J. Zhang, F. Liu, W. Xu, and H. Yu, “Feature Fusion Text Classification Model Combining CNN and BiGRU with Multi-Attention Mechanism,” Futur. Internet, vol. 11, no. 11, p. 237, Nov. 2019, doi: 10.3390/fi11110237.
Y. Han, M. Liu, and W. Jing, “Aspect-Level Drug Reviews Sentiment Analysis Based on Double BiGRU and Knowledge Transfer,” IEEE Access, vol. 8, pp. 21314–21325, 2020, doi: 10.1109/ACCESS.2020.2969473.
N. Gruber and A. Jockisch, “Are GRU Cells More Specific and LSTM Cells More Sensitive in Motive Classification of Text?,” Front. Artif. Intell., vol. 3, Jun. 2020, doi: 10.3389/frai.2020.00040.
W. Li, Y. Xu, and G. Wang, “Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network,” IEEE Access, vol. 7, pp. 145944–145952, 2019, doi: 10.1109/ACCESS.2019.2944136.
Y. Cheng, L. Yao, G. Xiang, G. Zhang, T. Tang, and L. Zhong, “Text Sentiment Orientation Analysis Based on Multi-Channel CNN and Bidirectional GRU With Attention Mechanism,” IEEE Access, vol. 8, pp. 134964–134975, 2020, doi: 10.1109/ACCESS.2020.3005823.
E. I. Setiawan, F. Ferry, J. Santoso, S. Sumpeno, K. Fujisawa, and M. H. Purnomo, “Bidirectional GRU for Targeted Aspect-Based Sentiment Analysis Based on Character-Enhanced Token-Embedding and Multi-Level Attention,” Int. J. Intell. Eng. Syst., vol. 13, no. 5, pp. 392–407, Oct. 2020, doi: 10.22266/ijies2020.1031.35.
L. Li, L. Yang, and Y. Zeng, “Improving Sentiment Classification of Restaurant Reviews with Attention-Based Bi-GRU Neural Network,” Symmetry (Basel)., vol. 13, no. 8, p. 1517, Aug. 2021, doi: 10.3390/sym13081517.
W. Ali, Y. Yang, X. Qiu, Y. Ke, and Y. Wang, “Aspect-Level Sentiment Analysis Based on Bidirectional-GRU in SIoT,” IEEE Access, vol. 9, pp. 69938–69950, 2021, doi: 10.1109/ACCESS.2021.3078114.
W. Gu, S. Zheng, R. Wang, and C. Dong, “Forecasting Realized Volatility Based on Sentiment Index and GRU Model,” J. Adv. Comput. Intell. Intell. Informatics, vol. 24, no. 3, pp. 299–306, May 2020, doi: 10.20965/jaciii.2020.p0299.
Z. Liu, B. Zhou, L. Meng, and G. Huang, “Multimodal Sentiment Analysis Using BiGRU and Attention-Based Hybrid Fusion Strategy,” Intell. Autom. Soft Comput., vol. 37, no. 2, pp. 1963–1981, 2023, doi: 10.32604/iasc.2023.038835.
X. Wang, X. Chen, M. Tang, T. Yang, and Z. Wang, “Aspect-Level Sentiment Analysis Based on Position Features Using Multilevel Interactive Bidirectional GRU and Attention Mechanism,” Discret. Dyn. Nat. Soc., vol. 2020, pp. 1–13, Jul. 2020, doi: 10.1155/2020/5824873.
J. Ravichandran, M. Kaden, S. Saralajew, and T. Villmann, “Variants of DropConnect in Learning vector quantization networks for evaluation of classification stability,” Neurocomputing, vol. 403, pp. 121–132, 2020, doi: 10.1016/j.neucom.2019.12.131.
Z. Lian, X. Jing, X. Wang, H. Huang, Y. Tan, and Y. Cui, “DropConnect Regularization Method with Sparsity Constraint for Neural Networks,” Chinese J. Electron., vol. 25, pp. 152–158, 2016, doi: 10.1049/CJE.2016.01.023.
H. Lim, “A Study on the Effect of DropConnect to Control Overfitting in Designing Neural Networks,” pp. 178–183, 2020, doi: 10.3233/faia200780.
J. Choi and S.-W. Lee, “Improving FastText with inverse document frequency of subwords,” Pattern Recognit. Lett., vol. 133, pp. 165–172, 2020, doi: 10.1016/j.patrec.2020.03.003.
E. Dynomant et al., “Word Embedding for the French Natural Language in Health Care: Comparative Study,” JMIR Med. Informatics, vol. 7, 2019, doi: 10.2196/12310.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Muhammad Fairuzabadi

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.