INDONESIAN MULTI-DOCUMENT SUMMARIZATION USING FEATURE BASED ARTIFICIAL BEE COLONY
Keywords:
Natural Languange Processing, Multi-Document Summarization, Artificial Bee Colony, Swarm IntelligenceAbstract
Multi-document summarization is one of the problems in natural language processing that aims to produce a summary of several documents that have the same topic. Document summarization technology has now developed rapidly. Various approaches have been applied to obtain more optimal results. One of the superior approaches used is to utilize the swarm intelligence algorithm. This study proposes a multi-document summarization method for Indonesian language texts using the Artificial Bee Colony (ABC) algorithm which uses a sentence feature-based approach. ABC is used to optimize feature weights for sentence weighting. The dataset used in this study is a collection of Indonesian language news documents sourced from various online news media. The ABC-based approach and the use of sentence features have proven effective in handling the complexity of sentence selection and have made a significant contribution in assessing the relevance of the system's summary sentences to expert summaries. The test results using the Rouge matrix show that the proposed method produces a summary with superior quality as seen from the Rouge model value which is better than several other comparative methods. The results of the K-Fold Cross Validation show that the performance of the ABC model built is proven to be consistent across all document variations.
References
Al-Saleh, A.B. and El Bachir Menai, M. 2018. Ant colony system for multi-document summarization. COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings. pp. 734–744.
Allahyari, M., Pouriyeh, S., Assefi, M., Safaei, S., D., E., B., J. and Kochut, K. 2017. Text Summarization Techniques: A Brief Survey. International Journal of Advanced Computer Science and Applications. 8(10).
Bhargav, S., Choudhury, A., Kaushik, S., Shukla, R. and Dutt, V. 2022. A Comparison Study of Abstractive and Extractive Methods for Text Summarization. (April), pp. 601–610.
Chin-Yew Lin 2004. ROUGE: A Package for Automatic Evaluation of Summaries. in Proceedings of the ACL 2004 Workshop on Text Summarization Branches Out, Association for Computational Linguistics. Barcelona. pp. 74–81.
El-Kassas, W.S., Salama, C.R., Rafea, A.A. and Mohamed, H.K. 2021. Automatic text summarization: A comprehensive survey. Expert Systems with Applications. 165, p. 113679.
Karaboga, D. and Akay, B. 2009. A comparative study of Artificial Bee Colony algorithm. Applied Mathematics and Computation. 214(1), pp. 108–132.
Karaboga, D. and Basturk, B. 2008. On the performance of artificial bee colony (ABC) algorithm. Applied Soft Computing Journal. 8(1), pp. 687–697.
Kogilavani, A. and Balasubramani, P. 2010. Clustering and Feature Specific Sentence Extraction Based Summarization of Multiple Documents. International Journal of Computer Science and Information Technology. 2(4), pp. 99–111.
Ma, C., Zhang, W.E., Guo, M., Wang, H. and Sheng, Q.Z. 2022. Multi-document Summarization via Deep Learning Techniques: A Survey. ACM Computing Surveys. 55(5), pp. 1–35.
Mirshojaei, S.H. and Masoomi, B. 2015. Text Summarization Using Cuckoo Search Optimization Algorithm. Journal of Computer & Robotics. 8(2), pp. 19–24.
Muthiah, A. and Rajkumar, R. 2014. A comparison of artificial bee colony algorithm and genetic algorithm to minimize the makespan for job shop scheduling. Procedia Engineering. 97, pp. 1745–1754.
Qi, Y.S., Gan, D., Wen, T.K. and Rui, S. 2021. Parameter tuning for artificial Bee Colony algorihtm. 5(1), pp. 1–4.
Rautray, R. and Balabantaray, R.C. 2017. Cat swarm optimization based evolutionary framework for multi document summarization. Physica A: Statistical Mechanics and its Applications. 477, pp. 174–186.
Rautray, R. and Balabantaray, R.C. 2018. An evolutionary framework for multi document summarization using Cuckoo search approach: MDSCSA. Applied Computing and Informatics. 14(2), pp. 134–144.
Reddy, A.N.K., Lakshmi, L., Dhana Sree Devi, K., Srinivasa Rao, K., Naga Prasad, S. and Nookala, V. 2024. Discrete Bat Algorithm for Efficient Multi-Document Summarization. Ingenierie des Systemes d’Information. 29(5), pp. 1981–1989.
Reddy, R.P., Nara, K. and Reddy, S.S. 2016. A Comparative Study of Text Summarization Based on Synchronous and Asynchronous PSO. International Journal of Advanced Engineering Research and Science. 3(11), pp. 125–130.
Sharma, G. and Sharma, D. 2023. Automatic Text Summarization Methods: A Comprehensive Review. SN Computer Science. 4(1), pp. 1–18.
Widyassari, A.P., Rustad, S., Shidik, G.F., Noersasongko, E., Syukur, A., Affandy, A. and Setiadi, D.R.I.M. 2022. Review of automatic text summarization techniques & methods. Journal of King Saud University - Computer and Information Sciences. 34(4), pp. 1029–1046.
Zhou, Q., Wei, F. and Zhou, M. 2020. At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization. COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference. pp. 5617–5628.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 I Made Widiartha, Rukmi Sari Hartati, Dewa Made Wiharta, Nyoman Putra Sastra

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.