Analisis Performa Algoritma Random Forest dan Naive Bayes Multinomial pada Dataset Ulasan Obat dan Ulasan Film
DOI:
https://doi.org/10.22441/incomtech.v12i1.14770Keywords:
Random Forest, Naïve Bayes, Klasifikasi, CountVectorizer, Preprocessing,Abstract
Kemudahan akses informasi memberikan peluang pertukaran informasi antar individu maupun kelompok. Kemudahan akses tersebut memberikan dampak dengan munculnya banyak opini terhadap suatu produk atau topik terhangat. Data opini ulasan dapat diolah menjadi data informasi baru yang memiliki nilai lebih bagi perusahaan maupun pemanfaat data. Pengolahan data ulasan dapat dilakukan dengan menggunakan machine learning dengan algoritma klasifikasi untuk mendapatkan analisis sentimen terhadap produk tertentu. Dataset yang digunakan pada penelitian ini adalah datasetulasan obat dan ulasan film untuk melakukan analisis sentimen dengan mengulas performansi algoritma Random Forestdengan menggunakan beberapa pohon keputusan yang sama yang disatukandan Naïve Bayes Multinomialmenggunakan perhitungan probabilitas pada tingkat akurasi dan waktu latih data. Dalam preprocessing untuk pengolahan data dan penyesuaian tipe data pada metode yang akan digunakan dengan menggunakan CountVectorizer untuk mengubah token kata menjadi vektor dan mengubah data fitur menjadi tipe array. Pembagian data latih dan uji dengan rasio 75:25. Dengan hasil akurasi data terbaik 0,57% dengan menggunakan algoritma Naïve Bayes Multinomial pada data ulasan film. dan latih waktu terlama pada algoritma Random Forestsehingga disarankan untuk dapat menggunakan Term Frequency-Inverse Document Frequency (TF-IDF)sebagai term pembobotan kata untuk mendapatkan hasil akurasi yang lebih baik pada penelitian selanjutnya.
Downloads
References
F. P. d. H. S. Rachmad, "Permandingan Model Deep Learning Untuk Klasifikasi Sentiment Analysis Dengan Teknik Natural Language Preprocessing," Jurnal Teknologi dan Manajemen Informatika, vol. 7, pp. 103-112, 2021.
Ruiz, Cristóbal Colón danIsabel Segura-Bedmar, "Comparing Deep Learning Architectures For Sentiment Analysis on Drug Reviews," Journal of Biomedical Informatics, vol. 110, p. 103539, 2020.
Yadav, Ashima and Vishwakarma, Dinesh Kumar, "A weighted text representation framework for sentiment analysis of medical drug reviews," in 2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM), New Delhi, India, 2020.
Moolthaisong, Kamoltep and Songpan, Wararat, "Emotion Analysis and Classification of Movie Reviews Using Data Mining," in 020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA), Khon Kaen, Thailand, 20202.
Jihad, Muhammad Asjad Adna and Adiwijaya, Adiwijaya and Astut, Widi, "Analisis Sentimen Terhadap Ulasan Film Menggunakan Algoritma Random Forest," eProceedings of Engineering, Vols. 8, No.5, 2021.
Kurniawan, Alit Fajar and Pane, Syafrial Fachri and Awangga, Rolly Maulana, "Prediksi Jumlah Penjualan Rumah di Bojongsoang ditengah Pandemi Covid-19 dengan Metode ARIMA," Jurnal Media Informatika, Vols. 5, No.4, pp. 1479--1487, 2021.
C. a. H. N. H. a. F. I. Prianto, "Analisis Sentimen Terhadap Kandidat Presiden Republik Indonesia Pada Pemilu 2019 di Media Sosial Twitter," Jurnal Media Informatika Budidarma, Vols. 3, No.4, pp. 405--413, 2019.
Farid, Muhammad and Fitrianah, Devi, "Rekomendasi Pemilihan Restoran Berdasarkan Rating Online Menggunakan Algoritma C4. 5," InComTech: Jurnal Telekomunikasi dan Komputer, Vols. 11, No.1, pp. 9-22, 2021.
Sari, Destin Eva Dila Purnama and Sari, Yuita Arum and Furqon, Muhammad Tanzil, "Pembentukan Daftar Stopword menggunakan Zipf Law dan Pembobotan Augmented TF-Probability IDF pada Klasifikasi Dokumen Ulasan Produk," Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 2548, p. 964X, 2020.
A. L. Hanifah, "Pembagunan Businee Intelligence pada Toserba Koperasi Karyawan Semen Padang (KKSP) Berbasis Dashboard System},," Universitas Andalas, 2020.
Nugroho, Arie and Husin, Abdullah, "Performance Analysis of Random Forest Using Attribute Normalization," Sistemasi: Jurnal Sistem Informasi, Vols. 11, No.1, pp. 186--196, 2022.
E. Fitri, "Analisis Sentimen Terhadap Aplikasi Ruangguru Menggunakan Algoritma Naive Bayes, Random Forest Dan Support Vector Machine," Jurnal Transformatika, Vols. 18, No. 1, pp. 71--80, 2020.
M. I. Putra, "Sistem rekomendasi kelayakan kredit menggunakan metode Random Forest pada BRI Kantor Cabang Pelaihari," UIN Sunan Ampel , Surabaya, 2019.
Sabrani, Alif and Bimantoro, Fitri and others, "Multinomial Naive Bayes untuk Klasifikasi Artikel Online tentang Gempa di Indonesia," Jurnal Teknologi Informasi, Komputer, Dan Aplikasinya (JTIKA), vol. 2, pp. 89-100, 2020.
Doloksaribu, Hiras Parasian and Samuel, Yusran Timur, "Komparasi Algoritma Data Mining untuk Analisis Sentimen Aplikasi Peduli Lindunngi," Jurnal Teknologi Informasi: Jurnal Keilmuan dan Aplikasi Bidang Teknik Informatika, Vols. 16, No.1, pp. 1--11, 2022.
Khan, Md Rafidul Hasan and Afroz, Umme Sunzida and Masum, Abu Kaisar Mohammad and Abujar, Sheikh and Hossain, Syed Akhter, "Sentiment analysis from bengali depression dataset using machine learning," in 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2020.
Downloads
Published
How to Cite
Issue
Section
License
The copyright to this article is transferred to Universitas Mercu Buana (UMB) if and when the article is accepted for publication. The undersigned hereby transfers any and all rights in and to the paper including without limitation all copyrights to UMB. The undersigned hereby represents and warrants that the paper is original and that he/she is the author of the paper, except for material that is clearly identified as to its original source, with permission notices from the copyright owners where required. The undersigned represents that he/she has the power and authority to make and execute this assignment.
We declare that:
1. This paper has not been published in the same form elsewhere.
2. It will not be submitted anywhere else for publication prior to acceptance/rejection by this Journal.
3. A copyright permission is obtained for materials published elsewhere and which require this permission for reproduction.
Furthermore, I/We hereby transfer the unlimited rights of publication of the above mentioned paper in whole to UMB. The copyright transfer covers the exclusive right to reproduce and distribute the article, including reprints, translations, photographic reproductions, microform, electronic form (offline, online) or any other reproductions of similar nature.
The corresponding author signs for and accepts responsibility for releasing this material on behalf of any and all co-authors. This agreement is to be signed by at least one of the authors who have obtained the assent of the co-author(s) where applicable. After submission of this agreement signed by the corresponding author, changes of authorship or in the order of the authors listed will not be accepted.
Retained Rights/Terms and Conditions
1. Authors retain all proprietary rights in any process, procedure, or article of manufacture described in the Work.
2. Authors may reproduce or authorize others to reproduce the Work or derivative works for the authors personal use or for company use, provided that the source and the UMB copyright notice are indicated, the copies are not used in any way that implies UMB endorsement of a product or service of any employer, and the copies themselves are not offered for sale.
3. Although authors are permitted to re-use all or portions of the Work in other works, this does not include granting third-party requests for reprinting, republishing, or other types of re-use.









