Path: Top -> Journal -> Telkomnika -> 2017 -> Vol.15, No.3, September
Improving DNA Barcode-based Fish Identification System on Imbalanced Data using SMOTE
Oleh : Wisnu Ananta Kusuma, Nurdevi Noviana, Lailan Sahrina Hasibuan, Mala Nurilmala, Telkomnika
Dibuat : 2017-11-07, dengan 1 file
Keyword : DNA Barcode, imbalanced dataset, mislabeled fish, smote, support vector machine
Url : http://journal.uad.ac.id/index.php/TELKOMNIKA/article/view/5011
Sumber pengambilan dokumen : WEB
Problem in imbalanced data is very common in classification or identification. The problem is raised when the number of instances of one class far exceeds the other. In the previous research, our DNA barcode-based Identification System of Tuna and Mackerel was developed in imbalanced dataset. The number of samples of Tuna and Mackerel were much more than the number of other fish samples. Therefore, the accuracy of the classification model was probably still in bias. This research aimed at to employ Synthetic Minority Oversampling Technique (SMOTE) to yield balanced dataset. We used k-mers frequencies from DNA barcode sequence as features and Support Vector Machine (SVM) as classification method. In this research we used trinuclotide (3-mers) and tetranucleotide (4-mers). The training dataset was taken from Barcode of Life Database (BOLD). For evaluating the model, we compared the accuracy of model using SMOTE and without SMOTE in order to classify DNA barcode sequences taken from Department of Aquatic Product Technology, Bogor Agricultural University. The results showed that the accuracy of the model in the species level using SMOTE was 7% and 13% higher than those of non-SMOTE for trinucleotide (3-mers) and tetranucleotide (4-mers), respectively. It is expected that the use of SMOTE, as one of data balancing technique, could increase the accuracy of DNA barcode based fish classification system, particularly in the species level which is difficult to be identified.
Beri Komentar ?#(0) | Bookmark
Properti | Nilai Properti |
---|---|
ID Publisher | gdlhub |
Organisasi | Telkomnika |
Nama Kontak | Herti Yani, S.Kom |
Alamat | Jln. Jenderal Sudirman |
Kota | Jambi |
Daerah | Jambi |
Negara | Indonesia |
Telepon | 0741-35095 |
Fax | 0741-35093 |
E-mail Administrator | elibrarystikom@gmail.com |
E-mail CKO | elibrarystikom@gmail.com |
Print ...
Kontributor...
- , Editor: sukadi
Download...
Download hanya untuk member.
5011-17068-1-PB
File : 5011-17068-1-PB.pdf
(457977 bytes)