Path: Top -> Journal -> Jurnal Internasional -> King Saud University -> 2020 -> Volume 32, Issue 1, January
Methodology for fuzzy duplicate record identification based on the semantic-syntactic information of similarity
Oleh : Djulaga Hadzic, Nermin Sarajlic, King Saud University
Dibuat : 2020-01-06, dengan 1 file
Keyword : Duplicate records, Fuzzy, Semantic, Syntactic, Blocking, Windowing
Url : http://www.sciencedirect.com/science/article/pii/S1319157817304512
Sumber pengambilan dokumen : Web
There are different methodologies for identification of fuzzy duplicate records in the process of data cleaning for data warehouse and data mining. The methodologies for duplicate record identification can be classified into three groups: blocking methods, windowing methods, and semantic methods. The article specifically focuses on semantic methods and describes Semantic-Syntactic Method for fuzzy duplicate record identification. Based on the conducted testing, comparative analysis is presented of the results obtained through the Semantic-Syntactic Method and two other standard methods over a selected data set. In the end, the article presents conclusions with regard to the quality and efficiency of the Semantic-Syntactic Method, as well as suggestions for future research in this field.
Beri Komentar ?#(0) | Bookmark
Properti | Nilai Properti |
---|---|
ID Publisher | gdlhub |
Organisasi | King Saud University |
Nama Kontak | Herti Yani, S.Kom |
Alamat | Jln. Jenderal Sudirman |
Kota | Jambi |
Daerah | Jambi |
Negara | Indonesia |
Telepon | 0741-35095 |
Fax | 0741-35093 |
E-mail Administrator | elibrarystikom@gmail.com |
E-mail CKO | elibrarystikom@gmail.com |
Print ...
Kontributor...
- , Editor: Calvin
Download...
Download hanya untuk member.
1-s2
File : 1-s2.0-S1319157817304512-main.pdf
(1140644 bytes)