Path: Top -> Journal -> Telkomnika -> 2018 -> Vol. 16, No. 2, April

Data Cleaning Service for Data Warehouse: An Experimental Comparative Study on Local Data

Journal from gdlhub / 2018-05-30 14:34:24
Oleh : Arif Bramantoro, Telkomnika
Dibuat : 2018-05-30, dengan 0 file

Keyword : Data Cleaning Service; data Warehouse; data quality; local data;
Url : http://journal.uad.ac.id/index.php/TELKOMNIKA/article/view/7669
Sumber pengambilan dokumen : WEB

Data warehouse is a collective entity of data from various data sources. Data are prone to several complications and irregularities in data warehouse. Data cleaning service involves identification of errors, removing them and improve the quality of data. Data cleaning service is non trivial activity to ensure data quality. One of the common methods is duplicate elimination. This research focuses on the service of duplicate elimination on local data. It initially surveys data quality focusing on quality problems, cleaning methodology, involved stages and services within data warehouse environment. It also provides a comparison through some experiments on different duplicate elimination services based on different spelling on different pronunciation, misspellings, name abbreviation, honorific prefixes, common nicknames, splitted name and exact match. In addition, the comparison also includes the evaluation of performance for each service based on the required response time, memory load and CPU time, so that in the future these services are reliable to handle big data in data warehouse.

Deskripsi Alternatif :

Data warehouse is a collective entity of data from various data sources. Data are prone to several complications and irregularities in data warehouse. Data cleaning service involves identification of errors, removing them and improve the quality of data. Data cleaning service is non trivial activity to ensure data quality. One of the common methods is duplicate elimination. This research focuses on the service of duplicate elimination on local data. It initially surveys data quality focusing on quality problems, cleaning methodology, involved stages and services within data warehouse environment. It also provides a comparison through some experiments on different duplicate elimination services based on different spelling on different pronunciation, misspellings, name abbreviation, honorific prefixes, common nicknames, splitted name and exact match. In addition, the comparison also includes the evaluation of performance for each service based on the required response time, memory load and CPU time, so that in the future these services are reliable to handle big data in data warehouse.

Beri Komentar ?#(0) | Bookmark

PropertiNilai Properti
ID Publishergdlhub
OrganisasiTelkomnika
Nama KontakHerti Yani, S.Kom
AlamatJln. Jenderal Sudirman
KotaJambi
DaerahJambi
NegaraIndonesia
Telepon0741-35095
Fax0741-35093
E-mail Administratorelibrarystikom@gmail.com
E-mail CKOelibrarystikom@gmail.com

Print ...

Kontributor...

  • Editor: sukadi