Path: Top -> Journal -> Jurnal Nasional Teknik Elektro dan Teknologi Informasi -> 2018 -> Vol 7, No 3

Ekstraksi Frasa Kunci pada Penggabungan Klaster berdasarkan Maximum-Common-Subgraph

Journal from gdlhub / 2019-11-15 10:57:57
Oleh : Adhi Nurilham, Diana Purwitasari, Chastine Fatichah, JNTETI
Dibuat : 2019-05-07, dengan 1 file

Keyword : pelabelan klaster, penggabungan klaster, Frequent Phrase Mining, Maximum Common Subgraph, TopicRank
Url : http://ejnteti.jteti.ugm.ac.id/index.php/JNTETI/article/view/432
Sumber pengambilan dokumen : WEB

Document clustering based on topic similarities helps users in searching from a collection of scientific articles. Topic labels are necessesary for describing subjects of the document clusters. Clusters with related subjects or contextual similarities can be merged to produce more descriptive labels. Relations between those words in one context can be modelled as a graph. Instead of single word, this paper proposed cluster labeling of phrases from scientific articles with cluster merging based on graph. The proposed method begins with K-Means++ for clustering the scientific articles. Then, the candidates of word phrases from document clusters are extracted using Frequent Phrase Mining which inspired by Apriori algorithm. Each cluster result has a representation graph from those extracted word phrases. An indicator value from each graph shows any similarities of graph structures which is calculated with Maximum Common Subgraph (MCS). Those clusters are merged if there are any structure similarities between them. Topic labels of clusters are keyword phrases extracted from a representation graph of previous merged clusters using TopicRank algorithm. The merging process which becomes the contribution of this paper is considering topic distribution within clusters for phrase extraction. The proposed method evaluation is performed based on topic coherence of the merged clusters label. The results show that proposed method can improve topic coherence on the merged clusters with MCS graph size percentage as the key factor. Further observation shows that merged cluster labels consistent to MCS graph.

Beri Komentar ?#(0) | Bookmark

PropertiNilai Properti
ID Publishergdlhub
OrganisasiJNTETI
Nama KontakHerti Yani, S.Kom
AlamatJln. Jenderal Sudirman
KotaJambi
DaerahJambi
NegaraIndonesia
Telepon0741-35095
Fax0741-35093
E-mail Administratorelibrarystikom@gmail.com
E-mail CKOelibrarystikom@gmail.com

Print ...

Kontributor...

  • , Editor: sustriani

Download...

  • Download hanya untuk member.

    432-729-1-SM
    Download Image
    File : 432-729-1-SM.pdf

    (1771664 bytes)