Path: Top -> Journal -> Jurnal Internasional -> King Saud University -> 2014 -> Volume 26, Issue 4, December

An Arabic CCG approach for determining constituent types from Arabic Treebank

Journal from gdlhub / 2017-08-16 13:52:35
Oleh : Ahmed I. El-taher a , * , Hitahm M. Abo Bakr a , Ibrahim Zidan a , Khaled Shaalan, King Saud University
Dibuat : 2014-12-16, dengan 1 file

Keyword : Arabic CCGbank Treebank
Url : http://www.sciencedirect.com/science/article/pii/S1319157814000299
Sumber pengambilan dokumen : web

Converting a treebank into a CCGbank opens the respective language to the sophisticated tools developed for Combinatory Categorial Grammar (CCG) and enriches cross-linguistic development. The conversion is primarily a three-step process: determining constituentsÂ’ types, binarization, and category conversion. Usually, this process involves a preprocessing step to the Treebank of choice for correcting brackets and normalizing tags for any changes that were introduced during the manual annotation, as well as extracting morpho-syntactic information that is necessary for determining constituentsÂ’ types. In this article, we describe the required preprocessing step on the Arabic Treebank, as well as how to determine Arabic constituentsÂ’ types. We conducted an experiment on parts 1 and 2 of the Penn Arabic Treebank (PATB) aimed at converting the PATB into an Arabic CCGbank. The performance of our algorithm when applied to ATB1v2.0 & ATB2v2.0 was 99% identification of head nodes and 100% coverage over the Treebank data.

Beri Komentar ?#(0) | Bookmark

PropertiNilai Properti
ID Publishergdlhub
OrganisasiKing Saud University
Nama KontakHerti Yani, S.Kom
AlamatJln. Jenderal Sudirman
KotaJambi
DaerahJambi
NegaraIndonesia
Telepon0741-35095
Fax0741-35093
E-mail Administratorelibrarystikom@gmail.com
E-mail CKOelibrarystikom@gmail.com

Print ...

Kontributor...

  • , Editor: sukadi

Download...