PageRank based semantic similarity measure on a graph based Turkish WordNet

[ X ]

Tarih

2017

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Semantic similarity of texts is one of the important areas of Natural Language Processing, and there are several approaches to measure similarity: statistical, WordNet based, and hybrid. For all of these approaches, a lexical knowledge is used such as corpus or semantic network. WordNet is one of the most preferred and mature lexical knowledge base. In this study, we have focused on measuring semantic similarity of Turkish words with a graph based Turkish WordNet. In order to measure semantic similarities, a PageRank based application was chosen. For testing the success of the proposed system, RG65 standard similarity dataset was translated to Turkish and used as benchmark data. Similarity results of the translated RG65 dataset are computed using Turkish WordNet. Result of the computation shows ?=0.543 correlation with human judgement. Taking into account that Turkish WordNet is very limited in term of number of words and there is no study in this area for Turkish language, it is considered that also the low success for this study is acceptable. © 2017 IEEE.

Açıklama

2nd International Conference on Computer Science and Engineering, UBMK 2017 -- 5 October 2017 through 8 October 2017 -- Antalya -- 132116

Anahtar Kelimeler

BalkaNet, Graph based Turkish WordNet, Natural Language Processing, PageRank, Semantic similarity

Kaynak

2nd International Conference on Computer Science and Engineering, UBMK 2017

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye