Turkish Named Entity Discovery Based on Termsets

dc.authoridOzel, Selma Ayse/0000-0001-9201-6349
dc.contributor.authorCoban, Onder
dc.contributor.authorOzel, Selma Ayse
dc.contributor.authorIean, Ali
dc.date.accessioned2025-01-06T17:43:46Z
dc.date.available2025-01-06T17:43:46Z
dc.date.issued2019
dc.description4th International Conference on Computer Science and Engineering (UBMK) -- SEP 11-15, 2019 -- Samsun, TURKEY
dc.description.abstractNamed Entity Recognition (NER) is a subtask of the information extraction process and aims to discover named entities in unstructured texts. Previous studies on NER mostly use statistical machine learning models instead of using classifiers since solving this problem as a classification task requires to deal with quite high dimensional and sparse vector spaces. In this paper, we take NER as a classical text classification problem and extract nominal features from each token in the unstructured text sequence. We convert each token to a document transaction and then, we use frequent termset mining to extract termset features and apply termset weighting to classify named entities. Therefore we deal with lower dimensional feature spaces. Our experimental results obtained on a large Turkish dataset show that frequent termsets and their weighting scheme can be used in NER task.
dc.description.sponsorshipIEEE,IEEE Turkey Sect
dc.identifier.doi10.1109/ubmk.2019.8907039
dc.identifier.endpage32
dc.identifier.isbn978-1-7281-3964-7
dc.identifier.scopus2-s2.0-85076211334
dc.identifier.scopusquality0
dc.identifier.startpage28
dc.identifier.urihttps://doi.org/10.1109/ubmk.2019.8907039
dc.identifier.urihttps://hdl.handle.net/20.500.14669/2778
dc.identifier.wosWOS:000609879900006
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherIEEE
dc.relation.ispartof2019 4th International Conference on Computer Science and Engineering (Ubmk)
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_20241211
dc.subjectNamed entity recognition
dc.subjectfrequent itemset min-ing
dc.subjecttermsets
dc.subjecttext classification
dc.titleTurkish Named Entity Discovery Based on Termsets
dc.typeConference Object

Dosyalar