Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study

Eliguzel, Nazmiye; Cetinkaya, Cihan; Dereli, Turkay

Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study

dc.authorid	Eliguzel, Nazmiye/0000-0001-6354-8215
dc.authorid	Dereli, Turkay/0000-0002-2130-5503
dc.authorid	Cetinkaya, Cihan/0000-0002-5899-8438
dc.contributor.author	Eliguzel, Nazmiye
dc.contributor.author	Cetinkaya, Cihan
dc.contributor.author	Dereli, Turkay
dc.date.accessioned	2025-01-06T17:38:08Z
dc.date.available	2025-01-06T17:38:08Z
dc.date.issued	2020
dc.description.abstract	In emergencies, Twitter is an important platform to get situational awareness simultaneously. Therefore, information about Twitter users' location is a fundamental aspect to understand the disaster effects. But location extraction is a challenging task. Most of the Twitter users do not share their locations in their tweets. In that respect, there are different methods proposed for location extraction which cover different fields such as statistics, machine learning, etc. This study is a sample study that utilizes geo-tagged tweets to demonstrate the importance of the location in disaster management by taking three cases into consideration. In our study, tweets are obtained by utilizing the earthquake keyword to determine the location of Twitter users. Tweets are evaluated by utilizing the Latent Dirichlet Allocation (LDA) topic model and sentiment analysis through machine learning classification algorithms including the Multinomial and Gaussian Naive Bayes, Support Vector Machine (SVM), Decision Tree, Random Forest, Extra Trees, Neural Network, k Nearest Neighbor (kNN), Stochastic Gradient Descent (SGD), and Adaptive Boosting (AdaBoost) classifications. Therefore, 10 different machine learning algorithms are applied in our study by utilizing sentiment analysis based on location-specific disaster-related tweets by aiming fast and correct response in a disaster situation. In addition, the effectiveness of each algorithm is evaluated in order to gather the right machine learning algorithm. Moreover, topic extraction via LDA is provided to comprehend the situation after a disaster. The gathered results from the application of three cases indicate that Multinomial Naive Bayes and Extra Trees machine learning algorithms give the best results with an F-measure value over 80%. The study aims to provide a quick response to earthquakes by applying the aforementioned techniques.
dc.identifier.doi	10.1016/j.aei.2020.101151
dc.identifier.issn	1474-0346
dc.identifier.issn	1873-5320
dc.identifier.scopus	2-s2.0-85090404146
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1016/j.aei.2020.101151
dc.identifier.uri	https://hdl.handle.net/20.500.14669/2496
dc.identifier.volume	46
dc.identifier.wos	WOS:000607575400007
dc.identifier.wosquality	Q1
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Elsevier Sci Ltd
dc.relation.ispartof	Advanced Engineering Informatics
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_20241211
dc.subject	Geo-tagged
dc.subject	LDA
dc.subject	Location extraction
dc.subject	Machine learning
dc.subject	Sentiment
dc.subject	Tweet
dc.title	Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study
dc.type	Article

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study

Dosyalar

Koleksiyon