Publication:
A Feature Based Simple Machine Learning Approach with Word Embeddings to Named Entity Recognition on Tweets

dc.contributor.authorGANİZ, MURAT CAN
dc.contributor.authorsTaspinar, Mete; Ganiz, Murat Can; Acarman, Tankut
dc.contributor.editorFrasincar, F
dc.contributor.editorIttoo, A
dc.contributor.editorNguyen, L
dc.contributor.editorMetais, E
dc.date.accessioned2022-03-12T16:23:39Z
dc.date.accessioned2026-01-11T09:05:41Z
dc.date.available2022-03-12T16:23:39Z
dc.date.issued2017
dc.description.abstractNamed Entity Recognition (NER) is a well-studied domain in Natural Language Processing. Traditional NER systems, such as Stanford NER system, achieve high performance with formal and grammatically well-structured texts. However, when these systems are applied to informal and noisy texts, which have mixed language with emoticons or abbreviations, there is a significant degradation in results. We attempt to fill this gap by developing a NER system with using novel term features including Word2vec based features and machine learning based classifier. We describe the features and Word2Vec implementation used in our solution and report the results obtained by our system. The system is quite efficient and scalable in terms of classification time complexity and shows promising results which can be potentially improved with larger training sets or with the use of semi-supervised classifiers.
dc.identifier.doi10.1007/978-3-319-59569-6_30
dc.identifier.eissn1611-3349
dc.identifier.isbn978-3-319-59569-6; 978-3-319-59568-9
dc.identifier.issn0302-9743
dc.identifier.urihttps://hdl.handle.net/11424/225952
dc.identifier.wosWOS:000434206800030
dc.language.isoeng
dc.publisherSPRINGER INTERNATIONAL PUBLISHING AG
dc.relation.ispartofNATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017
dc.relation.ispartofseriesLecture Notes in Computer Science
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectNamed entity recognition
dc.subjectWord2Vec
dc.subjectWord embeddings
dc.subjectClassification
dc.subjectMachine learning
dc.titleA Feature Based Simple Machine Learning Approach with Word Embeddings to Named Entity Recognition on Tweets
dc.typeconferenceObject
dspace.entity.typePublication
oaire.citation.endPage259
oaire.citation.startPage254
oaire.citation.titleNATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017
oaire.citation.volume10260

Files