Publication:
A Semantic Kernel for Text Classification Based on Iterative Higher-Order Relations between Words and Documents

dc.contributor.authorsAltinel, Berna; Ganiz, Murat Can; Diri, Banu
dc.contributor.editorRutkowski, L
dc.contributor.editorKorytkowski, M
dc.contributor.editorScherer, R
dc.contributor.editorTadeusiewicz, R
dc.contributor.editorZadeh, LA
dc.contributor.editorZurada, JM
dc.date.accessioned2022-03-12T16:14:49Z
dc.date.accessioned2026-01-11T18:34:16Z
dc.date.available2022-03-12T16:14:49Z
dc.date.issued2014
dc.description.abstractWe propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order relations between the words and between the documents. Conventional approach in text categorization systems is to represent documents as a Bag of Words (BOW) in which the relations between the words and their positions are lost. Additionally, traditional machine learning algorithms assume that instances, in our case documents, are independent and identically distributed. This approach simplifies the underlying models, but nevertheless it ignores the semantic connections between words as well as the semantic relations between documents that stem from the words. In this study, we improve the semantic knowledge capture capability of a previous work in [1], which is called chi-Sim Algorithm and use this method in the SVM as a semantic kernel. The proposed approach is evaluated on different benchmark textual datasets. Experiment results show that classification performance improves over the well-known traditional kernels used in the SVM such as the linear kernel (one of the state-of-the-art algorithms for text classification system), the polynomial kernel and the Radial Basis Function (RBF) kernel.
dc.identifier.doidoiWOS:000341246000043
dc.identifier.eissn1611-3349
dc.identifier.isbn978-3-319-07172-5; 978-3-319-07173-2
dc.identifier.issn0302-9743
dc.identifier.urihttps://hdl.handle.net/11424/225477
dc.identifier.wosWOS:000341246000043
dc.language.isoeng
dc.publisherSPRINGER-VERLAG BERLIN
dc.relation.ispartofARTIFICIAL INTELLIGENCE AND SOFT COMPUTING ICAISC 2014, PT I
dc.relation.ispartofseriesLecture Notes in Artificial Intelligence
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectmachine learning
dc.subjectsupport vector machine
dc.subjecttext classification
dc.subjecthigher-order paths
dc.subjectsemantic kernel
dc.titleA Semantic Kernel for Text Classification Based on Iterative Higher-Order Relations between Words and Documents
dc.typeconferenceObject
dspace.entity.typePublication
oaire.citation.endPage517
oaire.citation.startPage505
oaire.citation.titleARTIFICIAL INTELLIGENCE AND SOFT COMPUTING ICAISC 2014, PT I
oaire.citation.volume8467

Files