Publication:
A new hybrid semi-supervised algorithm for text classification with class-based semantics

dc.contributor.authorALTINEL GİRGİN, AYŞE BERNA
dc.contributor.authorGANİZ, MURAT CAN
dc.contributor.authorsAltinel, Berna; Ganiz, Murat Can
dc.date.accessioned2022-03-12T20:27:40Z
dc.date.accessioned2026-01-11T17:13:45Z
dc.date.available2022-03-12T20:27:40Z
dc.date.issued2016
dc.description.abstractVector Space Models (VSM) are commonly used in language processing to represent certain aspects of natural language semantics. Semantics of VSM comes from the distributional hypothesis, which states that words that occur in similar contexts usually have similar meanings. In our previous work, we proposed novel semantic smoothing kernels based on classspecific transformations. These kernels use class term matrices, which can be considered as a new type of VSM. By using the class as the context, these methods can extract class specific semantics by making use of word distributions both in documents and in different classes. In this study, we adapt two of these semantic classification approaches to build a novel and high performance semi-supervised text classification algorithm. These approaches include Helmholtz principle based calculation of term meanings in the context of classes for initial classification and a supervised term weighting based semantic kernel with Support Vector Machines (SVM) for the final classification model. The approach used in the first phase is especially good at learning with very small datasets, while the approach in the second phase is specifically good at eliminating noise in a relatively large and noisy training sets when building a classification model. Overall, as a semantic semi-supervised learning algorithm, our approach can effectively utilize abundant source of unlabeled instances to improve the classification accuracy significantly especially when the amount of labeled instances are limited. (C) 2016 Elsevier B.V. All rights reserved.
dc.identifier.doi10.1016/j.knosys.2016.06.021
dc.identifier.eissn1872-7409
dc.identifier.issn0950-7051
dc.identifier.urihttps://hdl.handle.net/11424/233750
dc.identifier.wosWOS:000382592600007
dc.language.isoeng
dc.publisherELSEVIER
dc.relation.ispartofKNOWLEDGE-BASED SYSTEMS
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectSemantics
dc.subjectSemi-supervised classification
dc.subjectText classification
dc.subjectSemantic smoothing kemel
dc.subjectClass-based transformations
dc.subjectSMOOTHING METHOD
dc.subjectKERNEL
dc.titleA new hybrid semi-supervised algorithm for text classification with class-based semantics
dc.typearticle
dspace.entity.typePublication
oaire.citation.endPage64
oaire.citation.startPage50
oaire.citation.titleKNOWLEDGE-BASED SYSTEMS
oaire.citation.volume108

Files