Publication:
Semi-Supervised Learning using Higher-Order Co-occurrence Paths to Overcome the Complexity of Data Representation

dc.contributor.authorsGaniz, Murat Can
dc.date.accessioned2022-03-12T16:16:02Z
dc.date.accessioned2026-01-11T17:17:45Z
dc.date.available2022-03-12T16:16:02Z
dc.date.issued2016
dc.description.abstractWe present a novel approach to semi-supervised learning for text classification based on the higher-order co-occurrence paths of words. We name the proposed method as Semi-Supervised Semantic Higher-Order Smoothing (S3HOS). The S3HOS is built on a tri-partite graph based data representation of labeled and unlabeled documents that allows semantics in higher order co-occurrence paths between terms (words) to be exploited. There are several graph-based techniques proposed in the literature to diffuse class labels from labeled documents to the unlabeled documents. In this study we propose a different and natural way of estimating class conditional probabilities for the terms in unlabeled documents without need to label the documents first. The proposed approach allows estimating class conditional probabilities for the terms in unlabeled documents and improve the estimation of terms in the labeled documents at the same time. We experimentally show that S3HOS can highly improve the parameter estimation and hence increase the classification accuracy particularly when the amount of the labeled data is scarce but unlabeled data is plentiful.
dc.identifier.doidoiWOS:000402634702022
dc.identifier.isbn978-1-5090-1897-0
dc.identifier.issn1062-922X
dc.identifier.urihttps://hdl.handle.net/11424/225690
dc.identifier.wosWOS:000402634702022
dc.language.isoeng
dc.publisherIEEE
dc.relation.ispartof2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC)
dc.relation.ispartofseriesIEEE International Conference on Systems Man and Cybernetics Conference Proceedings
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectSemi-Supervised Learning
dc.subjectNaive Bayes
dc.subjectSemantic Smoothing
dc.subjectHigher-Order Naive Bayes
dc.subjectText Classification
dc.titleSemi-Supervised Learning using Higher-Order Co-occurrence Paths to Overcome the Complexity of Data Representation
dc.typeconferenceObject
dspace.entity.typePublication
oaire.citation.endPage2247
oaire.citation.startPage2242
oaire.citation.title2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC)

Files