Publication:
Semi-supervised learning using higher-order co-occurrence paths to overcome the complexity of data representation

dc.contributor.authorsGaniz M.C.
dc.date.accessioned2022-03-15T02:12:47Z
dc.date.accessioned2026-01-10T19:09:43Z
dc.date.available2022-03-15T02:12:47Z
dc.date.issued2017
dc.description.abstractWe present a novel approach to semi-supervised learning for text classification based on the higher-order co-occurrence paths of words. We name the proposed method as Semi-Supervised Semantic Higher-Order Smoothing (S3HOS). The S3HOS is built on a tri-partite graph based data representation of labeled and unlabeled documents that allows semantics in higher-order co-occurrence paths between terms (words) to be exploited. There are several graph-based techniques proposed in the literature to diffuse class labels from labeled documents to the unlabeled documents. In this study we propose a different and natural way of estimating class conditional probabilities for the terms in unlabeled documents without need to label the documents first. The proposed approach allows estimating class conditional probabilities for the terms in unlabeled documents and improve the estimation of terms in the labeled documents at the same time. We experimentally show that S3HOS can highly improve the parameter estimation and hence increase the classification accuracy particularly when the amount of the labeled data is scarce but unlabeled data is plentiful. © 2016 IEEE.
dc.identifier.doi10.1109/SMC.2016.7844572
dc.identifier.isbn9781509018970
dc.identifier.urihttps://hdl.handle.net/11424/247826
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectHigher-Order Naive Bayes
dc.subjectNaive Bayes
dc.subjectSemantic Smoothing
dc.subjectSemi-Supervised Learning
dc.subjectText Classification
dc.titleSemi-supervised learning using higher-order co-occurrence paths to overcome the complexity of data representation
dc.typeconferenceObject
dspace.entity.typePublication
oaire.citation.endPage2247
oaire.citation.startPage2242
oaire.citation.title2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings

Files