Publication: Semi-supervised learning using higher-order co-occurrence paths to overcome the complexity of data representation
| dc.contributor.authors | Ganiz M.C. | |
| dc.date.accessioned | 2022-03-15T02:12:47Z | |
| dc.date.accessioned | 2026-01-10T19:09:43Z | |
| dc.date.available | 2022-03-15T02:12:47Z | |
| dc.date.issued | 2017 | |
| dc.description.abstract | We present a novel approach to semi-supervised learning for text classification based on the higher-order co-occurrence paths of words. We name the proposed method as Semi-Supervised Semantic Higher-Order Smoothing (S3HOS). The S3HOS is built on a tri-partite graph based data representation of labeled and unlabeled documents that allows semantics in higher-order co-occurrence paths between terms (words) to be exploited. There are several graph-based techniques proposed in the literature to diffuse class labels from labeled documents to the unlabeled documents. In this study we propose a different and natural way of estimating class conditional probabilities for the terms in unlabeled documents without need to label the documents first. The proposed approach allows estimating class conditional probabilities for the terms in unlabeled documents and improve the estimation of terms in the labeled documents at the same time. We experimentally show that S3HOS can highly improve the parameter estimation and hence increase the classification accuracy particularly when the amount of the labeled data is scarce but unlabeled data is plentiful. © 2016 IEEE. | |
| dc.identifier.doi | 10.1109/SMC.2016.7844572 | |
| dc.identifier.isbn | 9781509018970 | |
| dc.identifier.uri | https://hdl.handle.net/11424/247826 | |
| dc.language.iso | eng | |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | |
| dc.relation.ispartof | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.subject | Higher-Order Naive Bayes | |
| dc.subject | Naive Bayes | |
| dc.subject | Semantic Smoothing | |
| dc.subject | Semi-Supervised Learning | |
| dc.subject | Text Classification | |
| dc.title | Semi-supervised learning using higher-order co-occurrence paths to overcome the complexity of data representation | |
| dc.type | conferenceObject | |
| dspace.entity.type | Publication | |
| oaire.citation.endPage | 2247 | |
| oaire.citation.startPage | 2242 | |
| oaire.citation.title | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings |
