Publication:
An Efficient Document Categorization Approach for Turkish Based Texts

dc.contributor.authorsSevinç İlhan OMURCA;Semih BAŞ;EKİN EKİNCİ
dc.date.accessioned2022-04-04T18:32:46Z
dc.date.accessioned2026-01-11T15:10:59Z
dc.date.available2022-04-04T18:32:46Z
dc.date.issued2015
dc.description.abstract0
dc.description.abstract: Since, it is infeasible to classify all the documents with human effort due to the rapid and uncontrollable growth in textual data, automatic methods have been approached in order to organize the data. Therefore a support vector machine (SVM) classifier is used for text categorization in this study. In text categorization applications, the text representation process could take a huge computation time on weighting the huge size of terms. So far, lexicons that contain less number of terms are used for the solution in the literature. However it has been observed that these kinds of solutions reduce the accuracy of the text classification. In this paper, the term-document matrix is constructed as user dependent according to the purpose of classification. Since the number of terms is still relatively large, we used a hash table for efficient search of terms. Hereby an efficient and rapid TF-IDF method is introduced to construct a weight-matrix to represent the term-document relations and a study concerning classification of the documents in Turkish based news and Turkish columnists is conducted. With the proposed study, the computational time that is required for term-weighting process is reduced substantially; also 99% accuracy is achieved in determination of the news categories and 98% accuracy is achieved in detection of the columnists.
dc.identifier.issn2147-6799;2147-6799
dc.identifier.urihttps://hdl.handle.net/11424/263164
dc.language.isoeng
dc.relation.ispartofInternational Journal of Intelligent Systems and Applications in Engineering
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectBilgisayar Bilimleri, Yapay Zeka
dc.titleAn Efficient Document Categorization Approach for Turkish Based Texts
dc.typearticle
dspace.entity.typePublication
oaire.citation.issue1
oaire.citation.startPageJul.13
oaire.citation.titleInternational Journal of Intelligent Systems and Applications in Engineering
oaire.citation.volume3

Files