Analyzing the performance of different large language models of chatgpt on turkish homonyms

AYTEKİN, ÇİĞDEM

doi:10.17932/iau.iausbd.2021.021/iausbd_v16i3003

Publication:
Analyzing the performance of different large language models of chatgpt on turkish homonyms

dc.contributor.author	AYTEKİN, ÇİĞDEM
dc.contributor.authors	AYTEKİN Ç., Karabina T. B.
dc.date.accessioned	2024-08-02T07:49:18Z
dc.date.accessioned	2026-01-11T06:01:42Z
dc.date.available	2024-08-02T07:49:18Z
dc.date.issued	2024-07-01
dc.description.abstract	Son zamanların popüler konusu ChatGPT ve gerçekleştirdiği başarılı işler, yapay zekânın ne kadar geliştiğini ve ilerleyen yıllar için vadettiklerini bizlere göstermektedir. ChatGPT’nin hâlihazırda kullanılan Büyük Dil Modelleri arasındaki farklılıklar bu çalışmanın konusunu oluşturmaktadır. ChatGPT-3.5 ve ChatGPT-4’ün performansları Türkçedeki eş adlı kelimeler üzerinden incelenmiştir. Büyük Dil Modelleri oluşturulurken kullanılan Doğal Dil İşleme sistemlerinde aşılması en büyük zorluklardan birisi de bu sistemlerin kelimeanlam belirsizliğini ayırt edebilme becerileridir. Bu belirsizlikleri tespit etmek amacıyla Türkçede en yaygın olarak kullanılan 200 eş adlı kelime örneklem olarak seçilmiştir. Ardından tek bir eş adlı kelimenin, aynı cümle içerisinde iki farklı anlama da gelecek şekilde iki kez kullanılmasıyla cümleler oluşturulmuş ve öncelikle ChatGPT-3.5’den sonra ChatGPT-4’den farklı anlamları tespit etmesi istenmiştir. ChatGPT’ler her iki anlamdan birini bilemediği ve bazen iki anlamı da bilemediği çıktılar üretmiştir. Amaç doğrultusunda ChatGPT-3.5 ve ChatGPT-4 modellerinden alınan çıktılar karşılaştırılmıştır. ChatGPT 3.5’e kıyasla daha fazla parametreye ve veri setine sahip olan ChatGPT-4, beklendiği gibi çok daha iyi bir performans göstermiştir. Başarı oranı dağılım analizi, eş adlı kelimeye göre performans değişikliği, eş adlı kelimenin karakter sayısı ve başarı oranı, istatistiksel testler yapılan diğer analizlerdir. Anahtar Kelimeler: ChatGPT-3.5, ChatGPT-4, Büyük Dil modeli, Eş Adlı Kelime, Dil Bilimsel Belirsizlik.
dc.description.abstract	ChatGPT, the popular topic in recent periods, and its achievements show us how much artificial intelligence has developed and what it promises for the coming years. This study focuses on the differences between ChatGPT and its currently used Large Language Models. The performances of ChatGPT-3.5 and ChatGPT-4 are analyzed on Turkish homonyms. One major challenge faced by Natural Language Processing systems used in the generation of Large Language Models is identifying word-sense ambiguity. In order to detect these ambiguities, the 200 most commonly used synonyms in Turkish were selected as the sample. Then, sentences were formed by using a single homonym twice in the same sentence to convey two different meanings, and ChatGPT-3.5 and then ChatGPT-4 were asked to detect the different meanings. ChatGPTs generated outputs in which they could not know either of the two meanings and sometimes could not know both meanings. In line with the objective, the outputs from ChatGPT-3.5 and ChatGPT-4 models were compared. As expected, ChatGPT-4, with its larger parameters and datasets, outperformed ChatGPT-3.5. Success rate distribution analysis, performance variation based on the homonym, the number of characters of the homonym and the success rate are the other statistical tests carried out. Keywords: ChatGPT-3.5, ChatGPT-4, Large Language Model, Homonym, Linguistic Ambiguity.
dc.identifier.citation	AYTEKİN Ç., Karabina T. B., "ANALYZING THE PERFORMANCE OF DIFFERENT LARGE LANGUAGE MODELS OF CHATGPT ON TURKISH HOMONYMS", İstanbul Aydın Üniversitesi Sosyal Bilimler Dergisi, cilt.16, sa.3, ss.365-390, 2024
dc.identifier.doi	10.17932/iau.iausbd.2021.021/iausbd_v16i3003
dc.identifier.endpage	390
dc.identifier.issn	2757-7252
dc.identifier.issue	3
dc.identifier.startpage	365
dc.identifier.uri	https://dergipark.org.tr/tr/pub/iausos/issue/86239/1444041
dc.identifier.uri	https://hdl.handle.net/11424/297371
dc.identifier.volume	16
dc.language.iso	eng
dc.relation.ispartof	İstanbul Aydın Üniversitesi Sosyal Bilimler Dergisi
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	ChatGPT-3.5
dc.subject	ChatGPT-4
dc.subject	Büyük Dil modeli
dc.subject	Eş Adlı Kelime
dc.subject	Dil Bilimsel Belirsizlik.
dc.subject	Large Language Model
dc.subject	Homonym
dc.subject	Linguistic Ambiguity
dc.title	Analyzing the performance of different large language models of chatgpt on turkish homonyms
dc.title.alternative	Chatgpt’nin farklı büyük dil modelleri performanslarinin türkçedeki eş adli kelimeler üzerinden incelenmesi
dc.type	article
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: file.pdf
Size:: 1.26 MB
Format:: Adobe Portable Document Format

Download

Collections

Araştırma Çıktıları

Publication: Analyzing the performance of different large language models of chatgpt on turkish homonyms

Files

Original bundle

Collections

Publication:
Analyzing the performance of different large language models of chatgpt on turkish homonyms