Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi

İNAN, TİMUR

doi:10.17341/gazimmfd.1206277

Publication:
Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi

dc.contributor.author	İNAN, TİMUR
dc.contributor.authors	Ergin İ., İnan T.
dc.date.accessioned	2024-06-05T14:36:30Z
dc.date.accessioned	2026-01-10T21:51:55Z
dc.date.available	2024-06-05T14:36:30Z
dc.date.issued	2024-05-01
dc.description.abstract	Bu çalışmada, derin öğrenme algoritmalarından kodlayıcı-kod çözücü ve dikkat mimarisi kullanılarak karakter tabanlı Türkçe dil bilgisi kurallarına uygun anlamlı kelime üretimi amaçlanmıştır. Geliştirilen modelin sonuçları diğer derin öğrenme algoritmaları olan LSTM ve GRU modellerinin sonuçları ile karşılaştırılmaktadır. LSTM ve GRU modelleri ile oluşturulan dil modelleri 100 ve 200 epoch değerlerinde ve sıcaklık örnek alma yönteminin farklı eşik değerlerinde birbirine yakın sonuçlar verdiği görülmektedir. Bu modellerden en yüksek başarı değerini 200 epoch ve 0,5 sıcaklık eşik değerinde %88,40 ile GRU modeli vermektedir. Bu çalışma için geliştirilen kodlayıcı-kod çözücü ve dikkat dil modeli ise 100 ve 200 epoch değerlerinde ve sıcaklık örnek alma yönteminin farklı eşik değerlerinde en yüksek başarı değerini 200 epoch ve 0,5 sıcaklık eşik değerinde %91,90 ile vermektedir. Yapılan denemeler sonunda, kodlayıcı-kod çözücü ve dikkat mimarisi modeli LSTM modeline göre ortalama olarak %2,83 ve GRU modeline göre ortalama olarak %0,19 oranında daha fazla başarı göstermiştir.
dc.description.abstract	In this study, it is aimed to produce meaningful words in accordance with character-based Turkish grammar rules by using encoder-decoder and attention architecture, which are deep learning algorithms. The results of the developed model are compared with the results of LSTM and GRU models, which are other deep learning algorithms. It is seen that the language models created with LSTM and GRU models give similar results at 100 and 200 epoch values and at different threshold values of the temperature sampling method. Among these models, the GRU model gives the highest success value with 88.40% at 200 epochs and 0.5 temperature threshold value. The encoder-decoder and attention language model developed for this study gives the highest success value of 91.90% at 100 and 200 epoch values and at different threshold values of the temperature sampling method at 200 epoch and 0.5 temperature threshold value. At the end of the experiments, the encoder-decoder and attention architecture model showed an average of 2.83% more success than the LSTM model and an average of 0.19% more success than the GRU model.
dc.identifier.citation	Ergin İ., İnan T., "Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi", Gazi üniversitesi mühendislik mimarlık fakültesi dergisi, cilt.39, sa.3, ss.1999-2010, 2024
dc.identifier.doi	10.17341/gazimmfd.1206277
dc.identifier.endpage	2010
dc.identifier.issn	1300-1884
dc.identifier.issue	3
dc.identifier.startpage	1999
dc.identifier.uri	https://dergipark.org.tr/tr/pub/gazimmfd/issue/82783/1206277
dc.identifier.uri	https://hdl.handle.net/11424/296991
dc.identifier.volume	39
dc.language.iso	tur
dc.relation.ispartof	Gazi üniversitesi mühendislik mimarlık fakültesi dergisi
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Bilgisayar Bilimleri
dc.subject	Yapay Zeka, Bilgisayarda Öğrenme ve Örüntü Tanıma
dc.subject	Mühendislik ve Teknoloji
dc.subject	Computer Sciences
dc.subject	Artificial Intelligence, Computer Learning and Pattern Recognition
dc.subject	Engineering and Technology
dc.subject	Mühendislik, Bilişim ve Teknoloji (ENG)
dc.subject	Bilgisayar Bilimi
dc.subject	BİLGİSAYAR BİLİMİ, YAPAY ZEKA
dc.subject	Engineering, Computing & Technology (ENG)
dc.subject	COMPUTER SCIENCE
dc.subject	COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
dc.subject	Bilgisayarla Görme ve Örüntü Tanıma
dc.subject	Bilgisayar Bilimi Uygulamaları
dc.subject	Yapay Zeka
dc.subject	Bilgisayar Bilimi (çeşitli)
dc.subject	Genel Bilgisayar Bilimi
dc.subject	Fizik Bilimleri
dc.subject	Computer Vision and Pattern Recognition
dc.subject	Computer Science Applications
dc.subject	Artificial Intelligence
dc.subject	Computer Science (miscellaneous)
dc.subject	General Computer Science
dc.subject	Physical Sciences
dc.subject	Artificial intelligence
dc.subject	machine learning
dc.subject	natural language processing
dc.subject	language model
dc.subject	text generation
dc.subject	deep learning
dc.subject	Yapay zekâ
dc.subject	makina öğrenmesi
dc.subject	doğal dil işleme
dc.subject	dil modeli
dc.subject	metin üretimi
dc.subject	derin öğrenme
dc.title	Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi
dc.title.alternative	Encoder character based using decoder and attention algorithms word production
dc.type	article
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: file.pdf
Size:: 803.95 KB
Format:: Adobe Portable Document Format

Download

Collections

Araştırma Çıktıları

Publication: Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi

Files

Original bundle

Collections

Publication:
Kodlayıcı kod çözücü ve dikkat algoritmaları kullanılarak karakter tabanlı kelime üretimi