Context detection and identification in multi-agent reinforcement learning with non-stationary environment çok etmenli pekiştirmeli öǧrenmede devingen ortamlarda baǧlam deǧişim tespiti ve tanımlama

TÜMER, MUSTAFA BORAHAN

doi:10.1109/siu55565.2022.9864802

Publication:
Context detection and identification in multi-agent reinforcement learning with non-stationary environment çok etmenli pekiştirmeli öǧrenmede devingen ortamlarda baǧlam deǧişim tespiti ve tanımlama

dc.contributor.author	TÜMER, MUSTAFA BORAHAN
dc.contributor.authors	Talha Selamet E., Tumer B.
dc.date.accessioned	2022-12-23T08:52:46Z
dc.date.accessioned	2026-01-11T13:47:34Z
dc.date.available	2022-12-23T08:52:46Z
dc.date.issued	2022-01-01
dc.description.abstract	© 2022 IEEE.Reinforcement learning methods are mostly constructed on the very assumption that environments are stationary. However, most real world environments are non-stationary; that is, we assume they are composed of several stationary components (i.e., sub-environments or contexts). So, methods with this assumption are not capable of learning non-stationary environments. Reinforcement Learning - Context Detection (RL-CD) method enables the agent to learn the environment without prior information; detect the environment\"s context change points and create a partial model for each context. The underlying environment of this approach is single-agent and has shortcomings for multi-agent learning. In this study, we introduce a new approach called Multi-agent reinforcement learning-context detection (MARL-CD), which can both detect context change points and enable agents to learn non-stationary environments with multi-agent settings. This approach is based on RL-CD approach. MARL-CD is more efficient in terms of detecting context change created by the agents on the environment and detecting the context change of the environment itself. It enables an agent to detect the context changes not only from the change of environment dynamics but also from policy changes of agents in the environment. In the approach in this study, it has been shown by the experimental results that the agents spend 16% less energy and are more efficient than RL-CD in terms of detecting the change points more accurately.
dc.description.abstract	—Peki¸stirmeli ögrenme yakla¸sımları, ço ˘ gunlukla or- ˘ tamın duragan olması varsayımıyla etmenin ö ˘ grenmesini konu ˘ alır. Fakat, gerçek hayat uygulamalarında ortam duragan de- ˘ gildir. Birçok dura ˘ gan ortamın bir araya gelmesiyle olu¸san ˘ devingen ortamlardır. Ortamda birden fazla etmen bulunabilir ve bu etmenler de ortamı devingen hale getirmektedir. Peki¸stirmeli ö˘grenme-ba˘glam sezme (RL-CD) [1] yöntemi, etmenin devingen ortam hakkında önsel bir bilgisi olmadan ögrenmesini ˘ ve baglam de ˘ gi¸simlerinin belirlenmesini sa ˘ glayan yakla¸sımdır. ˘ Bu yakla¸sımın temelindeki ortamda tek etmen vardır ve çok etmenli ögrenim için eksiklikleri bulunmaktadır. Bu çalı¸smada ˘ çok etmenli devingen ortamlarda hem baglam de ˘ gi¸sim noktalarını ˘ sezebilen hem de etmenlerin ortamları ögrenebilmesine olanak ˘ saglayan ˘ çok etmenli peki¸stirmeli ö˘grenme-ba˘glam sezme (MARLCD) adında yeni bir yakla¸sım geli¸stirilmi¸stir. Bu yakla¸sım RLCD yöntemini temel alır. Çok etmenli ögrenmede, etmenlerin ˘ ortam üzerinde olu¸sturdukları devingenligi sezmesi ve ba ˘ glam ˘ degi¸sikli ˘ gini belirlemesi yönüyle daha verimlidir. Ba ˘ glamdaki ˘ degi¸siklikleri yalnızca ortam dinamiklerinin de ˘ gi¸siminin yanı sıra ˘ ortamdaki etmenlerin politika degi¸siklikleriyle de belirleyebilme- ˘ sini saglar. Bu çalı¸smadaki yakla¸sımda, etmenler enerjilerini %16 ˘ daha az harcayarak ve degi¸sim noktalarını daha do ˘ gru sezmesi ˘ açısından RL-CD’ye daha verimli oldugu, deney sonuçları ile ˘ gösterilmi¸stir
dc.identifier.citation	Talha Selamet E., Tumer B., \"Context Detection and Identification In Multi-Agent Reinforcement Learning With Non-Stationary Environment Çok Etmenli Pekiştirmeli Öǧrenmede Devingen Ortamlarda Baǧlam Deǧişim Tespiti ve Tanimlama\", 30th Signal Processing and Communications Applications Conference, SIU 2022, Safranbolu, Türkiye, 15 - 18 Mayıs 2022
dc.identifier.doi	10.1109/siu55565.2022.9864802
dc.identifier.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85138734884&origin=inward
dc.identifier.uri	https://hdl.handle.net/11424/283896
dc.language.iso	tur
dc.relation.ispartof	30th Signal Processing and Communications Applications Conference, SIU 2022
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Sosyal ve Beşeri Bilimler
dc.subject	Sosyoloji
dc.subject	Kütüphanecilik
dc.subject	Bilgi Sistemleri, Haberleşme ve Kontrol Mühendisliği
dc.subject	Sinyal İşleme
dc.subject	Bilgisayar Bilimleri
dc.subject	Algoritmalar
dc.subject	Veritabanı ve Veri Yapıları
dc.subject	Mühendislik ve Teknoloji
dc.subject	Social Sciences and Humanities
dc.subject	Sociology
dc.subject	Library Sciences
dc.subject	Information Systems, Communication and Control Engineering
dc.subject	Signal Processing
dc.subject	Computer Sciences
dc.subject	algorithms
dc.subject	Database and Data Structures
dc.subject	Engineering and Technology
dc.subject	Mühendislik, Bilişim ve Teknoloji (ENG)
dc.subject	Sosyal Bilimler (SOC)
dc.subject	Bilgisayar Bilimi
dc.subject	Mühendislik
dc.subject	Sosyal Bilimler Genel
dc.subject	BİLGİSAYAR BİLİMİ, YAPAY ZEKA
dc.subject	BİLGİSAYAR BİLİMİ, YAZILIM MÜHENDİSLİĞİ
dc.subject	TELEKOMÜNİKASYON
dc.subject	MÜHENDİSLİK, ELEKTRİK VE ELEKTRONİK
dc.subject	BİLGİ BİLİMİ VE KÜTÜPHANE BİLİMİ
dc.subject	Engineering, Computing & Technology (ENG)
dc.subject	Social Sciences (SOC)
dc.subject	COMPUTER SCIENCE
dc.subject	ENGINEERING
dc.subject	SOCIAL SCIENCES, GENERAL
dc.subject	COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
dc.subject	COMPUTER SCIENCE, SOFTWARE ENGINEERING
dc.subject	TELECOMMUNICATIONS
dc.subject	ENGINEERING, ELECTRICAL & ELECTRONIC
dc.subject	INFORMATION SCIENCE & LIBRARY SCIENCE
dc.subject	Bilgisayar Ağları ve İletişim
dc.subject	Fizik Bilimleri
dc.subject	Bilgisayar Bilimi Uygulamaları
dc.subject	Bilgisayarla Görme ve Örüntü Tanıma
dc.subject	Yazılım
dc.subject	Bilgi Sistemleri ve Yönetimi
dc.subject	Sosyal Bilimler ve Beşeri Bilimler
dc.subject	Computer Networks and Communications
dc.subject	Physical Sciences
dc.subject	Computer Science Applications
dc.subject	Computer Vision and Pattern Recognition
dc.subject	Software
dc.subject	Information Systems and Management
dc.subject	Social Sciences & Humanities
dc.subject	context detection
dc.subject	multi-agent
dc.subject	non-stationary environment
dc.subject	Reinforcement learning
dc.subject	Peki¸stirmeli ö˘grenme
dc.subject	devingen ortamlar
dc.subject	ba˘glam sezme
dc.subject	çoklu etmenli ö˘grenme
dc.subject	Reinforcement learning
dc.subject	non-stationary environment
dc.subject	context detection
dc.subject	multi-agent
dc.title	Context detection and identification in multi-agent reinforcement learning with non-stationary environment çok etmenli pekiştirmeli öǧrenmede devingen ortamlarda baǧlam deǧişim tespiti ve tanımlama
dc.type	conferenceObject
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: file.pdf
Size:: 806.21 KB
Format:: Adobe Portable Document Format

Download

Collections

Araştırma Çıktıları

Publication: Context detection and identification in multi-agent reinforcement learning with non-stationary environment çok etmenli pekiştirmeli öǧrenmede devingen ortamlarda baǧlam deǧişim tespiti ve tanımlama

Files

Original bundle

Collections

Publication:
Context detection and identification in multi-agent reinforcement learning with non-stationary environment çok etmenli pekiştirmeli öǧrenmede devingen ortamlarda baǧlam deǧişim tespiti ve tanımlama