Ambient lighting controller based on reinforcement learning components of multi-agents

A. A. Bielskis, E. Guseinoviene, D. Dzemydiene, D. Drungilas, G. Gricius

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

The paper presents a vision of sustainable eco-social laboratory, the ESLab which might be used to speed up the process of development of the recently proposed by authors of the Smart Eco-Social Apartment. It is presented the multi-agent model of the ambient comfort measurement and environment control system to be used for the development of the ESLab. The human Ambient Lighting Affect Reward index, the ALAR index is proposed at the first time used for development of the Reinforcement Learning Based Ambient Comfort Controller, the RLBACC for the ESLab. The ALAR index is dependent on human physiological parameters: the temperature, the ECG- electrocardiogram and the EDA-electro-dermal activity. The fuzzy logic is used to approximate the ALAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Lighting Affect Reward (ALAR) System. The goal of the RLBACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RLBACC to performing of two roles - the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Critic in this paper was used as a value function approximation of the continuous learning tasks of the RLBACC. Ill. 9, bibl. 7 (in English; abstracts in English and Lithuanian).

Original languageEnglish
Pages (from-to)79-84
Number of pages6
JournalElektronika ir Elektrotechnika
Issue number5
Publication statusPublished - May 2012
Externally publishedYes

Fingerprint

Reinforcement learning
Lighting
Controllers
Electrocardiography
Fuzzy inference
Fuzzy logic
Neural networks
Control systems
Temperature

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Bielskis, A. A., Guseinoviene, E., Dzemydiene, D., Drungilas, D., & Gricius, G. (2012). Ambient lighting controller based on reinforcement learning components of multi-agents. Elektronika ir Elektrotechnika, (5), 79-84.

Ambient lighting controller based on reinforcement learning components of multi-agents. / Bielskis, A. A.; Guseinoviene, E.; Dzemydiene, D.; Drungilas, D.; Gricius, G.

In: Elektronika ir Elektrotechnika, No. 5, 05.2012, p. 79-84.

Research output: Contribution to journalArticle

Bielskis, A. A. ; Guseinoviene, E. ; Dzemydiene, D. ; Drungilas, D. ; Gricius, G. / Ambient lighting controller based on reinforcement learning components of multi-agents. In: Elektronika ir Elektrotechnika. 2012 ; No. 5. pp. 79-84.
@article{e9eaa9f83aff4944b2d865362fea1a66,
title = "Ambient lighting controller based on reinforcement learning components of multi-agents",
abstract = "The paper presents a vision of sustainable eco-social laboratory, the ESLab which might be used to speed up the process of development of the recently proposed by authors of the Smart Eco-Social Apartment. It is presented the multi-agent model of the ambient comfort measurement and environment control system to be used for the development of the ESLab. The human Ambient Lighting Affect Reward index, the ALAR index is proposed at the first time used for development of the Reinforcement Learning Based Ambient Comfort Controller, the RLBACC for the ESLab. The ALAR index is dependent on human physiological parameters: the temperature, the ECG- electrocardiogram and the EDA-electro-dermal activity. The fuzzy logic is used to approximate the ALAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Lighting Affect Reward (ALAR) System. The goal of the RLBACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RLBACC to performing of two roles - the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Critic in this paper was used as a value function approximation of the continuous learning tasks of the RLBACC. Ill. 9, bibl. 7 (in English; abstracts in English and Lithuanian).",
author = "Bielskis, {A. A.} and E. Guseinoviene and D. Dzemydiene and D. Drungilas and G. Gricius",
year = "2012",
month = "5",
language = "English",
pages = "79--84",
journal = "Elektronika ir Elektrotechnika",
issn = "1392-1215",
publisher = "Kauno Technologijos Universitetas",
number = "5",

}

TY - JOUR

T1 - Ambient lighting controller based on reinforcement learning components of multi-agents

AU - Bielskis, A. A.

AU - Guseinoviene, E.

AU - Dzemydiene, D.

AU - Drungilas, D.

AU - Gricius, G.

PY - 2012/5

Y1 - 2012/5

N2 - The paper presents a vision of sustainable eco-social laboratory, the ESLab which might be used to speed up the process of development of the recently proposed by authors of the Smart Eco-Social Apartment. It is presented the multi-agent model of the ambient comfort measurement and environment control system to be used for the development of the ESLab. The human Ambient Lighting Affect Reward index, the ALAR index is proposed at the first time used for development of the Reinforcement Learning Based Ambient Comfort Controller, the RLBACC for the ESLab. The ALAR index is dependent on human physiological parameters: the temperature, the ECG- electrocardiogram and the EDA-electro-dermal activity. The fuzzy logic is used to approximate the ALAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Lighting Affect Reward (ALAR) System. The goal of the RLBACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RLBACC to performing of two roles - the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Critic in this paper was used as a value function approximation of the continuous learning tasks of the RLBACC. Ill. 9, bibl. 7 (in English; abstracts in English and Lithuanian).

AB - The paper presents a vision of sustainable eco-social laboratory, the ESLab which might be used to speed up the process of development of the recently proposed by authors of the Smart Eco-Social Apartment. It is presented the multi-agent model of the ambient comfort measurement and environment control system to be used for the development of the ESLab. The human Ambient Lighting Affect Reward index, the ALAR index is proposed at the first time used for development of the Reinforcement Learning Based Ambient Comfort Controller, the RLBACC for the ESLab. The ALAR index is dependent on human physiological parameters: the temperature, the ECG- electrocardiogram and the EDA-electro-dermal activity. The fuzzy logic is used to approximate the ALAR index function by defining two fuzzy inference systems: the Arousal-Valence System, and the Ambient Lighting Affect Reward (ALAR) System. The goal of the RLBACC is to find such the environmental state characteristics that create an optimal comfort for people affected by this environment. The Radial Basis Neural Network is used as the main component of the RLBACC to performing of two roles - the policy structure, known as the Actor, used to select actions, and the estimated value function, known as the Critic that criticizes the actions made by the Actor. The Critic in this paper was used as a value function approximation of the continuous learning tasks of the RLBACC. Ill. 9, bibl. 7 (in English; abstracts in English and Lithuanian).

UR - http://www.scopus.com/inward/record.url?scp=84863706891&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84863706891&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84863706891

SP - 79

EP - 84

JO - Elektronika ir Elektrotechnika

JF - Elektronika ir Elektrotechnika

SN - 1392-1215

IS - 5

ER -