<?xml version="1.0"?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/"><dc:title>Cost minimization in energy communities with multi-agent deep reinforcement learning and linear programming</dc:title><dc:creator>Pokorn,	Matic	(Avtor)
	</dc:creator><dc:creator>Čampa,	Andrej	(Avtor)
	</dc:creator><dc:creator>Smolnikar,	Miha	(Avtor)
	</dc:creator><dc:creator>Mohorčič,	Mihael	(Avtor)
	</dc:creator><dc:creator>Hribar,	Jernej	(Avtor)
	</dc:creator><dc:description>With energy costs on the rise and with ever growing concern for environmental impact, energy providers and regulatory bodies have been pushing for dynamic energy prices as a means to encourage load shifting to reduce daily energy demand variance. Coupled with recent advancements in photovoltaics (PV) power generation and Battery Energy Storage System (BESS) technology, this has encouraged the development of energy communities with one of the goals to mitigate the effect of dynamic prices on homeowners’ energy bills without sacrificing comfort, while at the same time utilizing aggregation of Distributed Energy Resources (DER) to contribute to grid flexibility. In this paper, we present Mathematical Optimization and Deep Reinforcement learning for Energy Cost minimization (MODREC), a decentralised Community Energy Management System (CEMS). MODREC leverages Multi-Agent Deep Reinforcement Learning (MADRL) coupled with Linear Programming (LP) to minimize cost in an energy community by intelligently charging and discharging household BESSs while assuming non-elastic consumer loads. MODREC follows an LP-guided training pipeline, where an optimal strategy computed with LP on historical data is employed to train a set of Deep Reinforcement Learning (DRL) agents, each assigned to a household in the community, that minimize a common cost function. Our main contribution lies in the system-level integration of LP-derived expert supervision with decentralized multi-agent control for community energy cost minimization under dynamic pricing. With MODREC, we manage to save up to 30% of energy costs compared to conventional approaches and efficiently shift energy load to off-peak hours.</dc:description><dc:publisher>IEEE</dc:publisher><dc:date>2026</dc:date><dc:date>2026-03-27 14:46:09</dc:date><dc:type>Neznano</dc:type><dc:identifier>28672</dc:identifier><dc:identifier>UDK: 004.8</dc:identifier><dc:identifier>ISSN pri članku: 2169-3536</dc:identifier><dc:identifier>DOI: 10.1109/ACCESS.2026.3676334</dc:identifier><dc:identifier>COBISS_ID: 273377283</dc:identifier><dc:source>ZDA</dc:source><dc:language>sl</dc:language><dc:rights>© 2026 The Authors. </dc:rights></metadata>
