Show simple item record

dc.contributor.authorMurad, Abdulmajid Abdullah Yahya
dc.contributor.authorKraemer, Frank Alexander
dc.contributor.authorBach, Kerstin
dc.contributor.authorTaylor, Gavin
dc.date.accessioned2019-08-19T08:44:03Z
dc.date.available2019-08-19T08:44:03Z
dc.date.created2019-06-20T11:51:22Z
dc.date.issued2019
dc.identifier.isbn978-1-7281-2731-6
dc.identifier.urihttp://hdl.handle.net/11250/2608932
dc.description.abstractReinforcement learning (RL) is capable of managing wireless, energy-harvesting IoT nodes by solving the problem of autonomous management in non-stationary, resource-constrained settings. We show that the state-of-the-art policy-gradient approaches to RL are appropriate for the IoT domain and that they outperform previous approaches. Due to the ability to model continuous observation and action spaces, as well as improved function approximation capability, the new approaches are able to solve harder problems, permitting reward functions that are better aligned with the actual application goals. We show such a reward function and use policy-gradient approaches to learn capable policies, leading to behavior more appropriate for IoT nodes with less manual design effort, increasing the level of autonomy in IoT.nb_NO
dc.language.isoengnb_NO
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)nb_NO
dc.relation.ispartof2019 IEEE 13th International Conference on Self-Adaptive and Self-Organizing Systems (SASO)
dc.titleAutonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement Learningnb_NO
dc.typeChapternb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.pagenumber43-51nb_NO
dc.identifier.doi10.1109/SASO.2019.00015
dc.identifier.cristin1706390
dc.description.localcode© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.nb_NO
cristin.unitcode194,63,30,0
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for informasjonssikkerhet og kommunikasjonsteknologi
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record