基于强化学习的鱼群自组织行为模拟

doi:10.11887/j.cn.202001027

首页 > 过刊浏览>2020年第42卷第1期 >194-202. DOI:10.11887/j.cn.202001027

基于强化学习的鱼群自组织行为模拟
DOI:
                        10.11887/j.cn.202001027
                    
作者:
                        
                        
                    
作者单位:(1. 大连海洋大学 水产与生命学院, 辽宁 大连 116023;2. 军事科学院, 北京 海淀 100071)
作者简介:杨慧慧(1999—),女,湖南长沙人,本科生,E-mail:yhhhygge@163.com； 黄万荣(通信作者),男,助理研究员,博士,E-mail:huangwr1990@163.com
通讯作者:
中图分类号:TP305
基金项目:

Simulation on self-organization behaviors of fish school based on reinforcement learning

Author:

Affiliation:

(1. College of Fisheries and Life Science, Dalian Ocean University, Dalian 116023, China;2. Academy of Military Sciences, Beijing 100071, China)

摘要

图/表

访问统计

参考文献

相似文献

引证文献()

资源附件

文章评论

摘要:

自组织行为广泛存在于自然界中。为了通过学习的方式模拟鱼群自组织行为,构建了鱼群模拟环境模型、智能体模型和奖励机制,并提出了一种基于赫布迹和行动者-评价者框架的多智能体强化学习方法。该方法利用赫布迹加强游动策略的学习记忆能力,基于同构思想实现了多智能体的分布式学习。仿真结果表明,该方法能够适用于领航跟随、自主漫游、群体导航等场景中鱼群自组织行为学习,并且基于学习方法模拟的鱼群展现的行为特性与基于博德规则计算模拟的鱼群行为类似。

Abstract:

Self-organizing behaviors are widespread in nature. In order to simulate self-organizing behaviors of the fish school through learning, the fish school simulation environment model, the agent model and the reward mechanism were built, and a multi-agent reinforcement learning approach based on Hebbian trace and actor-critic framework was proposed as well. This approach uses Hebbian trace to enhance the swimming strategy learning with memory ability and realizes the distributed learning of multi-agent based on the homogeneous hypothesis. The simulation results show that the proposed approach can be applied to self-organizing behaviors learning of the fish school in the scenarios of leader-follower, autonomous wandering and navigation. Moreover, the characteristics of the fish school based on learning methods is similar to that based on Boids rules.

参考文献

相似文献

引证文献

引用本文

杨慧慧,黄万荣,敖富江.基于强化学习的鱼群自组织行为模拟[J].国防科技大学学报,2020,42(1):194-202.
YANG Huihui, HUANG Wanrong, AO Fujiang. Simulation on self-organization behaviors of fish school based on reinforcement learning[J]. Journal of National University of Defense Technology,2020,42(1):194-202.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2019-02-15
最后修改日期:
录用日期:
在线发布日期: 2020-01-19
出版日期: 2020-02-28

首页

期刊介绍

投稿指南

编委会

出版声明

开放获取声明

联系我们

期刊订阅

Rss

AI检索

English

引用本文

分享

文章指标

历史

文章二维码