多主体强化学习协作策略研究
多主体强化学习协作策略研究封面图

多主体强化学习协作策略研究

孙若莹, 赵刚, 著

出版社:清华大学出版社

年代:2014

定价:32.0

书籍简介:

本书在系统介绍多主体、强化学习及多主体协作的基本内容的基础上,阐述了有关多主体强化学习、协作策略研究的发展过程及最新动向。深入探讨了多主体强化学习的理论与方法、多主体的协作策略,为多主体的强化学习与协作策略研究方向提供新的理论和方法,为其在相关研究领域的应用提供新的支撑和手段。

书籍目录:

Chapter 1 Introduction

1.1 Reinforcement Learning

1.1.1 Generality of Reinforcement Learning

1.1.2 Reinforcement Learning on Markov Decision Processes

1.1.3 Integrating Reinforcement Learning into Agent Architecture

1.2 Multiagent Reinforcement Learning

1.2.1 Multiagent Systems

1.2.2 Reinforcement Learning in Multiagent Systems

1.2.3 Learning and Coordination in Multiagent Systems

1.3 Ant System for Stochastic Combinatorial Optimization

1.3.1 Ants Forage Behavior

1.3.2 Ant Colony Optimization

1.3.3 MAX-MIN Ant System

1.4 Motivations and Consequences

1.5 Book Summary

Bibliography

Chapter 2 Reinforcement Learning and Its Combination with Ant Colony System

2.1 Introduction

2.2 Investigation into Reinforcement Learning and Swarm Intelligence

2.2.1 Temporal Differences Learning Method

2.2.2 Active Exploration and Experience Replay in Reinforcement Learning

2.2.3 Ant Colony System for Traveling Salesman Problem

2.3 The Q-ACS Multiagent Learning Method

2.3.I The Q-ACS Learning Algorithm

2.3.2 Some Properties of the Q-ACS Learning Method

2.3.3 Relation with Ant-Q Learning Method

2.4 Simulat'ions and Results

2.5 Conclusions

Bibliography

Chapter 3 Multiagent Learning Methods Based on Indirect Media Information Sharing

3.1 Introduction

3.2 The Multiagent Learning Method Considering Statistics Features

3.2.I Accelerated K-certainty Exploration

3.2.2 The T-ACS Learning Algorithm

3.3 The Heterogeneous Agents Learning

3.3.1 The D-ACS Learning Algorithm

3.3.2 Some Discussions about the D-ACS Learning Algorithm

3.4 Comparisons with Related State-of-the-arts

3.5 Simulations and Results

3.5.1 Experimental Results on Hunter Game

3.5.2 Experimental Results on Traveling Salesman Problem

3.6 Conclusions

Bibliography

Chapter 4 Action Conversion Mechanism in Multiagent Reinforcement Learning

4.1 Introduction

4.2 Model-Based Reinforcement Learning

4.2.1 Dyna-Q Architecture

4.2.2 Prioritized Sweeping Method

4.2.3 Minimax Search and Reinforcement Learning

4.2.4 RTP-Q Learning

4.3 The Q-ac Multiagent Reinforcement Learning

4.3.1 Task Model

4.3.2 Converting Action

4.3.3 Multiagent Cooperation Methods

4.3.4 Q-value Update

4.3.5 The Q-ac Learning Algorithm

4.3.6 Using Adversarial Action Instead of s Probability Exploration

……

Chapter 5 Multiagent Learning Approaches Applied to Vehicle Routing Problems

Chapter 6 Multiagent learning Methods Applied to Multicast Routing Problems

Chapter 7 Multiagent Reinforcement Learning for Supply Chain Management

Chapter 8 Multiagent Learning Applied in Supply Chain Ordering Management

内容摘要:

多主体的研究与应用是近年来备受关注的热点领域,多主体强化学习理论与方法、多主体协作策略的研究是该领域重要研究方向,其理论和应用价值极为广泛,备受广大从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的关注。本书清晰地介绍了多主体、强化学习及多主体协作等基本概念和基础内容,明确地阐述了有关多主体强化学习、协作策略研究的发展过程及最新动向,深入地探讨了多主体强化学习与协作策略的理论与方法,具体地分析了多主体强化学习与协作策略在相关研究领域的应用方法。全书系统脉络清晰、基本概念清楚、图表分析直观,注重内容的体系化和实用性。通过本书的阅读和学习,读者即可掌握多主体强化学习及协作策略的理论和方法,更可了解在实际工作中应用这些研究成果的手段。本书可作为从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的学习和阅读参考,同时高等院校相关专业研究生以及人工智能爱好者也可从中获得借鉴。

书籍规格:

书籍详细信息
书名多主体强化学习协作策略研究站内查询相似图书
9787302368304
如需购买下载《多主体强化学习协作策略研究》pdf扫描版电子书或查询更多相关信息,请直接复制isbn,搜索即可全网搜索该ISBN
出版地北京出版单位清华大学出版社
版次1版印次1
定价(元)32.0语种英文
尺寸26 × 19装帧平装
页数印数 3000

书籍信息归属:

多主体强化学习协作策略研究是清华大学出版社于2014.出版的中图分类号为 G442 的主题关于 学习方法-研究-英文 的书籍。