Distributed learning of the global maximum in a two-player stochastic game with identical payoffs

abstract

Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decision-makers. The case of two decision-makers is considered, and prior knowledge is assumed about the expected rewards. The prior knowledge captures the asymmetries that may be present in the rewards. It is shown that each decision-maker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.

authors

Kumar, Panganamala

published proceedings

IEEE Transactions on Systems Man and Cybernetics

altmetric score

3

author list (cited authors)

Kumar, P., & Young, G.

citation count

3

publication date

November 1985

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

keywords

38 Economics
3803 Economic Theory
46 Information And Computing Sciences

Digital Object Identifier (DOI)

10.1109/tsmc.1985.6313458

start page

743

end page

753

volume

SMC-15

issue

6

URL

http://dx.doi.org/10.1109/tsmc.1985.6313458

Distributed learning of the global maximum in a two-player stochastic game with identical payoffs Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL