Some optimal strategies for bandit problems with beta prior distributions

doi:10.1023/A:1004130209258

淡江大學機構典藏 > 理學院 > 應用數學與數據科學學系 > 期刊論文 > Item 987654321/41450

請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/41450

題名:	Some optimal strategies for bandit problems with beta prior distributions
作者:	林千代;Lin, Chien-tai;Shiau, C. J.
貢獻者:	淡江大學數學學系
關鍵詞:	Bandit problems;sequential experimentation;dynamic allocation of Bernoulli processes;staying-with-a-winner;switching-on-a-loser;k-failure strategy;m-run strategy;non-recalling m-run strategy;N-learning strategy
日期:	2000-06-01
上傳時間:	2010-01-28 07:35:53 (UTC+8)
出版者:	Kluwer Academic Publishers
摘要:	A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Bernoulli arms are independent and identically distributed random variables from a common distribution with beta(a, b). We investigate the k-failure strategy which is a modification of Robbins's stay-with-a-winner/switch-on-a-loser strategy and three other strategies proposed recently by Berry et al. (1997, Ann. Statist., 25, 2103–2116). We show that the k-failure strategy performs poorly when b is greater than 1, and the best strategy among the k-failure strategies is the 1-failure strategy when b is less than or equal to 1. Utilizing the formulas derived by Berry et al. (1997), we obtain the asymptotic expected failure rates of these three strategies for beta prior distributions. Numerical estimations and simulations for a variety of beta prior distributions are presented to illustrate the performances of these strategies. Bandit problemssequential experimentationdynamic allocation of Bernoulli processesstaying-with-a-winnerswitching-on-a-loserk-failure strategym-run strategynon-recalling m-run strategyN-learning strategy
關聯:	Annals of the Institute of Statistical Mathematics 52(2), pp.397-405
DOI:	10.1023/A:1004130209258
顯示於類別:	[應用數學與數據科學學系] 期刊論文

文件中的檔案:

檔案	大小	格式	瀏覽次數
index.html	0Kb	HTML	196	檢視/開啟
index.html	0Kb	HTML	202	檢視/開啟
Some optimal strategies for bandit problems with beta prior distributions.pdf	536Kb	Adobe PDF	1	檢視/開啟

在機構典藏中所有的資料項目都受到原著作權保護.

TAIR相關文章

資料載入中.....