分布估计算法
1
2
2022-6-5
3
基于种群的增强式学习
₪ Population based Incremental Learning (PBIL,
Baluja, 1994)
4
₪ P initialize probability vector (each position = 0.5)
₪ while (generations++ < limit)
generate Vi(j) according to P(j)
■ for each vector i do
■ for each position j do
■
■ end-do
■ evaluate f(Vi)
■ end-do
■ Vmax = max(f(Vi))
■ update P according to Vmax
■ if random(0,1] < Pmutate
■ mutate P
■ end-if
₪ end-while
5
6
7
8