get_action and deicde_action #21

jiadd · 2018-11-05T08:44:30Z

您好，
请教一下rlmodel.py中的 get_action 和 deicde_action 有什么区别与联系呢，用这两个函数的作用分别是什么呢？谢谢！

xuyanfu · 2018-11-06T03:26:13Z

get_action是根据概率采样。deicde_action是贪心的选择概率较大的。

jiadd · 2018-11-13T06:33:01Z

明白了非常感谢！
还有一个问题想请教，您的代码rlmodel.py中entity_embedding = tf.get_variable(name = 'entity_embedding',initializer=entity_ebd,trainable=False)，如果我想让entity_embedding参与训练即trainable=True，该怎么修改代码呢？现在计算梯度时报如下错误：
gradBuffer[index] += grad（rlmodel.py 282行）
ValueError: could not broadcast input array from shape (2,50) into shape (2)
谢谢！

xuyanfu · 2018-11-13T12:07:35Z

不好意思，最近已经很久没有看过这个项目的代码了，抱歉帮不上什么忙。

zxs1995 · 2019-10-02T05:52:49Z

您好，请问下您为什么要有两种action呢，麻烦解答了~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get_action and deicde_action #21

get_action and deicde_action #21

jiadd commented Nov 5, 2018

xuyanfu commented Nov 6, 2018

jiadd commented Nov 13, 2018

xuyanfu commented Nov 13, 2018

zxs1995 commented Oct 2, 2019 •

edited

Loading

get_action and deicde_action #21

get_action and deicde_action #21

Comments

jiadd commented Nov 5, 2018

xuyanfu commented Nov 6, 2018

jiadd commented Nov 13, 2018

xuyanfu commented Nov 13, 2018

zxs1995 commented Oct 2, 2019 • edited Loading

zxs1995 commented Oct 2, 2019 •

edited

Loading