Fig. 1 (IMAGE)
Caption
Comparison between clustering-based bonus rewards with novelty alone (η = 1.0) and clustering-based bonus rewards (η = 0.5). Here, the collected states (blue dots) are clustered into 5 clusters and the agent is rewarded with 1 in the orange area and receives no reward in other areas.
Credit
Xiao MA, Shen-Yi ZHAO, Zhao-Heng YIN, Wu-Jun LI
Usage Restrictions
none
License
Original content