Viewing a single comment thread. View all comments

Better_Ad4061 t1_j7xyb2r wrote

I'm trying to make a decision transformer, but I can't quite figure out how to prompt it. I trained it on a chess dataset of (state, reward, move) but I don't know how to "prompt" it with the reward I would like.

1

visarga t1_j7yc08k wrote

You prompt it by reward. Let's say your top reward is 1.

you predict model(past history, state, 1) -> move

1