A few expert queries suffices for sample-efficient rl with resets and linear value approximation

Philip Amortila, Nan Jiang, Dhruv Madeka, Dean P Foster

November 2022

Type

Journal article

Publication

NeurIPS 2022