Robot Learning Final Project: Adaptive MPQ(λ)
This was my final project for Cornell’s CS 4756 Robot Learning: MPQ(λ) with State Dependent Policy Weighting via Ensemble Network Uncertainty Estimation. This research is an extension of a paper by Bhardwaj et al. Below is our report and to the right is an explanatory video.