VQ-VAE Hyperparameter Optimization for Bimanual Policies
Troubleshooting and optimizing bimanual robotic policies on the YOR (Cone-E) hardware using AR Kit visual-inertial localization.
VQ-VAE Hyperparameter Optimization for Bimanual Policies
Overview
Troubleshooting and optimizing bimanual robotic policies on the YOR (Cone-E) hardware using AR Kit visual-inertial localization.
Project Overview
Conducted at the GRAIL Lab, this project involved troubleshooting and optimizing bimanual robotic policies on the YOR (Cone-E) hardware. Trajectories were recorded using multi-iPhone visual-inertial localization via Appleās AR Kit. To validate these raw trajectory replays, I created a custom MuJoCo simulation environment using available MJCF files.
Technical Details
During training, I ran comprehensive sweeps on the Vector Quantized Variational Autoencoder (VQ-VAE) hyperparameters within the VQ-BET behavioral cloning policy to isolate encoding bottlenecks. By analyzing variables like codebook size, sequential versus parallel codebooks, and loss weights, I identified that the primary degradation stemmed from AR Toolkit drift. To counter this, I proposed and implemented a targeted normalization strategy for the VQ-VAE input layers.
Bimanual policy execution