Posts

Comments

Comment by Taywon Min (taywon-min) on An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation · 2025-02-24T11:30:44.077Z · LW · GW

Thanks for the great work. I think that multimodal sparse auto encoders is a promising direction. Do you think it is possible / worthwhile to train SAEs on vla models like OpenVLA? I haven't seen any related work training or interpreting action models using SAE work, and am curious of your thoughts.