yeu-tong-lau

Posts
Comments

Posts

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders 2024-12-11T06:30:37.076Z

Understanding Positional Features in Layer 0 SAEs 2024-07-29T09:36:40.701Z

An adversarial example for Direct Logit Attribution: memory management in gelu-4l 2023-08-30T17:36:59.034Z

Comments