Posts

Comments

Comment by Zhu Xiaohu (zhu-xiaohu) on Towards understanding-based safety evaluations · 2023-11-27T11:30:34.945Z · LW · GW

I have a pretty fundamental concern with these sorts of techniques as a mechanism for eventually assessing alignment

that would lead to safety or alignment goodharting problem. 

Comment by Zhu Xiaohu (zhu-xiaohu) on My Assessment of the Chinese AI Safety Community · 2023-04-28T07:46:09.548Z · LW · GW

Hi. Thanks for mentioning us. 

Unlike main labs or companies in China, we are doing fundamental research work on the ontological crisis problem with model theory from mathematical logic trying to set a new base for analyzing and preventing the crisis. 

Due to our lacking of funding and restricted intellectual resources, the process is slower, but we will share our work when ready. 

Comment by Zhu Xiaohu (zhu-xiaohu) on Decision theory and zero-sum game theory, NP and PSPACE · 2022-01-11T09:53:46.185Z · LW · GW

Mention a recent interesting work here: On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games gave a related analysis on the comuting of Markov PE for RL agents.