About Me

Ziqi Huang | 黄子棋

Hi there! I am Ziqi Huang, a fourth-year undergraduate student at Tongji University, majoring in Information Security, Guohao Academic. I am a M.S. student Candidate at the Intelligent Information Processing Lab, Department of Computer Science and Technology, Tongji University (Shanghai, China), under the supervision of Prof. Zhihua Wei.

My research interests lie at the intersection of AI safety, AI Deception, Interpretability. Specifically, I’m passionate about:

SAE and other interpretability methods
Reinforcement Learning & Alignment
I’m not very good at CV and large multimodal models. :(

Email: 2253726 [at] tongji.edu.cn / hzq1915851440 [at] gmail.com

Github / Twitter / Wechat

News

Publication

indicates equal contribution

Toward Safer Large Language Models via Internal Mechanism Analysis

Ziqi Huang, Author B, Author C

xxxx 2026

arXiv / Code / Project

A brief summary: we study interpretable internal representations and propose a practical safety-alignment method for reducing deceptive behavior in LLMs.

𝓜𝓲𝓼𝓬𝓮𝓵𝓵𝓪𝓷𝓮𝓸𝓾𝓼

⚽ 𝓢𝓹𝓸𝓻𝓽𝓼

Badminton, soccer, and basketball
Big fan of Stephen Curry and the Golden State Warriors

🎵 𝓐𝓻𝓽𝓼 & 𝓒𝓻𝓮𝓪𝓽𝓲𝓿𝓮 𝓘𝓷𝓽𝓮𝓻𝓮𝓼𝓽𝓼

Playing the Ukulele and Violin
Photography, Singing, Cooking

wardell-H