Hao’s Homepage

Hi, I am 孙豪 (Hao Sun), an NLP researcher in mihoyo_logo . I got my master’s degree from the Department of Computer Science and Technology, Tsinghua University. I was affiliated in CoAI group and fortunately supervised by A/Prof. Minlie Huang. Before Tsinghua University, I got my bachelor’s degree in Shanghai Jiao Tong University in 2020.

My research interests include

Large-scale LLM training (Data collection, scalable model architecture, training algorithms, etc.);
Scaling laws and training dynamics of LLMs;
Safety and social implications of LLMs (mainly during my master period)

Publications

Find a part of my article list on my google scholar

* means equal contribution

Hao Sun, Zhexin Zhang, Fei Mi, Yasheng Wang, Wei Liu, Jianwei Cui, Bin Wang, Qun Liu, Minlie Huang MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions. In the Association for Computational Linguistics: ACL 2023
Hao Sun*, Guangxuan Xu*, Jiawen Deng, Jiale Cheng, Chujie Zheng, Hao Zhou, Nanyun Peng, Xiaoyan Zhu, and Minlie Huang. 2022. On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark. In Findings of the Association for Computational Linguistics: ACL 2022, pages 3906–3923, Dublin, Ireland. Association for Computational Linguistics.
Hao Sun*, Zhenru Lin*, Chujie Zheng, Siyang Liu, and Minlie Huang. 2021. PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1489–1503, Online. Association for Computational Linguistics.
Jiale Cheng, Sahand Sabour, Hao Sun, Zhuang Chen, Minlie Huang PAL: Persona-Augmented Emotional Support Conversation Generation. In Findings of Association for Computational Linguistics: ACL 2023
Yuxian Gu*, Jiaxin Wen*, Hao Sun*, Yi Song, Pei Ke, Chujie Zheng, Zheng Zhang, Jianzhu Yao, Xiaoyan Zhu, Jie Tang, Minlie Huang EVA2. 0: Investigating Open-domain Chinese Dialogue Systems With Large-scale Pre-training. Mach. Intell. Res. (2023).
Jiawen Deng*, Jingyan Zhou*, Hao Sun, Fei Mi, and Minlie Huang. 2022. COLD: A Benchmark for Chinese Offensive Language Detection. In Empirical Methods in Natural Language Processing: EMNLP 2022.
Zhexin Zhang, Jiale Cheng, Hao Sun, Jiawen Deng, Fei Mi, Yasheng Wang, Lifeng Shang, Minlie Huang. 2022. Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation. In Findings of Empirical Methods in Natural Language Processing: EMNLP 2022.

Preprint

Hao Sun, Zhexin Zhang, Jiawen Deng, Jiale Cheng, Minlie Huang. Safety Assessment of Chinese Large Language Models. arXiv preprint arXiv:2304.10436.
Jiawen Deng, Hao Sun*, Zhexin Zhang, Jiale Cheng, Minlie Huang Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey. *arXiv preprint arXiv:2302.09270.

Teaching Assistant of Artificial Neural Network in Tsinghua University, 2021/2022, autumn.
Reviewer in ACL/EMNLP 2022/23/24, ICLR/Neurips 2024

Awards

Outstanding Graduate in Tsinghua University, 2023.
Outstanding Master Thesis in Tsinghua University, 2023.
Second prize of Autumn Scholarship for Graduate Students of Tsinghua University, 2022
Second prize of Autumn Scholarship for Graduate Students of Tsinghua University, 2021
Second prize of Tencent Games Security Technology Competition (Track: Natural Language Processing), 2021
Excellence award of Tencent Games Security Technology Competition (Track: Machine Learning), 2019

Hao Sun | 孙豪

Publications

Preprint

Social Services

Awards