- π: I am a
firstsecond-year PhD student at the University of Science and Technology of China (USTC), supervised by Professor Enhong Chen. Currently, I am working as a research intern at Tencent AI Lab, mentored by Zhirui Zhang. Prior to that, I worked as a research intern in the Language Technology Lab at Alibaba DAMO Academy. - π―: I am working in the field of NLP & Data Mining. My research interests and past experience can be summarized as follows:
- Pre-trained Language Models: Foundation Model; Multimodal LLM
- Multilingual and Multimodal Translation: End-to-End Speech Translation; Multilingual Neural Machine Translation
- Data mining in healthcare: ICD coding; Extreme Multi-Label Classification
- Knowledge Graph: Entity Linking
- [2023-10-08] Three papers are accepted by EMNLP'2023, about theoretical understanding of kNN-MT, interactive machine translation and interventional rationalization!
- [2023-07-10] Our MineTrans speech translation system is accepted by IWSLT'2023, which win the championship in the Speech-to-Speech Translation (End-to-End subtrack).
- [2023-01-21] Two papers are accepted by ICLR'2023, about federated machine translation and efficient kNN-MT!
- [2022-10-06] One paper is accepted by EMNLP'2022, about domain adaptation for speech translation!
- [2022-09-13] One paper is accepted by NeurIPS'2022, about disentangled representation learning!
- Xu Huang, Zhirui Zhang, Ruize Gao, Yichao Du, Lemao Liu, Guoping Huang, Shuming Shi, Jiajun Chen, Shujian Huang. IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems. In EMNLP'2023 (Oral, Top3%).
- Ruize Gao, Zhirui Zhang, Yichao Du, Lemao Liu, Rui Wang. Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer. In EMNLP'2023.
- Linan Yue, Qi Liu, Li Wang, Yanqing An, Yichao Du, Zhenya Huang. Interventional Rationalization. In EMNLP'2023.
- Yichao Du, Zhengsheng Guo, Jinchuan Tian, Zhirui Zhang, Xing Wang, Jianwei Yu, Zhaopeng Tu, Tong Xu and Enhong Chen. The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks. In IWSLT'2023.
- Yichao Du, Zhirui Zhang, Bingzhe Wu, Lemao Liu, Tong Xu, and Enhong Chen. Federated Nearest Neighbor Machine Translation. In ICLR'2023.
- Yuhan Dai, Zhirui Zhang, Qiuzhi Liu, Qu Cui, Weihua Li, Yichao Du and Tong Xu. Simple and Scalable Nearest Neighbor Machine Translation. In ICLR'2023.
- Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, and Enhong Chen. Non-Parametric Domain Adaptation for End-to-end Speech Translation. In EMNLP'2022 (Oral, Top3%).
- Linan Yue, Qi Liu, Yichao Du, Yanqing An, Li Wang and Enhong Chen. DARE: Disentanglement-Augmented Rationale Extraction. In NeurIPS'2022 (Spotlight, Top8%).
- Yichao Du, Zhirui Zhang, Weizhi Wang, Boxing Chen, Jun Xie, Tong Xu, and Enhong Chen. Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement. In AAAI'2022.
- Yichao Du, Pengfei Luo, Xudong Hong, Tong Xu, Zhe Zhang, Chao Ren, Yi Zheng, Enhong Chen. Inheritance-guided Hierarchical Assignment for Clinical Automatic Diagnosis. In DASFAA'2021.
- Weizhi Wang, Zhirui Zhang, Yichao Du, Boxing Chen, Jun Xie, Weihua Luo. Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables. In EMNLP'2021.
- Zhe Zhang, Tong Xu, Le Zhang, Yichao Du, Hui Xiong, and Enhong Chen. Knowledge Powered Cooperative Semantic Fusion for Patent Classification. In CICAI'2021. (Best Student Paper Finalist)
- Yichao Du, Tong Xu, Jianhui Ma, Enhong Chen, Yi Zheng, Tongzhu Liu, Guixian Tong. An Automatic ICD Coding Method for Clinical Records Based on Deep Neural Network. In Big Data Research.
- IWSLT 2023: ranked 1st (English->Chinese) in Speech-to-Speech Translation Track (End-to-End subtrack).