- 🏫 I’m pursuing a PhD in Computer Science at Nanjing University, supervised by Prof. Shujian Huang.
- 🔬 I’m currently interested in LLMs safety (jailbreak & defense, interpretability, etc.).
- 📚 My blog: https://deep1994.github.io
- 🤝 Contact me: [email protected]
🎯
Focusing
Make the change
-
NJU(Nanjing University)
- Nanjing, China
-
13:05
(UTC +08:00) - https://deep1994.github.io/
Highlights
- Pro
Pinned Loading
-
NJUNLP/ReNeLLM
NJUNLP/ReNeLLM PublicThe official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily".
-
An-Attentive-Neural-Model-for-labeling-Adverse-Drug-Reactions
An-Attentive-Neural-Model-for-labeling-Adverse-Drug-Reactions PublicAn Attentive Neural Sequence Labeling Model for Adverse Drug Reactions Mentions Extraction
-
NJUNLP/Hallu-PI
NJUNLP/Hallu-PI PublicThe code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs".
-
NJUNLP/SAGE
NJUNLP/SAGE PublicThe official implementation of our ACL 2025 paper "Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.