Skip to content
View Deep1994's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Deep1994

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Deep1994/README.md

Pinned Loading

  1. NJUNLP/ReNeLLM NJUNLP/ReNeLLM Public

    The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily".

    Python 150 16

  2. NJUNLP/Hallu-PI NJUNLP/Hallu-PI Public

    The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs".

    11

  3. NJUNLP/SAGE NJUNLP/SAGE Public

    The official implementation of our ACL 2025 paper "Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement".

    Python 8 1

  4. NJUNLP/SDGO NJUNLP/SDGO Public

    The code and datasets of our EMNLP 2025 paper "SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models".

    Jupyter Notebook 7

  5. NJUNLP/ISA NJUNLP/ISA Public

    The code and datasets of our paper "Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift Attack".

    Python 3 1