Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
[pdf] [project website] Preprint (Under Review)
📰This work was exclusively reported by New York Times and many other social medias!
, 2023Redirecting to our project website…