publications

publications by categories in reversed chronological order.
* = equal contribution.

2025

  1. NAACL
    iris_thumbnail.png
    Stronger Universal and Transfer Attacks by Suppressing Refusals
    David Huang, Avidan Shah, Alexandre Araujo, and 2 more authors
    In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
    Also appeared in Neurips Safe Generative AI Workshop 2024.
  2. ICRA
    robo_thumbnail.png
    Robo-DM: Efficient Robot Big Data Management
    Kaiyuan Chen, Letian Fu, David Huang*, and 8 more authors
    In ICRA 2025: IEEE International Conference on Robotics and Automation, Apr 2025
  3. WIP
    DOOR_diagram.png
    Improving LLM Safety Alignment with Dual-Objective Optimization
    In Review for International Conference on Machine Learning 2025, Jan 2025

2024

  1. ICLR
    pubdef_thumbnail.png
    PubDef: Defending against Transfer Attacks from Public Models
    Chawin Sitawarin, Jaewon Chang*, David Huang*, and 2 more authors
    In The Twelfth International Conference on Learning Representations, Jan 2024