publications

publications by categories in reversed chronological order.
* = equal contribution.

2026

  1. better_logo_measuring.png
    Measuring General Intelligence with Generated Games
    Vivek Verma, David Huang, William Chen, and 2 more authors
    In Under submission at ICLR, 2026

2025

  1. NAACL
    iris_thumbnail.png
    Stronger Universal and Transfer Attacks by Suppressing Refusals
    David Huang, Avidan Shah, Alexandre Araujo, and 2 more authors
    In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
    Also appeared in Neurips Safe Generative AI Workshop 2024.
  2. ICRA
    robo_thumbnail.png
    Robo-DM: Data Management For Large Robot Datasets
    Kaiyuan Chen, Letian Fu, David Huang*, and 8 more authors
    In ICRA 2025: IEEE International Conference on Robotics and Automation, Apr 2025
  3. ICML
    DOOR_diagram.png
    Improving LLM Safety Alignment with Dual-Objective Optimization
    Xuandong Zhao*, Will Cai*, Tianneng Shi, and 4 more authors
    In Forty-second International Conference on Machine Learning, May 2025

2024

  1. ICLR
    pubdef_thumbnail.png
    PubDef: Defending against Transfer Attacks from Public Models
    Chawin Sitawarin, Jaewon Chang*, David Huang*, and 2 more authors
    In The Twelfth International Conference on Learning Representations, Jan 2024