publications

Since I don’t find the typical dump of publication references with no elaboration in CVs particularly helpful for actually knowing someone, I’ve instead provided some tl;drs and topic identifiers for a selected portion of my works below, which should give you a proper overview of me in a more self-contained manner.

For a full publication list, please refer to my Google Scholar — be warned that a large share of my citations comes from surveys I didn’t lead.

Selected Publications

2025

  1. Under Review
    Sweeping Promptable Spoofs under the DirtyRAG: A Practical, Query-Blind RAG Attack Done Right
    May 2025
  2. EMNLP 2025 Main Oral
    Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly
    May 2025
  3. EMNLP 2025 Findings
    LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem
    May 2025
  4. NeurIPS 2025
    70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float
    May 2025

2024

  1. ICLR 2025 Spotlight
    MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations
    Oct 2024
  2. EMNLP 2025 Findings
    KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
    Jun 2024
  3. ICML 2024
    KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
    Feb 2024
  4. EMNLP 2024 Main
    Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
    Jun 2024
  5. ICML 2024
    GNNs Also Deserve Editing, and They Need It More Than Once
    Feb 2024

2023

  1. NeurIPS 2023
    One Less Reason for Filter Pruning: Gaining Free Adversarial Robustness with Structured Grouped Kernel Pruning
    May 2023

2021

  1. ICLR 2022
    Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions
    Oct 2021