publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. CRISP: Persistent Concept Unlearning via Sparse Autoencoders
    Tomer Ashuach, Dana Arad, Aaron Mueller, and 2 more authors
    arXiv preprint arXiv:2508.13650, 2025

2024

  1. REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
    Tomer Ashuach, Martin Tutek, and Yonatan Belinkov
    In Findings of the Association for Computational Linguistics: ACL 2025, 2024