Is Multilingual LLM Watermarking Truly Multilingual? A Simple Back-Translation Solution Paper β’ 2510.18019 β’ Published 21 days ago β’ 17
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation Paper β’ 2510.07959 β’ Published Oct 9 β’ 14
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models Paper β’ 2411.00154 β’ Published Oct 31, 2024 β’ 1
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers Paper β’ 2506.15674 β’ Published Jun 18 β’ 2
Calibrating Large Language Models Using Their Generations Only Paper β’ 2403.05973 β’ Published Mar 9, 2024 β’ 1
LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity Paper β’ 2207.13129 β’ Published Jul 26, 2022
Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability Paper β’ 2304.02688 β’ Published Apr 5, 2023
ProPILE: Probing Privacy Leakage in Large Language Models Paper β’ 2307.01881 β’ Published Jul 4, 2023 β’ 2
Efficient and Transferable Adversarial Examples from Bayesian Neural Networks Paper β’ 2011.05074 β’ Published Nov 10, 2020
TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification Paper β’ 2402.12991 β’ Published Feb 20, 2024 β’ 1