Recent Research
I. Safety Alignment
- A2RM: Adversarial-Augmented Reward Model
II. Advanced Evaluation
- GhostEI-Bench: Do Mobile Agent Withstand Environmental Injection in Dynamic On-Device Environments?
III. Attack and Defense
- StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
