Security-gUided Reasoning and learnIng

university

https://henrygwb.github.io/lab.htm

https://github.com/ucsb-mlsec

AI & ML interests

Code LLMs, AI agents for security, Reasoning

Recent Activity

yuzhounie authored a paper about 2 months ago

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

yuzhounie authored a paper about 2 months ago

Humanity's Last Exam

yuzhounie authored a paper about 2 months ago

OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

View all activity

yuzhounie

authored 4 papers about 2 months ago

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 13

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Paper • 2505.23885 • Published May 29

AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents

Paper • 2505.05849 • Published May 9

tangken333

updated 2 models 7 months ago

UCSB-SURFI/Co-PatcheR-Val-no-assert-14B

15B • Updated May 26 • 5 • 1

UCSB-SURFI/Co-PatcheR-Val-assert-14B

15B • Updated May 26 • 6 • 1

tangken333

updated a collection 7 months ago

Co-PatcheR

Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models • 3 items • Updated May 29

tangken333

published 2 models 7 months ago

UCSB-SURFI/Co-PatcheR-Val-no-assert-14B

15B • Updated May 26 • 5 • 1

UCSB-SURFI/Co-PatcheR-Val-assert-14B

15B • Updated May 26 • 6 • 1

tangken333

updated a model 7 months ago

UCSB-SURFI/Co-PatcheR-Loc-Gen-14B

15B • Updated May 26 • 4 • 1

tangken333

published a model 7 months ago

UCSB-SURFI/Co-PatcheR-Loc-Gen-14B

15B • Updated May 26 • 4 • 1

March07

authored 5 papers over 1 year ago

PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

Paper • 2306.04528 • Published Jun 7, 2023 • 3

A Survey on Evaluation of Large Language Models

Paper • 2307.03109 • Published Jul 6, 2023 • 42

Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning

Paper • 2308.02533 • Published Aug 1, 2023

Large Language Models Understand and Can be Enhanced by Emotional Stimuli

Paper • 2307.11760 • Published Jul 14, 2023 • 1

DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks

Paper • 2309.17167 • Published Sep 29, 2023 • 1

March07

authored a paper almost 2 years ago

PromptBench: A Unified Library for Evaluation of Large Language Models

Paper • 2312.07910 • Published Dec 13, 2023 • 18