-
CyberSecEvalTest
📈71Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 53.7k • • 250 -
meta-llama/Prompt-Guard-86M
Text Classification • 0.3B • Updated • 52.1k • • 288 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 196k • • 82
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
updated
a collection
4 days ago
Safety & Security
liked
a model
4 days ago
google/gemma-scope-2
upvoted
a
collection
11 days ago
VibeVoice
Organizations
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Large Language Models Utils
Utils useful for LLM
-
Running100
Predict Memory
🧮100Calculate memory usage for model configurations
-
Running on CPU UpgradeFeatured992
Model Memory Utility
🚀992Calculate vRAM needed for model training and inference
-
Running64
Transformers Timeline
🤗64Interactive timeline to explore the 🤗Transformers models
-
Running on CPU UpgradeFeatured2.66k
The Smol Training Playbook
📚2.66kThe secrets to building world-class LLMs
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 91 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 127
Safety & Security
-
Running71
CyberSecEvalTest
📈71Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 53.7k • • 250 -
meta-llama/Prompt-Guard-86M
Text Classification • 0.3B • Updated • 52.1k • • 288 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 196k • • 82
Large Language Models Utils
Utils useful for LLM
-
Running100
Predict Memory
🧮100Calculate memory usage for model configurations
-
Running on CPU UpgradeFeatured992
Model Memory Utility
🚀992Calculate vRAM needed for model training and inference
-
Running64
Transformers Timeline
🤗64Interactive timeline to explore the 🤗Transformers models
-
Running on CPU UpgradeFeatured2.66k
The Smol Training Playbook
📚2.66kThe secrets to building world-class LLMs
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 91 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 127