Open to Collab

6 132 172

Shyam Sunder Kumar

theainerd

shyam_sunder_kr
theainerd
beingprofess

AI & ML interests

Natural Language Processing

Recent Activity

updated a collection 4 days ago

Safety & Security

liked a model 4 days ago

google/gemma-scope-2

upvoted a collection 11 days ago

VibeVoice

View all activity

Shyam Sunder Kumar

AI & ML interests

Recent Activity

Organizations

theainerd 's collections 4

CyberSecEvalTest

meta-llama/Llama-Guard-3-8B

meta-llama/Prompt-Guard-86M

protectai/deberta-v3-base-prompt-injection-v2

Agent Laboratory: Using LLM Agents as Research Assistants

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Predict Memory

Model Memory Utility

Transformers Timeline

The Smol Training Playbook

Training Large Language Models to Reason in a Continuous Latent Space

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Evolving Deeper LLM Thinking

Kimi k1.5: Scaling Reinforcement Learning with LLMs

CyberSecEvalTest

meta-llama/Llama-Guard-3-8B

meta-llama/Prompt-Guard-86M

protectai/deberta-v3-base-prompt-injection-v2

Predict Memory

Model Memory Utility

Transformers Timeline

The Smol Training Playbook

Agent Laboratory: Using LLM Agents as Research Assistants

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Training Large Language Models to Reason in a Continuous Latent Space

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Evolving Deeper LLM Thinking

Kimi k1.5: Scaling Reinforcement Learning with LLMs