Open to Collab

Shyam Sunder Kumar

theainerd

shyam_sunder_kr
theainerd
beingprofess

AI & ML interests

Natural Language Processing

Recent Activity

new activity about 7 hours ago

pollen-robotics/reachy_mini_conversation_app:Plans for an official local LLM-backed version?

liked a Space about 7 hours ago

pollen-robotics/reachy_mini_conversation_app

liked a dataset 2 days ago

NodeLinker/deepseek-ai-Thinking-with-Visual-Primitives-deleted-repo

View all activity

Shyam Sunder Kumar

AI & ML interests

Recent Activity

Organizations

theainerd 's collections 4

CyberSecEvalTest

meta-llama/Llama-Guard-3-8B

meta-llama/Prompt-Guard-86M

protectai/deberta-v3-base-prompt-injection-v2

Agent Laboratory: Using LLM Agents as Research Assistants

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Predict Memory

Model Memory Utility

Transformers Timeline

The Smol Training Playbook

Training Large Language Models to Reason in a Continuous Latent Space

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Evolving Deeper LLM Thinking

Kimi k1.5: Scaling Reinforcement Learning with LLMs

CyberSecEvalTest

meta-llama/Llama-Guard-3-8B

meta-llama/Prompt-Guard-86M

protectai/deberta-v3-base-prompt-injection-v2

Predict Memory

Model Memory Utility

Transformers Timeline

The Smol Training Playbook

Agent Laboratory: Using LLM Agents as Research Assistants

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Training Large Language Models to Reason in a Continuous Latent Space

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Evolving Deeper LLM Thinking

Kimi k1.5: Scaling Reinforcement Learning with LLMs