-
CyberSecEvalTest
📈72Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 73.7k • • 298 -
meta-llama/Prompt-Guard-86M
Text Classification • 0.3B • Updated • 1.31M • • 333 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 391k • • 107
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
new activity about 7 hours ago
pollen-robotics/reachy_mini_conversation_app:Plans for an official local LLM-backed version? liked a Space about 7 hours ago
pollen-robotics/reachy_mini_conversation_app liked a dataset 2 days ago
NodeLinker/deepseek-ai-Thinking-with-Visual-Primitives-deleted-repoOrganizations
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 96 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 104 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Large Language Models Utils
Utils useful for LLM
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
- Runtime errorAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningAgents80
Transformers Timeline
🤗80Interactive timeline to explore the 🤗Transformers models
- Running on CPU UpgradeFeatured3.19k
The Smol Training Playbook
📚3.19kThe secrets to building world-class LLMs
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 67 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 131
Safety & Security
- Running72
CyberSecEvalTest
📈72Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 73.7k • • 298 -
meta-llama/Prompt-Guard-86M
Text Classification • 0.3B • Updated • 1.31M • • 333 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 391k • • 107
Large Language Models Utils
Utils useful for LLM
- RunningAgents109
Predict Memory
🧮109Estimate model memory usage and see detailed plots
- Runtime errorAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningAgents80
Transformers Timeline
🤗80Interactive timeline to explore the 🤗Transformers models
- Running on CPU UpgradeFeatured3.19k
The Smol Training Playbook
📚3.19kThe secrets to building world-class LLMs
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 96 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 104 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 67 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 131