arxiv:2306.03460
Apurva Gandhi
apurvaga
·
AI & ML interests
Agents, LLMs, Reinforcement Learning
Recent Activity
published
a model about 1 month ago
apurvaga/code-search-qwen-4b-distilled-from-14b-str-output updated
a model about 1 month ago
apurvaga/code-search-qwen-4b-distilled-from-14b-str-output upvoted a paper 4 months ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective
Fine-tuning of LLM Agents