flashresearch/FlashResearch-DS-33k
Viewer • Updated • 33.1k • 18 • 6
A 4B-parameter Qwen model distilled from Tongyi DeepResearch-30B A3B, optimized for web-scale “deep research” tasks and inference with Alibaba-NLP/DeepResearch.
flashresearch/FlashResearch-DS-33k
flashresearch/FlashResearch-DS-33kThis model is intended to be used directly with the DeepResearch repo.
git clone https://github.com/Alibaba-NLP/DeepResearch
cd DeepResearch
# Create env (example)
python -m venv .venv && source .venv/bin/activate
pip install -e . # or pip install -r requirements.txt if provided
Edit the config to add this model
MODEL_PATH=flashresearch/FlashResearch-4B-Thinking
If you use this model, please cite:
@software{cheapresearch_thinking_2025,
title = {CheapResearch 4B Thinking},
author = {Artem Y.},
year = {2025},
url = {https://huggingface.co/flashresearch/FlashResearch-4B-Thinking}
}
And the dataset:
@dataset{cheapresearch_ds_33k,
title = {CheapResearch-DS-33k},
author = {Artem Y.},
year = {2025},
url = {https://huggingface.co/datasets/flashresearch/FlashResearch-DS-33k}
}
---
language:
- en
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
tags:
- qwen
- deep-research
- browsing
- citation
- reasoning
- distillation
- agent
- vllm
- cheapresearch
datasets:
- flashresearch/FlashResearch-DS-33k
base_model:
- Qwen/Qwen3-4B-Thinking-2507
model-index:
- name: FlashResearch-4B-Thinking
results: []
---