Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR
ASLP-lab
ASLP-lab
AI & ML interests
None yet
Recent Activity
liked a model 9 days ago
mcshao/LAT-Audio updated a dataset 12 days ago
ASLP-lab/HumDial-FDBench updated a collection 12 days ago
Speaker-ReasonerOrganizations
None yet
spaces 8
Running on Zero
Agents
9
YingMusic-Singer-Plus
π€
Edit lyrics, keep the melody
Runtime error
Agents
12
WenetSpeech Yue
π₯
Large-Scale Cantonese Speech Corpus
Runtime error
Agents
1
VoiceSculptor
π
Running on Zero
Agents
44
DiffRhythm2
π΅
Generate a full song from lyrics and style prompts
Running on Zero
Agents
22
SongFormer
π΅
State-of-the-art music analysis with multi-scale datasets
Running on Zero
Agents
Featured
687
DiβͺβͺRhythm
πΆ
Blazingly Fast and Embarrassingly Simple Song Generation
models 34
ASLP-lab/Speaker-Reasoner
32B β’ Updated β’ 60 β’ 1
ASLP-lab/Speaker-Reasoner-4194h
32B β’ Updated β’ 63
ASLP-lab/YingMusic-Singer-Plus
Updated β’ 1.56k β’ 7
ASLP-lab/OmniCodec
Feature Extraction β’ Updated β’ 1
ASLP-lab/OSUM-Pangu
Audio-to-Audio β’ Updated β’ 2
ASLP-lab/VoiceSculptor-VD
Text-to-Speech β’ 4B β’ Updated β’ 31 β’ 18
ASLP-lab/WenetSpeech-Wu-Speech-Understanding
Updated
ASLP-lab/WenetSpeech-Wu-Speech-Generation
Text-to-Speech β’ Updated β’ 2
ASLP-lab/LLasa-1B-Yue-Update
1B β’ Updated β’ 22
ASLP-lab/WSChuan-ASR
Automatic Speech Recognition β’ Updated β’ 5
datasets 18
ASLP-lab/HumDial-FDBench
Updated β’ 147 β’ 1
ASLP-lab/FastTurn-Testset
Updated β’ 43
ASLP-lab/WSC-Train
Preview β’ Updated β’ 352 β’ 120
ASLP-lab/LyricEditBench
Viewer β’ Updated β’ 7.2k β’ 229 β’ 2
ASLP-lab/WenetSpeech-Wu-Bench
Viewer β’ Updated β’ 242 β’ 389 β’ 4
ASLP-lab/WenetSpeech-Wu
Updated β’ 37 β’ 1
ASLP-lab/WenetSpeech-Yue
Updated β’ 389 β’ 41
ASLP-lab/WSC-Eval
Viewer β’ Updated β’ 1.19k β’ 10.4k β’ 7
ASLP-lab/Easy-Turn-Trainset
Viewer β’ Updated β’ 1.91k β’ 516 β’ 9
ASLP-lab/SongFormBench
Viewer β’ Updated β’ 300 β’ 469 β’ 2