Generate multilingual speech from typed text
Detect anime faces and landmarks in an image
Towards Unified Music Emotion Recognition across Dimensional