Transcribe live English speech to text
Count people in images with bounding boxes
Transcribe audio to English text with timestamps
Transcribe audio to text in Yoruba or Naija English