Indian Flag
Government Of India
A-
A
A+

shuka-v1

Multilingual audio to text model

About Model

Shuka v1 is an innovative audio understanding model for Indic languages, combining Saaras v1 encoder and Meta's Llama3-8B-Instruct as the decoder. Trained on less than 100 hours of data, it outperforms larger models in audio-based question-answering tasks and supports fine-tuning for customized use cases. Shuka v1 is available open-source, marking the start of advancements in audio language models for Indic languages.

shuka-v1

Metadata Metadata

CC0 10 Public Domain

sarvamai

Audio-to-text

Transformers

Open

Sarvam AI

Science, Technology and Research

24/02/25 07:45:11

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 9
  • Views 131
  • File Size 0

Tags Tags

  • audio-llms

License Control License Control

CC0 10 Public Domain

More Models from Sarvam AI More Models from Sarvam AI

sarvam-1
India's first indic model, pretrained on 4 trillion tokens
  • See Upvoters0
  • Downloads20
  • File Size0
  • Views464
Updated 1 year(s) ago

SARVAM AI

shuka-v1
Multilingual audio to text model
audio-llms
  • See Upvoters0
  • Downloads9
  • File Size0
  • Views132
Updated 1 year(s) ago

SARVAM AI