Home/Models/shuka-v1

shuka-v1

Multilingual audio to text model

Sarvam AI
Aashay

About Model

Shuka v1 is an innovative audio understanding model for Indic languages, combining Saaras v1 encoder and Meta's Llama3-8B-Instruct as the decoder. Trained on less than 100 hours of data, it outperforms larger models in audio-based question-answering tasks and supports fine-tuning for customized use cases. Shuka v1 is available open-source, marking the start of advancements in audio language models for Indic languages.

shuka-v1

Metadata

License

CC0 10 Public Domain

Hosted By

sarvamai

Model Type

Audio-to-text

Model Format

Transformers

Visibility

Open

Source organisation

Sector

Science, Technology and Research

Updated Date & Time

24/02/25 07:45:11

Created By

Aashay Sachdeva

Size

0

Activity Overview

0
9
131
0

Tags

audio-llms

License Control

CC0 10 Public Domain

More Models from Sarvam AI

sarvam-1

India's first indic model, pretrained on 4 trillion tokens

0
20
0
464

Updated 1 year(s) ago

SARVAM AI

shuka-v1

Multilingual audio to text model

audio-llms

0
9
0
132

Updated 1 year(s) ago

SARVAM AI

© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.