Indian Flag
Government Of India
A-
A
A+

Dhwani - Multilingual Speech LLM

Dhwani is India's first end-to-end trained speech Large Language Model (LLM), capable of directly understanding speech without a separate ASR (Automatic Speech Recognition) model, avoiding cascading ASR errors. It supports speech-to-text translation across multiple Indic languages and English.

About Model

Dhwani is an end-to-end trained speech LLM designed for Indic speech-to-text and multilingual speech translation. Developed by Krutrim AI Labs, Dhwani is powered by Krutrim-1 LLM, enabling direct speech understanding without the need for ASR models. It features a dual encoder structure, utilizing Whisper's speech encoder for processing speech inputs and BEATs audio encoder for non-speech audio signals. The model employs a Window-Level Query Transformer (Q-Former) as a bridge between audio and text processing. Using Low-Rank Adaptation (LoRA) fine-tuning, Dhwani aligns audio-derived inputs with textual output, ensuring accurate speech recognition and translation. It supports English, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Tamil, and Telugu and excels in use cases like multilingual communication, media translation, education, healthcare, customer support, business, and legal applications. Evaluation results show high BLEU scores for English-to-Indic and Indic-to-English translations, demonstrating its efficiency in real-world scenarios.

Dhwani - Multilingual Speech LLM

Metadata Metadata

Krutrim Community License Agreement Version 1.0

Ola Krutrim

Automatic Speech Recognition

N.A.

Open

Ola Krutrim

Sector Agnostic

28/02/25 07:00:47

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 7
  • Views 118
  • File Size 0

Tags Tags

  • Translation
  • Indic Languages
  • Multilingual AI
  • Deep Learning
  • Speech-to-Text
  • conversational-AI
  • Krutrim AI
  • BharatBench
  • speech LLM
  • ASR-free speech recognition

License Control License Control

Krutrim Community License Agreement Version 1.0

More Models from Ola Krutrim More Models from Ola Krutrim

Chitrarth - Multilingual Vision-Language Model
Chitrarth is a multilingual vision-language model (VLM) integrating a Large Language Model (LLM) with a vision module. It is trained on multilingual image-text data and supports 10 Indic languages along with English.
NLP
computer vision
multimodal AI
image-text AI
vision-language model
BharatBench
generative AI
Krutrim AI
Deep Learning
Indic Languages
  • See Upvoters0
  • Downloads9
  • File Size0
  • Views134
Updated 10 month(s) ago

OLA KRUTRIM

Krutrim Translate - Indic Language Translation Model
Krutrim Translate is a multilingual machine translation model optimized for Indic languages, supporting English-to-Indic and Indic-to-English translations. It extends IndicTrans2 with a longer context length (4096 tokens) and leverages the Bharat Parallel Corpus Collection (BPCC) for training.
BharatBench
Krutrim AI
text-to-text translation
Deep Learning
Multilingual AI
IndicTrans2
Indic Languages
NLP
Machine Translation
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views94
Updated 10 month(s) ago

OLA KRUTRIM

Krutrim-1 Instruct Large Language Model
Krutrim-1 is a 7.3B parameter multilingual foundation model trained on a 2 trillion token dataset, designed for Indian linguistic and demographic needs. It supports 11 Indic languages and matches or exceeds comparable state-of-the-art models in multilingual tasks.
multilingual NLP
LLAMA-2 alternative
generative AI
Krutrim AI
AI Research
Text Generation
Large Language Model
Deep Learning
Indic Languages
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views63
Updated 10 month(s) ago

OLA KRUTRIM

Krutrim-2 Instruct
Krutrim-2 is a 12B parameter multilingual large language model built on the Mistral-NeMo 12B architecture, optimized for Indic languages and Indian cultural context. It supports long-form conversations, reasoning, coding, and translation tasks.
BharatBench
coding AI
multilingual NLP
generative AI
Krutrim AI
AI Research
Text Generation
Large Language Model
Deep Learning
Indic Languages
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views64
Updated 10 month(s) ago

OLA KRUTRIM

Vyakyarth - Multilingual Sentence Embedding Model
Vyakyarth is a sentence-transformers-based model fine-tuned for Indic languages, capable of mapping text to a 768-dimensional dense vector space for semantic search, similarity, classification, and clustering tasks.
NLP
XLM-RoBERTa
sentence embedding
text similarity
semantic search
paraphrase mining
Krutrim AI
Deep Learning
Multilingual AI
Indic Languages
  • See Upvoters0
  • Downloads8
  • File Size0
  • Views98
Updated 10 month(s) ago

OLA KRUTRIM

Dhwani - Multilingual Speech LLM
Dhwani is India's first end-to-end trained speech Large Language Model (LLM), capable of directly understanding speech without a separate ASR (Automatic Speech Recognition) model, avoiding cascading ASR errors. It supports speech-to-text translation across multiple Indic languages and English.
speech LLM
BharatBench
Krutrim AI
conversational-AI
Speech-to-Text
Deep Learning
Multilingual AI
Indic Languages
Translation
ASR-free speech recognition
  • See Upvoters0
  • Downloads7
  • File Size0
  • Views118
Updated 10 month(s) ago

OLA KRUTRIM