ProLLM Benchmarks

Transcription

DescriptionEvaluates transcription models on multi-lingual, multi-speaker audio with varying levels of background noise across multiple business domains such as software-development, finance, classifieds, food-delivery, and healthcare. The dataset consists of 150 unique audio samples, with each sample being augmented to generate a low and a high noise version.Number of Samples450LanguageEnglish, Hindi, Portuguese, Polish, Afrikaans and DutchProviderToqan (Synthetically generated audio.)Evaluation MethodAccuracy is reported as 1 - WER (Word Error Rate).Data Collection PeriodAugust 2024

Language

The language of the conversation.

Noise

The level of background noise in the audio sample.

Domain

The business domain of the conversation.

Last updated: August 30, 2024

Share this view

#	Model	Provider	Accuracy
No results.