ProLLM Benchmarks

Q&A Assistant

DescriptionEvaluates an LLM's effectiveness as a team member in a business environment by assessing its ability to provide accurate and contextually relevant responses. It utilizes diverse queries covering both technical (such as coding) and non-technical areas.Number of Samples275LanguageEnglishProviderToqanEvaluation MethodAuto-evaluation with GPT4-Turbo over ground-truth.Data Collection PeriodFebruary 2022 - October 2023

Tags

Tags that describe the type of question asked.

Complexity

The complexity level of the questions.

Last updated: November 14, 2024

Share this view

#	Model	Provider	Size	Acceptance
No results.

Q&A Assistant

View Examples