Q&A Assistant
DescriptionEvaluates an LLM's effectiveness as a team member in a business environment by assessing its ability to provide accurate and contextually relevant responses. It utilizes diverse queries covering both technical (such as coding) and non-technical areas.Number of Samples275LanguageEnglishProviderToqanEvaluation MethodAuto-evaluation with GPT4-Turbo over ground-truth.Data Collection PeriodFebruary 2022 - October 2023
Tags
Tags that describe the type of question asked.
Complexity
The complexity level of the questions.
Last updated: November 14, 2024
Share this view
# | Model | Provider | Size | Acceptance |
---|---|---|---|---|
No results. |