Summarization
Description: Evaluates an LLM's ability to accurately summarize long texts from diverse sources such as YouTube video transcripts, websites, PDFs, and direct text inputs. It also assesses the model's capacity to follow detailed user instructions to extract specific data insights.
Number of Samples: 41
Language: English
Provider: Toqan
Evaluation Method: Auto-evaluation with GPT4 - Turbo
Data Collection Period: February 2022 - October 2023
Last updated: May 22, 2024
Share this view
# | Model | Provider | Size | Chunk Size | Adherence To Instructions | Accuracy of Content | Quality of Writing |
---|---|---|---|---|---|---|---|
No results. |