Everyone's favorite model from OpenAI released in November 2023.
Roumered to be 20B
Supported context length
Price for prompt tokens*
Price for response tokens*
*Note: Data based on 11/14/2023
Here's how GPT-3.5-turbo-1106 performed across all three task types
Digging deeper, here’s a look how GPT-3.5-turbo-1106 performed across specific datasets
|Tasks||Insights||Dataset Name||Dataset Performance|
|QA without RAG||The model performs well which show less bias and good factual knowledge.||Truthful QA|
|QA with RAG||The model excels at this task which demonstrates great reasoning and comprehension skills. It struggles a bit on mathematical skills as it scores relatively low on DROP compared to other dataset.||MS Marco|
|Long form text generation||The model excels at this task which demonstrates great ability to generate long text without factual errors.||Open Assistant|
💰 Cost insights
The model offers a great balance of cost and performance. It is 30x cheaper compared to GPT4 and 2x costlier compared to Llama 70b variant. OpenAI provides generous rate limits for production application.