exhaust all MCs inside your Chatwize account, additional AI usage will be charged against your own API key and you will continue to be able to use AI features. New subscriptions do not come with monthly recurring MCs. However, you still have the option to purchase add-ons for monthly recurring MCs directly from Chatwize.
Setting up BYOK
To set up your own OpenAI API key, you will first need to apply for one directly from OpenAI’s website. Sign up for an account on OpenAI, then go to this link: https://platform.openai.com/account/api-keys. Additional instructions for obtaining an API key can be found here: https://help.openai.com/en/articles/4936850-where-do-i-find-my-openai-api-key. After you obtain your API key, store it in a private and secure location. Chatwize recommends that you use a separate API key for each of your BYOK applications (including Chatwize), and to never share your API key with anyone else. Next, navigate to Chatwize. After logging in, go to the top right corner of the UI. Click the “person” icon the bring up a dropdown menu.

Budgeting for AI usage
In general, using your own API key will be more cost efficient than purchasing MC add-ons directly. To help you estimate costs associated with running your BYOK account, we provide the following references.OpenAI changes its pricing from time to time, so the information we provide may not always be up-to-date.
For latest information on OpenAI’s pricing, please visit https://openai.com/pricing.
- For GPT-3.5 and GPT-3.5-16k, the cost is ~$0.0064 USD / MC
- For GPT-4 series of LLMs, the cost is ~$0.0028 USD / MC
- System prompt and metadata
- User-defined base prompt
- Variables and definitions
- Tag definitions
- Function metadata and descriptions
- Function parameters
- Function response
- Static RAG context
- Semantic conversation memory
- LLM response
- Response metadata
Model | Reserved for Input | Reserved for Output | Cost / Input Token | Cost / Output Token | Total Cost per Query |
---|---|---|---|---|---|
GPT-3.5 | 3200 | 800 | 0.0000015 | 0.000002 | 0.0064 |
GPT-3.5-16k | 15200 | 800 | 0.000003 | 0.000004 | 0.0488 |
GPT-4-1106-1k | 800 | 200 | 0.00001 | 0.00003 | 0.014 |
GPT-4-1106-2k | 1600 | 400 | 0.00001 | 0.00003 | 0.028 |
GPT-4-1106-4k | 3200 | 800 | 0.00001 | 0.00003 | 0.056 |
GPT-4-0125-8k | 7200 | 800 | 0.00001 | 0.00003 | 0.096 |
GPT-4-1106-16k | 15200 | 800 | 0.00001 | 0.00003 | 0.176 |
As of April 2024, Chatwize still uses gpt-3.5-turbo-0613 for GPT-3.5. This is because based on our internal benchmarks, this model
still significantly outperforms its cheaper counterparts. Other variants like gpt-3.5-turbo-1106-4k and gpt-3.5-turbo-0125 suffer from
the “laziness” problem, where generated responses are overly terse and can ignore provided context.
BYOK for white-label commercial partners
In addition to the costs for MC expenditures during LLM queries, you also need to pay to run our AI multi-agent framework using your own API key. This is independent of whether your users have supplied their API key for their personal accounts. Since the official Chatwize subsidizes its users for all costs associated with running the AI framework, your white-label solution must operate with this premise as well. There are three separate workflows that require your own API key to cover your user’s AI expenditures. The conditions under which they apply are listed below.- AI Agent intent generation
- Applicable if two or more user-facing Agents are connected
- Charged whenever a new user-facing AI Agent goes live or an existing one is edited
- Query intent classification
- Applicable if two or more user-facing Agents are connected
- Charged on a per query basis
- Variable extraction
- Applicable if AI Agent has one or more variables set up
- Charged on a per query basis
Workflow | Average Estimated Input | Average Estimated Output | Cost / Input Token | Cost / Output Token | Estimated Cost per Run |
---|---|---|---|---|---|
AI Agent intent generation (gpt-4-1106-preview) | 600 | 450 | 0.00001 | 0.00003 | 0.0011 |
Query intent classification (gpt-3.5-turbo-1106) | 1000 | 50 | 0.000001 | 0.000002 | 0.0195 |
Variables extraction (gpt-3.5-turbo-1106) | 1000 | 100 | 0.000001 | 0.000002 | 0.0012 |