fix: use full Kiro payload for prompt token counting by juslintek · Pull Request #102 · jwadow/kiro-gateway

juslintek · 2026-03-20T02:32:03Z

Problem

The prompt_tokens reported to clients (used by Claude Code's /context command) were wildly inaccurate because they were derived from Kiro's contextUsagePercentage, which returns unreliable values.

Solution

Count tokens from the complete serialized Kiro request payload using tiktoken. This includes system prompt, messages, tools, and all other payload fields — matching what actually gets sent to the API.

Changes

Replace request_messages/request_tools params with pre-counted prompt_tokens across all streaming functions
Count tokens from full kiro_request_body in both OpenAI and Anthropic route handlers
Remove dependency on contextUsagePercentage for token counting
Update tests to match new function signatures

The prompt_tokens reported to clients (used by Claude Code's /context command) were wildly inaccurate because they were derived from Kiro's contextUsagePercentage, which returns unreliable values. Instead, count tokens from the complete serialized Kiro request payload using tiktoken. This includes system prompt, messages, tools, and all other payload fields — matching what actually gets sent to the API. - Replace request_messages/request_tools params with pre-counted prompt_tokens across all streaming functions - Count tokens from full kiro_request_body in both OpenAI and Anthropic route handlers - Remove dependency on contextUsagePercentage for token counting - Update tests to match new function signatures

cla-bot · 2026-03-20T02:32:06Z

Thanks for the PR! 🎉

Before merge, we need a one-time CLA confirmation.
It confirms that you have the right to contribute this code and allow the project to use it.

Full CLA text:
https://github.com/jwadow/kiro-gateway/blob/main/CLA.md

Please reply once with:

I have read the CLA and I accept its terms

You need to write once, all further messages from me can be ignored.

juslintek · 2026-03-21T01:43:56Z

I have read the CLA and I accept its terms

This was referenced Mar 20, 2026

feat: add /v1/messages/count_tokens endpoint for Anthropic API #103

Open

fix(gateway): truncate long tool names instead of rejecting them #104

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: use full Kiro payload for prompt token counting#102

fix: use full Kiro payload for prompt token counting#102
juslintek wants to merge 1 commit intojwadow:mainfrom
juslintek:fix/prompt-token-counting

juslintek commented Mar 20, 2026

cla-bot bot commented Mar 20, 2026

juslintek commented Mar 21, 2026

Labels

2 participants