feat(api): add /v1/detokenize endpoint by Dennisadira · Pull Request #9620 · mudler/LocalAI

Dennisadira · 2026-04-30T06:33:55Z

Summary

Closes #1649. Mirror of /v1/tokenize for the inverse direction: take a list of token IDs and return the detokenized text, requested by @benniekiss in the issue thread for "complete API workflow" use cases that need to turn token IDs back into text without local processing.

The proto/handler shape was discussed in #1649 (comment). @benniekiss reacted positively; landing this with the strict-mirror-of-tokenize precedent in mind. Happy to adjust if the proto naming or response shape should differ.

What's added

Proto (backend/backend.proto): new Detokenize(DetokenizeRequest) returns (DetokenizeResponse) RPC, with DetokenizeRequest{repeated int32 tokens} and DetokenizeResponse{string content}. The Go bindings are regenerated by make protogen-go (gitignored as usual).
llama.cpp backend (backend/cpp/llama-cpp/grpc-server.cpp): handler that calls common_token_to_piece per token and concatenates — the same primitive TokenizeString already uses internally at the same file.
Other backends: inherit the default Unimplemented from pkg/grpc/base.Base — same pattern as Detect, Rerank, etc. Backends can opt in later.
Go plumbing: pkg/grpc/{interface,server,backend,client,embed}.go + pkg/grpc/base/base.go updated alongside their TokenizeString counterparts.
HTTP: POST /v1/detokenize in core/http/endpoints/localai/detokenize.go and core/http/routes/localai.go. Request {"model": "...", "tokens": [...]}, response {"content": "..."}.
Auth: entry in RouteFeatureRegistry gated by the existing FeatureTokenize — no new feature flag.
Discovery: added under ai_functions in the routes index.
Swagger regenerated; authentication.md updated to list the new endpoint.

Test plan

make protogen-go regenerates clean
go build ./core/... ./pkg/grpc/... clean
go vet ./core/... ./pkg/grpc/... clean
go test -c -o /dev/null ./core/services/nodes/... clean (the existing testcontainers-based suite needs Docker; only updated the two interface mocks so the test package still compiles)
make swagger regenerates with the new endpoint visible
Manual round-trip: POST /v1/tokenize → POST /v1/detokenize returns the original text on a llama.cpp model

Assisted-by: Claude:claude-opus-4-7

mudler · 2026-04-30T09:31:00Z

        return grpc::Status::OK;
    }

+    grpc::Status Detokenize(ServerContext* context, const backend::DetokenizeRequest* request, backend::DetokenizeResponse* response) override {


this requires a test addition to our e2e-backend test suite where we exercise a mocked backend via api

Done — added a Detokenize method to the mock gRPC backend and two e2e tests in the MockBackend suite (0024a9c): one that POSTs known token IDs and asserts a non-empty content response, and a round-trip that tokenizes first then detokenizes the returned IDs.

Add Detokenize to the mock gRPC backend and wire up two e2e tests in the MockBackend suite: one that posts known token IDs and asserts a non-empty content response, and a round-trip that tokenizes first then detokenizes the returned IDs. Addresses reviewer feedback on mudler#9620. Assisted-by: Claude:claude-sonnet-4-6

Add Detokenize to the mock gRPC backend and wire up two e2e tests in the MockBackend suite: one that posts known token IDs and asserts a non-empty content response, and a round-trip that tokenizes first then detokenizes the returned IDs. Addresses reviewer feedback on mudler#9620. Assisted-by: Claude:claude-sonnet-4-6 Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

Dennisadira · 2026-05-21T20:23:35Z

Rebased onto current master (05e8e1e). The only conflicts were in the generated swagger files — upstream had added Diarization types; I merged both sets in alphabetical order. All other files applied cleanly. Ready for another look when you have a moment.

Dennisadira · 2026-05-31T08:21:44Z

Hi @mudler — just checking in on this one. The e2e mock backend tests you requested are in (commit 349c9d2), and the branch was rebased onto current master on May 21. Happy to address any further feedback whenever you get a chance to take another look.

@benniekiss

Closes mudler#1649. Mirror of the existing /v1/tokenize path, requested by @benniekiss in the issue thread for "complete API workflow" use cases that need to turn token IDs back into text without local processing. - Add Detokenize gRPC RPC with DetokenizeRequest{tokens} / DetokenizeResponse{content} messages. - Implement in the llama.cpp backend using common_token_to_piece, the same primitive TokenizeString already uses internally. - Other backends inherit the default Unimplemented from base.Base, in line with how Detect, Rerank, etc. are gated per-backend. - Wire up the Go gRPC interface, server, client, and in-process embed wrapper alongside their TokenizeString counterparts. - Add the schema types, ModelDetokenize wrapper, HTTP handler, route registration, RouteFeatureRegistry entry (gated by FeatureTokenize so no new feature flag is needed), and the discovery map entry under ai_functions. - Regenerated swagger reflects the new endpoint and types. - Update authentication.md to list /v1/detokenize alongside /v1/tokenize. Assisted-by: Claude:claude-opus-4-7 Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

Add Detokenize to the mock gRPC backend and wire up two e2e tests in the MockBackend suite: one that posts known token IDs and asserts a non-empty content response, and a round-trip that tokenizes first then detokenizes the returned IDs. Addresses reviewer feedback on mudler#9620. Assisted-by: Claude:claude-sonnet-4-6 Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

Dennisadira · 2026-06-03T04:11:52Z

Rebased onto current master — branch now has 2 commits (the endpoint + e2e tests) cleanly on top of master with no unrelated changes.

Also checked: no other PR has landed a /v1/detokenize implementation in master. The feature is still unaddressed upstream (related open issue: #1649), so this PR is still relevant.

Dennisadira · 2026-06-03T05:51:55Z

Hi @mudler — just flagging that the e2e tests requested in the review have been added in the second commit (c168bcba). The test suite adds a Detokenize method to the mock gRPC backend and covers both a basic POST with known token IDs and a tokenize→detokenize round-trip.

Happy to make any further changes if needed.

mudler reviewed Apr 30, 2026

View reviewed changes

mudler added enhancement New feature or request needs-review waiting-from-reporter labels Apr 30, 2026

Dennisadira force-pushed the feat/detokenize-endpoint branch from 0024a9c to 5ea7612 Compare May 17, 2026 10:04

Dennisadira mentioned this pull request May 21, 2026

fix(middleware): repair tool_choice string mode + legacy flat shape for /v1/chat/completions #9859

Closed

2 tasks

Dennisadira force-pushed the feat/detokenize-endpoint branch from 5ea7612 to 349c9d2 Compare May 21, 2026 20:23

Dennisadira added 2 commits June 3, 2026 07:09

Dennisadira force-pushed the feat/detokenize-endpoint branch from 349c9d2 to c168bcb Compare June 3, 2026 04:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(api): add /v1/detokenize endpoint#9620

feat(api): add /v1/detokenize endpoint#9620
Dennisadira wants to merge 2 commits into
mudler:masterfrom
Dennisadira:feat/detokenize-endpoint

Dennisadira commented Apr 30, 2026

Uh oh!

mudler Apr 30, 2026

Uh oh!

Dennisadira May 17, 2026

Uh oh!

Dennisadira commented May 21, 2026

Uh oh!

Dennisadira commented May 31, 2026

Uh oh!

Dennisadira commented Jun 3, 2026

Uh oh!

Dennisadira commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Dennisadira commented Apr 30, 2026

Summary

What's added

Test plan

Uh oh!

mudler Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Dennisadira May 17, 2026

Choose a reason for hiding this comment

Uh oh!

Dennisadira commented May 21, 2026

Uh oh!

Dennisadira commented May 31, 2026

Uh oh!

Dennisadira commented Jun 3, 2026

Uh oh!

Dennisadira commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants