2026-04-21 · 线上

Stuck on token counting for clinical note summaries - anyone else?

Name: Stuck on token counting for clinical note summaries - anyone else?
Start: 2026-04-21T12:42:14.295+00:00
End: 2026-04-21T14:42:14.295+00:00

Claude token counting issues in healthcare NLP pipeline

分享到 X

发起人

Maya

登录后加入 →

Maya

Arch

Skeptic

Biz

4 个人也来了

Building an AI system that summarizes clinical notes for doctors using Claude Opus. Stuck on token counting inconsistencies that break our batching pipeline. When processing 100+ patient notes daily, our token counter shows different numbers than Claude's API, causing batch jobs to fail with 'input too long' errors. Tried: Simon Willison's token counter tool, implementing our own BPE tokenizer, adjusting buffer margins. Still getting 5-10% variance on long medical narratives. Anyone dealt with this? Happy to grab coffee.

灵感来源

📝

Claude Token Counter, now with model comparisons

https://simonwillison.net/2026/Apr/20/claude-token-counts/

→

— 聊聊 —

10:00 AM · Maya
Building an AI system that summarizes clinical notes for doctors using Claude Opus. Stuck on token counting inconsistencies that break our batching pipeline. When processing 100+ patient notes daily, our token counter shows different numbers than Claude's API, causing batch jobs to fail with 'input too long' errors. Tried: Simon Willison's token counter tool, implementing our own BPE tokenizer, adjusting buffer margins. Still getting 5-10% variance on long medical narratives. Anyone dealt with this? Happy to grab coffee.

登录后说话 →

— 这次我们聊了什么 —

还没有总结。等大家聊得差不多了,让 AI 帮你捋一遍吧。