Skip to content

[pull] main from danny-avila:main#110

Merged
pull[bot] merged 3 commits into
innFactory:mainfrom
danny-avila:main
Jun 13, 2026
Merged

[pull] main from danny-avila:main#110
pull[bot] merged 3 commits into
innFactory:mainfrom
danny-avila:main

Conversation

@pull

@pull pull Bot commented Jun 13, 2026

Copy link
Copy Markdown

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

* feat: emit ON_CONTEXT_USAGE token budget snapshot per model call

* fix: scale post-prune remaining tokens to calibrated units

* feat: include per-tool token counts and deferred names in budget breakdown

* fix: dispatch context usage at invoke time with post-format recount

* test: live context usage verification for single, handoff, and subagent runs

* fix: gate usage dispatch behind empty-prompt guard, catch deferred definitions

* fix: refresh tool accounting on discovery, align reserve ratio, recalibrate per-tool counts

* fix: account same-length tail formatting in usage snapshot, assert discovery refresh

* test: live accuracy verification for tool loops, caching, and pruning

* test: cross-provider live accuracy matrix (google, bedrock)

* fix: coherent post-prune breakdown, trailing-batch recount, awaited dispatch, compaction budget fields

* 🩹 fix: Keep Usage Breakdown Coherent with Calibrated Budget Fields

- sync breakdown.instructionTokens/availableForMessages to the pruner's
  effective (calibrated) overhead so the aggregate agrees with the
  top-level budget fields even under the calibration variance threshold
- re-derive breakdown.messageTokens and update messageCount after the
  dispatch-time recount/tail-delta adjustments so the snapshot describes
  the payload actually sent

* 🩹 fix: Exact Legacy-Content Deltas and Reconciled Per-Tool Counts

- track the exact set of messages formatContentStrings will flatten via a
  shared isLegacyConvertible predicate and fold their token delta into the
  dispatch-time adjustment, so legacy-content rewrites before the trailing
  batch no longer skew the snapshot (zero extra tokenization when no
  convertible messages exist)
- apportion per-tool schema counts with the largest-remainder method so
  they sum exactly to the aggregate, both at initial counting and when
  calibration rescales them
- document fallback-retry snapshot semantics on ContextUsageEvent
* 💸 feat: Report Subagent Child-Run Model Usage via Usage Sink

* 🎯 fix: Per-Request Usage Events, Root-Run & Invoked-Provider Attribution

* 🧮 fix: Await Async Usage Sinks & Attribute Summarizer Model
@pull pull Bot locked and limited conversation to collaborators Jun 13, 2026
@pull pull Bot added the ⤵️ pull label Jun 13, 2026
@pull pull Bot merged commit d161adb into innFactory:main Jun 13, 2026
1 check passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant