Cost Tracking
Track inference cost per request including compute time, memory usage, and token counts.
cargo run --example inference_cost_tracking
Track inference cost per request including compute time, memory usage, and token counts.
cargo run --example inference_cost_tracking