AI Dashboard
Model usage, token consumption, latency, and cost intelligence.
Total Requests
1.84M
+24.8%last 30 days
Tokens Used
437M
+18.2%input + output
Avg. Latency
742ms
-48msp50 latency
Monthly Cost
$8,420
+6.4%across all models
AI Requests — Last 30 Days
Daily request volume with annotated spike events
Model Usage
Calls & token distribution
GPT-4o
42%184M
Claude 3.5
28%122M
Llama 3.1
22%96M
Custom
8%35M
Top Prompt
Summarize ticket
used 14,820 times
Avg Confidence
94.2%
+2.1% vs last week
Error Rate
1.8%
-0.4% improvement
Cache Hit Rate
62.4%
$1,820 saved
AI Insights
LiveCache hit rate could save you $4,820/mo
Routing 38% of repeat GPT-4o calls through the semantic cache would reduce your monthly bill by ~57%. Detected 4 prompt patterns responsible for 62% of total cost.
Cost trajectory (15d)
Projected next week$2,180
Recent AI Requests
Latest model invocations across the platform
Live tail
| Request ID | User | Model | Tokens | Latency | Cost | Status |
|---|---|---|---|---|---|---|
| req_8j2k4 | PI Priya Iyer | GPT-4o | 2,840 | 842ms | $0.09 | success |
| req_8j2k3 | MC Marcus Chen | Claude 3.5 | 1,820 | 628ms | $0.03 | success |
| req_8j2k2 | SG Sofia García | GPT-4o | 8,420 | 1240ms | $0.25 | success |
| req_8j2k1 | YT Yuki Tanaka | Llama 3.1 | 1,280 | 412ms | $0.00 | success |
| req_8j2k0 | AS Aaroh Sharma | Custom | 620 | 380ms | $0.00 | failed |
| req_8j2j9 | FA Fatima Al-Rashid | Claude 3.5 | 3,420 | 920ms | $0.05 | success |
| req_8j2j8 | EW Ethan Wright | GPT-4o | 5,640 | 1480ms | $0.17 | success |
| req_8j2j7 | LR Leo Romano | Llama 3.1 | 920 | 286ms | $0.00 | success |