Nimbus Pro
AS

AI Dashboard

Model usage, token consumption, latency, and cost intelligence.

Total Requests

1.84M

+24.8%last 30 days

Tokens Used

437M

+18.2%input + output

Avg. Latency

742ms

-48msp50 latency

Monthly Cost

$8,420

+6.4%across all models

AI Requests — Last 30 Days

Daily request volume with annotated spike events

+24.8% MoM

Model Usage

Calls & token distribution

GPT-4o
42%184M
Claude 3.5
28%122M
Llama 3.1
22%96M
Custom
8%35M

Top Prompt

Summarize ticket

used 14,820 times

Avg Confidence

94.2%

+2.1% vs last week

Error Rate

1.8%

-0.4% improvement

Cache Hit Rate

62.4%

$1,820 saved

AI Insights

Live

Cache hit rate could save you $4,820/mo

Routing 38% of repeat GPT-4o calls through the semantic cache would reduce your monthly bill by ~57%. Detected 4 prompt patterns responsible for 62% of total cost.

Cost trajectory (15d)

Projected next week$2,180

Recent AI Requests

Latest model invocations across the platform

Live tail
Request IDUserModelTokensLatencyCostStatus
req_8j2k4
PI
Priya Iyer
GPT-4o2,840842ms$0.09success
req_8j2k3
MC
Marcus Chen
Claude 3.51,820628ms$0.03success
req_8j2k2
SG
Sofia García
GPT-4o8,4201240ms$0.25success
req_8j2k1
YT
Yuki Tanaka
Llama 3.11,280412ms$0.00success
req_8j2k0
AS
Aaroh Sharma
Custom620380ms$0.00failed
req_8j2j9
FA
Fatima Al-Rashid
Claude 3.53,420920ms$0.05success
req_8j2j8
EW
Ethan Wright
GPT-4o5,6401480ms$0.17success
req_8j2j7
LR
Leo Romano
Llama 3.1920286ms$0.00success