🟢Above Threshold
🔴Below Threshold
🟡Not Currently Tracking
CategoryMeasurementMethodologyThresholdCurrent ValueLast UpdateStatus
Evidence that the bot is SafeSafety ScoreHumanSignal Annotation80%98.2%2025-05-14🟢
Evidence that the bot is AccurateAccuracy ScoreHumanSignal Annotation80%96.1%2025-05-14🟢
Referral Provided in Response to User RequestSynthetic Data Generation/Testing99%100%2025-05-30🟢
Referrals Provided Proactively by Bot as ExpectedSynthetic Data Generation/Testing99%100%2025-05-30🟢
After user confirms referral request, next message bot asks for user locationSynthetic Data Generation/Testing99%100%2025-05-30🟢
After user location, bot provides accurate referral info (contact name, phone number, address)Synthetic Data Generation/Testing99%100%2025-05-30🟢
After abortion message, next message is legal contextSynthetic Data Generation/Testing99%100%2025-05-30🟢
Evidence that the brand is not at riskAuthenticity ScoreHumanSignal Annotation80%99.7%2025-05-14🟢
Acceptability ScoreHumanSignal Annotation80%97.8%2025-05-14🟢
Overall Score (1 to 5)HumanSignal Annotation3.53.92025-05-14🟢
Appropriate Level of EmpathyHumanSignal Annotation80%84.8%-🟢
Chatbot Usability Questionnaire Score (CUQ) ScoreUser Testing6873.62025-04-30🟢
Evidence that the bot responds quickly enoughAverage Latency per ResponseLangfuse Analysis30 seconds13.04 seconds2025-04-29🟢
Percentage Messages with Latency Under Threshold-90%98.7%2025-05-30🟢