ExS
Feelxs

QBHS Pilot Program Results

Qualified Behavioral Health Specialist Training Pilot

September 2024 – March 2026

Prepared by ExS · Confidential
FeelxsAI Clinical Assessment

Feelxs evaluates therapist competency through simulated therapeutic encounters. Trainees respond to standardized video vignettes depicting challenging clinical moments – a grieving parent, a burned-out nurse, a resistant teen – and are scored across 12 therapeutic domains including empathy, alliance bond capacity, and rupture repair. The result is an objective, scalable measure of clinical readiness that complements traditional supervision-based evaluation.

View detailed results ↓

Executive Summary

The QBHS pilot trained 8 behavioral health specialists to deliver evidence-based care in pediatric primary care settings. Results demonstrate measurable clinical improvement for patients and significant skill development for trainees.

169
Total Patients
53
Active Patients
8
Trainees
642
Total Sessions
702
MBS Hours
52%
Avg PHQ-9 Reduction
30
Patients ≥50% Improved
8
Credentialed
Key finding: Patients receiving QBHS-delivered behavioral activation showed an average 52% reduction in PHQ-9 depression scores, with 30 of 53 active patients achieving ≥50% symptom improvement. Trainees demonstrated consistent skill growth across clinical intervention and general counseling competencies.

Patient Demographics

169 total patients · Average age: 17.9

Age Distribution

Gender

Race / Ethnicity

Presenting Conditions

Patient Outcomes

52% average PHQ-9 reduction · 30/53 patients ≥50% improved

PHQ-9 (Depression)

Mean score by session – declining = improvement

PSC-17 (Pediatric)

Mean score by session – declining = improvement

Dose-Response Relationship

More sessions produce stronger outcomes

Client Satisfaction (CSQ-8)

29.3

out of 32 across 18 responses

Top Performers

30

of 53 active patients with ≥50% improvement

Trainee Overview

8 trainees across 2 cohorts (excludes trainees with insufficient data)

TraineeCohortPatientsPHQ-9 ReductionKnowledge ΔPrior MH Exp.
T3
Trainee 3
11338%+6No
T2
Trainee 2
12250%+5No
T1
Trainee 1
15937%+-1Yes
T7
Trainee 7
2622%+4No
T4
Trainee 4
21544%+3No
T8
Trainee 8
2693%+6No
T6
Trainee 6
213+-4No
T5
Zoe Shook
23341%+-2No

Knowledge Growth

Pre- vs. post-training assessment scores · Avg gain: +2.1 pts

Satisfaction Over Time

Aggregate mean (1–5 scale) · Avg: 82.4

Burnout Indicators

1 of 8

trainees reported elevated burnout during the pilot

Clinical Skill Development

MBS scores (Sep 2025 – Jan 2026) · 702 total supervision hours

Mean Improvement Across All Trainees

+9pts(+84%)

IPS (Intervention Skills)

+9.5pts(+53%)

GS (General Skills)

IPS Scores Over Time

GS Scores Over Time

MBS Hours by Trainee

Each session = 2 hours

Credentialing

8 credentialed · Avg competencies: 48.1

Competency Scores

Basic (lighter) and advanced (darker) skills

Feelxs AI Clinical Assessment

AI-powered evaluation of clinical competency across 7 simulated therapeutic scenarios · 12 therapeutic domains scored on 0–4 scale

57.9
Mean Score
66.4
Highest Mean
1/28
Excellent Ratings
8/28
Below Adequate

Clinical Scenarios

Trainees responded to 7 standardized video vignettes depicting challenging therapeutic moments. Each scenario is scored across 12 domains.

Avg 53.5

David, a father who recently lost his teenage son, resists his therapist's suggestion to attend a grief support group. He feels no one could understand his pain and that "moving on" dishonors his child's memory.

Avg 61.3

Rachel, an ICU nurse experiencing severe burnout, tells her therapist that sessions are making things worse. She reveals she cries for an hour after each appointment and questions whether therapy is right for her.

Avg 60.3

James, a Fortune 500 CEO, challenges his therapist's qualifications, questioning how someone without executive experience could possibly understand the pressures of running a billion-dollar company.

Avg 61.5

A 15-year-old client vents about strict parents and casually mentions involvement in illegal activity with friends. The therapist must balance building rapport with addressing safety and mandatory reporting obligations.

Avg 53.8

Margaret, a 78-year-old widow who has outlived most of her friends and family, insists her young therapist (age 22) cannot possibly understand the depth of her loneliness and accumulated losses.

Avg 73.5

A pregnant client rejects standard prenatal care recommendations, expressing distrust of the medical establishment based on prior negative experiences. The therapist must navigate safety concerns while respecting autonomy.

Avg 56.8

A community activist experiencing burnout breaks down in rage during a session, challenging whether traditional therapy models are equipped to address systemic trauma and ongoing community violence.

Overall Candidate Scores

Mean feelxs score across all 7 simulated scenarios (0–100 scale)

Performance by Scenario

Individual scores across 7 clinical vignettes – color indicates performance tier

Trainee
Grieving Father
Burned-Out Nurse
High-Achieving Client
Teen & Crime
Elderly Client
Pregnant Client
Activist in Rage
Mean
Trainee 53973567361798466.4
Trainee 76763376937695256.3
Trainee 6580756267794354.9
Trainee 85048734250674854
Avg53.561.360.361.553.873.556.857.9

Scenario Difficulty Ranking

Average score across all trainees per scenario – lower scores indicate more challenging vignettes

Aggregate Domain Performance

Average scores across 12 therapeutic domains (all trainees, all scenarios) – normalized to 0–100%

Warmth & Positive Regard
70%
Flusterability
70%
Verbal Fluency
63%
Problem Focus
63%
Interpersonal Perception
62%
Empathy
62%
Alliance Bond
61%
Confidence
60%
Rupture Repair
56%
Emotional Expressiveness
54%
Hopefulness
51%
Persuasiveness
50%

Domain Profile Comparison

Per-trainee average domain scores (0–4 scale) overlaid for comparison

Individual Trainee Profiles

Detailed domain breakdown and scenario analysis per trainee

T5

Trainee 5

7 scenarios assessed
66.4Good
Domain Performance
Warmth & Positive Regard
79%
Flusterability
79%
Interpersonal Perception
71%
Alliance Bond
71%
Problem Focus
71%
Empathy
68%
Verbal Fluency
64%
Emotional Expressiveness
64%
Confidence
64%
Rupture Repair
61%
Persuasiveness
57%
Hopefulness
54%
Strengths
Warmth & Positive Regard (79%)
Flusterability (79%)
Interpersonal Perception (71%)
Areas for Growth
Hopefulness (54%)
Persuasiveness (57%)
Rupture Repair (61%)
Best: Activist in Rage (84)

Provider demonstrates exceptional rupture repair skills by immediately acknowledging their misstep and validating the client's perspective. The response shows humility and genuine curiosity about the client's needs.

Lowest: Grieving Father (39)

Provider offers surface-level validation but completely misses the depth of the father's grief and resistance. The response lacks therapeutic substance and fails to engage with the core emotional content.

T7

Trainee 7

7 scenarios assessed
56.3Adequate
Domain Performance
Verbal Fluency
68%
Confidence
64%
Flusterability
64%
Problem Focus
64%
Warmth & Positive Regard
61%
Interpersonal Perception
54%
Hopefulness
54%
Empathy
54%
Alliance Bond
54%
Persuasiveness
50%
Rupture Repair
46%
Emotional Expressiveness
43%
Strengths
Verbal Fluency (68%)
Confidence (64%)
Flusterability (64%)
Areas for Growth
Emotional Expressiveness (43%)
Rupture Repair (46%)
Persuasiveness (50%)
Best: Teen & Crime (69)

Provider handles the disclosure competently, prioritizing safety while maintaining rapport. The approach is measured and appropriate though somewhat clinical in tone.

Lowest: Elderly Client (37)

Provider's response is tone-deaf to the depth of loss and isolation expressed. The generic approach to depression fails to honor the specific experience of aging and progressive loss.

T6

Trainee 6

7 scenarios assessed
54.9Adequate
Domain Performance
Warmth & Positive Regard
75%
Empathy
75%
Interpersonal Perception
71%
Flusterability
67%
Verbal Fluency
63%
Emotional Expressiveness
63%
Hopefulness
63%
Alliance Bond
63%
Problem Focus
63%
Confidence
58%
Rupture Repair
58%
Persuasiveness
54%
Strengths
Warmth & Positive Regard (75%)
Empathy (75%)
Interpersonal Perception (71%)
Areas for Growth
Persuasiveness (54%)
Rupture Repair (58%)
Confidence (58%)
Best: Pregnant Client (79)

The provider skillfully balances validation of the client's autonomy with appropriate concern for safety. They demonstrate exceptional warmth and empathy while navigating this sensitive topic with cultural awareness.

Lowest: Activist in Rage (43)

The provider's response inadvertently confirms the client's concern about therapy being disconnected from her reality. The attempt to assert her 'right to be fragile' misses the point of her critique about systemic issues.

T8

Trainee 8

7 scenarios assessed
54Adequate
Domain Performance
Flusterability
71%
Warmth & Positive Regard
68%
Verbal Fluency
57%
Alliance Bond
57%
Rupture Repair
57%
Interpersonal Perception
54%
Empathy
54%
Confidence
54%
Problem Focus
54%
Emotional Expressiveness
46%
Persuasiveness
39%
Hopefulness
36%
Strengths
Flusterability (71%)
Warmth & Positive Regard (68%)
Verbal Fluency (57%)
Areas for Growth
Hopefulness (36%)
Persuasiveness (39%)
Emotional Expressiveness (46%)
Best: High-Achieving Client (73)

Provider handles the challenge well by acknowledging their limitations while maintaining therapeutic stance. Shows good boundary management and non-defensive posture.

Lowest: Teen & Crime (42)

Provider fails to address critical safety issues and illegal behavior. The response inappropriately validates dangerous activities and misses mandatory reporting considerations.

Scenario Clinical Analysis

Key clinical insights and contextual considerations per scenario

Scenario 1

Grieving father resists attending support group

Avg 53.5
Trainee 5
39
Trainee 7
67
Trainee 6
58
Trainee 8
50
Clinical Insight (top performer)

Provider recognizes the importance of readiness and autonomy in grief work, using exploration rather than pushing. However, the response lacks the emotional attunement needed for profound grief - the language is somewhat intellectualized rather than heart-centered.

Contextual Considerations

Provider appropriately recognizes that David's resistance to 'moving on' is valid given the recency and magnitude of his loss. The exploration approach respects his autonomy while maintaining therapeutic engagement.

Scenario 2

Burned-out nurse feels therapy is deepening her hopelessness

Avg 61.3
Trainee 5
73
Trainee 7
63
Trainee 6
0
Trainee 8
48
Clinical Insight (top performer)

The provider demonstrates understanding of therapy process by normalizing initial discomfort while maintaining hope. The shift to collaborative goal-setting shows good clinical judgment in responding to client feedback.

Contextual Considerations

Appropriately addresses the burnout context by acknowledging the compounded difficulty of processing trauma while continuing to work in a traumatic environment. Shows sensitivity to the nurse's emotional exhaustion.

Scenario 3

High-Achieving Client Belittles Therapist's Background

Avg 60.3
Trainee 5
56
Trainee 7
37
Trainee 6
75
Trainee 8
73
Clinical Insight (top performer)

The provider demonstrates sophisticated understanding of how to navigate identity-based mistrust by neither minimizing the client's concerns nor becoming defensive. The reframe toward universal stress management skills while still honoring the client's unique position shows clinical wisdom.

Contextual Considerations

The provider appropriately addresses the CEO's power dynamics and high-stress context without being intimidated. They validate his unique stressors while establishing their competence to help, showing cultural awareness of executive-level pressures.

Scenario 4

Teen Vents About Parents and Mentions a Crime

Avg 61.5
Trainee 5
73
Trainee 7
69
Trainee 6
62
Trainee 8
42
Clinical Insight (top performer)

The provider demonstrates sophisticated understanding of adolescent engagement by validating the excitement while addressing safety. The shift to exploring the mother's perspective shows good clinical thinking about family dynamics.

Contextual Considerations

Appropriately navigates the adolescent's minimization of risk without becoming another authority figure. Shows cultural competence in understanding teenage rebellion while maintaining therapeutic boundaries.

Scenario 5

Elderly Client Insists Therapist Can't Understand

Avg 53.8
Trainee 5
61
Trainee 7
37
Trainee 6
67
Trainee 8
50
Clinical Insight (top performer)

The provider appropriately uses self-disclosure about their limitations while maintaining therapeutic stance. The focus on 'seasons of loneliness' attempts to find common ground without minimizing the client's specific losses, though this could be more skillfully executed.

Contextual Considerations

The provider shows awareness of the age and life experience gap but could more explicitly honor the weight of multiple losses in late life. The quick pivot to 'finding community' may not fully acknowledge the irreplaceable nature of lifelong relationships.

Scenario 6

Pregnant Client Rejects Prenatal Care Advice

Avg 73.5
Trainee 5
79
Trainee 7
69
Trainee 6
79
Trainee 8
67
Clinical Insight (top performer)

The provider shows excellent clinical judgment by supporting the client's autonomy while introducing safety planning as collaboration rather than opposition. The 'How does that sound?' ending reinforces client agency.

Contextual Considerations

Appropriately respects the client's negative experiences with medical providers while still addressing safety. Shows cultural sensitivity to alternative birth choices while maintaining ethical responsibility.

Scenario 7

Burnt-Out Activist Breaks Down in Rage

Avg 56.8
Trainee 5
84
Trainee 7
52
Trainee 6
43
Trainee 8
48
Clinical Insight (top performer)

The provider's immediate acknowledgment that their suggestion was 'silly' shows sophisticated understanding of rupture repair. The curiosity about 'therapy isn't built for people like me' demonstrates clinical acumen in exploring systemic barriers.

Contextual Considerations

Shows excellent cultural humility in recognizing the mismatch between traditional therapy approaches and activist burnout. Appropriately addresses the power differential and validates the client's lived experience of urgency.

Performance Scale
Excellent (80+)
Good (65+)
Adequate (50+)
Poor (30+)
Very Poor (0+)

Feelxs uses AI-powered analysis of video-recorded clinical simulations to evaluate 12 therapeutic domains on a 0–4 Likert scale. The overall feelxs score (0–100) synthesizes domain performance, contextual appropriateness, and clinical judgment. Assessments were conducted across 7 standardized vignettes covering grief, burnout, power dynamics, adolescent risk, aging, prenatal autonomy, and activist rage.

Synthesis

Training quality mapped to patient outcomes

Training Quality vs Patient Outcomes

Supervision Readiness Scorecard

TraineeIPSGSCred.PHQ-9 ↓HoursStatus
T2
Trainee 2
212850%124Supervision Ready
T1
Trainee 1
213037%114Supervision Ready
T8
Trainee 8
213093%28Supervision Ready
T3
Trainee 3
172638%112Near Ready
T4
Trainee 4
202744%90Near Ready
T6
Trainee 6
202848Near Ready
T5
Zoe Shook
202641%44Near Ready
T7
Trainee 7
212822%42Developing
The Quality Assurance Flywheel: The WHO task-shifting model envisions credentialed providers becoming supervisors for future cohorts. With 8 trainees now credentialed and 3 supervision-ready, the program has established the foundation for a self-sustaining quality assurance flywheel – scaling access to behavioral health care without proportional increases in expert supervision.
ExS

Confidential · Prepared by ExS · 2026

Scale definitions: PHQ-9 (Patient Health Questionnaire, 0–27, depression severity); PSC-17 (Pediatric Symptom Checklist, 0–34, psychosocial impairment); CSQ-8 (Client Satisfaction Questionnaire, 8–32, higher = more satisfied); IPS (Intervention-specific Performance Scale, 0–30); GS (General Skills, 0–30).

All trainee identities have been anonymized. Patient data is reported in aggregate only.