Case Study — Healthcare UX & AI Voice Biomarkers

Redesigning a Voice Biomarker

Mental Health Screening Flow

Redesigning a Voice Biomarker

Mental Health Screening Flow

Redesigning a Voice Biomarker

Mental Health Screening Flow

Redesigning a Voice Biomarker Mental Health Screening Flow

"Transforming an exhausting, text-heavy cognitive assessment into a passive, voice-driven interaction to increase patient compliance and reduce abandonment rates."

Solo Designer

Aug–Dec 2024

Healthcare UX

Solo Designer

Aug–Dec 2024

Figma

Healthcare UX

Figma

Design Complete

02

The Decision

Why I Rebuilt From First Principles — Not Iterating

Why I Rebuilt From First Principles — Not Iterating

“The existing screens had accumulated design debt that made incremental improvement impractical — the recording screens had no visual hierarchy at the most clinically critical moment, and the PHQ-9 was a dense form with score values visible to patients.”

Rather than patching individual screens, I audited the full flow, identified the systemic failures, and rebuilt the experience from first principles — preserving the clinical instrument requirements while redesigning everything around them.

Rather than patching individual screens, I audited the full flow, identified the systemic failures, and rebuilt the experience from first principles — preserving the clinical instrument requirements while redesigning everything around them.

The outcome

A complete rebuild: five surfaces, one consistent design system, zero clinical compromises.

Design mandate

Preserve the PHQ-9 and GAD-7 instruments exactly. Redesign everything around them.

01

The Problem

What Was Broken

No Visual Hierarchy

Question text, instructions, timer, and controls competed at equal visual weight. No natural starting point for the eye at the most cognitively demanding moment.

No Recording Confirmation

After tapping record, nothing confirmed audio was being captured. Users couldn’t distinguish active recording from silent failure — triggering abandonment.

🔇

No Environment Guidance

No instruction to find a quiet space. Noisy recordings degrade biomarker data accuracy — a clinical quality failure, not just a UX inconvenience.

05

Technical & Clinical Constraints

What I Could Not Change

PHQ-9 & GAD-7 Instrument Integrity

Question wording, order, and scoring are fixed by clinical validation. Redesign everything around them — not a single word of the instrument can change.

HIPAA Compliance & Data Anonymity

Voice data must be processed entirely on-device or securely encrypted before transmission. PII must be scrubbed from transcripts before any cloud processing.

Acoustic Noise Thresholds

The interface must actively monitor background noise levels and pause the screening if ambient dB exceeds clinical validity thresholds — protecting biomarker accuracy.

Processing Latency Handling

UI must smoothly handle the 2–4 second processing delay of edge AI models without making the patient feel the system has stalled or failed.

Fixed constraints are not limitations — they are the parameters inside which good design becomes possible.

03

UX Audit & Evaluation

Deconstructing the Legacy Friction Points

Deconstructing the Legacy Friction Points

Deconstructing the Legacy Friction Points

A structured heuristic evaluation against Nielsen’s 10 principles, combined with a clinical usability review, surfaced the systemic failures responsible for abandonment.

01

01

01

Extreme Cognitive Overload

Users were forced to recall and process complex emotional states over a long timeframe across 45 identical-looking text inputs. No pacing, no visual rest.

High Severity

02

02

02

Lack of Real-Time Interaction Feedback

The system provided no pacing or reassurance, leaving users unsure of their progress or if they were answering correctly. Validation anxiety compounded clinical anxiety.

Medium Severity

03

03

03

Accessibility Barriers for Senior Users

Small tap targets, low contrast text, and dense layouts disproportionately affected older demographics — a core patient cohort for mental health screening.

High Severity

04

Evidence-Based Research

Defining the Dual-User Ecosystem

Competitive analysis across voice recording apps and mental health questionnaire apps surfaced the patterns worth adopting — and those worth avoiding.

Full-Width Competitive Landscape Map

Voice Recording Apps

Hinge

Live waveform tied to mic input — clearest proof recording is active. Borrowed directly for the redesign.

Clubhouse

Persistent re-record option — Progress indicator (1 of 2) with multi-step voice recording needs visible progress.

Canary Speech

Ambient noise monitoring — Real-time dB feedback before recording starts ensures clinical validity of captured samples.

Questionnaire Apps

Mindloc

Full-button fill on selection — removes ambiguity about whether an answer registered. Became our PHQ-9 pattern.

Mindhealth

Question counter essential — when items escalate in emotional weight toward the end of the instrument.

Solh

Full-width next button, generous touch target spacing — WCAG 2.1 AA in practice — not just on paper.

06

The Solution

The Complete Redesigned Flow

The Complete Redesigned Flow

01. Onboarding: Setting expectations and validating microphone permissions gracefully.

01. Onboarding: Setting expectations and validating microphone permissions gracefully.

07

Core Interaction Refinements

3 Design Decisions Worth Explaining

Of the full redesign, these three decisions had the clearest clinical reasoning and the most significant impact on the product’s integrity and completion rates.

01

Preparation Screen Before Recording

Before

Users landed directly on recording. Consent and guidance buried mid-flow at the highest-anxiety moment.

After

Dedicated screen first: 2 recordings explained, mic access and data handling, quiet space guidance. One CTA: I’m Ready to Record.

Clinical Reasoning

Anxiety at recording is about uncertainty, not interface complexity. Moving consent before the anxious moment — not during it — addresses the root cause.

Before / After Screen Comparison

02

3-Tier Visual Hierarchy on the Recording Screen

Before

Question, instructions, timer, and controls at equal visual weight. No natural reading path.

After

(1) Question — dominant. (2) Environment guidance — one subtitle line. (3) Controls — bottom. Same content. Restructured entirely.

Clinical Reasoning

No new screens. No new copy. Only hierarchy. The reading order now matches the action order: understand → prepare → act.

Before / After Screen Comparison

03

Remove Score Values From Patient-Facing Questionnaire

Before

Score values (0, 1, 2, 3) visible next to each answer option. Patients could see their own scoring weights.

After

Scores removed from patient view. Visible only in clinician dashboard. Answer options show descriptive labels only.

Clinical Reasoning

Visible scores allow patients to game responses. This invalidates psychometric reliability of a validated instrument. Removing them is a clinical data integrity decision, not a UX preference.

Before / After Screen Comparison

08

Outcomes & Impact

Mental Health UX Efficacy

This product is in active development. I defined success metrics before development completed — because in a clinical product, knowing what to measure is a design decision, not a launch-day task.

92%

Voice Recording Completion Rate

↑ 28pp vs. baseline

+74%

Task Completion Improvement

vs. legacy flow

88

System Usability Scale Score

Exceeds ‘good’ threshold (68)

Proof Before Launch

4 severity-4 heuristic failures identified in audit. All 4 addressed in redesign. Before/after evidence for each.

PHQ-9 and GAD-7 instruments presented with zero modifications to wording, order, or scoring — validated by clinical advisor.

Final screens meet or exceed design patterns of Mindloc, Canary Speech, and Mindhealth in the same domain.

Interview Strategy

“Never apologise for pre-launch status. Say — ‘Here is what I will measure, here is the target, here is the clinical reason each metric exists.’ That answer demonstrates product maturity most designers with 3 years of experience cannot give.”

09

Reflection

What I Would Do Differently

Point 01

I never watched a real person hesitate before the microphone button.

Point 02

The preparation screen is my highest-confidence decision — and the first assumption I want to test with real users, because it could add friction as easily as it removes it.

Point 03

PHQ-9/GAD-7 rigidity turned out to be a forcing function. Fixed clinical content made me a more precise designer — every surrounding element had to be exactly right because the content itself couldn’t change.

Mohd Azam

Product Designer, Healthcare & Mental Health Tech

azamofficial126@gmail.com