Patent Pending · U.S. 64/016,001

Stuttering is a
timing problem.
Now it's visible.

OpenMic measures every syllable in real time — scripted reading, live conversation, or two-way exchange. Sound Bridge and Articulation Trainer turn the measurement into structured practice. One engine, browser only, no wearables.

Subscribe See it in action →

Cancel anytime · works in any browser
● Two-way Conversation now in early access for OpenMic Pass subscribers

See It In Action

OpenMic — live

Subscribe View Capabilities Deck → SLP Dashboard → cancel anytime

✓ HIPAA-compliant by architecture — no PHI created, collected, or stored. Audio processed anonymously with no retention.

Now in Early Access · Conversation Analyze Mode

Two-way Conversation — analysis that runs while you talk.

OpenMic's analyze layer extended into live conversation. Same per-syllable scoring, same layered analysis pipeline — running on natural back-and-forth speech, at session pace.

Where the scripted console gives controlled measurement on prepared text, Two-way Conversation gives measurement during the moments that carry the heaviest speech-motor load: unstructured exchange. No teleprompter, no pause-and-wait, no read-aloud script.

Same engine. Conversation mode. Currently rolling out to OpenMic Pass subscribers as conversation-mode calibration completes.

Subscribe Early access · OpenMic Pass

Live

Conversation mode

Same

PAD engine

Per-syl

Scoring resolution

Script needed

The Console

OpenMic

The browser-based speech analysis console. OpenMic listens through your microphone and runs a layered analysis pipeline on every syllable — acoustic feature extraction, speech recognition, and weighted scoring — producing a per-syllable signal profile in real time, session over session. Scripted reading, teleprompter, free recording. Two-way Conversation mode now in early access for Pass subscribers.

📊

Per-syllable scoring

Acoustic events — onset repetitions, prolonged voicing, intensity anomalies — surfaced from the audio signal in real time. Every syllable scored for acoustic stability across multiple analysis layers.

📄

Teleprompter mode

Practice any text. Words advance as you speak. The scoring engine runs in parallel on every word.

📈

Multi-session comparison

Session history, trend lines, and per-word scoring maps that update every session.

📷

Camera mirror

Your camera stays on while you speak so you can watch your own delivery in real time.

🌐

Any language

Language-agnostic acoustic engine with cloud-based speech recognition. Supports dozens of languages and regional variants.

Subscribe cancel anytime

OpenMic · Session 14 · Engine v10

The

PAD 97

presentation

PAD 91

b-b-begins ⚑

PAD 42

PAD 96

t———en ⚑

PAD 37

o'clock

PAD 89

Session PAD

2 events · syllable-weighted

From Analyze to Practice

The analyze layer surfaces the signal. The practice tools train what it shows — phoneme transitions, articulatory effort, rhythm, exposure. Each tool runs on the same engine.

Now Live · Practice Tool · Sound Transitions

Sound Bridge — practice lives in the bridge

The space between two sounds is where voicing continuity is most fragile — and where it can be practiced. Sound Bridge measures voicing continuity across every phoneme transition you produce: frame-accurate, microphone-only, no wearables.

Twenty-eight sound pairs across four difficulty tiers. The PAD engine measures the bridge as three independent phases — hold, slide, landing — and surfaces exactly where the bridge held and where it didn't. Pitch trajectory and energy envelope drawn over every attempt.

▶ See how it works Try Sound Bridge →

Sound pairs

Difficulty tiers

Phases scored

16ms

Frame resolution

Now Live · Practice Tool · Per-Syllable Effort

Articulation Trainer — effort, made visible

Excessive articulatory co-contraction — antagonist muscles fighting each other when a sound should flow — is observable in the acoustic signal. Articulation Trainer makes that visible.

Pick a word or type your own. Speak it naturally. Each sound slot lights up in sequence — blue for passed, red for repeated attempts or high effort. The target is all green: minimal-effort production, phoneme by phoneme.

Try Articulation Trainer →

Live

Per-syllable scoring

Slot states

Place

Articulator feedback

Min

Effort target

Practice Ecosystem

The games

Eight voice-driven practice modes, all running on the same layered engine. Each targets a different aspect of speech-motor control. Microphone and browser only — no downloads.

🌈

Rainbow Syllables

Full phrases

Phrase practice

How to play

Choose a phrase. Press the mic. Speak naturally. Syllable blobs light up in real time — blue for soft, green for medium, orange for loud. Your PAD score shows after each round. Difficulty adjusts automatically.

Method

Practice Principle

Easy onset and prolonged speech are well-studied speech-production patterns. Rainbow Syllables structures practice around sustained, controlled voicing across an entire phrase.

Neuroscience

Phrase-level production exercises the full basal ganglia–thalamocortical loop. Each syllable requires the putamen to gate the next motor plan in sequence.

🌉

Sound Bridge

Sound transitions

Coarticulation See how it works ↗

How to play

Two sounds appear on screen. Say both without breaking your voice between them. 28 sound pairs across four difficulty levels. Start easy, work up.

Method

Practice Principle

Continuous phonation and coarticulation are well-studied speech-motor patterns. Sound Bridge isolates the transition between two sounds and measures voicing continuity across it.

Neuroscience

Coarticulation is controlled by the premotor cortex. Sound Bridge directly trains feedforward control described in the DIVA model of speech production.

⛰️

Summit

Challenge words

Feared-word practice

How to play

Type your scariest word. Hit start. Say it. Say it again. Watch your score climb and the climber rise with each repetition. The word that felt impossible becomes the word you've said fifty times.

Method

Practice Principle

Structured repetition of feared words draws on established speech-practice principles (Sheehan, Van Riper). Summit applies structured exposure — confront the word through repetition.

Neuroscience

Word-specific fear engages the amygdala, which modulates the basal ganglia gating system. Repeated voluntary production engages the amygdala–basal ganglia gating circuit through structured exposure.

🎯

Articulation Trainer

Effort visualization

Effort monitoring

How to play

Pick a word from the bank or type your own. Tap Speak and say it naturally. Each sound slot lights up in sequence — blue means it passed, red means it detected repeated attempts or high effort. Aim for all green. Less effort = better score.

Method

Practice Principle

Effort monitoring and proprioceptive awareness are well-studied speech-motor principles. Articulation Trainer surfaces per-phoneme effort visually — showing which articulators are over-engaged.

Neuroscience

Excessive co-contraction of antagonist muscles at the articulatory level is observable acoustically. Per-phoneme effort visualization surfaces motor overflow patterns that are otherwise only observable by ear.

🥁

Rhythm Pad

Volume control

Motor precision

How to play

Pads appear with a target volume zone. Make a sound and land in the zone. Green = nailed it. Difficulty increases as you improve. Three modes: hold, alternate, and burst.

Method

Practice Principle

Motor learning principles — specificity of practice, distributed practice, variable practice. Rhythm Pad trains proprioceptive control through volume targeting.

Neuroscience

Volume regulation targets the M1 orofacial region and cerebellar-cortical coordination for sound intensity mapping.

🎵

Cadence

Speech rhythm

Rhythm & timing

How to play

Watch the beat indicator. When it hits the zone, make your sound. Start slow, speed up. The game scores how close you land to each beat.

Method

Practice Principle

Rhythmic cueing externalizes the timing signal that the basal ganglia typically provides internally — providing an alternative timing input pathway.

Neuroscience

External rhythm engages the supplementary motor area via the cerebellum, providing an alternative timing pathway alongside the basal ganglia–SMA loop.

🫧

Bubble Hunt

Precision under pressure

Amplitude + timing

How to play

Watch bubbles move in wave patterns. When one enters the green zone, make a sound at the right volume. Miss the zone and it floats away. Speed changes as you level up.

Method

Practice Principle

Dual-task practice — producing controlled speech while tracking a moving target — trains attention resource management under cognitive load.

Neuroscience

Simultaneous visuomotor tracking and vocal output engages prefrontal executive control alongside the speech-motor circuit, training the system to perform under divided attention.

The Engine

Layered analysis.
One score per syllable.

OpenMic doesn't run a single algorithm on your speech. It runs a layered analysis pipeline — independent systems that each see different things in the audio signal. Their outputs converge through weighted scoring into a single per-syllable signal score. That score is what you see. The layers underneath are what make session-over-session score comparison reliable.

U.S. Provisional 64/016,001 · Filed March 24, 2026

DFS

Acoustic Analysis

The Disfluency Feature Stream runs at ~60 frames per second directly on the audio signal. Every frame is classified — silent, building, or voiced — producing a continuous stream of acoustic features: intensity, onset count, voiced duration, signal-stability flags. This is the raw measurement layer. No transcription, no interpretation — just signal.

Speech Recognition

A cloud-based speech recognition layer operates independently from DFS. It returns what was said — phoneme identity, phoneme-level accuracy scores, word boundaries, and timing. Where DFS tells you how the speech sounded acoustically, SR tells you what was produced linguistically. Two independent verdicts on the same utterance.

PAD

Per-Syllable Score

DFS features and SR output converge through a weighted scoring layer with defined, calibrated weights. The result is a single per-syllable signal score — the PAD score — that reflects acoustic stability, articulatory accuracy, and timing. This is what the speaker sees. The layers underneath are what make session-over-session comparison reliable.

Microphone → DFS · 60fps + SR · cloud → Scoring Weights → PAD Score

What the engine measures per syllable

Voice onset time (VOT)

Formant trajectories (F1, F2)

Fundamental frequency variability (F0)

Pre-speech silence duration

Amplitude contour and RMS intensity

Per-phoneme identity and accuracy

Onset repetition patterns and voicing anomalies

Onset count and restart patterns

Why two upstream layers matter

DFS and SR see different things. DFS detects that something acoustically unstable happened — a repeated onset, an intensity spike, a voiced duration anomaly. SR detects that the phoneme produced didn't match the target, or that a word was repeated. When both layers flag the same syllable, the scoring confidence is high. When they disagree, the system surfaces the disagreement for review.

Two independent signals → weighted convergence → one score per syllable

This is what makes session-over-session score comparison reliable. The score isn't one algorithm's opinion — it's the convergence of two.

Six-axis signal profile

Each session produces a six-axis signal profile — Acoustic Pressure, Vocal Steadiness, Onset Consistency, Speech Rate, Session Consistency, and Phoneme Range. Stacked across sessions, the shape shows session-by-session trajectory. Drag to rotate, scroll to zoom.

ILLUSTRATIVE DATA · NO CLINICAL CLAIMS Open full screen ↗

For Clinicians & Partners

Extend your sessions.

Practice happens between sessions, not just during them. OpenMic gives your clients an independent practice tool and gives you the session data from between appointments.

Assign and review

Share OpenMic with clients. Review per-syllable session data, trend lines, and challenge-word PAD maps between appointments.

Multi-session analytics

Session-over-session PAD scores, acoustic event logs, and per-word history. Data export in CSV.

SLP Dashboard

Caseload management, session-over-session score tracking, per-phoneme analysis, and a six-axis signal profile — all from the engine's scoring output.

View SLP dashboard ↗

Platform partnerships

Integrating the OpenMic engine inside your platform or product? B2B licensing and white-label arrangements available. Contact for scope and pricing.

Research access

Research protocol FP-PILOT-001 · academic IRB pathway in development. Contact for research partnership inquiries.

Signal architecture walkthrough

Full four-level resolution breakdown — session, word, syllable, phoneme — with the complete OpenMic signal record. For clinical and technical partners.

View capabilities deck ↗

✓ HIPAA-compliant by architecture — no PHI created, collected, or stored. Audio processed anonymously with no retention.

Plans

Individual

OpenMic Pass

OpenMic console

PAD scoring — every session

Cancel anytime

Subscribe →

SoundBridge

Phoneme transition drills

28 sound pairs · 4 difficulty tiers

Frame-accurate scoring

Buy → Subscribe →

Articulation Trainer

Per-phoneme effort visualization

Place-of-articulation feedback

Minimal-pressure production training

Buy → Subscribe →

Clinical & Enterprise

SLP — OpenMic / PAD Engine

OpenMic console + PAD engine

Per-session scoring, clinical export

One seat per licensed SLP

Subscribe — SLP →

Enterprise & Research

OpenMic / PAD Engine

Platform integration, white-label

Contact

Research / IRB

FP-PILOT-001 · academic IRB pathway in development

Contact

Get in touch →

The Science

The pre-articulatory window.

Where the signal carries the most information

Research by Per Alm and others locates the neurological origin of stuttering in the basal ganglia–SMA timing loop — not at articulation, but in pre-articulatory motor planning. The motor plan destabilizes before the mouth moves.

Most speech tools measure at or after articulation. OpenMic's engine analyzes the pre-articulatory window acoustically, treating voice onset time and formant stability as observable signal features upstream of articulation.

IFG

→

Pre-SMA

→

SMA

→

Putamen

→

Thalamus

→

The engine targets instability at the Pre-SMA → SMA gate.

Word history over phonetic complexity

Phonetic complexity alone doesn't predict where stuttering occurs. Word-specific neural history shapes pre-articulatory motor planning. A familiar word carries a different motor planning load than a novel one — independent of phonetic difficulty.

OpenMic tracks per-word scoring across sessions. The scoring map shows per-word PAD trajectories session by session.

fNIRS pathway

FluentPlay is in initial discussions with NIRx about developing a wearable fNIRS protocol for pre-SMA hemodynamic monitoring during speech planning tasks.

Status: early-stage, contingent on equipment funding

No active protocol is underway. If this work proceeds, PAD scores would be compared against fNIRS-measured pre-SMA activation. If validated, PAD could serve as a non-invasive acoustic correlate of pre-articulatory motor activity.

Browser-side audio pipeline

Every game shares the same audio engine. The microphone feeds a real-time analysis pipeline — the DFS runs at approximately 60 frames per second, classifying every frame into a disfluency feature stream. A parallel cloud-based speech recognition layer provides phoneme-level accuracy. Both streams converge into the scoring layer. No audio is recorded. Nothing is stored.

View pipeline walkthrough ↗

HIPAA & data architecture

FluentPlay is HIPAA-compliant by architecture. No personally identifiable or protected health information is created, collected, or stored at any layer. Audio is processed in real time through cloud-based speech recognition with no retention. No accounts or identifiers are required to use OpenMic.

IRB pilot

Research protocol FP-PILOT-001 · academic IRB pathway in development. Research partnership inquiries welcome — contact below.

About

Built by someone who stutters.

Will Carbone is the founder of FluentPlay Technologies. He's stuttered since childhood. Before FluentPlay, he spent more than a decade inside clinical and commercial biotech — building, monitoring, and stress-testing the systems that take drug molecules from synthesis to verified purity. Extraction, purification, analytical measurement, quality control from bench to commercial scale. The discipline of making invisible things legible through rigorous data.

When he looked at what existed for people who stutter, he found tools that hadn't kept pace with the neuroscience. The research is clear: stuttering involves timing instability in pre-articulatory motor planning. The tools treated it as something else.

He left biotech and built the practice tool the field was missing.

FluentPlay Technologies LLC · Somerville, MA
willcarbone@fluentplaytech.com

Get In Touch

For clinicians,
partners, and researchers.

Email

willcarbone@fluentplaytech.com

linkedin.com/in/will-c-387692188

Research / IRB

Research protocol FP-PILOT-001 · academic IRB pathway in development.
Contact for research partnership inquiries.

Platform & Enterprise

Engine licensing, white-label deployments, and enterprise agreements handled directly. No RFPs — just a conversation.

Stuttering is atiming problem.Now it's visible.

OpenMic — live

Two-way Conversation — analysis that runs while you talk.

OpenMic

Per-syllable scoring

Teleprompter mode

Multi-session comparison

Camera mirror

Any language

Sound Bridge — practice lives in the bridge

Articulation Trainer — effort, made visible

The games

Rainbow Syllables

How to play

Method

Sound Bridge

How to play

Method

Summit

How to play

Method

Articulation Trainer

How to play

Method

Rhythm Pad

How to play

Method

Cadence

How to play

Method

Bubble Hunt

How to play

Method

Layered analysis.One score per syllable.

Acoustic Analysis

Speech Recognition

Per-Syllable Score

What the engine measures per syllable

Why two upstream layers matter

Extend your sessions.

Assign and review

Multi-session analytics

SLP Dashboard

Platform partnerships

Research access

Signal architecture walkthrough

The pre-articulatory window.

Where the signal carries the most information

Word history over phonetic complexity

fNIRS pathway

Status: early-stage, contingent on equipment funding

Browser-side audio pipeline

HIPAA & data architecture

IRB pilot

Built by someone who stutters.

For clinicians,partners, and researchers.

Email

LinkedIn

Research / IRB

Platform & Enterprise

Stuttering is a
timing problem.
Now it's visible.

Layered analysis.
One score per syllable.

For clinicians,
partners, and researchers.