When I built the scoring system for OT Wizard/ MyTherapyWizard, I wanted performance bands that would do three things at once: hold up psychometrically, work across a wide age range, and use language that is genuinely strengths-based. After looking at how the major standardized assessments handle this, I landed on a framework that is closely aligned with what the field already uses.
| Range | Band Label | Interpretation |
|---|---|---|
| 85-100% | Mastered | Performs skill consistently across contexts with little to no support |
| 70-84% | Proficient | Performs reliably with minimal support in most contexts |
| 60-69% | Developing | Skill is somewhat present but may require occasional prompts or support |
| 40-59% | Emerging | Skill performed inconsistently or only in structured/familiar settings |
| 20-39% | Beginning | Early attempts or partial skill components observed |
| 0-19% | Not Yet Observed | No evidence of skill use or minimal attempts |
The labels describe where a skill is in its trajectory. They do not describe the individual being assessed.
The neuroaffirming move in assessment language is not softness. It is precision and neutrality. Words like "Developing," "Emerging," and "Beginning" describe a trajectory of skill acquisition. They imply that growth is possible without making any judgment about the person being assessed.
Compare that to deficit-state language like "Inconsistent" or "Limited." Those words describe what someone is not doing. They locate the problem in the individual rather than in the skill being measured. Clients and family members who have spent years on the receiving end of deficit language tend to be especially sensitive to it, and that is true whether the client is a six year old, a teenager, or an adult.
The "Not Yet Observed" label at the bottom of the scale is doing important work too. The "Yet" signals that absence of observation is not a fixed trait. It keeps the door open without assuming a specific timeline.
OT Wizard /MyTherapyWizard is designed to be used across the full age range, from young children through adults. A common concern I hear is that words like "Developing" or "Emerging" feel too young for older clients. I understand the instinct, but the issue is usually not the labels. It is the items being scored.
A teenager being assessed on age-appropriate executive function, handwriting fluency, or self-advocacy skills will not feel infantilized by an "Emerging" rating. An adult being assessed on workplace task initiation or community mobility will not either. The mismatch comes when older clients are scored on items that were really designed for younger ones. That is an item-pool problem, not a label problem. The best way to solve this is to select guided evaluations for the appropriate age population and to skip subdomains (such as scissor skills) that aren’t relevant.This is why our platform invests so heavily in age-appropriate item development. The labels stay consistent. The items adapt to the person.
The language we use is very much in line with how the major developmental and rehabilitation assessments handle their descriptive categories. Here is a quick look:
AEPS (Assessment, Evaluation, and Programming System) uses Consistently Performed, Inconsistently Performed, and Does Not Perform. These describe pattern, not stage.
HELP (Hawaii Early Learning Profile) uses Mastered, Emerging, and Not Yet Present. The same trajectory framing we use.
Vineland-3 (birth through 90+ years), which is the gold standard for cross-age range assessments, uses High, Moderately High, Adequate, Moderately Low, and Low. These are statistical comparisons rather than developmental stages.
BOT-2 (ages 4-21) uses Well-Above Average, Above Average, Average, Below Average, and Well-Below Average. Again, statistical comparisons that work across the full age range.
Sensory Profile-2 (birth through 14) uses Much Less Than Others, Less Than Others, Just Like the Majority of Others, More Than Others, and Much More Than Others. Fully neutral frequency language.
The pattern across the field is clear. Assessments that span wide age ranges either use statistical comparison language or trajectory language. The trajectory words we use are standard psychometric vocabulary, not preschool-coded terms.
The band labels are hardcoded into our scoring system on purpose. This is not a limitation. It is an intentional architectural choice tied to psychometric integrity.
When labels stay consistent across every report the platform generates, three things happen. Inter-rater reliability is preserved. Reports remain comparable across time, across clinicians, and across settings.
If a guided template is custom built for a specific clinician or discipline, the items inside it can be tailored. The descriptors can be adjusted. The band labels themselves stay the same so that a "Proficient" rating means the same thing in every report, no matter who generated it or who the client is.
Every guided evaluation in OT Wizard / MyTherapyWizard follows this same scoring logic:
Because the bands are tied to underlying percentages, and because the items are built with psychometric scoring properties baked in, the system can produce reliable, comparable results across evaluations, across clinicians, and eventually across the entire normative dataset as it grows.
The performance bands in OT Wizard / MyTherapyWizard are not arbitrary. They are designed to be neuroaffirming, psychometrically sound, and consistent across the full age range we serve, from young children through adults. The language is strength-based without sliding into deficit framing. It mirrors what the major assessments in our field already use. And it stays consistent across every report so that what a clinician reads, what a family member reads, and what a teacher or care partner reads all carry the same meaning.