Human-AI trust and control · 2026–present

Designing control boundaries for an AI teammate

Design Lead · early-stage exploration

I work on where the line sits between what an AI teammate does on its own and what it brings back to a person: what it can act on, when it should stop and ask, and how someone stays in control of it.

This is early, exploratory work, and the product is confidential, so I don't show screens here. What I can share is the research, the thinking, and how it's being used to shape where the product goes.

At a glance

RoleDesign Lead

StageEarly-stage exploration, shaping product direction

MethodsSimulated longitudinal study, prototyping, concept model

Core questionWhat should an AI teammate be allowed to do on its own?

The work

The problem

Everyone agreed an AI teammate could help. The hard part was deciding what it should do without asking first: where to draw the line between what it handles on its own and what it brings back to a person. That line, not the feature list, was where people disagreed.

The team needed a way to think about it: which actions the AI could take by itself, which ones it should pause and confirm, and where a person had to stay in control.

What the research found

I ran a study with 21 real participants, walking them through structured scenarios that simulated a six-month relationship with an AI teammate, to see how their trust changed as it suggested things, coached them, made decisions, and asked for access.

They rated it high on helpfulness, communication, and initiative, between 4.3 and 4.8 out of 5. One thing sat far below the rest. Handing over credentials scored 3.4. People would let it suggest, coach, even decide. They would not give it the keys.

Trust ratings out of 5, from a simulated six-month study with 21 people.

Why that gap exists

It wasn't about effort. Handing over credentials takes one click, less work than most of the things people were happy to let the AI do. It still scored lowest, because there was no taking it back.

People judged each action by two things: how bad it would be if the AI got it wrong, and whether they could undo, limit, or fix it afterward. So the design problem moved. It's less about how capable the AI is, and more about how recoverable its actions are.

The design: act, ask, hand back

I used this to frame the problem as three zones, and to make the tradeoff clear to the team and to leadership:

Act: low-risk, reversible things the AI can do on its own, like drafting something only that person would see.
Ask: high-impact or unclear actions it should pause and confirm first, like automating something the rest of their team would see or build on.
Hand back: the moments a person needs to step in, to override it, pull back access, correct what it remembered, or repair a mistake.

Today, every action still asks first, a deliberate choice while trust is still being established. At the vision stage, act, ask, and hand back became concrete examples for how that could change: reversibility, receipts, contextual help, and a hand back that feeds a correction into what the AI knows about someone's preferences, not just a log of it. I used prototypes, an override-and-learn loop among them, to help my team visualize what that experience could be.

Trust isn't a one-time setting. It builds up over time, one interaction at a time.

Conceptual model: what an AI teammate runs on its own, where it pauses for a person, and how a person steps back in.

Where this sits now

This is still exploratory, so there's no shipped product to point to. What it has done is shape decisions. I worked with a PM to turn the research and the vision into something leadership could act on, and it fed into how the team is deciding where to invest. We're still working out the specific vision and where it takes the product.

The thinking behind this is published as research: The Trust Budget. If you work on AI systems where trust and autonomy actually matter, I would like to compare notes.

Request details