Product builder with a decade of PM experience and an engineering foundation.

A few things I've built lately.

I spent a decade in product, and lately I've been getting back closer to the build.

I studied EECS at Berkeley and started out as a software engineer before moving into product roles at places like LinkedIn and Docusign.

Tools like Codex, Claude Code, Lovable, and Magic Patterns made it practical for me to build directly again. This site is a place to share a few things I have built, along with what they taught me about product judgment, AI workflows, and working with these tools.

See the work About me

Selected work

Each project has a short story about what it is, how it works, and what building it taught me.

Personal Agent OS

A shared workspace for specialized agents, where each one gets only the context it needs.

What it taught me: agents get better when context is scoped, governed, and reviewed instead of shared indiscriminately.

Health Updates Briefing

A system for turning health research, news, and education into a clearer weekly update, with checks in place before anything goes out.

What it taught me: trust comes from retrieval, guardrails, and eval design, not just from a polished summary.

Desk Flex

A stretching companion for desk workers that reduces the friction of deciding what to do, how long to do it, and what the body probably needs next.

What it taught me: recommendation systems feel smarter when they simplify the decision instead of exposing all the logic behind it.

Pocket Caddy

A phone-first decision-support tool that recommends a club from the player's own bag with just enough context to be useful on the course.

What it taught me: transparent decision support can feel more trustworthy than a smarter-looking black box.

What I am learning

A few lessons have stuck with me.

Some of these came from building AI systems. Some came from smaller design choices. The common thread is that building has a way of making the real product questions harder to ignore.

See longer notes

Model changes are system changes.

The same agent workflow can behave very differently under a different model, even when the harness, tools, and prompt are unchanged. That has made me think a lot more about regression testing and operational behavior, not just output quality.

Long-lived agent state needs progressive disclosure.

One personal productivity workflow started to break once its working state got too large, which pushed me toward a smaller index up front and deeper context only when the task actually needed it.

Safer AI products need more than one quality check.

The more I build systems that summarize, recommend, or act on behalf of a user, the less I trust a single pass/fail check. I keep coming back to layered evaluation: hard guardrails, softer rubrics, and a way to inspect what the system actually did.

A skip button can mean three different things.

In Desk Flex, skipping can mean not now, I already did this, or do not suggest it again. Treating those as the same signal would make the product feel dumber over time.