Case Study
7 min read · Updated May 14, 2026
Steering leading indicators
Targeting, retention, and marketplace mechanics
01 · Context
An early-stage marketplace with serious analytics muscle.
User Interviews was a roughly thirty-person product-led-growth company when I joined as a PM in late 2020. The product was a two-sided SaaS marketplace for user research. Researchers recruited participants, participants got paid for completed sessions, and the unit of value was a successfully-run study. For the company's size, the analytics program was unusually mature: data scientists were embedded in product pods rather than centralized, and most meaningful product decisions were scoped as experiments, launched first to roughly 50% of users, and held or shipped based on the data.
I joined the Core pod (researcher-side flows—email theming, multi-team org accounts, etc.) and later moved over to Matching, the platform pod that owned the core marketplace. My data background unlocked the move. Matching owned two pieces that mattered for everything that follows: the targeting capabilities researchers used to define their audience, and the email-batching algorithm that fed participants into studies on a cadence to fill them.
02 · Opportunity
EQR predicted both sides, so EQR was the metric to move.
The instinctive metrics for a research marketplace (study completion, time-to-fill) are lagging—by the time they move, the participant or researcher is already either delighted or gone. The data science team had identified an upstream signal that predicted both: Early Qualification Rate, or EQR—the percentage of new participants who qualified for their first study within two weeks of signup. Higher EQR meant participants were materially more likely to complete a session, which meant researchers were materially more likely to complete an entire study on their timeline, and both parties were materially more likely to stick around for future studies.
One metric, both sides of the marketplace, leading rather than lagging. EQR was the metric to move.
The diagnostic was straightforward and uncomfortable. Too many new participants were entering screener funnels for studies they weren't a fit for, being screened out, and disengaging. Researchers had only basic demographic targeting available—location, age, gender—but their actual use cases (often B2B research) needed to filter on occupation and skills. So researchers had been doing it indirectly: writing screener questions like “Are you a product manager?” and relying on the post-invitation screener to filter. The problem is that screener filtering happens after the invitation lands. By the time a participant is told they don't qualify, they're already hopeful if not expectant that they do.
03 · Discovery
Researchers wanted to filter on it. Participants wanted to be seen for it.
User Interviews ran continuous discovery as a standing practice, and the same gap surfaced from both sides of the marketplace at once.
Researcher signal
Occupation and skills targeting would unblock B2B research.
Researchers consistently told us that occupational targeting—and skills targeting, separately—would let them reach participants for their studies more efficiently. The screener-question workaround was a known tax, not an invisible one.
Participant signal
Participants wanted attributes that explained why they should be picked.
Participants reported feeling overlooked because the platform only knew them by demographic basics. Adding occupation and skills was, to them, less about targeting than about being seen by the right researchers.
On the quant side, my embedded data scientist had already done the work to establish EQR as the leading indicator; we worked together to scope the experiments that followed, and I built my own SQL dashboards in Mode to watch what we shipped. The picture from the data became clearer with each successive experiment: qualification was concentrated among participants whose profiles contained occupational data; researcher and participant retention were increasing.
04 · Strategy
Pre-qualify participants entering the funnel.
The bet was that moving qualification upstream of the invitation would compound through the funnel. If researchers could target on occupation and skills, and participants could surface those attributes on their profiles, two things should follow. First, invitations would land on participants who were more likely to qualify, which would lift EQR directly. Second, participants would feel selected for rather than filtered out—improving retention even in the cases where they didn't ultimately fit a specific study.
That second mechanism is the part that most marketplace PMs underrate. A screener rejection is a big piece of bad news delivered after a small piece of good news. A targeting attribute mismatch, by contrast, is more or less invisible to the participant entirely. The first costs you retention almost immediately. The second costs you retention over time if you don't implement other solutions (which is why we tended to iterative changes to batch size, cadence, and composition).
05 · Execution
Two quarters, three phases per new data attribute, and two-to-three-week experiments.
The pods' shipping cadence shaped the project as much as the data did. We scoped each effort to a two-to-three week build and a two-to-three week live experiment. Experiments launched to roughly 50% of users, and we decided go or no-go from the data. We then full-shipped the previous experiment while the next one was going live. That rhythm is what let a build that involved two third-party integrations and an algorithm rewrite land in two quarters, from ideation to full scale. Each new data attribute went through three phases.
Phase 01
Participant-side data first to ensure we can provide meaningful targeting.
Before building anything researchers could see, we tested whether participants would actually fill in occupation and skills attributes on their profiles. Stated preference is famously generous; we needed confidence the data would be there before the researcher-side filters depended on it. That phase shipped the participant-side surfaces first and gated everything downstream on its adoption numbers. We enabled occupation and skills data collection in onboarding and profile editing, and ran a product marketing campaign to encourage existing participants to update their profiles.
Phase 02
Researcher-side filters, built on standardized taxonomies.
Job title went live first, anchored on a Lightcast integration of Bureau of Labor Statistics data. Skills went live second, anchored on a LinkedIn dataset. Both leveraged ElasticSearch in the UI, rather than using free text inputs. Build-vs-buy on the data sources was the single hardest decision of the project; even after choosing to buy, both integrations required manual internal cleaning and enrichment before build could start. Buying also allowed us to ensure data taxonomy was standardized on industry data rather than data we generated ourselves.
Phase 03
Tuning the batching algorithm to honor the broader EQR hypothesis.
With the new attributes live, the algorithm that batched and sent invitations to participants needed to be adjusted. The pre-qualification on new attributes could only be fully proven out by adjustments to batch size and cadence. The more confident we were in the matching, the smaller and less frequent the batches needed to be. But we couldn't do this and ignore which participants were still inside their first two-week window. In this phase, the work shifted from feature releases to continuous iteration on the email invitation algorithm, including batch size, cadence, and composition.
06 · Outcomes
EQR +15%. Re-recruitment +135% from a one-month side quest.
The primary outcome was on the metric we set out to move: EQR rose 15%, with downstream lifts on the lagging indicators (study completion and participant retention).
An unrelated solution was also contributing to the rise in EQR during this time. While I was still on Core, we had a fragile hypothesis: if we could increase usage of the “invite past participants” feature, researchers would see better study outcomes. The data showed that researchers who used it had measurably better study outcomes; usage was surprisingly low. We had to figure out why and fix it.
I performed a heuristic analysis that suggested comprehension was the issue, rather than discoverability. A few discovery sessions confirmed it over a couple of weeks. We clarified the help text language for the feature, explaining what it was and how it worked, and tweaked the icon and its color as a secondary fix. The whole thing—diagnosis, discovery, build, ship—took about a month (fast for the pre-Claude-Code era). Re-recruitment usage rose 135%. When we began working on occupation and skills targeting, re-recruitment was already influencing EQR in a positive direction; the new releases compounded the effect of that prior work.
Time to scale
2 qtrs
Ideation to full rollout of the targeting build, across two third-party data integrations and a matching-algorithm rewrite.
EQR
+15%
Early Qualification Rate—the share of new participants who qualified for their first study within two weeks of signup. Lift driven by the targeting build.
Re-recruitment
+135%
Researcher usage of the ‘invite past participants’ feature after a separate, one-month UX writing fix to an existing, but underused surface.
07 · Reflection
Pick the leading indicator, then engineer against it.
The biggest leverage move at User Interviews wasn't a feature in the traditional sense—it was changing where general qualification happened in the supply-side funnel. This holds true across all early-stage marketplaces.
In the case of user research marketplaces, it is about moving general qualification out of screeners and upstream of study invitations. Screener-based filtering should be niche and project-specific. If general qualification is done via the screener, the marketplace pays its cost in churn, because every screener rejection is a participant who had unrealistic expectations set and is now disappointed in the entire platform, not just the single outcome.
Pre-qualifying via attribute targeting costs more upfront—data sourcing, integration work, participant-side adoption—but the benefit compounds in the marketplace's favor over time. Every marketplace has its own version of this tradeoff. The lesson generalizes: be skeptical of downstream supply-side qualification, especially when the price is paid by your users.
The other lesson is about metric selection. The reason this project worked is that the org had picked the right thing to measure before anyone wrote a line of feature code. EQR predicted both sides of the marketplace and was upstream of every product decision I made for two quarters. A weaker org would have anchored on whichever side was loudest in a given week. Picking the right metric is product work, not analytics work, and an embedded data-science function is the easiest way to make that judgment accurately. At People Inc., the same instinct meant stewarding a multi-year identity roadmap one program launch at a time, each launch a proof point for the bet and a vehicle for the next increment of executive buy-in. The re-recruitment side story is the same lesson at smaller scale—a fragile hypothesis, a few discovery sessions, and a month of work outperformed several quarters of people not prioritizing investigation of a solution on the assumption that it would be complex.
Two next steps, if this is the kind of PM work you're hiring for: review my resume .
Or, if you're ready to talk, book a 30-min product chat (opens in new tab).