Loomworks — Quick-capture Engagement Investigation — v0.1

Version. 0.1 Date. 2026-05-09 Status. Investigation. Thinking material — pattern crystallized, not yet specified for build. Author. Claude.ai (investigation layer). Operator: Marvin Percival. Provenance. Conversation initiated by the Operator's question: Would it be possible to have the Companion present and listening on my mobile (iPhone or Android) similar to how Siri is always present on my iPhone? I would like to be able to say "Hey Companion, I parked on level 10." The conversation surfaced two engagements rather than one — a capture-device engagement (mobile presence) and a substrate intent-class engagement (quick-capture). This document is the second of the pair, drafted first because it is substrate-shaped, methodology-richer, and applies regardless of which capture device delivers the utterance. Informed by. Phase 41 (Companion identity, personal engagement). Phase 42 (intent classification, three-stage pipeline). Phase 43 (personal memory contribution, remember_about_me / forget_about_me, cross-promotion instruction). Methodology v0.20 (Companion identity, executor opacity, Memory expectations). Companion as Agent Investigation v0.1 (capability tiers, attribution model). Closed-loop Engagement Investigation v0.1 (the Companion as attribution channel; observation distributed and attribution centralized — relevant by analogy). Knowledge Elevation Pathway Investigation v0.1 (held + commit lifecycle as the universal write path).

1. The framing question

The Operator's question carries three claims worth unpacking, because each has its own answer:

Companion present on mobile — a capture-device claim.
Always listening — a posture claim about the device.
I parked on level 10 — a payload claim about what the Companion does with the captured utterance.

Claims 1 and 2 are mobile-presence questions, treated in the companion investigation. This document treats claim 3.

The payload is subtly different from anything the converse pipeline currently handles. I parked on level 10 is not a project conversation. It is not a remember_about_me (the fact is not about the Operator's identity, preferences, or constraints — it is about the world, and only briefly). It is not add_knowledge (no project context). It is not general_conversation (no expectation of dialogue). It is closer to dictation that lands somewhere.

The framing question this document treats: what intent class does Loomworks need to handle utterances whose shape is "log this fact to the right engagement, acknowledge briefly, and stop"?

2. What lands

The architectural form of the answer is:

There is a class of turns — call them quick-capture turns — that are structurally distinct from conversation. The Operator emits a fact; the Companion identifies the right destination engagement, authors a held assertion (per Phase 43 lifecycle), produces a minimal acknowledgment, and stops. No dialogue follow-up. No conversation history accumulation in the conversational sense. No engagement context loading beyond what's needed to identify the destination.

Quick-capture is not one new intent in the existing taxonomy; it is a different mode of turn that the existing taxonomy can serve once augmented. The methodologically interesting question — raised but not answered here — is whether quick-capture is an eighth/tenth/Nth intent on the existing classifier path, or a different pre-classifier mode that bypasses the conversational pipeline's full machinery.

The following crystallize:

Quick-capture is append-only Memory contribution with terminal acknowledgment. No conversation; no follow-up; no dialogue. The contract is: capture, route, commit-or-hold, acknowledge, stop.
The destination is engagement-determined, not capture-determined. The same utterance — "I parked on level 10" — could land in personal Memory (if no specific engagement claims it), in a parking engagement (if one exists), or in a daily log engagement (if one is the Operator's catch-all). The Companion routes; the device only captures.
Routing is a Companion responsibility, not a capture-device responsibility. A mobile shortcut, a keyboard shortcut, a watch complication, and a browser extension all deliver the same utterance to the same routing logic. The capture device does not pre-decide the destination.
Acknowledgment voice is its own register. "Got it. Level 10." is a different voice from the conversational responder. It must be short enough to feel like dictation, specific enough to confirm capture happened, and resistant to the conversational pipeline's tendency toward warmth and elaboration.
The held + commit lifecycle still applies. Quick-captured assertions are first-class Memory; they enter at held and progress through the standard lifecycle. Whether commit is conversational (Phase 43 pattern) or auto (delegation contract) is a Phase 45-shaped question.
Quick-capture composes with existing Loomworks substrate. No new engagement-class concept; no new actor kind; no new room. The pieces all exist; they're being assembled in a different order.

What does not land in this document: which of the two architectural shapes (intent-class extension vs. pre-classifier fast path) wins. That is the central open question at §6.

3. The working-through

3.1 Round one — is this just `remember_about_me`?

Initial framing under consideration. I parked on level 10 is a personal fact. Phase 43 already added remember_about_me. Quick-capture is just remember_about_me reached from a voice surface.

Why it fell. Three structural mismatches.

First, scope. remember_about_me was defined for facts about the Operator's identity, preferences, schedule, and constraints — facts that should be remembered across all projects per Phase 43 §S2. "I parked on level 10" is none of those. It is a transient fact about the world that has utility for hours, not lifetimes. Placing it in personal Memory pollutes the long-lived personal-memory layer with ephemeral state.

Second, lifecycle. remember_about_me operates with held + conversational commit (Phase 43 §S2 / §S6): the Companion responds with an acknowledgment-and-confirmation ("I'll remember you're allergic to all shellfish — is that right?"); the Operator commits or refines on the next turn. Quick-capture must not require a follow-up turn. Its whole point is capture and stop. A conversational confirmation is structurally wrong.

Third, conversational shape. remember_about_me lives inside the converse pipeline — classifier prompt, persona system prompt, conversation history, engagement context loading. The full machinery is overkill for "log this fact." The classifier-LLM call alone takes longer than the dictation took to utter.

What landed. Quick-capture is a different kind of turn. It might use remember_about_me-style routing for personal facts among its destinations, but the turn shape itself is distinct.

3.2 Round two — what does dictation actually need?

Stepping outside the existing intent taxonomy: what does a dictation-class turn structurally require?

Capture. The utterance, as text. The capture device handles this; it's outside the substrate's concern.
Routing. Which engagement is the destination? A few cases:
If the Operator is currently in an engagement (active project, parking-engagement-of-the-day), default to that engagement.
If the utterance names an engagement explicitly ("on the soil-irrigation project, ..."), route to that.
If the utterance describes a personal fact, route to personal Memory.
If none of the above, route to a designated catch-all engagement (a "daily log" or "scratch" engagement) if one exists, or to personal Memory as fallback.
Authoring. The captured utterance lands as a held assertion in the destination engagement. Provenance carries the capture device, the time, and any disambiguation context the routing layer used.
Acknowledgment. A minimal voiced response: "Got it. Level 10." Short. Specific enough to confirm. No follow-up question.
Stop. The turn terminates. No conversation history keying off this turn for the next one. The next utterance starts a fresh quick-capture or, if the Operator opens a chat surface, a fresh conversation.

Notably absent from the requirements: classifier-LLM-call between seven-or-more intents; persona-system-prompt assembly; engagement context loading beyond destination resolution; full responder LLM call.

The dictation-class turn requires less of the converse pipeline than the conversational classes do. Whether that "less" is best expressed by extending the pipeline with a new intent that bypasses some of its phases, or by sitting alongside the pipeline as a separate fast path, is the §6 open question.

3.3 Round three — routing as the substantive piece

The piece with the most methodology depth is routing. The capture and acknowledgment ends are mechanical; the lifecycle is Phase 43 standard; the open architectural question (intent vs. fast path) is structural but not deep. Routing is where the work is.

Three routing-shape candidates surfaced:

Candidate A — engagement-context routing. If the Operator is currently active in an engagement (a chat is open, an engagement is "focused" in the Operator Layer), default the utterance there. Names work as overrides. Personal facts route to personal Memory. Catch-all to a designated engagement.

Candidate B — content-classification routing. A small classifier (LLM or heuristic) reads the utterance and decides destination based on content. "I parked on level 10" → daily log. "The soil samples need to be retested" → soil-irrigation project. "I'm allergic to peanuts" → personal Memory.

Candidate C — Operator-pre-declared routing. The Operator declares routing rules at engagement-creation time or via Memory: "Route parking, errands, and groceries to the daily-log engagement. Route project-language to the active project. Route personal facts to personal Memory." The router applies the rules; ambiguity routes to personal Memory or asks at next chat surface open.

Why each survives or falls.

Candidate A composes well with the Phase 41-onward Companion model — the Companion already knows which engagement is active. It fails on capture devices that have no notion of "active engagement" (a watch face; a Siri shortcut from the home screen; a keyboard shortcut from any text field). The capture device must surface enough context to identify which engagement is active, and that context isn't always present.

Candidate B is closest to how the existing classifier already works. It fits the converse pipeline naturally. It fails on cost: a classifier LLM call per dictated fact is the wrong economics. "I parked on level 10" should not cost a classifier call. The Operator dictates dozens of these per day; the call cost is felt.

Candidate C composes with Phase 43's remember_about_me cross-promotion pattern (Memory-driven behavior governance) and aligns with the methodology's emphasis on Operator authority. It fails when no rules are pre-declared — the alpha onboarding case — and produces a chicken-and-egg: the Operator needs Memory to declare rules in, which means the system needs default routing for at least the first utterances.

What landed. The three are not exclusive — they are layers. A workable router likely uses all three:

Operator-declared rules (Candidate C) consulted first when present.
Engagement-context default (Candidate A) when the capture device surfaces context and no rule fires.
Light heuristic / content classification (Candidate B) as fallback when neither rules nor context resolve, with a personal-Memory default if the heuristic itself is unsure.

The heuristic in step 3 is small — string matching on engagement names, simple keyword-to-engagement maps the Operator can extend, and personal-fact detection (Phase 43's existing remember_about_me boundary). LLM classification is reserved for ambiguity the deterministic layers can't resolve, and even then, it should be the exception path.

3.4 Round four — acknowledgment as its own voice surface

The acknowledgment is short. Short enough that the conventional voice register (warmth, expertise, plain-English fluency) is too much. "Got it. Level 10." is closer to a confirmation tone than a conversational one — the voice equivalent of a checkmark.

Three properties seem load-bearing:

Capture-confirming. The acknowledgment must echo enough of the captured fact for the Operator to know the system heard correctly. "Got it" alone is insufficient — if the speech-to-text misheard "level 10" as "level Ben", the Operator never finds out. "Got it. Level 10." surfaces the misunderstanding.
Routing-confirming when ambiguous. When routing was non-obvious (the daily-log catch-all fired; a content classifier fired; a rule fired in a way the Operator might not predict), the acknowledgment names the destination. "Daily log: level 10." tells the Operator where it landed.
Silent on the obvious. When the engagement is unmistakable (the Operator is in a parking engagement; the utterance starts "on the parking project, ..."), the destination doesn't need restating. Over-explaining destinations becomes its own form of friction.

This is a different voice register than the conversational pipeline's. It probably wants its own template and its own composition seam (analogous to loomworks/orchestration/credit_voice.py from Phases 49 and 50 — a domain-specific voice loader). The methodology question of whether this is one voice template or several (per destination type, per disambiguation level) is open.

3.5 Round five — the held-vs-committed question

Phase 43 established that personal-fact assertions land held and commit on conversational confirmation. Quick-capture cannot afford a conversational confirmation turn. So one of two things must happen:

Option 1 — quick-capture lands committed directly. Trust the speech-to-text + routing + the ack-checks-the-utterance pattern. Skip the held layer for quick-capture. Methodologically, this is the dictation analogue of the delegation contract (Phase 45): pre-authorized auto-commit for a specific class of turns the Operator has signed off on as acceptable to commit without per-turn approval.

Option 2 — quick-capture lands held; commit is asynchronous. Held assertions accumulate; the Operator confirms a batch later (in the chat surface, or at engagement close, or at end-of-day). Methodologically conservative; preserves the methodology's commit-by-Operator stance without exception.

Option 3 — quick-capture lands held with auto-commit-after-N-hours-unless-Operator-touches. Hybrid. The held lifecycle is preserved on the substrate side; the Operator practically experiences commit-by-default through a delay-and-decay mechanism. Closer to how email "undo send" works. Methodologically more complex; needs careful thinking about what "auto-commit" means for the engagement event log.

The closest precedent: Phase 45's delegation contract for auto-issue of low-stakes actions. The Phase 50 alpha posture for the analogous credit case is always-require-approval at alpha; auto-issue gated by future config flag (P50-D2). The same gradient applies here: alpha probably wants Option 2 (conservative; held + asynchronous batch commit), and a future delegation-contract phase enables Option 1 (auto-commit) for specific quick-capture categories the Operator trusts.

What landed. Three viable options, with Option 2 as the alpha posture and Option 1 as the post-delegation-contract posture. Option 3 is structurally more complex than Option 1 and probably not worth the complexity given Option 1 is reachable through the same mechanism Phase 45 already defined.

3.6 Round six — what about the engagement that doesn't exist yet?

Subtle case worth surfacing: "Add milk to the grocery list."

There is no grocery-list engagement. Should there be? Three responses:

(a) Auto-create. The Companion notices the utterance references a not-yet-existing engagement; creates a grocery-list engagement on the fly; lands the assertion. Aggressive; may produce a long tail of single-use engagements the Operator never intended.
(b) Prompt at next chat open. The held assertion lands in a holding area; on next chat surface open, the Companion asks "I noticed you mentioned a grocery list — should I create one?" Operator-gated; matches the Phase 41 explicit-engagement-creation discipline; defers the work.
(c) Route to default. The utterance lands in the catch-all daily-log engagement, with a tag or note that the Operator referenced "grocery list." If the Operator later creates a grocery-list engagement, prior daily-log entries with that tag could be migrated (or not — the methodology allows references to land where they will).

What landed. Option (b) matches the methodology's engagement-creation discipline — engagements are first-class objects whose creation is a Companion-proposes / Operator-commits act. Quick-capture should not auto-create engagements; it should surface the gap as a proposal for the Operator to act on at the next chat surface. Option (c) is the alpha fallback when (b) hasn't fired yet — the assertion still lands somewhere, and the Operator can sort it out later.

This connects to the queued direction Engagement creation assistance + Discovery-to-seed skill tracked in loomworks-queued-directions-and-deferred-work-v0_2.md. Quick-capture is a forcing function for that queued direction: dictation surfaces engagement gaps the Operator might not otherwise notice.

4. The architectural shape that emerged

A workable shape, distilling §3:


[Capture device]
       │
       ▼  (utterance text + minimal context — active engagement if known,
       │   capture device id, timestamp, optional explicit engagement tag)
       │
[Quick-capture entry surface] ──────────┐
       │                                 │  (this is the Phase 50-shape question:
       │                                 │   intent-class extension or pre-classifier
       │                                 │   fast path; both reach the same router)
       ▼                                 │
[Router — three layers]                 │
   1. Operator-declared rules           │
   2. Engagement-context default        │
   3. Light heuristic / classifier      │
   Fallback: personal Memory            │
       │                                 │
       ▼                                 │
[Held assertion authored in destination engagement]
       │                                 │
       ▼                                 │
[Acknowledgment — short voice register, capture-confirming, destination-confirming when ambiguous]
       │                                 │
       ▼                                 │
[Stop. Turn ends.]                      │
                                         │
[Asynchronous commit path — Operator's chat surface accumulates batched held assertions
 from quick-capture; commit/retract/edit through standard Phase 43 controls]

The substrate work is small to moderate: a router (modest); a voice surface (small); an entry surface (the architectural question — small if it's an intent extension, larger if it's a fast path). The methodology weight is in the routing rules and the held + asynchronous-commit lifecycle.

5. Composition with existing methodology

Quick-capture composes cleanly with the existing methodology. Worth tracing the composition explicitly because each composition is a property the methodology already has, used in a new combination.

Phase 41 — Companion identity. The Companion is the actor on the held assertion. ActorRef(kind="companion") is exactly the existing pattern. Quick-capture is the Companion's voice on a thin turn rather than a thick one.

Phase 42 — intent classification. Maybe the entry surface — see §6. If quick-capture is an eighth/tenth/Nth intent, the classifier extends. If it's a fast path, the classifier is bypassed.

Phase 43 — held + commit lifecycle. Inherited directly. Quick-capture lands held; commit is conversational (Phase 43 standard) or asynchronous (the §3.5 alpha posture) or auto (post-Phase-45 delegation).

Phase 43 — remember_about_me boundary. The classifier prompt's existing personal-fact-vs-project-fact boundary is exactly the boundary quick-capture's heuristic layer (3.3 layer 3) needs. Reuse, not re-implementation.

Phase 45 — delegation contract. Quick-capture is a delegation-contract case in waiting. Auto-commit for trusted quick-capture categories is structurally identical to Phase 45's pre-authorized engine-operation execution.

Phase 49 — bimodal dispatch. Quick-capture is plausibly Operator-direct (no Companion-as-Authority decision-making — the Companion routes but does not propose for approval). The bimodal dispatch surface from Phase 49 (delegation_required: bool) is the natural seam.

Phase 50 — Companion-as-Authority pattern, by contrast. Quick-capture is not Companion-as-Authority. The Companion is not making a proposal for the Operator to approve before action — the Companion is logging a fact. The methodology distinction between Companion-proposes / Operator-commits (Phase 49 closed-loop, Phase 50 delivery-class) and Companion-routes / Operator-views-later is worth naming. Quick-capture surfaces a third class: Companion-routes-and-records, with commit happening later through standard Memory lifecycle.

Closed-loop investigation v0.1 — the Companion as attribution channel. Quick-capture is a clean instance of "many possible observers, single attribution channel." The phone is one observer. The browser extension is another. The watch is a third. All route through the Companion to author the held assertion. The investigation's principle — observation distributed, attribution centralized — generalizes to capture devices.

Knowledge elevation pathway investigation v0.1 — held + commit as universal write path. Quick-capture is a high-frequency exercise of this pattern. Worth verifying that the held queue UX scales to dozens of held assertions per day.

6. The architectural question this investigation does not answer

The central design question: is quick-capture an intent-class extension on the existing Phase 42 classifier path, or a pre-classifier fast path?

6.1 Option (a) — intent-class extension

Shape. Quick-capture is one or more new intents on the existing Phase 42 classifier. "I parked on level 10" enters the converse pipeline normally; the classifier identifies it as quick_capture (or a more granular variant — quick_capture_personal, quick_capture_engagement); the router dispatches to a quick-capture handler that does the routing-and-write work; the responder produces the short acknowledgment.

Composition advantages.

Reuses the classifier, the persona system prompt, the responder pipeline. Quick-capture lives where conversation lives.
Intent extensions are well-trodden ground: Phase 43 added two intents this way; the pattern is mature.
The intent-class taxonomy stays the canonical statement of "things the Companion can be asked to do." Quick-capture not being in the taxonomy makes the taxonomy incomplete.
Cross-promotion (Phase 43's "I noticed you mentioned X — should I remember that?") composes naturally because cross-promotion is itself a conversation-pipeline phenomenon.

Composition costs.

Two LLM calls per dictated fact (classifier + responder). The classifier call alone is ~500-800ms. For dictation-frequency utterances, this is the wrong economics.
Engagement context loading (~2,000 token budget per Phase 43) happens for every dictated fact. Most of it is wasted — quick-capture doesn't need engagement narrative; it needs destination resolution.
Conversation-history accumulation: every quick-capture turn becomes part of the conversation history, polluting it for the next conversational turn.
Persona-system-prompt assembly and full responder LLM call for what is structurally a confirmation tone, not a conversation.

6.2 Option (b) — pre-classifier fast path

Shape. Quick-capture utterances enter through a different surface (a POST /quick-capture endpoint; a Siri shortcut delivering directly to a fast-path handler). The fast-path handler runs the deterministic router (rules → context → heuristic), authors the held assertion, and emits a short voice template-rendered acknowledgment. No classifier LLM call. No persona system prompt. No responder LLM call. The conversational pipeline is bypassed entirely.

Composition advantages.

Cost-appropriate. Sub-200ms responses for cases where the deterministic router resolves; LLM calls only for the ambiguity-fallback subset.
Conversation history stays clean. Quick-capture does not pollute the conversational state.
Different class of input gets a different class of pipeline — methodologically honest about the fact that dictation is not conversation.
The fast path can serve non-voice surfaces (keyboard shortcut, browser extension popup, watch complication, POST /quick-capture from any client) with the same machinery. The conversational pipeline is not on the critical path for quick-capture; the fast path is.

Composition costs.

New surface; new handler; new route. Substrate addition rather than intent-taxonomy extension.
The classifier-bypass means no LLM-quality routing for ambiguous cases unless the fast path explicitly delegates to a classifier when its deterministic layers can't resolve. (This is fine — but it's an additional design.)
Two pipelines for what feel like adjacent operations. The methodology must be clear about when each fires; "Hey Companion, what's the status of the soil project?" is a conversation; "Hey Companion, the soil samples arrived today" is quick-capture; the distinction must be discoverable.

6.3 Hybrid — intent on the classifier, fast-path on the surface

A third possibility worth naming: option (b)'s surface for the high-frequency case (mobile shortcut, watch complication, keyboard shortcut — all routing to POST /quick-capture directly) and option (a)'s intent for cases that come in through the chat surface (the Operator typing "I parked on level 10" into a chat box). Same router, two entry points.

This is methodologically sound: the same intent class can have multiple surfaces, just as add_knowledge can be reached through the Memory contribution UI (Phase 16) or through chat (Phase 42). What's distinctive is that the chat-pathway version pays the conversational pipeline's cost (which is acceptable when the user is already in a conversation), while the dedicated-surface version skips it (which is appropriate when the user is dictating).

What landed in this investigation. The hybrid is the most likely shape, but the question is open. The deciding factors are likely (i) operational cost expectations (how frequently quick-capture fires; what the LLM-call cost looks like at expected dictation frequency), (ii) UX cohesion (how confusing is it to have two surfaces), (iii) implementation complexity (the fast path is more new substrate code than the intent extension is). All three are scoping-time questions, not investigation-time questions.

6.4 What the next-stage scoping document needs to decide

Which of the three options.
If hybrid: the seam between the two pipelines (where they converge — likely at the router; potentially at the held-authoring step).
The voice register for the acknowledgment (single template? multi-template? expressed as a voice_loader analogous to credit_voice?).
The router's three-layer specification (rule format; context-resolution path; heuristic implementation; ambiguity-fallback to LLM or to personal-Memory-default).
Held vs. committed posture for alpha (Option 2 from §3.5 is the alpha recommendation; Option 1 reachable post-delegation-contract).
Engagement-creation-on-the-fly posture (Option (b) from §3.6 is the recommendation; capture device behavior when destination engagement doesn't exist yet).

These are scoping questions. The investigation's job is to surface them; the scoping note's job is to settle them.

7. What this investigation does not produce

A scoping note. The next-stage document settles the §6 architectural question and the §6.4 sub-decisions.
A CR. CRs come after scoping settles.
An intent-classifier prompt amendment. The prompt amendment depends on the §6.4 decisions.
A voice template. The voice template depends on the acknowledgment-register decisions in §3.4.
A routing-rules schema. The schema depends on the router's layer-1 (Operator-declared rules) decisions in §6.4.
A migration. There may not be one — quick-capture might compose entirely with existing tables (personal_engagement, the existing assertion lifecycle, the existing event log). Or it might warrant a quick_capture_log table for capture-device tracking. Scoping decides.
An engagement-creation-on-the-fly mechanism. The Discovery-to-seed skill queued direction owns that work; quick-capture is a forcing function but not the implementer.

This investigation produces architectural framing and a list of decisions for the next stage. The output is thinking material, not building material.

8. Open questions for the next stages

Carrying forward into next-stage scoping:

(a) intent extension vs. (b) pre-classifier fast path vs. hybrid. §6. Most likely hybrid; deciding factors are operational cost, UX cohesion, implementation complexity.

Routing-rule schema. §3.3 layer 1. What does an Operator-declared routing rule look like as a Memory assertion? Likely follows the Phase 49 / Phase 50 cross-engagement-Memory pattern (assertion in Credit Management Memory governs runtime substrate behavior) — quick-capture rules in personal Memory would govern the router. Boundary question: are the rules in personal Memory, or in a designated "Loomworks settings" engagement, or in a per-capture-device configuration?

Acknowledgment voice register. §3.4. Single voice template (parameterized by destination)? Multiple templates (per disambiguation level)? Voice loader at loomworks/orchestration/quick_capture_voice.py analogous to the credit_voice loader?

Held vs. committed posture for alpha. §3.5. Option 2 (held + asynchronous batch commit) is the recommendation. Sub-question: where does the Operator see the queue of held quick-capture assertions for batch commit? Existing chat surface? Dedicated surface? Memory room of the destination engagement?

Engagement-creation-on-the-fly behavior. §3.6. Option (b) (prompt at next chat open) is the recommendation. Sub-question: where does the in-flight assertion live until the engagement is created? Catch-all daily-log engagement? Dedicated holding-area engagement? Personal Memory with a "pending creation" tag?

Cross-device deduplication. Not surfaced earlier but worth flagging: if the Operator dictates the same fact through the Siri shortcut and through the watch complication within seconds (capture-device redundancy), do both assertions land? The methodology has no exact precedent; closest is Phase 48's metadata-based idempotency on conversion flows. Probably idempotency on (utterance, time-window, person) rather than (utterance, person), to avoid eating legitimate repeats hours later.

Speech-to-text quality interaction with routing. The router's confidence depends on the speech-to-text's accuracy. "I parked on level 10" misheard as "I marked at level 10" could route differently (parking → ?marking?). Should the acknowledgment include the heard utterance verbatim (forcing the Operator to catch errors), the inferred meaning (cleaner but hides errors), or both? §3.4 leaned toward verbatim; verifying with users matters.

Quick-capture and Multi-Contributor engagements. A multi-Contributor engagement's quick-capture would have to authenticate which Contributor is dictating. The capture device's auth posture (which person owns the Siri shortcut) handles this for Operator-class Contributors; multi-Contributor cases need scoping.

Audit and FORAY attestation. Quick-capture lands assertions in Memory; assertions in Memory are PROV-attested (Loom Protocol baseline). Whether quick-capture itself warrants a FORAY flow (attesting that "this dictation reached this assertion via this route") is a question for the FORAY/OVA integration roadmap. Probably no for alpha; possibly yes when capture devices proliferate and routing-rule audit becomes a concern.

Relationship to the mobile-presence investigation. The mobile capture device is the most-natural quick-capture surface, but quick-capture is broader than mobile. The two investigations should land in this order; the mobile-presence document is drafted next and references this one.

9. Implementation-readiness assessment

Quick-capture is not implementation-ready. The §6 architectural question and the §6.4 sub-decisions need scoping work. A scoping note is the next document; a CR follows after.

Two upstream considerations bear on implementation timing:

Persona-emergence dependency. The acknowledgment voice register (§3.4) depends on persona stability. Quick-capture acknowledgments fire dozens of times per day — voice quality matters at higher volume than the conversational surface. Persona-emergence through Phases 42/44/49/50 voice tuning should reach a stable register before quick-capture's voice surface is built; otherwise, quick-capture will be a heavy iteration target for voice tuning.

Mobile-presence dependency. Quick-capture and mobile-presence are conceptually independent (quick-capture works through any capture device; mobile-presence delivers any payload), but the highest-value quick-capture surface is mobile. Building quick-capture without mobile presence ships a feature the Operator can only reach through chat (which defeats most of the value). The two should ship in proximity. The scoping note should treat them as paired sequencing, not independent build streams.

The earliest reasonable phase is post-Phase-50 close, post-mobile-presence-investigation, post-scoping-pair. Phase 53+ feels right (one phase for routing substrate; one for acknowledgment voice; one for capture-device entry surfaces — mobile, keyboard, browser-extension; potentially folded if scoping reveals smaller surface than this investigation suggests).

10. What this investigation calls forward

The investigation calls forward:

The mobile-presence investigation (drafted next; pairs with this one). Capture-device-shaped questions, App Intents / App Actions integration, foreground-vs-background mic posture, iOS vs Android coverage, federation with the desktop/web Operator Layer.

A scoping note answering §6 and §6.4. Likely after mobile-presence investigation lands so the scoping is informed by the capture-device end.

A CR for the intent extension or fast path (or hybrid). Phase 53+.

A queued-direction entry in loomworks-queued-directions-and-deferred-work-v0_2.md for the pair, with cross-references to:
Engagement creation assistance + Discovery-to-seed skill (§3.6 forcing function).
Persona-prompt artifact extraction (§9 dependency on persona stability).
The closed-loop and knowledge-elevation investigations (§5 composition material).

A methodology consolidation finding for the next manifest pass: quick-capture as the third class in the methodology trinity — Companion-proposes / Operator-commits (closed-loop and delivery-class engagements); Companion-routes / Operator-views-later (quick-capture); plus the existing Companion-converses (Phase 42/43). The trinity is worth naming explicitly; the manifest's eighth-and-ninth Section 2 principle slot would fit it.

DUNIN7 — Done In Seven LLC — Miami, Florida Loomworks Quick-capture Engagement Investigation — v0.1 — 2026-05-09