A precise workflow tool designed to clean, structure, and optionally analyze raw interview or podcast transcripts into high-fidelity Q&A digests without losing original context.
- State the Plan: Infer mode and target language, present the execution plan, and wait for explicit user confirmation.
- Run Parser: Execute
python3 scripts/parse_transcript.py <file>to generate a timestamp skeleton to prevent missing any segments. Skipped in upgrade mode. - Clean & Format: Clean filler words/false starts while strictly preserving the speaker's original words (no paraphrasing/summarizing).
- Translate (if applicable): Translate Q/A into target language after cleaning. Skipped if output language matches transcript language.
- Analysis (Digest Mode Only): Append hard-hitting commentary for each exchange and a final meta-analysis section.
- Clean Mode (Default): Outputs strict
Q (with timestamp)andAblocks only. No external summary or commentary. - Digest Mode: Adds deep analysis layer (
Commentary) identifying avoided tensions, verifiable claims, and a concludingMeta-analysisof recurring patterns. - Upgrade Mode: Takes an existing clean
.mdfile and adds Commentary and Meta-analysis directly. Skips parser and clean steps entirely; Q and A text is read-only.
Saves results into [interviewee]_[source]_[mode].md and runs scripts/check_digest.py before completion.