6 Commits

Author SHA1 Message Date
charles 01b049cf37 Update main.py 2026-06-05 23:10:39 -07:00
charles da5ab1bb44 Refactor STT pipeline and CLI documentation
Split the STT worker into a collector and a transcription worker
to offload heavy processing to a background thread. Add the
`--whisper-model` flag and implement LLM latency logging. Expand
the README with comprehensive CLI usage instructions.
2026-05-31 15:04:41 -07:00
charles 71ecdb3468 Add LLM configuration and pipeline execution 2026-05-31 14:13:58 -07:00
charles afa8d17f10 Mostly working 2026-05-28 00:08:52 -07:00
charles b25f82cefc Implement RAG summarization and context pipeline
- Add ContextPipeline for async RAG lookups
- Implement RAG result summarization via LLMProcessor
- Add CLI flag for PDF ingestion
- Strip markdown code blocks from LLM responses
- Update TUI context display to use ListItems
2026-05-27 00:17:47 -07:00
charles f4c98fb2b9 Migrate to WhisperX for speaker diarization
Implement a sliding window audio buffer and update the transcriber to
use WhisperX for transcription, alignment, and speaker identification.
Update the pipeline to handle and store speaker-attributed transcripts.

Additionally, update the LLM processor's reasoning parameter to
"enable_thinking".
2026-05-26 21:48:30 -07:00