Commit Graph

6 Commits

Author SHA1 Message Date
charles 284c50acd8 refactor(stt): remove speaker identification (diarization) from transcriber
Removes the speaker diarization pipeline and alignment model from the STT module to reduce resource usage and complexity.
The transcription API remains compatible by returning 'Unknown' as the speaker ID for all transcribed segments.

- Removed DiarizationPipeline and align_model from Transcriber
- Simplified transcribe method to return basic transcription segments
- Updated logging and docstrings to reflect changes
2026-06-06 20:52:04 -07:00
charles f4c98fb2b9 Migrate to WhisperX for speaker diarization
Implement a sliding window audio buffer and update the transcriber to
use WhisperX for transcription, alignment, and speaker identification.
Update the pipeline to handle and store speaker-attributed transcripts.

Additionally, update the LLM processor's reasoning parameter to
"enable_thinking".
2026-05-26 21:48:30 -07:00
charles d0fcdfab01 Improvements 2026-05-26 21:07:58 -07:00
charles 58bab75bb5 Improve audio capture and LLM integration
- Implement Silero VAD for dynamic audio chunking
- Add support for Ollama and vLLM backends
- Harden extraction prompts for strict JSON output
- Refactor TUI worker to handle proposals asynchronously
2026-05-26 19:51:48 -07:00
charles 60e170e777 Temp 2026-05-25 22:50:09 -07:00
charles 685586318f feat: implement core D&D helpers logic and system architecture 2026-05-25 22:14:58 -07:00