Done: - [x] Implement monitoring metrics to follow improvements Todo: - [ ] Parallelize more TTS and database calls (study OTEL traces for opportunity confirmation) - [ ] Reduce dependency calls before sending call to the LLM or defer them - [ ] Compress the prompt (LLMlingua?) - [ ] Use a LLM with a lower latency (Phi 4?) - [ ] Trace the code executions with local debugger to pin points unseen optimizations