Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter out unsafe AI messages from the history #709

Open
SmittieC opened this issue Oct 8, 2024 · 0 comments
Open

Filter out unsafe AI messages from the history #709

SmittieC opened this issue Oct 8, 2024 · 0 comments

Comments

@SmittieC
Copy link
Collaborator

SmittieC commented Oct 8, 2024

We now save unsafe AI messages. This needs to be filtered out from the chat history whenever we give it to the LLM to generate a response, as well as from the chat UI when debug mode is disabled.

Proposed approach:
We tag the offending messages with a new tag. Something like SAFTEY_LAYER_TRIGGERD. Whenever we send the history to th bot or show it to the participant, we filter out all AI messages with this tag, since it is unsafe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant