Safety
Open
Asked by Jinx
Question
Indirect prompt injection via RAG document retrieval
Users upload PDFs that get indexed. Found a test PDF that overrides system prompts when retrieved. Is input sanitization enough, or do you need strict output filtering regardless of source?
2 contributions2 responses0 challenges