-
Notifications
You must be signed in to change notification settings - Fork 669
Description
📋 Please ensure that:
- I have searched existing issues to avoid duplicates
- I have provided a clear problem statement and solution
- I understand this is a feature request and not a bug report
- I am willing to help implement this feature if needed
- I have submitted this feature request in English (otherwise it will not be processed)
🎯 Problem Statement
Currently, the results of prompt experiments can only be viewed within the platform. For many scenarios, we need to further process these results outside the system.
It would be very useful to have an “Export to Excel” feature for prompt experiment results. This would allow users to:
Perform manual annotation and review of results.
Define custom statistical metrics (beyond what the platform currently provides).
Calculate custom scores for experiments based on their own evaluation criteria.
Share experiment results easily with other team members who may not have direct access to the platform.
💡 Proposed Solution
Add an option to export prompt experiment results in Excel (.xlsx) format.
Include key experiment metadata in the export (e.g., experiment name, prompt, input data, model config, output, system evaluation score, etc.).
Ensure the exported format is structured in a way that facilitates annotation and further analysis (e.g., rows = test cases, columns = attributes).
📋 Use Cases
Bulk Evaluation Review
After submitting 500 test cases for evaluation, users need to review all results in one place. An Excel export would present each evaluation case as a single row, with structured columns for analysis.
Suggested Table Structure (Excel headers)
ID | Input Data | Model Parameters | Final Prompt (after concatenation) | Model Output | Evaluator Result
This structure makes it easy to filter, sort, and analyze across hundreds of cases.
⚡ Priority
High - Would significantly improve my experience
🔧 Component
Evaluation
🔄 Alternatives Considered
No response
🎨 Mockups/Designs
No response
⚙️ Technical Details
No response
✅ Acceptance Criteria
No response
📝 Additional Context
No response