├── src/
│ ├── agent/ # Core agent logic
│ │ ├── graph/ # Langgraph workflow definitions
│ │ ├── prompts/ # Prompt templates
│ │ └── utils/ # Utility functions
│ ├── app/ # Application examples
│ │ └── demo/ # Demo workflows
│ └── mcp_servers/ # Sandbox server implementations
├── tests/ # Test files and sample data
└── .env.example # Environment configuration example
The system can automatically generate detailed data analysis plans based on user-provided data, including data exploration, statistical analysis, and visualization schemes. Through well-designed prompt templates, it guides LLMs to generate structured analysis ideas.
Leveraging E2B Code Interpreter's secure sandbox environment, it can safely execute generated Python code, preventing malicious code execution and system resource abuse. It supports capturing and parsing code execution results.
It supports generating various data visualization charts, including bar charts, line charts, scatter plots, etc., to help users intuitively understand data characteristics and analysis results.
It can export the complete analysis process and results in Notebook format, facilitating users to review, modify, and share later.
- Langchain/Langgraph: For building state graphs and workflow orchestration of intelligent agents
- E2B Code Interpreter: Providing secure code execution sandbox environment
- FastMCP/MCP: Implementing tool-calling mechanism
- LLM models such as DeepSeek: Providing natural language processing and code generation capabilities
- Python: Main development language
- Python 3.10+ environment
- E2B API Key (for sandbox service)
- LLM API Key (supporting models like DeepSeek)
-
Clone the project repository
git clone https://github.com/yourusername/DAAgent.git cd DAAgent -
Install dependencies
pip install -r requirements.txt
-
Set up environment variables
cp .env.example .env
Edit the
.envfile and fill in your API keys and other configuration information. -
Start the MCP server
python -m src.mcp_servers.server
-
Run the agent demo
python -m src.app.demo
- Initialization : Create a secure sandbox environment and pre-install necessary Python libraries
- Data Upload : User uploads data files to be analyzed
- Plan Generation : Generate detailed analysis plans based on data content
- Code Execution : Execute generated analysis code in the sandbox
- Result Processing : Collect code execution results and visualization charts
- Report Generation : Generate complete analysis reports
Tips: README was generated by AI. Please submit issues for any errors or omissions.
