Faraday Web Research Agent 🕵️‍♀️

✨ Overview

Faraday is a comprehensive web research agent designed to investigate queries by autonomously gathering and analyzing information from multiple online sources. It uses a sophisticated agent powered by LLMs and LangGraph to dynamically select tools, conduct research, and synthesize findings, ultimately delivering a structured research report directly within a Streamlit interface.

🚀 Features

🤖 Agentic Workflow: Employs an AI agent orchestrated with LangGraph to manage the entire research process directly within Streamlit.
🛠️ Dynamic Tool Selection: The agent intelligently chooses the best tools (Search Engines, Web Scrapers, APIs) based on the query and intermediate findings.
🔍 Multi-source Evidence Collection: Gathers information from diverse sources using tools like Tavily, Google Search (via Gemini), DuckDuckGo, Wikidata, NewsAPI, and Firecrawl.
🧩 Query Decomposition: Can break down complex queries into simpler, searchable sub-questions using LLMs.
📝 Structured Reporting: Synthesizes findings into a well-organized report with a summary, detailed sections, and source attribution.
🔗 Source Attribution: Transparently lists all sources consulted and the tools used to access them.
🖥️ Modern Dark Mode Interface: Clean, user-friendly Streamlit interface for interaction and result presentation.

🏗️ System Architecture

Faraday leverages an agentic architecture, orchestrated using LangGraph, running directly within the Streamlit application. Instead of a fixed pipeline, a central Web Research Agent dynamically plans and executes tasks using a suite of available tools:

graph TD
    user([User])
    app[Streamlit App]
    agent[LangGraph Agent]
    llm1[Primary LLM: Decision Making]
    llm2[Parser LLM: Gemini Output Parsing]
    
    subgraph tools[Tools]
        t1[Tavily Search]
        t2[DuckDuckGo Search]
        t3[Gemini Search Tool]
        t4[Firecrawl Scrape Tool]
        t5[News API Tool]
        t6[Wikidata Search Tool]
        t7[Query Decomposition Tool]
        t8[FINISH Signal]
    end
    
    subgraph sources[External Sources]
        s1[Websites]
        s2[APIs/Databases]
    end
    
    report[(Research Report Schema)]
    
    user --> |Inputs Query| app
    app --> |Invokes Agent Directly| agent
    agent --> |Reasoning & Tool Selection| llm1
    agent --> |Parses Specific Outputs| llm2
    agent --> |Tool Invocation| tools
    tools --> |Data Retrieval| sources
    sources --> |Returns Data| tools
    tools --> |Observations| agent
    agent --> |Synthesizes Report| report
    agent --> |Returns Report/Error| app
    app --> |Displays Report| user

This diagram represents a high-level overview of the simplified system components and their interactions, driven by the agent's dynamic workflow within Streamlit.

⚙️ How the System Works (Agentic Flow)

The web research process is driven by the agent's autonomous reasoning within the LangGraph framework, executed within the Streamlit app:

User Input: A user submits a research query via the Streamlit UI (app.py).
Agent Execution: The Streamlit app directly invokes the LangGraph agent (run_web_research) with the query:
- The agent analyzes the query and decides the next best action (e.g., decompose query, search the web, scrape a page).
- It invokes the appropriate tool with specific inputs.
- It processes the tool's observation (output) and updates its internal state.
- This cycle continues (Agent -> Tool -> Agent) until the agent determines it has gathered sufficient information.
Final Synthesis: Once the agent decides to finish, it synthesizes all gathered information and structured observations into a ResearchReport or ErrorResponse object.
Presentation: The final report or error message is returned to the Streamlit app and presented directly to the user in the UI.

🚀 Getting Started

Prerequisites

Python 3.8+
Required API keys stored securely (e.g., in a .env file) for the tools you intend to use (e.g., OpenAI, Google AI, Tavily, NewsAPI, Firecrawl).

Installation

Clone the repository:

git clone <repository_url>
cd <repository_directory>

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables by creating a .env file in the root directory with your API keys.

Running the Application

Start the Streamlit application:
```
streamlit run app.py
```
The app will usually be available at http://localhost:8501.

🤔 Example Queries to Try

Challenge the agent with various research queries:

"What are the latest advancements in renewable energy technology?"
"Explain the concept of large language models and their applications."
"Compare and contrast the economic impacts of Brexit on the UK and the EU."
"Provide a brief history of the internet."

🔌 API Usage

(The separate API endpoint is no longer applicable as the logic is integrated into the Streamlit app.)

🤝 Contributing

Contributions are welcome! If you have suggestions, bug reports, or want to add new tools or features, please feel free to:

Open an issue to discuss the change.
Fork the repository.
Create a new branch (git checkout -b feature/YourFeature).
Make your changes.
Commit your changes (git commit -m 'Add some feature').
Push to the branch (git push origin feature/YourFeature).
Open a Pull Request.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built with ❤️ using Python, Streamlit, and LangChain/LangGraph.
Leverages powerful APIs and tools from Tavily AI, Google AI (Gemini), DuckDuckGo, Firecrawl, NewsAPI, Wikidata, and others.
Inspired by the need for effective and automated information gathering.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.streamlit		.streamlit
docs		docs
research_system		research_system
tests		tests
.gitignore		.gitignore
Logo.png		Logo.png
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Faraday Web Research Agent 🕵️‍♀️

✨ Overview

🚀 Features

🏗️ System Architecture

⚙️ How the System Works (Agentic Flow)

🚀 Getting Started

Prerequisites

Installation

Running the Application

🤔 Example Queries to Try

🔌 API Usage

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Kaos599/Faraday-Web-Researcher-Agent

Folders and files

Latest commit

History

Repository files navigation

Faraday Web Research Agent 🕵️‍♀️

✨ Overview

🚀 Features

🏗️ System Architecture

⚙️ How the System Works (Agentic Flow)

🚀 Getting Started

Prerequisites

Installation

Running the Application

🤔 Example Queries to Try

🔌 API Usage

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages