SGR Research Agent - Neural Deep Agent

Web interface

0829.1.1.mov

Terminal use

0829.1.mov

A sophisticated AI research agent that combines Schema-Guided Reasoning (SGR) with OpenAI Function Calls to create a natural, interpretable, and powerful research workflow with persistent context memory.

Star History

🧠 Core Innovation: Two-Phase Architecture + Context Memory

Traditional agents either use pure function calls (losing reasoning transparency) or structured output with local execution (missing natural LLM behavior). This agent combines the best of both worlds with persistent memory across sessions.

Phase 1: Reasoning (SGR)

Reasoning as a Tool - generate_reasoning function call
Controlled via tool_choice="generate_reasoning"
Structured Output for explicit reasoning analysis
Model explains what to do and why
Pure analytical thinking without tool execution
Full transparency into decision-making process

Phase 2: Action (Function Calls)

Native OpenAI Function Calls with tool_choice="auto"
Model naturally chooses appropriate tools based on reasoning
Preserves LLM's natural conversation flow
No disruption to chat template or message structure

🔄 Context Memory System

Task Summaries - automatically created after each completed task
Session History - remembers previous requests and actions across tasks
File Memory - tracks created and modified files
Anti-Forgetting - context is preserved between different user queries

✨ Key Features

🔍 Research & Information

Web Search - Internet research via Tavily API
Report Generation - Comprehensive reports with citations
Date/Time Awareness - Gets current date for time-sensitive queries
Adaptive Planning - Real-time strategy adjustment

📁 File Operations

Read Files - Analyze local file content
Create Files - Generate new files with specified content
Update Files - Modify existing files (append, prepend, replace)
File Memory - Remembers all created files across sessions

📂 Directory Operations

List Directories - Browse file structure with tree view
Create Directories - Build new folder structures
Recursive Exploration - Deep directory analysis

🧭 Intelligent Communication

Clarification - Asks questions when requests are unclear
Simple Answers - Quick responses without formal reports
Multi-language Support - Russian and English
Context Awareness - References previous conversations

🔄 Session Memory

Task History - "What did I ask before?"
Action Memory - "What files did we create?"
Continuous Context - No information loss between tasks
Smart Summaries - Efficient context compression

🏗️ Architecture Benefits

✅ Natural LLM Behavior

Both phases use native OpenAI function calling
Phase 1: tool_choice="generate_reasoning" - forced reasoning
Phase 2: tool_choice="auto" - natural tool selection
Maintains proper chat message flow throughout
Model decides tool usage naturally within OpenAI's framework

✅ Complete Interpretability

Every decision is explicitly reasoned
Clear explanation of why each action is taken
Transparent thought process at each step
Easy debugging and understanding

✅ Persistent Memory

Cross-session continuity - remembers previous interactions
Task summaries - compact history storage
File tracking - knows what was created/modified
Context integration - seamlessly uses previous information

✅ Adaptive Planning

Real-time adaptation based on new information
Context-aware decision making
Anti-cycling mechanisms to prevent loops
Dynamic re-planning when needed

📁 Project Structure

├── sgr_agent.py          # 🎯 Main orchestration engine
├── models.py             # 📊 Pydantic models for type safety
├── tool_schemas.py       # 🛠️ OpenAI function schemas
├── executors.py          # ⚡ Tool execution logic
├── prompts.yaml          # 💬 System prompts configuration
├── config.yaml.example   # ⚙️ Configuration template
├── requirements.txt      # 📦 Python dependencies
├── gui_app.py            # 🌐 Chainlit web interface
├── api_server.py         # 🔌 FastAPI OpenAI-compatible server
├── test_openai_client.py # 🧪 OpenAI client test script
├── start_api.sh          # 🚀 API server startup script
└── API_README.md         # 📖 API server documentation

🔄 Workflow Deep Dive

graph TD
    A[User Query] --> B[Load Previous Context]
    B --> C[Phase 1: SGR Analysis]
    C --> D[Structured Output Call]
    D --> E[ReasoningStep Model]
    E --> F{Validation}
    F -->|Pass| G[Phase 2: Tool Execution]
    F -->|Fail| C
    G --> H[Function Calls Auto]
    H --> I[Local Tool Execution]
    I --> J[Update Context]
    J --> K{Task Complete?}
    K -->|No| C
    K -->|Yes| L[Create Task Summary]
    L --> M[Save to Global Context]
    M --> N[Task Completion]

🛠️ Available Tools

Reasoning & Communication

generate_reasoning - Analyze situation and plan next steps
clarification - Ask clarifying questions when request is unclear
simple_answer - Provide quick, direct answers

Research & Information

web_search - Search the internet for information
create_report - Generate comprehensive reports with citations
get_current_datetime - Get current date and time

File Operations

read_local_file - Read content from local files
create_local_file - Create new files with specified content
update_local_file - Modify existing files (append, prepend, replace)

Directory Operations

list_directory - Show contents of directories (supports tree view)
create_directory - Create new directories (with user confirmation)

Task Management

report_completion - Mark tasks as completed

🚀 Quick Start

1. Install Dependencies

pip install -r requirements.txt

or

pip install uv
uv sync

2. Configure API Keys

export OPENAI_API_KEY="your-openai-key"
export TAVILY_API_KEY="your-tavily-key"

Or create config.yaml:

openai:
  api_key: "your-openai-key"
  model: "gpt-4o"
  temperature: 0.3

tavily:
  api_key: "your-tavily-key"

execution:
  max_rounds: 8
  max_searches_total: 6

3. Run the Agent

Console Interface

python sgr_agent.py

Web Interface (Recommended)

chainlit run gui_app.py -w

The web interface will be available at http://localhost:8000

Web Interface Features:

🌐 Beautiful chat interface
📊 Real-time progress tracking
📄 Formatted reports and results
🎨 Visual feedback for all operations
📱 Mobile-friendly design
🔄 Auto-reload during development (with -w flag)

🧪 Example Sessions

Research Session

🔍 Enter research task: Find current Bitcoin price

🧠 Analysis: Need current date for accurate pricing
🕒 Getting current date: 2025-08-29
🔎 Search: 'Bitcoin price 29 August 2025'
💬 Answer: Bitcoin is trading at $166,912 (projected)

File Operations Session

🔍 Enter research task: Create a Python script for data analysis

🧠 Analysis: User wants Python script creation
📝 Creating file: data_analysis.py
✅ File created with data processing functions

Context Memory Session

🔍 Enter research task: What did I ask before?

🧠 Analysis: Checking previous session history
📋 Previous actions:
   - Request: 'Find Bitcoin price' → Actions: web search, simple answer
   - Request: 'Create Python script' → Actions: file creation
💬 Answer: You previously asked about Bitcoin price and Python script creation

🔧 Configuration

Environment Variables

OPENAI_API_KEY: Your OpenAI API key
TAVILY_API_KEY: Your Tavily search API key
OPENAI_MODEL: Model to use (default: gpt-4o)
MAX_ROUNDS: Maximum research rounds (default: 8)
MAX_SEARCHES_TOTAL: Maximum searches per session (default: 6)

Advanced Configuration

Edit prompts.yaml to customize system prompts:

structured_output_reasoning:
  template: |
    You are a reasoning module...
    # Customize reasoning instructions

outer_system:
  template: |
    You are an expert researcher...
    # Customize main system prompt

🌟 Advanced Features

Context Memory System

The agent maintains memory across sessions through:

Task Summaries - Each completed task creates a structured summary
History Integration - Previous actions are loaded into new conversations
File Tracking - All created/modified files are remembered
Smart Context - Relevant history is automatically included

Anti-Cycling Protection

Prevents repetitive clarification requests
Detects and breaks reasoning loops
Ensures forward progress on tasks

Multilingual Support

Automatic language detection from user input
Consistent language usage throughout responses
Russian and English support

Error Recovery

Graceful handling of API failures
Structured output validation with fallbacks
Context preservation during errors

🧪 Example Research Session

Session 1:
User: "Research Tesla Model S pricing"
Agent: Creates comprehensive report → Saves to context

Session 2:
User: "What did I research before?"
Agent: "You researched Tesla Model S pricing and created a report"

Session 3:
User: "Now compare with BMW i7"
Agent: References previous Tesla research → Creates comparison

🔌 API Server

The project includes a FastAPI-based API server that provides full OpenAI API compatibility with streaming support and SGR integration.

Features

🔌 OpenAI API Compatibility - Drop-in replacement for OpenAI API
🌊 Streaming Support - Real-time streaming with stream: true
🧠 SGR Integration - Full Schema-Guided Reasoning capabilities
📚 Auto-documentation - Swagger/ReDoc documentation

Quick Start

# Install dependencies
pip install -r requirements.txt

# Start API server
python api_server.py

The API server will be available at:

API Endpoint: http://localhost:8000/v1/chat/completions
Documentation: http://localhost:8000/docs
Health Check: http://localhost:8000/health

Basic Usage

from openai import OpenAI

# Point to local API
client = OpenAI(
    api_key="dummy-key", base_url="http://localhost:8000/v1"  # Not used by local API
)

# Same API calls as OpenAI
response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Research Python"}],
    stream=False,
)

🌐 Web Interface

The project includes a modern web interface built with Chainlit that provides a superior user experience compared to the console version.

Features

Interactive Chat - Natural conversation flow with the AI agent
Beautiful UI - Modern, responsive design that works on all devices
Real-time Updates - Watch the agent's reasoning and tool execution live
Rich Content - Formatted reports, code blocks, and structured data display
Progress Tracking - Visual indicators for search progress and tool execution
Session Management - Automatic context preservation between conversations
Error Handling - Graceful error display and recovery options

Running the Web Interface

# Install chainlit if not already installed
pip install chainlit>=1.0.0

# Start the web interface with auto-reload
chainlit run gui_app.py -w

# Interface will be available at http://localhost:8000

Web Interface vs Console

Feature	Console	Web Interface
User Experience	Text-based	Rich, interactive
Visual Feedback	Limited	Comprehensive
Progress Tracking	Basic	Real-time with animations
Report Display	Plain text	Formatted with syntax highlighting
Mobile Support	No	Yes
Multi-session	Manual	Automatic
File Operations	Text output	Visual file browser

Configuration

The web interface automatically uses the same configuration as the console version:

API keys from config.yaml or environment variables
Same tool set and capabilities
Identical reasoning and memory systems

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes with type hints and tests
Update documentation as needed
Submit a pull request

📝 License

MIT License - see LICENSE file for details.

🔗 Related Work

Built with ❤️ for transparent, powerful AI research automation with persistent memory

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
api_server.py		api_server.py
config.yaml.example		config.yaml.example
docker-compose.dist.yaml		docker-compose.dist.yaml
entrypoint.sh		entrypoint.sh
example_report.md		example_report.md
executors.py		executors.py
gui_app.py		gui_app.py
models.py		models.py
prompts.yaml		prompts.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
settings.py		settings.py
sgr_agent.py		sgr_agent.py
tool_schemas.py		tool_schemas.py
погода_москва_29_августа_2025.txt		погода_москва_29_августа_2025.txt

License

testing0mon21/sgr-deep-research

Folders and files

Latest commit

History

Repository files navigation

SGR Research Agent - Neural Deep Agent

Web interface

Terminal use

Star History

🧠 Core Innovation: Two-Phase Architecture + Context Memory

Phase 1: Reasoning (SGR)

Phase 2: Action (Function Calls)

🔄 Context Memory System

✨ Key Features

🔍 Research & Information

📁 File Operations

📂 Directory Operations

🧭 Intelligent Communication

🔄 Session Memory

🏗️ Architecture Benefits

✅ Natural LLM Behavior

✅ Complete Interpretability

✅ Persistent Memory

✅ Adaptive Planning

📁 Project Structure

🔄 Workflow Deep Dive

🛠️ Available Tools

Reasoning & Communication

Research & Information

File Operations

Directory Operations

Task Management

🚀 Quick Start

1. Install Dependencies

2. Configure API Keys

3. Run the Agent

Console Interface

Web Interface (Recommended)

🧪 Example Sessions

Research Session

File Operations Session

Context Memory Session

🔧 Configuration

Environment Variables

Advanced Configuration

🌟 Advanced Features

Context Memory System

Anti-Cycling Protection

Multilingual Support

Error Recovery

🧪 Example Research Session

🔌 API Server

Features

Quick Start

Basic Usage

🌐 Web Interface

Features

Running the Web Interface

Web Interface vs Console

Configuration

🤝 Contributing

📝 License

🔗 Related Work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages