OGhidra - Ollama-GhidraMCP Bridge

OGhidra bridges the gap between Large Language Models (LLMs) running via Ollama and the Ghidra reverse engineering platform through the GhidraMCP API. It enables using natural language to interact with Ghidra for binary analysis tasks.

OGhidra Architecture

Finding Malware with the 'run-tool analyze_function()' feature

Inspecting function with strange string

Inspecting strange function

Uh oh that doesn't sound good

Ask AI to summarize our findings

Key Features

Dual API Server Architecture: Uses the original GhidraMCP server and an extended Flask-based server for comprehensive API coverage.
Multi-Phase AI Processing: Employs a Planning-Execution-Analysis workflow for structured interaction.
Flexible Model Configuration: Allows using different Ollama models for each processing phase.
Command Normalization: Improves compatibility with various LLMs by correcting command formats.
Session Memory & Caching: Features session history, Retrieval-Augmented Generation (RAG), and Cache-Augmented Generation (CAG) for contextual awareness and knowledge persistence.
Interactive & Scriptable: Can be used interactively or integrated into scripts.

Architecture Overview

OGhidra uses a streamlined three-phase approach:

Planning Phase: An LLM analyzes the user's query and generates a structured plan using Ghidra tools.
Tool Calling Phase (Execution): The plan is deterministically parsed, and the corresponding GhidraMCP client methods are called to interact with the Ghidra instance(s). This phase uses a Python function (_parse_and_execute_plan in src/bridge.py) instead of an LLM.
Analysis Phase: An LLM analyzes the results gathered from Ghidra and provides a comprehensive response.

Dual API Servers

Original GhidraMCP Server: Typically runs on http://localhost:8080. Provides core Ghidra functions.
Extended API Server: A Flask server (src/ghidra_mcp_server.py) running on http://localhost:8081 (default). Implements functions defined in ghidra_knowledge_cache/function_signatures.json.
Client Fallback: The GhidraMCPClient (src/ghidra_mcp_client.py) attempts calls to the original server first and falls back to the extended server if needed.

Key Implementation Classes

Bridge: Main class coordinating the multi-phase processing (src/bridge.py).
OllamaClient: Handles communication with the Ollama API (src/ollama_client.py).
GhidraMCPClient: Communicates with the GhidraMCP servers (src/ghidra_mcp_client.py).
BridgeConfig: Centralizes configuration management (src/config.py).
MemoryManager: Manages session history and RAG (src/memory_manager.py).
CAGManager: Manages Cache-Augmented Generation (src/cag/manager.py).

📹 OGhidra Tutorial Video

Pre-installation

Contact me at [email protected] for setup help.

SETUP-GHIDRAMCP
- Ghidra 11.3.2 (https://github.com/NationalSecurityAgency/ghidra/releases/download/Ghidra_11.3.2_build/ghidra_11.3.2_PUBLIC_20250415.zip)
- GhidraMCP (https://github.com/LaurieWired/GhidraMCP/releases/download/1.3/GhidraMCP-release-1-3.zip) * Run Ghidra * Select File -> Install Extensions * Click the + button * Select the GhidraMCP-1-2.zip (or your chosen version) from the downloaded release * Restart Ghidra * Make sure the GhidraMCPPlugin is enabled in File -> Configure -> Developer * Optional: Configure the port in Ghidra with Edit -> Tool Options -> GhidraMCP HTTP Server
OLLAMA-SERVER-INSTALLATION * Install Ollama * Serve Ollamma service * Pull Gemma3:27B

Setup and Installation

Clone the repository:

git clone <repository-url>
cd OGhidra-main

Set up Ghidra and GhidraMCP: Follow the instructions for Ghidra and the GhidraMCP plugin to have the original server running (usually on port 8080).

Create a Python virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # Linux/macOS
.\venv\Scripts\activate    # Windows

Install dependencies:
```
pip install -r requirements.txt
```
Configure environment variables:
- Copy .envexample to .env.
- Edit .env to set your Ollama endpoint (OLLAMA_API_URL), default model (OLLAMA_MODEL), and GhidraMCP server URLs (GHIDRA_MCP_URL, GHIDRA_MCP_EXTENDED_URL).
- Configure phase-specific models, memory, and CAG settings as needed (see below).

** After Installation ** 6. *Run 'python main.py --interactive' 7. Check Health:

This will help identify if your Local OLLAMA and Local Ghidra Server are connected (Ghidra has to be open)

Interactive Mode:

python src/main.py --interactive

See README-MODELS.md and README-MODEL-SWITCHING.md (now incorporated here) for more details on model selection recommendations.

Interactive Mode Commands

When running OGhidra in interactive mode (python src/main.py --interactive), you have access to several commands to inspect and interact with the loaded binary:

run-tools analyze_function <function_name_or_address>: Decompiles and provides an analysis of the specified function. For example: analyze_function FUN_00401230 or analyze_function main.
run-tools strings: Lists all discovered strings within the binary. You can then ask follow-up questions about specific strings.
run-tools imports: Displays a list of all imported functions and the libraries they belong to.
run-tools exports: Shows all exported symbols from the binary.
review_session: Allows you to review the commands and AI responses from the current interactive session.
cag: Displays the current status of Cache-Augmented Generation (CAG), including whether it's enabled and information about the knowledge and session caches. Use this to check if CAG is active and what context it's using.
health: Checks the operational status of the Ollama and GhidraMCP bridge connections.
tools: Lists available tools/commands that can be used.
models: Lists the Ollama models available to the bridge.
vector-store: (If RAG/vector embeddings are enabled) Provides information or options related to the vector store.
help: Shows a list of available interactive commands and their descriptions.
exit / quit: Exits the interactive mode.

These commands leverage the underlying GhidraMCP functionalities and the AI's analytical capabilities to provide insights into the binary.

Single Query:

python src/main.py "Your analysis query here"

Example of hardcoded AI features:

'run-tool list_imports()'

'run-tool list_strings()'

Configuration Details

Configuration is primarily managed via the .env file and command-line arguments.

Models and Phases

Set the default model: OLLAMA_MODEL=llama3
Set phase-specific models (optional):
- OLLAMA_MODEL_PLANNING=gemma3:27b
- OLLAMA_MODEL_ANALYSIS=gemma3:27b
- (Note: The Execution phase uses deterministic Python code, not an LLM).
Use --list-models to see available Ollama models.
Phase-specific system prompts can also be set (e.g., OLLAMA_SYSTEM_PROMPT_PLANNING).

See README-MODELS.md and README-MODEL-SWITCHING.md (now incorporated here) for more details on model selection recommendations.

Command Normalization

The system automatically normalizes command names (e.g., decompileFunction -> decompile_function) and parameters to improve compatibility with LLMs that don't strictly follow the required format. Normalizations are logged to the console.

See README-COMMAND-NORMALIZATION.md (now incorporated here) for details.

Session Memory (History & RAG)

Enable/Disable: SESSION_HISTORY_ENABLED=true / false
Storage Path: SESSION_HISTORY_PATH="data/ollama_ghidra_session_history.jsonl"
Max Sessions: SESSION_HISTORY_MAX_SESSIONS=1000
Vector Embeddings (RAG):
- SESSION_HISTORY_USE_VECTOR_EMBEDDINGS=true / false
- SESSION_HISTORY_VECTOR_DB_PATH="data/vector_db"
Command Line:
- python src/main.py --check-memory
- python src/main.py --memory-stats
- python src/main.py --clear-memory
- python src/main.py --enable-vector-embeddings / --disable-vector-embeddings
Interactive Commands: memory-health, memory-stats, memory-clear, memory-vectors-on, memory-vectors-off

See src/README_MEMORY.md (now incorporated here) for implementation details.

Cache-Augmented Generation (CAG)

CAG provides persistent, cached knowledge (Ghidra commands, workflows) and session context (decompiled functions, renames) without real-time retrieval.

Enable/Disable: CAG_ENABLED=true / false
Knowledge Cache: CAG_KNOWLEDGE_CACHE_ENABLED=true / false
Session Cache: CAG_SESSION_CACHE_ENABLED=true / false
Token Limit: CAG_TOKEN_LIMIT=2000
Command Line: python src/main.py --disable-cag
Interactive Command: cag (shows status)

Knowledge Base Files:

ghidra_knowledge_cache/function_signatures.json
ghidra_knowledge_cache/common_workflows.json (if exists)
ghidra_knowledge_cache/binary_patterns.json (if exists)
ghidra_knowledge_cache/analysis_rules.json (if exists)

See README-CAG.md (now incorporated here) for more details.

Testing

Below is AI-generated nonsense just run 'python main.py --interactive' and type 'health' to check whats available and whats missing.
Extended API Server Tests: python -m unittest src/test_extended_api.py (Ensure the extended server is running).
Bridge/Normalization Tests: Check tests/ directory (e.g., test_command_normalization.py, test_bridge.py). Run relevant tests using unittest.
Memory Sample Data: python src/generate_sample_data.py (See memory docs for options).

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
docs		docs
ghidra_knowledge_cache		ghidra_knowledge_cache
src		src
tests		tests
.env		.env
.envexample		.envexample
.gitignore		.gitignore
GhidraMCP-1-3.zip		GhidraMCP-1-3.zip
LICENSE		LICENSE
README-COMMAND-NORMALIZATION.md		README-COMMAND-NORMALIZATION.md
README.md		README.md
ai_ghidra_capabilities.txt		ai_ghidra_capabilities.txt
main.py		main.py
requirements.txt		requirements.txt
run_tests.py		run_tests.py
tool_capabilities.md		tool_capabilities.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OGhidra - Ollama-GhidraMCP Bridge

OGhidra Architecture

Finding Malware with the 'run-tool analyze_function()' feature

Key Features

Architecture Overview

Dual API Servers

Key Implementation Classes

📹 OGhidra Tutorial Video

Pre-installation

Setup and Installation

Interactive Mode Commands

Example of hardcoded AI features:

Configuration Details

Models and Phases

Command Normalization

Session Memory (History & RAG)

Cache-Augmented Generation (CAG)

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ezrealenoch/OGhidra

Folders and files

Latest commit

History

Repository files navigation

OGhidra - Ollama-GhidraMCP Bridge

OGhidra Architecture

Finding Malware with the 'run-tool analyze_function()' feature

Key Features

Architecture Overview

Dual API Servers

Key Implementation Classes

📹 OGhidra Tutorial Video

Pre-installation

Setup and Installation

Interactive Mode Commands

Example of hardcoded AI features:

Configuration Details

Models and Phases

Command Normalization

Session Memory (History & RAG)

Cache-Augmented Generation (CAG)

Testing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages