Binary Analysis Toolkit (BAT)

Static analysis for suspicious binaries, focused on Windows PE files. BAT inspects a sample without running it, extracts IOCs, highlights suspicious capabilities, and produces a verdict to help triage.

Python dependencies are installed with uv sync. Optional tools such as Radare2, Ghidra, ilspycmd, UPX, capa, and YARA rules enable deeper analysis.

If you are triaging a live alert, start with docs/guide.md.

Disclaimer

Use this toolkit only for defensive, authorized analysis. The authors assume no liability for misuse or damage caused by use of the software.

Quick Start

git clone <repo-url>
cd binary-analysis-toolkit
uv sync

# Basic analysis
uv run binanalysis suspicious.exe

# Enable YARA or capa
uv run binanalysis suspicious.exe --yara
uv run binanalysis suspicious.exe --capa

# Decompile and generate an LLM report
uv run binanalysis suspicious.exe \
  --decompile ghidra \
  --yara \
  --capa \
  --llm-report \
  --llm-url http://ollama:11434 \
  --llm-model qwen3.5 \
  --llm-timeout 600 \
  --debug

BAT prints a verdict to the terminal and saves <filename>_analysis.json and <filename>_analysis.html next to the sample. --llm-report adds a natural-language report. --yara and --capa download rules on first use.

Static analysis works best on unpacked binaries. If upx is installed, BAT automatically unpacks UPX-packed samples.

Why BAT

BAT automates the first-pass checks analysts usually do by hand:

Compute hashes and imphash
Inspect PE metadata, sections, entropy, imports, exports, and resources
Extract strings and IOCs
Match suspicious behaviors to ATT&CK-style techniques
Optionally run YARA, capa, and decompilation
Produce a final verdict for triage

Key Features

PE header, section, entropy, Rich header, import, export, resource, TLS, overlay, and version-info analysis
.NET metadata inspection and optional ilspycmd decompilation
String extraction with threat-oriented pattern matching
Behavioral rules for common malware techniques such as injection, credential theft, persistence, exfiltration, and ransomware activity
IOC extraction for URLs, domains, file paths, registry keys, tokens, user agents, UUIDs, and environment variables
Optional YARA and capa integration
Optional Radare2 or Ghidra decompilation
Optional LLM-generated analyst report from a local or remote Ollama-compatible endpoint

Installation

Base install

git clone <repo-url>
cd binary-analysis-toolkit
uv sync

Optional external tools

Tool	Purpose
`upx`	Unpack UPX-packed binaries
`radare2`	Native pseudocode decompilation
`ghidra`	Headless decompilation with suspicious-function filtering
`ilspycmd`	.NET IL decompilation

Install commands:

# upx
sudo apt-get install upx          # Debian/Ubuntu
brew install upx                  # macOS

# radare2 + r2pipe Python binding
sudo apt-get install radare2      # Debian/Ubuntu
brew install radare2              # macOS
uv add r2pipe                     # Python binding (required for --decompile r2)

# Ghidra (headless decompilation — biggest impact on analysis quality)
# Download the latest release zip from:
#   https://github.com/NationalSecurityAgency/ghidra/releases
# Then install:
unzip ghidra_*.zip -d ~/tools/
export GHIDRA_HEADLESS=~/tools/ghidra_*/support/analyzeHeadless
# Add to ~/.bashrc or ~/.zshrc to persist

# Verify Ghidra is found:
$GHIDRA_HEADLESS --help 2>&1 | head -1

# ilspycmd (.NET decompilation)
dotnet tool install -g ilspycmd

Rules

--capa downloads capa rules to ~/.local/share/binanalysis/capa-rules
--yara downloads community YARA repos to ~/.local/share/binanalysis/yara-rules

Refresh them with:

uv run binanalysis file.exe --update-capa
uv run binanalysis file.exe --update-yara

Usage

# Basic analysis
uv run binanalysis suspicious.exe

# YARA and capa
uv run binanalysis suspicious.exe --yara --capa

# Decompile
uv run binanalysis suspicious.exe --decompile r2
uv run binanalysis suspicious.exe --decompile ghidra
uv run binanalysis suspicious.exe --decompile both

# Custom rule directories
uv run binanalysis suspicious.exe --yara --yara-rules /path/to/custom-rules
uv run binanalysis suspicious.exe --capa --capa-rules /opt/capa-rules

CLI Reference

binanalysis [-h] [--decompile {r2,ghidra,both}]
            [--capa] [--yara] [--update-capa] [--update-yara]
            [--capa-rules CAPA_RULES] [--yara-rules YARA_RULES [YARA_RULES ...]]
            [--llm-report] [--llm-url URL] [--llm-model MODEL] [--llm-timeout SECONDS]
            [--config CONFIG] [--debug]
            file

Argument	Description
`file`	Binary to analyze
`--decompile {r2,ghidra,both}`	Run native decompilation
`--capa`	Enable capa capability detection
`--yara`	Enable YARA scanning
`--update-capa`	Refresh capa rules before analysis
`--update-yara`	Refresh YARA repos before analysis
`--capa-rules`	Override capa rules directory
`--yara-rules`	Add one or more extra YARA rule directories
`--llm-report`	Generate an LLM analyst report
`--llm-url`	LLM API base URL
`--llm-model`	LLM model name
`--llm-timeout`	LLM timeout in seconds
`--config`	YAML config path
`--debug`	Write the LLM prompt to `<filename>_llm_prompt.md`

Configuration

BAT loads config in this order:

--config
binanalysis.yaml in the current directory
~/.config/binanalysis/config.yaml

A default config is created on first run. Common settings:

paths:
  capa_rules: ~/.local/share/binanalysis/capa-rules
  yara_community_dir: ~/.local/share/binanalysis/yara-rules

features:
  capa: false
  yara: false

llm:
  url: http://ollama:11434
  model: qwen3.5
  timeout: 600
  report: false

CLI flags override config values.

Output

BAT combines several signals into a final verdict:

Verdict	Meaning	Analyst action
`MALICIOUS`	Strong direct indicators	Quarantine, isolate affected hosts, open an incident, extract IOCs
`LIKELY MALICIOUS`	Multiple high-confidence signals	Escalate for deeper review, validate scope, check execution evidence
`SUSPICIOUS`	Context-dependent findings	Verify source, signer, delivery path, and business justification
`No strong indicators`	Nothing conclusive from static analysis	Check for packing; move to sandboxing if entropy is high or context is poor

What BAT Does Not Do

BAT helps with first-pass triage. It does not replace:

Dynamic analysis or sandbox detonation
Endpoint telemetry, process trees, or network logs
Full reverse engineering
Attribution to a specific actor or malware family
Proof that a capability was executed on a host

Use BAT to narrow the question, not to close an incident by itself.

Reports

JSON: best for SIEM ingestion and automation
HTML: best for sharing and manual review
LLM report: optional narrative summary for analysts

Docs

Live triage and analyst workflow: docs/guide.md
Enrichment pipeline: pipeline/README.md

Contributing

Contributions are welcome. Prefer focused changes with clear rationale and tests where practical.

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
binanalysis		binanalysis
data		data
docs		docs
pipeline		pipeline
.env.sample		.env.sample
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Binary Analysis Toolkit (BAT)

Disclaimer

Quick Start

Why BAT

Key Features

Installation

Base install

Optional external tools

Rules

Usage

CLI Reference

Configuration

Output

What BAT Does Not Do

Reports

Docs

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Binary Analysis Toolkit (BAT)

Disclaimer

Quick Start

Why BAT

Key Features

Installation

Base install

Optional external tools

Rules

Usage

CLI Reference

Configuration

Output

What BAT Does Not Do

Reports

Docs

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages