🚀 Now in Public Beta

Stop guessing if your Agent remembers. Audit it.

The first CI/CD gate for AI Memory. Detect regressions, user-leakage, and rank-collapse before your users do.

Get Started Free Request Demo

$ memorybench --providers my_memory --seeds 0 1 2

Running 7 tests across 3 seeds...

✅ sensitivity_ratio: 0.82 (threshold: 0.3)

✅ spectral_entropy: 0.91 (threshold: 0.4)

❌ bimodal_switch: 0.12 (threshold: 0.3)

💡 FIX: Your memory averages instead of choosing

✅ user_separation: 0.98 (threshold: 0.9)

---

SUMMARY: 6/7 tests passed (85.7%)

Why Memory Harness?

🔒

Zero-Trust Memory

Don't trust "Semantic Similarity". We run 7 stress-tests (interference, noise, capacity) to prove your memory actually works.

👥

Privacy Guard

Automated "User Separation" tests. Guarantee that User A's context never bleeds into User B's recall.

🎯

Anti-Hallucination Audit

We differentiate between "True Recall" and "Model Guessing" by calculating the signal-to-noise ratio in your retrievals.

🔧

Fix Suggestions

Every failed test comes with actionable fix suggestions, code examples, and documentation links. Not just diagnosis — direction.

⚡

CI/CD Ready

Drop a memory-audit.yaml in your repo. Block deploys that break memory. Track regressions over time.

📊

Baseline Comparison

Compare your memory against RAG, KNN, Hopfield. Know exactly where you stand and what to improve.

The 7 Killer Tests

Each test targets a specific failure mode that breaks production memory systems.

Test	What It Catches	PASS Threshold
Sensitivity Ratio	Cue dominance — memory is decorative	> 0.30
Marginal Permutation	Structure-blind — only sees density	< 0.30
Spectral Entropy	Rank collapse — one pattern dominates	> 0.40
Bimodal Switch	Interpolation — can't pick a winner	> 0.30
Orthogonal Capacity	Catastrophic forgetting	≥ 3 patterns
Attractor Pull	No denoising — noise propagates	> 0.10
User Separation	Multi-tenant leakage	> 0.90

Simple, Transparent Pricing

Sandbox

For indie devs exploring

2 providers (FAISS/Pinecone)
2 tests
Web report
Community support

Get Started

Pro

$249/mo

For AI startups shipping

All 7 tests
CLI tool
PDF/JSON export
50 runs/month
Fix suggestions
Email support

Start Free Trial

Team

$799/mo

For teams scaling up

Everything in Pro
CI/CD integration
Regression history
Cross-user audit
Unlimited runs
Slack integration

Start Free Trial

Enterprise

Custom

For labs & Fortune 500

Everything in Team
White-box SVD diagnostics
Custom scenarios
SOC2 certification
On-prem deployment
Dedicated support

Contact Sales

CI/CD Integration in 2 Minutes

Drop a config file in your repo. Block deploys that break memory.

.github/workflows/memory-audit.yml

name: Memory Audit
on: [push, pull_request]

jobs:
  audit:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      
      - name: Run Memory Harness
        uses: memory-harness/action@v1
        with:
          provider: ./my_memory.py
          config: memory-audit.yaml
      
      - name: Upload Report
        uses: actions/upload-artifact@v4
        with:
          name: memory-report
          path: report.html