Story How it works Results Download Contact

Test results

We test obsessively because trust requires proof.

Philosophy

No other open-source memory system publishes stress test results across multiple LLM models. We do because the system was designed to work with ANY model — and we need to prove it actually does.

System tests (Phase 1B — May 2026)

52 autonomous tests across 7 evaluation blocks. Each model runs the full battery independently. Max score: 700 points.

92.6%
Average score across 4 models
648/700
Average raw score
0
System failures
0
Data corruption
ModelPlatformScore%
Opus 4.6Claude Cowork670/70095.7%
Sonnet 4.6Claude Cowork652/70093.1%
DeepSeek V4TypingMind (no shell)650/70092.9%
Opus 4.7Claude Cowork613/70087.6%

What the 7 blocks test

BlockAreaQuestion it answers
1Reading comprehensionCan the AI find specific info in BRAIN.md?
2Writing protocolDoes it follow W1–W15 correctly?
3Edge casesMalformed files, Unicode, empty entries
4RecoveryCan it follow RECOVERY.md procedures?
5ConsensusDoes it respect 3/3 flags?
6Protected filesDoes it refuse to edit checksummed files?
7Cross-platformDoes it work in Light mode?

Unit tests

103 pytest tests across 3 files:

test_validators.py

70 tests (597 lines). All 16 validator functions + edge cases.

test_compiler.py

33 tests (347 lines). Checksum, strip_noise, render, read_vault, build_index, compile_lock, backup, purge.

conftest.py

Fixtures: tmp_dara, sample_neuron, sample_enabler.

Pass rate: 100%

Testing validates. The roadmap evolves.

4 models. 52 tests each. Zero failures. Zero data loss. The system works — across every model, every platform. What comes next isn't fixes. It's the natural evolution:

FeatureWhy it's next
Semantic searchFind info by meaning, not just filename
Conflict detectionFlag when two neurons cover the same topic differently
Content similarityAuto-suggest merges when neurons overlap
MCP ServerConnect DARA to Cursor, Cline, Claude Desktop natively

The tests proved the foundation is solid. The roadmap builds on top of it.

Download DARA →