Field-Theoretic Memory for AI Agents
Continuous Dynamics for Context Preservation
Rotalabs studies how AI agents fail when they act in the world — and builds the benchmarks, probes, monitors, and protocols to measure it. Every result is published, installable, and reproducible from the raw trajectories up.
A surface-clean paired-twin benchmark for irreversible banking actions, testing whether residual-stream probes can separate substantively wrong actions from legitimate twins under cross-pattern transfer.
Across three evaluable open-weight families, the linear probe showed no cross-pattern edge over a default-prompted judge. We publish the whole audit — including where our own method fails: paper, benchmark, trajectories, trained probes, and pipeline. The point is the protocol.
A benchmark, a protocol, and a reusable audit framework for testing monitors that gate irreversible agentic actions. It is designed to be re-run, challenged, and extended — a concrete research surface, not just a position statement.
Interpretability probes, adversarial testing, verification, evaluation science, steering, and trust propagation — released as papers and installable methods through 2026.
Rotalabs publishes installable packages so methods can be inspected, reproduced, and embedded into new experiments — all 12 on PyPI and npm, AGPL-3.0.
Context intelligence for AI agents.
Sandbagging detection via activation probes.
Steering vectors for runtime behavior control.
Neuro-symbolic verified code synthesis.
Trust-based decision routing.
Continuous Dynamics for Context Preservation
Semantic MAP-Elites red-teaming across frontier LLMs
CE2P translates formal-verification failures into structured LLM feedback. The benefit is inversely correlated with model capability — weaker...
Red-teaming that evolves its own attacks. rotalabs-redqueen uses quality-diversity search to discover diverse jailbreaks across single-turn, multi-turn, and agentic/MCP surfaces — reproducibly, and as...
We treat agent memory as continuous fields governed by partial differential equations instead of discrete database entries. The result: +116% F1 on multi-session reasoning...
AI agents that can't share what they know make the same mistakes independently. We're releasing rotalabs-context - a context intelligence engine for ingesting, searching,...
Credibility is not claimed. It is released — and open to be broken.Rotalabs operating principle
Rotalabs is our open research lab. The methods behind this work are available to enterprises as products and consulting through Rotascale →
Type to search across all pages and posts
Press ↑ ↓ to navigate, Enter to select