Memory Systems for AI Agents
arXiv Q1 2026Novel memory architectures for long-horizon agent tasks, enabling smooth retrieval and natural temporal dynamics.
Preprints and technical disclosures. Papers releasing Q1-Q2 2026.
Research papers releasing Q1-Q2 2026.
Novel memory architectures for long-horizon agent tasks, enabling smooth retrieval and natural temporal dynamics.
CE2P (Counterexample-to-Prompt) translates formal verification failures into structured LLM feedback. Key finding: CE2P benefit is inversely correlated with model capability—weaker models gain 34-39pp while GPT-4o needs no structured feedback.
Evolutionary approaches to adversarial testing where attack and defense strategies improve through competitive pressure.
Formal framework for ROI-based decision routing in multi-tier verification systems.
Comprehensive threat model for AI agents covering input attacks, state attacks, tool attacks, planning attacks, and coordination attacks. Empirical evaluation across agent architectures.
Using interpretability methods to detect when agentic systems are under adversarial attack. Extends metacognitive probing to security applications.
Methods for uncertainty quantification that track actual accuracy. Activation-based estimation, propagation through multi-step reasoning, and calibration without ground truth.
Defensive publications via TD Commons. CC BY 4.0.
| Title | Date | Link |
|---|---|---|
| UPIR: Universal Plan Intermediate Representation | Nov 2025 | TD Commons |
| ROI-Based Cascade Routing | Oct 2025 | TD Commons |
| ARTEMIS: Multi-Agent Debate Framework | Jan 2025 | TD Commons |
| Context System for AI Applications | Apr 2025 | TD Commons |
| ETLC: Context-First Data Processing | Apr 2025 | Google Cloud |
Work that informs Rotalabs' direction:
Type to search across all pages and posts
Press ↑ ↓ to navigate, Enter to select