What Your Agent Remembers, and Who Controls It

On a recent episode of Cloudy with a Chance of Insights, David spent some time on MemPalace, an open source memory project that caught his attention partly for technical reasons and partly because it was released by Mila Jovovich. The Fifth Element connection was not entirely coincidental. The film's premise is that the four classical elements are necessary but insufficient, and that the fifth, life itself, is what binds the others together and gives them meaning. David's argument, which I found hard to disagree with, is that agents sit in a similar position. You can give them all the reasoning capability, all the retrieval tooling, all the carefully engineered prompts and tool calls you like, and without continuity of identity across time, something fundamental is still missing. Memory, in this analogy, is the fifth element, and I think it is where a lot of agent systems stop being interesting and start being useful.

I've been thinking about this both before and after the podcast went out, partly because I've been working through the same questions on internal projects, and partly because the conversation has continued in other places since the episode went live, including a thread on Bluesky that threw up some genuinely interesting perspectives on the inspectability problem specifically. I think it is worth pulling out of the podcast context and into a broader frame, because the memory question does not stay a technical question for very long. It starts there and ends somewhere considerably more complicated.

The default in most agent implementations today is statelessness. Each session begins from nothing, and whatever continuity the user experiences is a function of context being injected at the start of the interaction, either manually by the builder or through retrieval systems that pull in relevant history on demand. This works well enough for short-horizon tasks, where the agent is essentially a sophisticated form-filler with some reasoning attached. It works considerably less well for anything longer. When the work spans multiple sessions, when the agent is operating on behalf of the same person or organisation over weeks or months, when decisions being made today depend on context established earlier, statelessness stops being a minor inconvenience and becomes a structural constraint on what the agent can actually do.

The architectural responses to this are varied, and the variation matters, because each approach embeds a different set of assumptions about what memory is for.

In-context memory is the simplest answer: keep a running summary or partial transcript in the model's context window and feed it back in on each turn. It is cheap, easy to inspect, and tightly coupled to the model's actual reasoning. The limitation is that it does not survive contact with any real volume. Context windows have grown but remain finite, and the workaround is summarisation, which is essentially a polite word for loss. The agent's memory of what happened two weeks ago is a compressed version of events rather than the events themselves, and compression has a tendency to drop the specific details that turn out to matter in retrospect.

Vector stores trade that problem for a different one. You embed memory into a searchable index and retrieve what looks relevant at query time. This scales considerably better than in-context memory and allows for much richer historical context, but relevance by semantic similarity is a proxy rather than the thing itself. The agent gets the memories that resemble the current query, which may or may not be the ones that actually bear on it. In routine use this is fine. In the edge cases, which is where memory matters most, the gap between similar and relevant can be what shapes whether a decision is sound. There is a defence of vector stores worth flagging, because it came up in a Bluesky thread I had after the episode went out. The efficiency case for semantic search at scale is real. Several people working in this space reported meaningful reductions in token consumption when they moved from naïve retrieval to properly indexed semantic search, and at any serious volume the cost and latency arguments are decisive. I think the question worth asking, though, is whether the approach that has become the default for reasons of engineering convenience is the right default when accuracy and accountability matter, or whether the industry has quietly settled on a trade-off it has not examined very carefully.

Structured knowledge graphs take a different direction, organising memory into explicit relationships rather than dense vectors. The advantage is precision, and with it, interpretability. You can see what the agent knows, how different pieces of information relate, and why a given memory is being surfaced. The overhead is considerable: someone has to decide what the schema looks like, maintain it as the world changes, and handle the drift when the graph stops matching reality. Knowledge graphs have been around long enough that practitioners who have worked with them at scale know exactly how much effort this takes, and the implementations that survive tend to be the ones that treat graph maintenance as a first-class engineering concern rather than something to get to later.

Then there is the approach MemPalace takes, which I want to flag as a design philosophy rather than a product recommendation. Verbatim, append-only storage with a fast index layered on top. Content is preserved exactly as recorded, without summarisation or discard. The cost is storage volume and the need for a sophisticated indexing layer to make retrieval tractable. The argument in favour is that fidelity is preserved completely, and fidelity is the foundation that trust actually sits on. Once you start summarising, you start making irreversible decisions about what matters, and those decisions are being made by a system that cannot be held accountable for having made them.

This is where I want to spend some proper time, because I think inspectability is the question that matters most and the one that gets the least attention.

If an agent remembers something incorrectly, who finds out? If it has stored something it should not have, who can see that, and who can remove it? If its memory of a customer interaction is subtly wrong in a way that shapes every subsequent interaction with that customer, how do you even detect the problem, let alone correct it? A significant proportion of current implementations do not have a good answer to any of those questions. Memory goes into the system, memory informs behaviour, and the mechanism connecting the two is largely opaque. That is a tolerable state of affairs for a chatbot that helps you draft emails. It is far less tolerable for an agent operating in a regulated environment, making decisions about credit, healthcare, compliance, or anything else where the audit trail is part of the product rather than an afterthought.

One of the more interesting contributions to that Bluesky conversation described a memory architecture built on top of a git repository, where every memory update is a commit, the history is append-only, and the edit trail is as owned and visible as the current state. The detail I found most useful was the distinction between builder-facing and vendor-facing ownership of memory. Most current implementations, whether commercial platforms or open source libraries, treat memory as something the system manages on the builder's behalf. Input goes into the store, retrieval happens somewhere downstream, behaviour emerges at the end, and the mechanics are largely out of the builder's hands. The git-backed approach inverts that relationship, making the builder the direct owner of memory state using tools they already understand. Whether that specific implementation holds up at production scale is a separate question, and I have not tested it. The underlying principle is worth taking seriously regardless. The thing that makes memory trustworthy is the ability to see what is in the store, the ability to edit it when editing is warranted, and the audit trail that proves what was known at any given point. Retrieval sophistication is a secondary concern when set against any of those.

The compliance dimension follows directly from this, and I think it is harder than it first appears. If your agent has memory, your agent has a data store, and that data store is subject to whatever obligations apply to data stores in your jurisdiction. Under GDPR, you have obligations around the accuracy of personal data, the right to erasure, and the right to rectification. If your agent is remembering things about individuals, those individuals have rights over those memories that you are now responsible for honouring. The EU AI Act adds another layer for high-risk applications, and sector-specific regulation adds another layer on top of that.

The reason this is harder than it sounds is that agent memory does not behave like a traditional database. Consider a worked example. A customer spoke to an agent a year ago about a financial difficulty. The agent stored that context, and because the context influenced how the agent behaved in subsequent interactions, the memory has continued to shape outcomes since. The customer's circumstances have changed, they want the memory corrected or removed, and under GDPR they have the right to make that request. Your obligation is, in principle, straightforward: find the memory, remove it, and confirm that the removal has taken effect. In practice, this becomes complicated fairly quickly. The memory may exist in several forms, including the original log entry, a derived summary, a vector embedding, and an index that points to any of the above. Removing the source record may or may not remove the memory in any meaningful sense, depending on whether stale derivatives continue to shape the agent's behaviour. When the agent made decisions last month that were influenced by that memory, those decisions may themselves have left traces that constitute retained data in their own right. The honest answer for a significant proportion of current implementations is that the removal operation is a cascade through several layers, and the architecture has rarely been designed with that cascade in mind.

The multi-agent and multi-tenant case extends the problem from governance into something closer to security. When you move from a single agent with its own memory to a fleet of agents that may share state, or to agents operating across organisational boundaries, the questions multiply faster than the answers. If Agent A learns something from an interaction and that learning informs Agent B's behaviour in a different context, the appropriateness of that transfer is a design question that nobody has necessarily made explicitly. If two tenants share an underlying memory layer, the enforcement of the separation that both tenants reasonably expect becomes non-trivial. If an agent is built by one organisation and deployed on behalf of another, the questions of where the memory lives, who has visibility into it, and what happens to it at the end of the commercial relationship become contractual concerns as well as architectural ones.

There is also a less comfortable dimension, which is that memory is an attack surface. Memory poisoning, where an adversary deliberately feeds malicious content into the memory layer to shape the agent's future behaviour, is already listed in the OWASP agentic AI top ten for good reason. If the memory store is not inspectable and not auditable, it is also not defensible. You cannot detect poisoning in a store you cannot examine, and recovering from it without the audit trail to identify what was poisoned and when is a problem nobody wants to be solving under pressure. The inspectability and governance questions turn out to be security questions as well, which is part of why I keep coming back to them.

None of this is intractable, I should say. These are recognisably engineering and governance problems, and the field is moving on them. The conversation about agent memory in most of the organisations I speak to happens at the capability level, about what you can do if agents remember more, rather than at the accountability level, about who is responsible for what the agent remembers and what the agent does with that memory. Capability is the interesting conversation for engineering teams, but accountability is the conversation that will determine whether the system survives its first serious incident, whether it can satisfy a regulator, and whether the organisation can deploy agents at any scale that actually matters.

For architects, I think the implication is that memory architecture belongs at the top of the design conversation rather than somewhere near the end of it. Memory architecture shapes the governance model, the compliance posture, the data residency story, the audit capability, and the security surface of the entire system. Getting it wrong at the design stage is considerably more expensive than getting it right, and the failure mode is fairly predictable: a retrofit programme eighteen months to three years in, triggered by a regulator or an incident, at considerably greater cost than designing for the concern from the start. The organisations that treat memory as a first-class architectural concern now will be at a meaningful advantage, partly because they will have developed the operational discipline that comes with taking memory seriously before it becomes a crisis, and partly because the evidence they will need to satisfy governance questions will already exist in a form that can be produced on demand.

For the C-suite, I think the question is simpler. Your agent knows things. It knows things about your customers, your employees, your decisions, and your operations. The questions worth asking are these. Who controls that knowledge? Who can see it, correct it, or remove it? When something goes wrong, as it eventually will, what does the audit trail look like? If the answers are unclear, the memory question is not a technical problem that has been delegated to engineering. It is a governance gap that has not yet been noticed, and the gap will not close on its own.

The reason I think this is worth attention now, rather than something that can wait until the technology settles, is that the decisions being made today about memory architecture are the ones that will prove hardest to reverse later. Data models accrete, audit requirements tend to harden rather than soften, and regulators have not historically granted grace periods for architectural mistakes made while the field was still young. The organisations that build without thinking about memory now are the ones that will be having uncomfortable conversations with their DPOs and their regulators in eighteen months.

The Fifth Element analogy holds up better than it has any right to. Memory is not the most glamorous component of an agent system, and it has never been the part that gets funded first. Without the right thinking about how it works and who owns it, the other components do not add up to a trustworthy system. The fifth element is what binds the other four together and gives them meaning. In the agent case, memory plays the same structural role, and I think the organisations that recognise this early will spend the next few years ahead of everyone else.

Source: Cloudy with a Chance of Insights — MemPalace and AI Agent Memory

What Your Agent Remembers, and Who Controls It

Tags

Comments

You might also like

Why Your Existing Tools Probably Can't Solve the AI Governance Problem

When Agents Go Operational, Governance Has to Go With Them

The Productivity Mirage: Why AI Is Changing Work Without Necessarily Improving It