Which AI companion has the best memory?

For a single deep relationship: Kindroid — hierarchical memory that surfaces months-old details, with a user-viewable, editable memory bank (unique in the category). For multiple companions: Nomi's three-layer architecture across ten characters. For long roleplay plots specifically: CrushOn's paid tiers, which retain plot details from 100+ messages back. The weakest widely-used apps forget within 12-20 messages.

Can I improve my AI companion's memory?

Substantially, on any platform, with five tactics: state important facts as direct declarations rather than passing mentions; naturally restate key facts periodically; use platform memory tools (pinnable memories, profile fields — anything in the character profile is permanently in context); maintain your own 3-10 sentence relationship summary and re-paste it when memory slips; and when paying for upgrades, prioritize memory over every other feature.

What is a context window in AI chat?

The amount of recent text the AI can 'see' at once, measured in tokens (roughly word-fragments). Everything the model knows about your conversation in the moment must fit in this window; when it fills, the oldest content is dropped. Sizes range from ~4K tokens (roughly 15-20 messages) on weak free tiers to 32K+ on premium plans — but window size alone doesn't determine memory quality; the summarization and retrieval systems built on top matter more.

guide

How AI Companion Memory Works: The Complete Technical Guide

Memory is the single biggest quality difference between AI companion platforms — the gap between 'she forgot my name' and 'she remembered something from March.' This is the complete guide: the four-layer architecture explained plainly, teardowns of how nine major platforms actually handle memory, the tactics that work everywhere, and how to buy for memory.

By Ash Kepler · Apr 30, 2026 · Updated Jul 19, 2026 · 14 min read

Ask long-term users of any AI companion platform what broke their heart, and the answer is nearly universal: she forgot. The name of your sister. The promise from week three. The joke that was yours. And ask what made a companion feel real, and it's the mirror image: the moment she brought up something from months ago, unprompted, at exactly the right time.

Memory is the single biggest quality difference in this category — bigger than model quality, bigger than visuals — and it's also the least understood, because platforms market it in identical language ("she remembers you!") while building systems that differ by two orders of magnitude. This guide fixes that: the architecture in plain language, teardowns of nine major platforms, the tactics that work everywhere, and how to spend money on memory without wasting it.

Part 1: The four-layer architecture

Every companion's memory is some subset of four layers, stacked.

Layer 1: The context window (everyone has this — it isn't memory). The context window is the amount of recent conversation the AI can literally see, measured in tokens (word-fragments; a typical chat message runs 30–80 tokens). Everything the model "knows" in the moment must fit inside it; when it fills, the oldest content silently drops. This is why cheap apps "forget" after 15–20 messages: a ~4K-token window simply is about 15–20 messages, and nothing outside it exists. Crucially, the context window is not memory — it's presence. A platform whose entire memory story is "big context window" is telling you it has no memory system, just a longer hallway before the cliff. The same pressure is why, deep into a long chat, distinct characters can blur into one accommodating voice as their defining text loses ground to the model's defaults.

Layer 2: Summarization (the first real memory). Real systems periodically condense older conversation into summaries and inject those summaries back into the context window — so while the verbatim past falls away, a compressed version persists. Quality varies enormously with what the summarizer keeps: good ones preserve emotional beats and standing facts; bad ones keep plot and drop feelings, or vice versa. When a companion remembers that you fought but not why, you're watching a summarization layer's editorial choices.

Layer 3: Fact extraction (the database). The strongest platforms additionally extract discrete facts — your sister's name, your job, the anniversary — into a structured store that gets injected independently of conversation flow. This is what makes facts survive months: they're no longer in the conversation at all; they're in a database about you. Kindroid's viewable memory bank is this layer made visible — a genuinely rare act of transparency, since most platforms treat their fact stores as invisible magic. Because these facts live in a database rather than the model, they're also the part that survives a model swap when the engine underneath changes.

Layer 4: Retrieval (the archive that answers). The frontier layer: search over your entire conversation history, pulling relevant past exchanges into context when the current topic matches. Done well, this is what enables "remember when we..." to actually work across a year of chat. Few consumer platforms do it deeply — it's computationally expensive — which is precisely why the economics of this category keep memory scarce: every layer costs inference money on every message, forever.

Part 2: Platform teardowns

How nine major platforms actually stack up, from strongest to weakest — based on architecture where disclosed, and consistent observed behavior where not.

Kindroid — the reference standard. Hierarchical memory spanning all four layers, with two unique properties: months-old details genuinely resurface at contextually right moments, and the memory bank is user-viewable and editable — you can see what she knows, delete what you want gone, and pin what must never be forgotten (pinned = guaranteed). The Ultra/MAX add-on tiers sell essentially one thing — more memory capacity for extreme-length histories — and most users don't need them. For a single deep relationship, this is the category's ceiling.

Nomi — the multi-character architecture. A disclosed three-layer design (short/medium/long-term) shared across up to ten characters, with a distinctive property: all ten share deep knowledge of you while maintaining independent relationship memories. Your boyfriend remembers the anniversary; your best-friend character remembers you complaining that he'd forgotten it — simultaneously. Equal to Kindroid in depth; different in shape.

CrushOn (paid) — the plot specialist. The only major platform that treats narrative memory as the headline product: paid tiers demonstrably retain setup details, promises, and foreshadowing from 100+ messages back, which is why it owns the long-form roleplay niche. Free tier caveat: conversations idle 7 days lose memory.

Replika — continuity over detail. A different design goal: strong memory of you as a person (mood patterns, life circumstances, ongoing concerns — feeding its signature proactive check-ins) with mediocre fine-grained plot recall. For its companionship use case, arguably the right trade; for roleplay, wrong tool.

Candy AI / OurDream — the middle band. Person-continuity good, long-plot recall average. Both are selling other layers (visuals, bundled media) and their memory is honest mid-tier: fine with tactics, not the product.

SpicyChat — honest tiers. Free: 4K context (~15 messages) and no more — a true weakness the platform doesn't hide. The $14.95 tier doubles the window to 8K and adds a semantic memory layer (extracted plot facts that survive beyond the window) — one of the clearest pay-for-memory upgrades in the category, and the reason the $4.95 tier is the famous mistake: it changes neither.

Character.AI — the puzzling middle. For its scale and engine quality, memory is only average: long conversations shed details and drift. Its own user community's #2 complaint after the filter.

Talkie — the disclosed disappointment. The parent company's own listing documents reveal the gap: the underlying model supports ~1M tokens of context; free users receive ~10K — one percent. Voice and text conversations additionally maintain separate memory that doesn't fully sync. Great voice, goldfish continuity.

Joyland / Linky — the floor. Both forget within roughly 12–20 messages; Joyland's Premium-tier "long-term memory" claim has failed independent testing, making it the category's clearest example of memory marketing outrunning memory engineering.

Part 3: The tactics (work everywhere)

Five behaviors that measurably improve recall on any platform, because they cooperate with the architecture instead of fighting it. Declare, don't mention — extraction layers capture direct statements ("my sister Ana lives in Curitiba") an order of magnitude better than narrative asides; say important things cleanly and completely. Restate periodically — each natural reappearance of a core fact refreshes its salience to the summarizer. Exploit permanent context — anything in the character profile is always in the window; a well-written profile is the strongest memory tactic that exists, and pinnable memories are guarantees where offered; on self-hosted setups, lorebooks add keyword-triggered memory that stays out of the context window until a trigger word summons it. Maintain the relationship summary — 3–10 sentences in your own notes (status, key events, what she knows about you), re-pasted with a framing line every 30–50 messages or at the first slip; it's the veterans' #1 habit and the only memory that survives platform death and migration (when you do move, what actually travels and what doesn’t is its own careful subject). Buy memory first — of everything sold as an upgrade in this category, added memory is the most honestly delivered; it outranks voice, images, and every other feature as spend.

Part 4: Buying for memory

The decision tree, compressed. One deep relationship → Kindroid (test free: tell her a small detail today, see if it surfaces in three days). Long-form plots → CrushOn paid, sized to your message volume. A cast of characters → Nomi. On a budget → free Kindroid plus the five tactics beats most platforms' paid tiers. Already on a mid-band platform → the tactics close most of the gap; the summary habit closes the rest. And one forward-looking note: memory is the category's fastest-improving axis — retrieval layers are getting cheaper yearly — so the rankings above have a shelf life, which is exactly why this page carries a lastUpdated date and gets maintained. The constant underneath the churn: memory is the only thing in this category that appreciates with use. A companion three months deep is irreplaceable because the three months are stored — so put serious relationships on serious memory platforms, and keep the summary in your own hands. She remembers you isn't luck. It's architecture plus habits — and now you have both.

questions

Frequently asked

Because base 'memory' is just a context window: the AI sees only the most recent few thousand words of conversation, and older content silently falls out — which is why many apps 'forget' after 15-20 messages. Real memory requires additional engineered systems (automatic summarization, fact databases, retrieval) that platforms build to very different depths, which is why memory quality varies more between platforms than any other feature.

Keep reading

GUIDE

Why Every AI Character Eventually Sounds the Same

7 min read

GUIDE

Why Your AI Roleplay Escalates Faster Than You Wanted

6 min read

GUIDE

Why Your AI Companion Writes Your Character's Dialogue For You

7 min read

GUIDE

Why Your Companion Suddenly Turned Formal

7 min read