History of AI Chatbot Conversation Storage: ELIZA to GPT-4

ELIZA and the Illusion of Memory (1966)

Joseph Weizenbaum's ELIZA, created at MIT in 1966, used simple pattern matching and scripted responses to simulate a psychotherapist. ELIZA had no memory — each response was generated purely from the current input with no reference to previous turns. Yet users frequently reported feeling that ELIZA truly understood and remembered them, a phenomenon Weizenbaum called the "ELIZA effect." The illusion of memory proved powerful even before memory existed.

Early Chatbot Systems and Session State

Through the 1970s and 1980s, conversational systems began maintaining session state — a temporary record of what had been said within a single conversation. PARRY (1972), which simulated a paranoid schizophrenic, tracked assertions made earlier in the conversation to maintain consistency. These systems maintained within-session coherence but had no cross-session persistence.

The Rise of Rule-Based Dialogue

1990s commercial chatbots, deployed in customer service applications, introduced more sophisticated dialogue state machines. Companies like SmarterChild (AIM, 2001) allowed users to set preferences that persisted across sessions — rudimentary but groundbreaking personalization. This era established the principle that conversation history has ongoing commercial value.

Neural Networks and Context Windows

The deep learning era introduced the concept of the context window as the mechanism for conversation memory. Early transformer models processed the full conversation history within a single forward pass, limited to a few thousand tokens. The context window became the fundamental constraint: everything inside it was "memory," everything outside was gone. GPT-3's launch in 2020 brought this architecture to mainstream awareness with a 2,048-token context window.

Modern LLMs and the Memory Problem

GPT-4 and Claude expanded context windows to 128k and 200k tokens respectively, but the fundamental reset problem remained. Every new conversation starts from zero. The AI has no memory of what you discussed yesterday, last week, or last year — unless you explicitly re-provide that context. This limitation is not an oversight; it's an architectural consequence of stateless inference. Solving it at scale is the defining engineering challenge of the AI memory category — and the market opportunity that ChatHistory.com represents.

Frequently Asked Questions

When did AI chatbots first get conversation memory?

Early chatbots like PARRY (1972) maintained within-session state to track conversational consistency. Cross-session persistence — remembering users across separate conversations — emerged commercially in the late 1990s and early 2000s with systems like SmarterChild on AOL Instant Messenger.

What is a context window in the context of AI memory?

A context window is the maximum amount of text (measured in tokens) that a large language model can process in a single request. Everything within the context window functions as the model's working memory for that interaction; nothing outside it is accessible without explicit re-injection.

Why do modern AI models like ChatGPT reset memory between sessions?

Modern LLMs use stateless inference — each request is processed independently with no server-side memory of previous requests. This is an architectural choice that enables scalability and privacy but means conversation memory between sessions must be managed externally by the application layer, not by the model itself.

ELIZA and the Illusion of Memory (1966)

Early Chatbot Systems and Session State

The Rise of Rule-Based Dialogue

Neural Networks and Context Windows

Modern LLMs and the Memory Problem

Frequently Asked Questions

When did AI chatbots first get conversation memory?

What is a context window in the context of AI memory?

Why do modern AI models like ChatGPT reset memory between sessions?

Keep Reading

The Complete Guide to AI Conversation Backup in 2025

Why Enterprises Are Losing Millions in AI-Generated Insights

How to Build the AI Memory Layer: A Product Blueprint