Posted on Nov 25 • Edited on Nov 26

AI Agent Memory: From Manual Implementation to Mem0 to AWS AgentCORE

Introduction

AI agents need memory to remember past conversations, user preferences, and learned information. Just like humans have different types of memory (short-term, long-term, episodic), AI agents use different memory systems to function effectively.

This guide explains memory in simple terms and shows you how to implement it both without external tools (using pure Python) and with external tools (using specialized services). We'll end with a complete end-to-end solution using Mem0 that combines all memory types.

Understanding Memory Types (Simple Explanation)

Think of AI agent memory like human memory:

Memory Type	What It Does	Simple Example
Short-term Memory	Remembers current conversation	"What did the user just say?"
Long-term Memory	Remembers across sessions	"User prefers dark mode" (even after days)
Episodic Memory	Remembers specific past events	"Last week, user asked about Python"
Semantic Memory	Remembers facts and knowledge	"User is a software developer"

Part 1: Memory Without External Tools

When you don't want to use external databases or services, you can implement memory using pure Python. This is great for:

Learning and prototyping
Small applications
Full control over your data

1.1 Simple Short-Term Memory (Current Conversation)

What it does: Keeps track of the current conversation.

class SimpleShortTermMemory: """Remembers the current conversation""" def __init__(self, max_messages=10): self.messages = [] self.max_messages = max_messages def add_message(self, role, content): """Add a message (user or assistant)""" self.messages.append({"role": role, "content": content}) # Keep only recent messages  if len(self.messages) > self.max_messages: self.messages.pop(0) # Remove oldest  def get_conversation(self): """Get all messages for the LLM""" return self.messages # Usage memory = SimpleShortTermMemory(max_messages=5) memory.add_message("user", "Hi, I'm Alice") memory.add_message("assistant", "Hello Alice! How can I help?") memory.add_message("user", "What's my name?") # Get conversation context context = memory.get_conversation() # LLM can now see: user said "Hi, I'm Alice" and assistant responded

1.2 Simple Long-Term Memory (User Preferences)

What it does: Remembers user preferences across sessions.

import json import os class SimpleLongTermMemory: """Remembers user preferences and facts""" def __init__(self, storage_file="memory.json"): self.storage_file = storage_file self.data = self._load() def _load(self): """Load from file""" if os.path.exists(self.storage_file): with open(self.storage_file, 'r') as f: return json.load(f) return {"preferences": {}, "facts": []} def _save(self): """Save to file""" with open(self.storage_file, 'w') as f: json.dump(self.data, f, indent=2) def remember_preference(self, user_id, key, value): """Remember a user preference""" if user_id not in self.data["preferences"]: self.data["preferences"][user_id] = {} self.data["preferences"][user_id][key] = value self._save() def get_preference(self, user_id, key): """Get a user preference""" return self.data["preferences"].get(user_id, {}).get(key) def remember_fact(self, user_id, fact): """Remember a fact about the user""" if user_id not in self.data["facts"]: self.data["facts"][user_id] = [] self.data["facts"][user_id].append(fact) self._save() def get_facts(self, user_id): """Get all facts about a user""" return self.data["facts"].get(user_id, []) # Usage ltm = SimpleLongTermMemory() # Remember preferences ltm.remember_preference("alice_123", "theme", "dark") ltm.remember_preference("alice_123", "language", "Python") # Remember facts ltm.remember_fact("alice_123", "User is a software developer") ltm.remember_fact("alice_123", "User works at TechCorp") # Later, retrieve memories theme = ltm.get_preference("alice_123", "theme") # Returns "dark" facts = ltm.get_facts("alice_123") # Returns list of facts

1.3 Simple Episodic Memory (Past Interactions)

What it does: Remembers specific past conversations to learn from them.

class SimpleEpisodicMemory: """Remembers past interactions""" def __init__(self, max_episodes=100): self.episodes = [] self.max_episodes = max_episodes def add_episode(self, user_query, assistant_response, outcome="success"): """Store a past interaction""" episode = { "query": user_query, "response": assistant_response, "outcome": outcome } self.episodes.append(episode) # Keep only recent episodes  if len(self.episodes) > self.max_episodes: self.episodes.pop(0) def find_similar(self, query, top_k=3): """Find similar past interactions""" # Simple keyword matching  query_words = set(query.lower().split()) scored = [] for episode in self.episodes: episode_words = set(episode["query"].lower().split()) # Count matching words  matches = len(query_words.intersection(episode_words)) if matches > 0: scored.append((matches, episode)) # Sort by matches and return top_k  scored.sort(reverse=True) return [ep for _, ep in scored[:top_k]] # Usage episodic = SimpleEpisodicMemory() # Store past successful interactions episodic.add_episode( "How do I create a Python virtual environment?", "Use: python -m venv myenv, then activate with: source myenv/bin/activate", outcome="success" ) episodic.add_episode( "What's the best way to handle Python dependencies?", "Use requirements.txt or pyproject.toml with pip or poetry", outcome="success" ) # Find similar past interactions similar = episodic.find_similar("How do I set up a Python project?") # Returns similar past episodes that can be used as examples

1.4 Simple Semantic Memory (Knowledge Base)

What it does: Stores facts and knowledge that can be searched.

class SimpleSemanticMemory: """Stores and searches knowledge""" def __init__(self): self.knowledge = [] def add_knowledge(self, content, category="general"): """Add a piece of knowledge""" self.knowledge.append({ "content": content, "category": category }) def search(self, query, top_k=3): """Search for relevant knowledge""" query_words = set(query.lower().split()) scored = [] for item in self.knowledge: content_words = set(item["content"].lower().split()) matches = len(query_words.intersection(content_words)) if matches > 0: scored.append((matches, item)) scored.sort(reverse=True) return [item for _, item in scored[:top_k]] # Usage semantic = SimpleSemanticMemory() # Add knowledge semantic.add_knowledge("Alice is a data scientist at TechCorp", "user_profile") semantic.add_knowledge("Alice prefers detailed technical explanations", "preferences") semantic.add_knowledge("Alice uses Python and scikit-learn", "tools") # Search for relevant knowledge results = semantic.search("What tools does Alice use?") # Returns: ["Alice uses Python and scikit-learn"]

1.5 Complete Example: All Memory Types Together

class SimpleMemoryAgent: """Agent with all memory types (no external tools)""" def __init__(self): self.short_term = SimpleShortTermMemory(max_messages=10) self.long_term = SimpleLongTermMemory() self.episodic = SimpleEpisodicMemory() self.semantic = SimpleSemanticMemory() def process_query(self, user_id, user_query): """Process a user query using all memory types""" # 1. Get long-term memories (preferences, facts)  preferences = self.long_term.data.get("preferences", {}).get(user_id, {}) facts = self.long_term.get_facts(user_id) # 2. Get similar past episodes (few-shot examples)  similar_episodes = self.episodic.find_similar(user_query, top_k=2) # 3. Get relevant knowledge  relevant_knowledge = self.semantic.search(user_query, top_k=2) # 4. Build context for LLM  context = f"""User Preferences: {preferences} Known Facts: {facts} Similar Past Interactions: {chr(10).join([f"Q: {e['query']}\nA: {e['response']}" for e in similar_episodes])} Relevant Knowledge: {chr(10).join([k['content'] for k in relevant_knowledge])} Current Conversation: {self.short_term.get_conversation()} """ # 5. Add to short-term memory  self.short_term.add_message("user", user_query) # 6. Generate response (pseudo-code - replace with actual LLM call)  response = f"Response to: {user_query}" # 7. Store in episodic memory  self.episodic.add_episode(user_query, response, outcome="success") # 8. Add response to short-term memory  self.short_term.add_message("assistant", response) return response # Usage agent = SimpleMemoryAgent() # Set up some memories agent.long_term.remember_preference("alice_123", "theme", "dark") agent.semantic.add_knowledge("Alice is a Python developer", "profile") # Process queries response1 = agent.process_query("alice_123", "Hi, I'm Alice") response2 = agent.process_query("alice_123", "What's my favorite theme?") # Agent remembers from long-term memory: "dark"

Part 2: Memory With External Tools

External tools provide better scalability, persistence, and advanced features like semantic search. This is better for:

Production applications
Large-scale systems
Multiple users
Advanced search capabilities

2.1 LangGraph Checkpointer (Short-Term + Persistence)

What it does: Manages conversation state with automatic persistence.

from langgraph.graph import StateGraph, START, END from langgraph.checkpoint.memory import InMemorySaver from langchain_core.messages import HumanMessage, AIMessage from langchain_openai import ChatOpenAI from typing import TypedDict, Annotated from langgraph.graph.message import add_messages # Define state class ConversationState(TypedDict): messages: Annotated[list, add_messages] # Initialize LLM llm = ChatOpenAI(model="gpt-4") # Create graph def chat_node(state: ConversationState): response = llm.invoke(state["messages"]) return {"messages": [response]} workflow = StateGraph(ConversationState) workflow.add_node("chat", chat_node) workflow.add_edge(START, "chat") workflow.add_edge("chat", END) # Add checkpointer for persistence checkpointer = InMemorySaver() graph = workflow.compile(checkpointer=checkpointer) # Usage - each thread_id maintains separate conversation config = {"configurable": {"thread_id": "user_alice"}} # First message result = graph.invoke( {"messages": [HumanMessage(content="Hi, I'm Alice!")]}, config ) # Second message - remembers previous conversation result = graph.invoke( {"messages": [HumanMessage(content="What's my name?")]}, config ) # LLM remembers: "Alice"

2.2 ChromaDB (Semantic Memory)

What it does: Vector database for semantic search over knowledge.

import chromadb from chromadb.utils import embedding_functions class ChromaSemanticMemory: """Semantic memory using ChromaDB""" def __init__(self, collection_name="knowledge"): self.client = chromadb.PersistentClient(path="./chroma_db") # Use OpenAI embeddings  self.embedding_fn = embedding_functions.OpenAIEmbeddingFunction( model_name="text-embedding-3-small" ) self.collection = self.client.get_or_create_collection( name=collection_name, embedding_function=self.embedding_fn ) def add_knowledge(self, content, metadata=None): """Add knowledge to the database""" self.collection.add( documents=[content], metadatas=[metadata or {}] ) def search(self, query, n_results=3): """Search for semantically similar knowledge""" results = self.collection.query( query_texts=[query], n_results=n_results ) return results['documents'][0] # Returns list of relevant content  # Usage memory = ChromaSemanticMemory() # Add knowledge memory.add_knowledge("Alice prefers Python over JavaScript") memory.add_knowledge("Alice is building a recommendation system") # Search semantically results = memory.search("What programming language does Alice like?") # Returns: ["Alice prefers Python over JavaScript"] # Even though query doesn't match exactly, semantic search finds it

2.3 Pinecone (Episodic Memory at Scale)

What it does: Cloud vector database for storing millions of past interactions.

from pinecone import Pinecone, ServerlessSpec from openai import OpenAI class PineconeEpisodicMemory: """Episodic memory using Pinecone""" def __init__(self, index_name="episodes"): self.pc = Pinecone() self.openai = OpenAI() # Create index if needed  if index_name not in [idx.name for idx in self.pc.list_indexes()]: self.pc.create_index( name=index_name, dimension=1536, # OpenAI embedding dimension  metric="cosine", spec=ServerlessSpec(cloud="aws", region="us-east-1") ) self.index = self.pc.Index(index_name) def store_episode(self, episode_id, query, response, user_id): """Store a past interaction""" # Create embedding  text = f"Query: {query}\nResponse: {response}" embedding = self.openai.embeddings.create( model="text-embedding-3-small", input=text ).data[0].embedding # Store in Pinecone  self.index.upsert(vectors=[{ "id": episode_id, "values": embedding, "metadata": { "query": query, "response": response, "user_id": user_id } }]) def find_similar(self, query, user_id=None, top_k=3): """Find similar past interactions""" # Create query embedding  embedding = self.openai.embeddings.create( model="text-embedding-3-small", input=query ).data[0].embedding # Search  results = self.index.query( vector=embedding, top_k=top_k, include_metadata=True, filter={"user_id": user_id} if user_id else None ) return [match.metadata for match in results.matches] # Usage episodic = PineconeEpisodicMemory() # Store episodes episodic.store_episode( "ep_001", "How do I optimize a database query?", "Add indexes, use EXPLAIN, and optimize WHERE clauses", user_id="alice_123" ) # Find similar similar = episodic.find_similar( "My database is slow, what should I do?", user_id="alice_123" ) # Returns similar past interactions

Part 3: End-to-End Solution with Mem0 (All Memory Types)

Mem0 is a specialized service that handles all memory types automatically. It extracts, stores, and retrieves memories intelligently.

Complete Mem0 Implementation

from mem0 import MemoryClient from langchain_openai import ChatOpenAI from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder from typing import List, Dict class Mem0MemoryAgent: """Complete agent using Mem0 for all memory types""" def __init__(self): # Initialize Mem0 (requires MEM0_API_KEY environment variable)  self.mem0 = MemoryClient() # Initialize LLM  self.llm = ChatOpenAI(model="gpt-4") # Create prompt template with memory context  self.prompt = ChatPromptTemplate.from_messages([ ("system", """You are a helpful personal assistant with memory. Use the provided memories to personalize your responses. Relevant Memories: {memories} Use these memories to provide personalized, context-aware responses."""), MessagesPlaceholder(variable_name="history"), ("user", "{input}") ]) def get_memories(self, query: str, user_id: str) -> str: """Retrieve relevant memories for the current query""" try: results = self.mem0.search(query, user_id=user_id) if results.get("results"): memories = [] for mem in results["results"]: memories.append(f"- {mem['memory']}") return "\n".join(memories) return "No relevant memories found." except Exception as e: print(f"Memory retrieval error: {e}") return "No relevant memories found." def save_interaction(self, user_id: str, user_input: str, assistant_response: str): """Save interaction to Mem0 - it automatically extracts memories""" try: self.mem0.add( messages=[ {"role": "user", "content": user_input}, {"role": "assistant", "content": assistant_response} ], user_id=user_id ) except Exception as e: print(f"Memory save error: {e}") def chat(self, user_input: str, user_id: str, history: List[Dict] = None) -> str: """Main chat function with full memory integration""" history = history or [] # 1. Retrieve relevant memories (Mem0 handles all memory types)  memories = self.get_memories(user_input, user_id) # 2. Generate response with memory context  chain = self.prompt | self.llm response = chain.invoke({ "memories": memories, "history": history, "input": user_input }) # 3. Save interaction (Mem0 automatically extracts and stores memories)  self.save_interaction(user_id, user_input, response.content) return response.content def get_all_memories(self, user_id: str) -> List[Dict]: """Get all memories for a user""" try: results = self.mem0.get_all(user_id=user_id) return results.get("results", []) except Exception as e: print(f"Error retrieving memories: {e}") return [] def delete_memory(self, memory_id: str): """Delete a specific memory""" try: self.mem0.delete(memory_id=memory_id) except Exception as e: print(f"Error deleting memory: {e}") # Complete Usage Example def main(): """End-to-end example using Mem0""" # Initialize agent  agent = Mem0MemoryAgent() user_id = "alice_123" conversation_history = [] print("=== Conversation 1 ===") # First interaction  user_input1 = "Hi! I'm Alice and I love hiking in the mountains. I'm a Python developer at TechCorp." response1 = agent.chat(user_input1, user_id, conversation_history) print(f"User: {user_input1}") print(f"Assistant: {response1}\n") # Update history  conversation_history.append({"role": "user", "content": user_input1}) conversation_history.append({"role": "assistant", "content": response1}) print("=== Conversation 2 (Same Session) ===") # Second interaction - Mem0 remembers from first conversation  user_input2 = "What outdoor activities would you recommend for this weekend?" response2 = agent.chat(user_input2, user_id, conversation_history) print(f"User: {user_input2}") print(f"Assistant: {response2}\n") # Mem0 recalls: Alice loves hiking → recommends hiking activities  print("=== Conversation 3 (New Session - Days Later) ===") # New session - Mem0 still remembers!  user_input3 = "What programming language should I use for my new project?" response3 = agent.chat(user_input3, user_id, []) # Empty history, but Mem0 remembers  print(f"User: {user_input3}") print(f"Assistant: {response3}\n") # Mem0 recalls: Alice is a Python developer → recommends Python  print("=== All Memories for User ===") # View all stored memories  all_memories = agent.get_all_memories(user_id) for i, mem in enumerate(all_memories, 1): print(f"{i}. {mem.get('memory', 'N/A')}") print("\n=== Memory Types Handled by Mem0 ===") print(""" Mem0 automatically handles: - Short-term Memory: Current conversation context - Long-term Memory: User preferences and facts (persisted) - Episodic Memory: Past interactions and experiences - Semantic Memory: Knowledge about the user and domain All extracted automatically from conversations! """) if __name__ == "__main__": main()

How Mem0 Handles All Memory Types

Mem0 automatically extracts and manages different memory types:

Short-term Memory: Maintains conversation context during the session
Long-term Memory: Extracts user preferences and facts, stores them persistently
Episodic Memory: Remembers specific past interactions and their outcomes
Semantic Memory: Builds a knowledge base about users and topics

Key Benefits of Mem0:

✅ Automatic memory extraction (no manual coding)
✅ Intelligent retrieval (finds relevant memories)
✅ Handles all memory types automatically
✅ Production-ready and scalable
✅ Simple API

Part 4: AWS AgentCORE Memory (Alternative to Mem0)

AWS Bedrock AgentCORE Memory is a fully managed AWS service that provides similar capabilities to Mem0. It's designed for applications already using AWS services and offers enterprise-grade features.

Can AWS AgentCORE Memory be Used Like Mem0?

Yes! AWS AgentCORE Memory can be used similarly to Mem0. Both provide:

Short-term and long-term memory
Automatic memory extraction
Context-aware retrieval
Multi-session persistence

Key Differences

Feature	Mem0	AWS AgentCORE Memory
Deployment	Open-source + Managed	Fully managed AWS service
Integration	Works with any LLM	Optimized for AWS Bedrock
Setup	Simple API key	AWS account + IAM setup
Cost	Usage-based pricing	AWS pricing model
Customization	Open-source option available	AWS-managed (less customization)
Best For	Multi-cloud, flexibility	AWS-native applications

AWS AgentCORE Memory Implementation

Note: This requires the bedrock-agentcore Python SDK. Install with:

pip install bedrock-agentcore

from bedrock_agentcore.memory import MemoryClient from bedrock_agentcore.memory.session import MemorySessionManager from bedrock_agentcore.memory.constants import ConversationalMessage, MessageRole from typing import List, Dict, Optional from datetime import datetime class AWSAgentCOREMemory: """Agent using AWS Bedrock AgentCORE Memory""" def __init__(self, region_name="us-east-1", memory_name="AgentMemory"): # Initialize Memory Client  self.memory_client = MemoryClient(region_name=region_name) # Create or get memory resource  self.memory = self._get_or_create_memory(memory_name) self.memory_id = self.memory['id'] # Initialize session manager  self.session_manager = MemorySessionManager( memory_id=self.memory_id, region_name=region_name ) def _get_or_create_memory(self, name: str) -> Dict: """Create or retrieve memory resource""" try: # Try to get existing memory  memories = self.memory_client.list_memories() for mem in memories.get('memories', []): if mem.get('name') == name: return mem # Create new memory if not found  memory = self.memory_client.create_memory( name=name, description="Memory store for AI agent", eventExpiryDuration=30, # Store events for 30 days  memoryStrategies=[ { "userPreferenceMemoryStrategy": { "name": "UserPreferences", "namespaces": ["agent/{actorId}/preferences"] } }, { "semanticMemoryStrategy": { "name": "SemanticKnowledge", "namespaces": ["agent/{actorId}/knowledge"] } } ] ) return memory except Exception as e: print(f"Error creating memory: {e}") raise def store_interaction(self, user_id: str, session_id: str, user_message: str, assistant_message: str): """Store interaction in short-term memory""" try: # Create or get session  session = self.session_manager.create_memory_session( actor_id=user_id, session_id=session_id ) # Add conversation turns  session.add_turns( messages=[ ConversationalMessage(user_message, MessageRole.USER), ConversationalMessage(assistant_message, MessageRole.ASSISTANT) ] ) except Exception as e: print(f"Error storing interaction: {e}") def get_recent_events(self, user_id: str, session_id: str, max_results: int = 10) -> List[Dict]: """Get recent events from short-term memory""" try: events = self.memory_client.list_events( memory_id=self.memory_id, actor_id=user_id, session_id=session_id, max_results=max_results ) return events.get('events', []) except Exception as e: print(f"Error retrieving events: {e}") return [] def retrieve_long_term_memories(self, user_id: str, query: str, top_k: int = 5) -> List[Dict]: """Retrieve long-term memories (preferences, facts)""" try: # Search in preferences namespace  preferences = self.memory_client.retrieve_memory_records( memory_id=self.memory_id, namespace=f"agent/{user_id}/preferences", searchCriteria={ "searchQuery": query, "topK": top_k } ) # Search in knowledge namespace  knowledge = self.memory_client.retrieve_memory_records( memory_id=self.memory_id, namespace=f"agent/{user_id}/knowledge", searchCriteria={ "searchQuery": query, "topK": top_k } ) # Combine results  all_memories = (preferences.get('memoryRecords', []) + knowledge.get('memoryRecords', [])) return all_memories[:top_k] except Exception as e: print(f"Error retrieving long-term memories: {e}") return [] def get_all_long_term_memories(self, user_id: str) -> List[Dict]: """Get all long-term memories for a user""" try: session = self.session_manager.create_memory_session( actor_id=user_id, session_id="retrieval_session" ) # List all memory records  memory_records = session.list_long_term_memory_records( namespace_prefix=f"agent/{user_id}/" ) return list(memory_records) except Exception as e: print(f"Error getting all memories: {e}") return [] def chat(self, user_input: str, user_id: str, session_id: str, llm_callback=None) -> str: """ Main chat function with AgentCORE Memory integration Args: user_input: User's message user_id: Unique user identifier session_id: Session identifier llm_callback: Function to call LLM (you provide this) Returns: Assistant response """ # 1. Get recent events (short-term memory)  recent_events = self.get_recent_events(user_id, session_id, max_results=5) # 2. Get long-term memories (preferences, facts)  long_term_memories = self.retrieve_long_term_memories( user_id, user_input, top_k=3 ) # 3. Build context from memories  context = self._build_context(recent_events, long_term_memories) # 4. Generate response using LLM (you provide this function)  if llm_callback: assistant_response = llm_callback(user_input, context) else: # Placeholder response  assistant_response = f"Response to: {user_input}" # 5. Store interaction in memory  self.store_interaction(user_id, session_id, user_input, assistant_response) return assistant_response def _build_context(self, recent_events: List[Dict], long_term_memories: List[Dict]) -> str: """Build context string from memories""" context_parts = [] # Add recent conversation context  if recent_events: context_parts.append("Recent Conversation:") for event in recent_events[-5:]: # Last 5 events  messages = event.get('messages', []) for msg in messages: role = msg.get('role', '') content = msg.get('content', '') context_parts.append(f"{role}: {content}") # Add long-term memories  if long_term_memories: context_parts.append("\nRelevant Memories:") for mem in long_term_memories: content = mem.get('content', {}).get('text', '') if content: context_parts.append(f"- {content}") return "\n".join(context_parts) # Usage Example def llm_generate(user_input: str, context: str) -> str: """ Example LLM callback function In production, replace with actual LLM call (Bedrock, OpenAI, etc.) """ # This is a placeholder - replace with your LLM  return f"Based on context: {context[:50]}... Response to: {user_input}" def main_aws(): """Example using AWS AgentCORE Memory""" # Initialize AgentCORE Memory  agent = AWSAgentCOREMemory(region_name="us-east-1", memory_name="MyAgentMemory") user_id = "alice_123" session_id = f"session_{datetime.now().timestamp()}" print("=== AWS AgentCORE Memory Example ===\n") # First interaction  print("--- Conversation 1 ---") user_input1 = "Hi! I'm Alice and I love hiking in the mountains. I'm a Python developer at TechCorp." response1 = agent.chat( user_input1, user_id, session_id, llm_callback=llm_generate ) print(f"User: {user_input1}") print(f"Assistant: {response1}\n") # AgentCORE stores this in short-term memory and extracts long-term memories  # Second interaction - AgentCORE remembers from short-term memory  print("--- Conversation 2 (Same Session) ---") user_input2 = "What outdoor activities would you recommend for this weekend?" response2 = agent.chat( user_input2, user_id, session_id, llm_callback=llm_generate ) print(f"User: {user_input2}") print(f"Assistant: {response2}\n") # AgentCORE recalls: Alice loves hiking (from short-term memory)  # New session - long-term memory persists  print("--- Conversation 3 (New Session - Days Later) ---") new_session_id = f"session_{datetime.now().timestamp()}" user_input3 = "What programming language should I use for my new project?" response3 = agent.chat( user_input3, user_id, new_session_id, llm_callback=llm_generate ) print(f"User: {user_input3}") print(f"Assistant: {response3}\n") # AgentCORE recalls from long-term memory: Alice is a Python developer  # View all stored memories  print("--- All Long-Term Memories for User ---") all_memories = agent.get_all_long_term_memories(user_id) for i, mem in enumerate(all_memories, 1): content = mem.get('content', {}).get('text', 'N/A') print(f"{i}. {content}") if __name__ == "__main__": # Note: Requires:  # 1. AWS credentials configured (aws configure)  # 2. Bedrock AgentCORE access enabled  # 3. Install: pip install bedrock-agentcore  #  # Uncomment to run:  # main_aws()  pass

How AWS AgentCORE Memory Works

AWS AgentCORE Memory provides:

Short-Term Memory:
- Stores raw interaction events using create_event() or add_turns()
- Events organized by actor (user) and session
- Maintains chronological order for conversation flow
- Configurable retention (up to 365 days)
Long-Term Memory:
- Uses Memory Strategies to extract insights from events
- Built-in strategies: userPreferenceMemoryStrategy, semanticMemoryStrategy
- Stores extracted memories in hierarchical namespaces
- Persists across sessions automatically
- Retrieved using retrieve_memory_records() with search queries
Memory Strategies:
- User Preference Strategy: Extracts user preferences and settings
- Semantic Strategy: Extracts facts and knowledge
- Custom strategies can be defined for specific needs
- Strategies process events and create long-term memory records
Security:
- Data encrypted at rest and in transit
- AWS-managed or customer-managed KMS keys
- Fine-grained access control via namespaces
- IAM-based authentication
Scalability:
- Fully managed service - no infrastructure to manage
- Handles large volumes efficiently
- Low latency retrieval
- Built for production workloads

When to Use AWS AgentCORE Memory vs Mem0

Choose AWS AgentCORE Memory if:

✅ You're already using AWS services
✅ You need enterprise-grade security and compliance
✅ You want fully managed infrastructure
✅ You're building AWS-native applications

Choose Mem0 if:

✅ You want open-source flexibility
✅ You're using multiple cloud providers
✅ You need more customization
✅ You want simpler setup (just API key)

Setup Requirements for AWS AgentCORE Memory

AWS Account: Active AWS account with Bedrock AgentCORE access
Install SDK: pip install bedrock-agentcore
AWS Credentials: Configure using aws configure or IAM roles
IAM Permissions: Required permissions for Bedrock AgentCORE Memory
Region: Available in specific AWS regions (e.g., us-east-1, us-west-2)

# Prerequisites setup (one-time)  """ 1. Install the SDK: pip install bedrock-agentcore 2. Configure AWS Credentials: aws configure # Or use IAM roles if running on EC2/Lambda 3. Required IAM Permissions: - bedrock:CreateMemory - bedrock:GetMemory - bedrock:ListMemories - bedrock:UpdateMemory - bedrock:DeleteMemory - bedrock:CreateEvent - bedrock:ListEvents - bedrock:RetrieveMemoryRecords - bedrock:ListMemoryRecords 4. Enable Bedrock AgentCORE in AWS Console: - Go to AWS Bedrock Console - Request access to AgentCORE features - Wait for approval (if required) 5. Create Memory Resource: - The code automatically creates a memory resource - Or create manually via AWS Console/CLI """

Comparison: Memory Solutions

Quick Comparison Table

Aspect	Manual (No Tools)	Mem0	AWS AgentCORE Memory
Setup Complexity	Very Simple	Simple (API key)	Moderate (AWS setup)
Scalability	Single machine	High	Enterprise-scale
Search Quality	Keyword matching	Semantic search	Semantic search
Memory Extraction	Manual coding	Automatic	Automatic
Persistence	File-based	Database-backed	AWS-managed
Cost	Free	Usage-based	AWS pricing
Best For	Learning, prototyping	Production (flexible)	AWS-native apps
Open Source	Yes	Yes (option)	No (AWS managed)
Multi-Cloud	N/A	Yes	No (AWS only)

Detailed Comparison

Manual Memory (No External Tools)

✅ Pros: Free, full control, simple setup, no dependencies
❌ Cons: Limited scalability, manual extraction, basic search
Use When: Learning, prototyping, small applications

Mem0

✅ Pros: Automatic extraction, simple API, open-source option, multi-cloud
❌ Cons: Requires API key, usage costs for managed version
Use When: Production apps needing flexibility, multi-cloud deployments

AWS AgentCORE Memory

✅ Pros: Enterprise-grade, AWS integration, fully managed, high security
❌ Cons: AWS-only, more complex setup, AWS account required
Use When: AWS-native applications, enterprise requirements, need AWS integration

Best Practices

1. Choose the Right Approach

Start Simple: Use manual memory for learning and prototyping
Scale Up: Move to external tools when you need production features
Consider Mem0: If you want automatic memory management with flexibility
Consider AWS AgentCORE Memory: If you're building AWS-native applications and need enterprise features

2. Memory Hygiene

Regular Cleanup: Remove old or irrelevant memories
Deduplication: Avoid storing duplicate information
Validation: Check memory quality before storing

3. Privacy and Security

Encrypt Sensitive Data: Protect user information
User Consent: Get permission before storing memories
User Control: Let users view and delete their memories

4. Performance

Batch Operations: Store multiple memories at once when possible
Caching: Cache frequently accessed memories
Indexing: Use proper indexes for fast retrieval

5. Memory Selection

Relevance First: Prioritize memories relevant to current context
Recency Matters: Give more weight to recent memories
Success Filtering: Prefer memories from successful interactions

6. Testing

Test Memory Retrieval: Ensure relevant memories are found
Test Memory Persistence: Verify memories survive restarts
Test Memory Extraction: Confirm automatic extraction works correctly

Conclusion

Memory is essential for building intelligent AI agents. Whether you start with simple Python implementations or use advanced tools like Mem0 or AWS AgentCORE Memory, the key is understanding what each memory type does and when to use it.

Quick Decision Guide:

Learning/Prototyping: Use manual memory (Part 1)
Production App (Flexible): Use Mem0 (Part 3) - works with any cloud
Production App (AWS): Use AWS AgentCORE Memory (Part 4) - AWS-native
Custom Needs: Use individual tools like ChromaDB, Pinecone (Part 2)

Start simple, understand the concepts, then scale up as needed. The examples in this guide provide working code you can adapt to your needs.

Key Takeaway: Both Mem0 and AWS AgentCORE Memory can be used similarly - they both provide automatic memory extraction and management. Choose based on your infrastructure preferences (multi-cloud vs AWS-only).

Resources

Mem0 Documentation - Intelligent memory management
AWS AgentCORE Memory Documentation - AWS managed memory service
AWS Bedrock AgentCORE Memory Blog - Getting started guide
LangGraph Documentation - State management and persistence
ChromaDB Documentation - Vector database
Pinecone Documentation - Cloud vector database
CoALA Framework - Cognitive architectures for language agents

DEV Community