How to Use Memory Innovation in AI Hardware

Explore top LinkedIn content from expert professionals.

Timothy Goebel

AI Solutions Architect | Computer Vision & Edge AI Visionary | Building Next-Gen Tech with GENAI | Strategic Leader | Public Speaker

17,565 followers 1mo
Report this post
𝐀𝐈 𝐚𝐠𝐞𝐧𝐭𝐬 𝐝𝐨𝐧’𝐭 𝐧𝐞𝐞𝐝 𝐦𝐨𝐫𝐞 𝐝𝐚𝐭𝐚 𝐭𝐡𝐞𝐲 𝐧𝐞𝐞𝐝 𝐛𝐞𝐭𝐭𝐞𝐫 𝐦𝐞𝐦𝐨𝐫𝐲 𝐚𝐫𝐜𝐡𝐢𝐭𝐞𝐜𝐭𝐮𝐫𝐞. Most agents fail not from ignorance, but from memory blindness. Design memory first, and agents become informed, consistent, and trustworthy. Five memories turn static models into adaptive, accountable digital coworkers. ↳ 𝐖𝐨𝐫𝐤𝐢𝐧𝐠 𝐦𝐞𝐦𝐨𝐫𝐲 holds current goals, constraints, and dialogue turns in play. ↳ 𝐒𝐞𝐦𝐚𝐧𝐭𝐢𝐜 𝐦𝐞𝐦𝐨𝐫𝐲 stores facts, schemas, and domain knowledge beyond single tasks. ↳ 𝐏𝐫𝐨𝐜𝐞𝐝𝐮𝐫𝐚𝐥 𝐦𝐞𝐦𝐨𝐫𝐲 captures tools, steps, and policies for repeatable execution. ↳ 𝐄𝐩𝐢𝐬𝐨𝐝𝐢𝐜 𝐦𝐞𝐦𝐨𝐫𝐲 logs situations, outcomes, and lessons from past work. ↳ 𝐏𝐫𝐞𝐟𝐞𝐫𝐞𝐧𝐜𝐞 𝐦𝐞𝐦𝐨𝐫𝐲 tracks users, roles, thresholds, and exceptions that personalize actions. Insight: Separation prevents overwrites and hallucinations when contexts suddenly shift. Insight: Retrieval gates control which memories are relevant, reducing noise. Insight: Freshness scores prioritize recent episodes without erasing durable knowledge. Insight: Audit trails from episodic memory create governance and regulatory defensibility. A Manufacturing support agent forgot entitlements and unnecessarily escalated routine tickets. Adding procedural, episodic, and preference memories with retrieval gates. Resolution accuracy rose, first contact resolutions jumped, and escalations dropped dramatically. Leaders finally trusted agents because decisions referenced verifiable, auditable memories. If you deploy agents, design memory before prompts, models, or dashboards. ♻️ Repost to your LinkedIn empower your network & follow Timothy Goebel for expert insights #AIAgents #Manufacturing #Construction #Healthcare #SmallBusiness
No more previous content

No more next content

Timothy Goebel

AI Solutions Architect | Computer Vision & Edge AI Visionary | Building Next-Gen Tech with GENAI | Strategic Leader | Public Speaker

𝐀𝐈 𝐚𝐠𝐞𝐧𝐭𝐬 𝐝𝐨𝐧’𝐭 𝐧𝐞𝐞𝐝 𝐦𝐨𝐫𝐞 𝐝𝐚𝐭𝐚 𝐭𝐡𝐞𝐲 𝐧𝐞𝐞𝐝 𝐛𝐞𝐭𝐭𝐞𝐫 𝐦𝐞𝐦𝐨𝐫𝐲 𝐚𝐫𝐜𝐡𝐢𝐭𝐞𝐜𝐭𝐮𝐫𝐞. Most agents fail not from ignorance, but from memory blindness. Design memory first, and agents become informed, consistent, and trustworthy. Five memories turn static models into adaptive, accountable digital coworkers. ↳ 𝐖𝐨𝐫𝐤𝐢𝐧𝐠 𝐦𝐞𝐦𝐨𝐫𝐲 holds current goals, constraints, and dialogue turns in play. ↳ 𝐒𝐞𝐦𝐚𝐧𝐭𝐢𝐜 𝐦𝐞𝐦𝐨𝐫𝐲 stores facts, schemas, and domain knowledge beyond single tasks. ↳ 𝐏𝐫𝐨𝐜𝐞𝐝𝐮𝐫𝐚𝐥 𝐦𝐞𝐦𝐨𝐫𝐲 captures tools, steps, and policies for repeatable execution. ↳ 𝐄𝐩𝐢𝐬𝐨𝐝𝐢𝐜 𝐦𝐞𝐦𝐨𝐫𝐲 logs situations, outcomes, and lessons from past work. ↳ 𝐏𝐫𝐞𝐟𝐞𝐫𝐞𝐧𝐜𝐞 𝐦𝐞𝐦𝐨𝐫𝐲 tracks users, roles, thresholds, and exceptions that personalize actions. Insight: Separation prevents overwrites and hallucinations when contexts suddenly shift. Insight: Retrieval gates control which memories are relevant, reducing noise. Insight: Freshness scores prioritize recent episodes without erasing durable knowledge. Insight: Audit trails from episodic memory create governance and regulatory defensibility. A Manufacturing support agent forgot entitlements and unnecessarily escalated routine tickets. Adding procedural, episodic, and preference memories with retrieval gates. Resolution accuracy rose, first contact resolutions jumped, and escalations dropped dramatically. Leaders finally trusted agents because decisions referenced verifiable, auditable memories. If you deploy agents, design memory before prompts, models, or dashboards. ♻️ Repost to your LinkedIn empower your network & follow Timothy Goebel for expert insights #AIAgents #Manufacturing #Construction #Healthcare #SmallBusiness

125 Comments

Like Comment
125 Comments
Like Comment
Sohrab Rahimi

Partner at McKinsey & Company | Head of Data Science Guild in North America

19,973 followers 4mo
Report this post
The biggest limitation in today’s AI agents is not their fluency. It is memory. Most LLM-based systems forget what happened in the last session, cannot improve over time, and fail to reason across multiple steps. This makes them unreliable in real workflows. They respond well in the moment but do not build lasting context, retain task history, or learn from repeated use. A recent paper, “Rethinking Memory in AI,” introduces four categories of memory, each tied to specific operations AI agents need to perform reliably: 𝗟𝗼𝗻𝗴-𝘁𝗲𝗿𝗺 𝗺𝗲𝗺𝗼𝗿𝘆 focuses on building persistent knowledge. This includes consolidation of recent interactions into summaries, indexing for efficient access, updating older content when facts change, and forgetting irrelevant or outdated data. These operations allow agents to evolve with users, retain institutional knowledge, and maintain coherence across long timelines. 𝗟𝗼𝗻𝗴-𝗰𝗼𝗻𝘁𝗲𝘅𝘁 𝗺𝗲𝗺𝗼𝗿𝘆 refers to techniques that help models manage large context windows during inference. These include pruning attention key-value caches, selecting which past tokens to retain, and compressing history so that models can focus on what matters. These strategies are essential for agents handling extended documents or multi-turn dialogues. 𝗣𝗮𝗿𝗮𝗺𝗲𝘁𝗿𝗶𝗰 𝗺𝗼𝗱𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 addresses how knowledge inside a model’s weights can be edited, updated, or removed. This includes fine-grained editing methods, adapter tuning, meta-learning, and unlearning. In continual learning, agents must integrate new knowledge without forgetting old capabilities. These capabilities allow models to adapt quickly without full retraining or versioning. 𝗠𝘂𝗹𝘁𝗶-𝘀𝗼𝘂𝗿𝗰𝗲 𝗺𝗲𝗺𝗼𝗿𝘆 focuses on how agents coordinate knowledge across formats and systems. It includes reasoning over multiple documents, merging structured and unstructured data, and aligning information across modalities like text and images. This is especially relevant in enterprise settings, where context is fragmented across tools and sources. Looking ahead, the future of memory in AI will focus on: • 𝗦𝗽𝗮𝘁𝗶𝗼-𝘁𝗲𝗺𝗽𝗼𝗿𝗮𝗹 𝗺𝗲𝗺𝗼𝗿𝘆: Agents will track when and where information was learned to reason more accurately and manage relevance over time. • 𝗨𝗻𝗶𝗳𝗶𝗲𝗱 𝗺𝗲𝗺𝗼𝗿𝘆: Parametric (in-model) and non-parametric (external) memory will be integrated, allowing agents to fluidly switch between what they “know” and what they retrieve. • 𝗟𝗶𝗳𝗲𝗹𝗼𝗻𝗴 𝗹𝗲𝗮𝗿𝗻𝗶𝗻𝗴: Agents will be expected to learn continuously from interaction without retraining, while avoiding catastrophic forgetting. • 𝗠𝘂𝗹𝘁𝗶-𝗮𝗴𝗲𝗻𝘁 𝗺𝗲𝗺𝗼𝗿𝘆: In environments with multiple agents, memory will need to be sharable, consistent, and dynamically synchronized across agents. Memory is not just infrastructure. It defines how your agents reason, adapt, and persist!
No more previous content

No more next content

Sohrab Rahimi

Partner at McKinsey & Company | Head of Data Science Guild in North America

The biggest limitation in today’s AI agents is not their fluency. It is memory. Most LLM-based systems forget what happened in the last session, cannot improve over time, and fail to reason across multiple steps. This makes them unreliable in real workflows. They respond well in the moment but do not build lasting context, retain task history, or learn from repeated use. A recent paper, “Rethinking Memory in AI,” introduces four categories of memory, each tied to specific operations AI agents need to perform reliably: 𝗟𝗼𝗻𝗴-𝘁𝗲𝗿𝗺 𝗺𝗲𝗺𝗼𝗿𝘆 focuses on building persistent knowledge. This includes consolidation of recent interactions into summaries, indexing for efficient access, updating older content when facts change, and forgetting irrelevant or outdated data. These operations allow agents to evolve with users, retain institutional knowledge, and maintain coherence across long timelines. 𝗟𝗼𝗻𝗴-𝗰𝗼𝗻𝘁𝗲𝘅𝘁 𝗺𝗲𝗺𝗼𝗿𝘆 refers to techniques that help models manage large context windows during inference. These include pruning attention key-value caches, selecting which past tokens to retain, and compressing history so that models can focus on what matters. These strategies are essential for agents handling extended documents or multi-turn dialogues. 𝗣𝗮𝗿𝗮𝗺𝗲𝘁𝗿𝗶𝗰 𝗺𝗼𝗱𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 addresses how knowledge inside a model’s weights can be edited, updated, or removed. This includes fine-grained editing methods, adapter tuning, meta-learning, and unlearning. In continual learning, agents must integrate new knowledge without forgetting old capabilities. These capabilities allow models to adapt quickly without full retraining or versioning. 𝗠𝘂𝗹𝘁𝗶-𝘀𝗼𝘂𝗿𝗰𝗲 𝗺𝗲𝗺𝗼𝗿𝘆 focuses on how agents coordinate knowledge across formats and systems. It includes reasoning over multiple documents, merging structured and unstructured data, and aligning information across modalities like text and images. This is especially relevant in enterprise settings, where context is fragmented across tools and sources. Looking ahead, the future of memory in AI will focus on: • 𝗦𝗽𝗮𝘁𝗶𝗼-𝘁𝗲𝗺𝗽𝗼𝗿𝗮𝗹 𝗺𝗲𝗺𝗼𝗿𝘆: Agents will track when and where information was learned to reason more accurately and manage relevance over time. • 𝗨𝗻𝗶𝗳𝗶𝗲𝗱 𝗺𝗲𝗺𝗼𝗿𝘆: Parametric (in-model) and non-parametric (external) memory will be integrated, allowing agents to fluidly switch between what they “know” and what they retrieve. • 𝗟𝗶𝗳𝗲𝗹𝗼𝗻𝗴 𝗹𝗲𝗮𝗿𝗻𝗶𝗻𝗴: Agents will be expected to learn continuously from interaction without retraining, while avoiding catastrophic forgetting. • 𝗠𝘂𝗹𝘁𝗶-𝗮𝗴𝗲𝗻𝘁 𝗺𝗲𝗺𝗼𝗿𝘆: In environments with multiple agents, memory will need to be sharable, consistent, and dynamically synchronized across agents. Memory is not just infrastructure. It defines how your agents reason, adapt, and persist!

11 Comments

Like Comment
11 Comments
Like Comment
Aishwarya Naresh Reganti

Founder @ LevelUp Labs | Ex-AWS | Consulting, Training & Investing in AI

111,866 followers 2mo
Report this post
😵 Woah, there’s a full-blown paper on how you could build a memory OS for LLMs. Memory in AI systems has only started getting serious attention recently, mainly because people realized that LLM context lengths are limited and passing everything every time for complex tasks just doesn’t scale. This is a forward-looking paper that treats memory as a first-class citizen, almost like an operating system layer for LLMs. It’s a long and dense read, but here are some highlights: ⛳ The authors define three types of memory in AI systems: - Parametric: Knowledge baked into the model weights - Activation: Temporary, runtime memory (like KV cache) - Plaintext: External editable memory (docs, notes, examples) The idea is to orchestrate and evolve these memory types together, not treat them as isolated hacks. ⛳ MemOS introduces a unified system to manage memory: representation, organization, access, and governance. ⛳ At the heart of it is MemCube, a core abstraction that enables tracking, fusion, versioning, and migration of memory across tasks. It makes memory reusable and traceable, even across agents. The vision here isn't just "memory", it’s to let agents adapt over time, personalize responses, and coordinate memory across platforms and workflows. I definitely think memory is one of the biggest blockers to building more human-like agents. This looks super well thought out, it gives you an abstraction to actually build with. Not totally sure if the same abstractions will work across all use cases, but very excited to see more work in this direction! Link: https://lnkd.in/gtxC7kXj
No more previous content

No more next content

Aishwarya Naresh Reganti

Founder @ LevelUp Labs | Ex-AWS | Consulting, Training & Investing in AI

😵 Woah, there’s a full-blown paper on how you could build a memory OS for LLMs. Memory in AI systems has only started getting serious attention recently, mainly because people realized that LLM context lengths are limited and passing everything every time for complex tasks just doesn’t scale. This is a forward-looking paper that treats memory as a first-class citizen, almost like an operating system layer for LLMs. It’s a long and dense read, but here are some highlights: ⛳ The authors define three types of memory in AI systems: - Parametric: Knowledge baked into the model weights - Activation: Temporary, runtime memory (like KV cache) - Plaintext: External editable memory (docs, notes, examples) The idea is to orchestrate and evolve these memory types together, not treat them as isolated hacks. ⛳ MemOS introduces a unified system to manage memory: representation, organization, access, and governance. ⛳ At the heart of it is MemCube, a core abstraction that enables tracking, fusion, versioning, and migration of memory across tasks. It makes memory reusable and traceable, even across agents. The vision here isn't just "memory", it’s to let agents adapt over time, personalize responses, and coordinate memory across platforms and workflows. I definitely think memory is one of the biggest blockers to building more human-like agents. This looks super well thought out, it gives you an abstraction to actually build with. Not totally sure if the same abstractions will work across all use cases, but very excited to see more work in this direction! Link: https://lnkd.in/gtxC7kXj

20 Comments

Like Comment
20 Comments
Like Comment
Nate Herkelman

Scale Without Increasing Headcount | Co-Founder & CGO @ TrueHorizon AI

29,893 followers 2mo
Report this post
Unlock the Next Evolution of Agents with Human-like Memory (n8n + zep) Most agents are set up to retain conversation history as a context window of the past 5 or 10 messages. If we want truly human-like agents, we need to give them long-term memory. → Memory that persists across sessions, understands relationships between entities, and evolves over time. I just dropped a 16 minute video where I show how to integrate Zep with n8n to give your agents long-term, relational memory. But here’s the catch: this kind of memory can quickly balloon your token usage, especially as you scale. So I break down: → The difference between short-term and long-term memory → How relational memory makes agents more intelligent → Why blindly loading memory is expensive and risky → Two methods I use to reduce token count and retrieve only the most relevant memories This is the next step in building smarter, more scalable AI systems. 📺 Watch the full video here: https://lnkd.in/g4i3mzr5 👥 Join the #1 community to learn & master no code AI automation: https://lnkd.in/dqVsX4Ab

43 Comments
Like Comment

How to Use Memory Innovation in AI Hardware

More in Advancing AI Development

Explore categories