FastMemory

FastMemory is an ontological clustering engine that transforms flat, unstructured text embeddings into a structured, agent-navigable functional memory graph using the Topology (Component, Block, Function, Data, Access, Event) taxonomy.

Developed by FastBuilder.AI, FastMemory bridges the gap between shallow vector retrieval (RAG) and deterministic computational memory.

🏆 State-of-the-Art (SOTA) Performance

FastMemory has officially achieved SOTA status on 13 distinct benchmarks (including FinanceBench, FRAMES, LongBench, GraphRAG-Bench, and HaluEval), comprehensively outperforming standard vector RAG architectures in massive multi-hop reasoning, logic extraction, and deterministic pathfinding.

Explore the full benchmark matrix and transparent execution traces on our official Hugging Face Model Card.

📊 Production Memory Efficiency (Verified)

These numbers are from our live production instance serving 445,289 PubChem compounds:

Metric	Vector Index (TurboVec)	FastMemory (Production)
Source data	31 GB (10M float32 vectors)	40 GB (PubChem Compounds)
Stored size	~4 GB (4-bit quantized)	1.15 GB (topology DB)
Compression ratio	8x	35x
Runtime RAM	~4 GB (index must live in RAM)	129 MiB (full backend)
Structure preserved	❌ Approximate similarity only	✅ Full CBFDAE topology
Multi-hop reasoning	❌ No	✅ Wormhole traversal
Anti-hallucination	❌ No	✅ Fabrication scrubber

35x compression. 129 MiB to serve 445K compounds. Full structural fidelity — no information lost, information gained.

⚡ Quickstart: Try the RAG-Replacement

Run the FastMemory Demonstration Notebook: Basic Topology Architecture.
Run the FinanceBench SOTA Notebook: Advanced multi-hop financial reasoning.

🤬 Developer Pain Points & The FastMemory Solution

Building reliable AI agents on top of massive codebases and datasets is incredibly hard. FastMemory directly solves the three biggest pain points developers face today:

RAG Hallucinations: Standard vector similarity retrieves unrelated text chunks just because they share keywords (e.g., retrieving the "Login Code" when the user asked about the "Login Bug Ticket"). FastMemory provides Deterministic Pathfinding through isolated functional clusters.
Context Fragmentation: Naive text chunking destroys logical boundaries, losing the surrounding context of a function. FastMemory parses semantic topologies into grouped Cognitive Blocks, providing the AI with sibling functions and deterministic access restrictions.
Graph DB Sync Overhead: Piping hierarchical data into Neo4J normally requires complex, fragile NLP and ETL pipelines. The FastMemory Rust engine does this natively in milliseconds using structural Louvain clustering.

🗺️ The Google Maps Analogy

Imagine opening Google Maps, but all you can see are roads and paths. There are no building names, no entry gates, no transaction information for the buildings, and no communication routing.

If you asked a humanoid robot to navigate to a hospital using this map, it would only see a "road to a doctor", a "road to a bed", a "road to a nurse", and a "road to a pharmacy." It would have a profoundly hard time knowing the modes and modality of how to actually behave, act, and pursue every target differently depending on context.

That is exactly what happens when you use standard RAG, semantic ontologies, or flat vector graphs.

Standard Ontology / RAG	FastMemory Topology Map

You simply have node-to-node semantic edges. You possess the "roads" (cosine similarity), but you lack the "buildings" (Functional Components) and the "rules of entry/engagement" (Access and Events).

FastMemory solves this. We utilize high-speed Community Detection (Louvain clustering) to mathematically derive and enhance this network of data for direct AI usage, translating raw text into executable cognitive blocks.

🔍 Features & Benefits

Topology Ontology: Information isn't just stored; it is classified into Components, Blocks, Functions, Data, Access restrictions, and Events.
Deterministic Pathfinding: Eliminates RAG hallucinations. An AI doesn't "guess" the answer based on semantic proximity; it traverses a rigorous, rule-based logic graph.
The Agentic Query Engine: Deep recursive subtree targeting. When you query FastMemory, it doesn't just return a matching string—it returns the deepest logical encompassing Block, providing the AI with sibling functions and contextual boundaries.
Production Ready: Designed to scale from local development to large-scale production deployments with graph databases like Neo4J.

📊 Before & After FastMemory

Standard vector RAG databases index chunks individually, often losing the multi-hop reasoning capability required to trace dependencies. FastMemory restructures these into event-driven, hierarchical memory blocks.

(You can open the interactive D3.js visualizations directly in your browser from the example/ directories!)

🏥 Health Science

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

🤖 Robotics

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

🚗 Driverless Cars

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

📈 Business Analytics

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

✉️ Email Analysis

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

📋 Audit Operations

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

🌍 World Events

Before: Flat Semantic Vectors	After: Clustered Functional Memory Graph

📦 Installation

Rust (Cargo) - CLI Utility To install the standalone fastmemory CLI tool for terminal, server, or MCP usage:

cargo install fastmemory

Python (PyPI) - Native Import To install the high-speed Python module (built natively via PyO3) for direct integration into your Python AI applications:

pip install fastmemory

🚀 Usage Guide

FastMemory can be utilized natively from the command line, spun up as a REST server, or imported directly into your Python scripts.

1. Terminal CLI (via Cargo)

# Build the memory graph from an ATF Markdown file
$ fastmemory build data/input.md

# Instantly query the hierarchical graph
$ fastmemory query data/input.md "reimbursement"

2. Python (Direct Import)

By utilizing our pip module, your Python loops can pass markdown directly to the compiled Rust engine without any JSON/CLI overhead. The resulting graph JSON is computed instantly via Louvain community detection.

import fastmemory

# 1. Define or fetch your raw Action-Topology Format (ATF) text
markdown_text = """
## [ID: auth_module]
**Action:** Validate_Token
**Data_Connections:** session_uuid
**Access:** Role_Admin
**Events:** User_Login
"""

# 2. Pass strings synchronously into the Rust engine
topology_json_graph = fastmemory.process_markdown(markdown_text)

print(topology_json_graph)

3. Running as a Service

FastMemory ships with a highly optimized embedded Axum web server and MCP (Model Context Protocol) integration for AI agents:

# Boot the REST API
$ fastmemory serve data/input.md --port 16743
# Query: curl http://localhost:16743/query?q=reimbursement

# Boot standard Stdio MCP (for Claude / Gemini IDE integration)
$ fastmemory mcp data/input.md

4. Data Ingestion

In addition to positional file arguments, FastMemory supports flexible ingestion for dynamic pipelines:

Local Directories (--data): Pass a local directory to parse and cluster multiple ATF Markdown files continuously.
```
$ fastmemory build --data /var/lib/fastmemory/data
```
Remote Pipelines (--datahost): Bind directly to Data Warehouses (Snowflake, BigQuery), Data Lakes (Databricks, Fabric), or S3 by passing a connection URI. FastMemory will securely intercept the URI and dynamically ingest the remote structures.
```
$ fastmemory serve --port 16743 --datahost postgres://db_user:secret@localhost:5432/app
$ fastmemory mcp --datahost s3://corporate-bucket/atfs/
```

Tip

Large Scale Graph DB Memory: When scaling FastMemory beyond local processing, the clustered JSON output explicitly maps into systems like Neo4J. FastMemory naturally supports partial topology updates. Please read our Production Scaling & Graph DB Ingestion Guide for detailed Python and Cypher deployment patterns.

5. Advanced Security & Federated Auth

Data access within FastMemory is rigorously secured at the graph layer. Utilizing the A_ (Access) node topology, you can map federated IAM rules (like AWS IAM or Azure AD) directly onto specific memory blocks.

Wrapper Implementation: Place an API Gateway ahead of fastmemory serve to enforce standard OAuth/SAML.
Code-Level Auth: AI agents parsing the memory graph will inherently see the A_Role_Admin nodes attached to functions, allowing the agent to deterministically self-regulate access before taking action.

6. Interactive Notebooks (Jupyter)

For a hands-on technical demonstration of how FastMemory replaces Vector RAG with deterministic Topology grounding, explore our interactive Jupyter Notebooks:

Basic Global Topology Notebook: Learn the core LangChain grounding loop and ATF extraction.
FinanceBench SOTA Notebook: Advanced demonstration of multi-hop financial reasoning using the Boeing 10-K dataset.

🏗️ Architecture & Integration Patterns

FastMemory is designed to integrate into complex, high-throughput data ecosystems. While it natively clusters Markdown-based Atomic Text Functions (ATFs) via rust-louvain, in production environments where data is distributed across Data Warehouses, Data Lakes, and specialized analytics platforms, FastMemory acts as an ontological orchestrator and agentic query engine bridging structured pipelines and autonomous AI logic.

For detailed integration patterns with Snowflake, BigQuery, Databricks, AWS Glue, Microsoft Fabric, and Neo4J, see the Architecture Guide.

🧠 Applications

Standard RAG Robot Brain	FastMemory Topology Robot Brain

Agentic Apps & SaaS: Integrate fastmemory mcp directly into your proprietary AI loops. Instead of sending agents to vector DBs, send them into a FastMemory graph where they can extract isolated, functional context blocks to execute workflows.
Fast Software Engineering: In FastBuilder.AI, FastMemory acts as the structural brain for rapid feature development. By indexing the entire application architecture into an ontological graph, coding agents can query precisely how a proposed change will impact distant, decoupled components.
The Possibilities are Endless: Medical diagnostics routing, autonomous drone navigation logic, compliance auditing, etc.

🏢 Commercial Support

FastMemory is and will always be free and open-source under the MIT License.

For teams and organizations that need managed deployments, compliance tooling, security monitoring, and dedicated support, we offer FastStudio — a packaged platform built on top of FastMemory that provides:

🔒 Compliance & Governance (BuildRight) — automated ontological compliance rules
🛡️ Security Monitoring (SafeSemantics) — real-time semantic threat detection
📊 Data Memory Management — topology visualization, audit trails, and RBAC
🤝 Dedicated Support — SLA-backed assistance from the FastMemory core team

Note

You do not need FastStudio to use FastMemory. Every feature in this repository is fully functional and unrestricted. FastStudio is for teams that want a managed, production-hardened experience with compliance and security baked in.

👉 Learn more about FastStudio →

📄 License

This project is licensed under the MIT License — free for personal, commercial, and organizational use without restriction.

🤝 Contributing

We welcome contributions! Whether it's bug fixes, new integration patterns, additional domain examples, or performance improvements — open a PR or start a discussion.

Built with 🛡️💻🧠 by FastBuilder.AI

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
bin		bin
example		example
scripts		scripts
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
architecture.md		architecture.md
banner.png		banner.png
build_huggingface_examples.py		build_huggingface_examples.py
datasets.md		datasets.md
example.md		example.md
fastmemory_demo.ipynb		fastmemory_demo.ipynb
fastmemory_financebench_sota.ipynb		fastmemory_financebench_sota.ipynb
generate_examples.sh		generate_examples.sh
production.md		production.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastMemory

🏆 State-of-the-Art (SOTA) Performance

📊 Production Memory Efficiency (Verified)

⚡ Quickstart: Try the RAG-Replacement

🤬 Developer Pain Points & The FastMemory Solution

🗺️ The Google Maps Analogy

🔍 Features & Benefits

📊 Before & After FastMemory

🏥 Health Science

🤖 Robotics

🚗 Driverless Cars

📈 Business Analytics

✉️ Email Analysis

📋 Audit Operations

🌍 World Events

📦 Installation

🚀 Usage Guide

1. Terminal CLI (via Cargo)

2. Python (Direct Import)

3. Running as a Service

4. Data Ingestion

5. Advanced Security & Federated Auth

6. Interactive Notebooks (Jupyter)

🏗️ Architecture & Integration Patterns

🧠 Applications

🏢 Commercial Support

📄 License

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FastMemory

🏆 State-of-the-Art (SOTA) Performance

📊 Production Memory Efficiency (Verified)

⚡ Quickstart: Try the RAG-Replacement

🤬 Developer Pain Points & The FastMemory Solution

🗺️ The Google Maps Analogy

🔍 Features & Benefits

📊 Before & After FastMemory

🏥 Health Science

🤖 Robotics

🚗 Driverless Cars

📈 Business Analytics

✉️ Email Analysis

📋 Audit Operations

🌍 World Events

📦 Installation

🚀 Usage Guide

1. Terminal CLI (via Cargo)

2. Python (Direct Import)

3. Running as a Service

4. Data Ingestion

5. Advanced Security & Federated Auth

6. Interactive Notebooks (Jupyter)

🏗️ Architecture & Integration Patterns

🧠 Applications

🏢 Commercial Support

📄 License

🤝 Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages