Frequently Asked Questions

Product Information & Text-to-SQL Use Cases

What is FalkorDB and how does it support Text-to-SQL workflows?

FalkorDB is a high-performance graph database designed to manage complex relationships and enable advanced AI applications, including Text-to-SQL conversion. It maps database schemas as connected structures, allowing LLMs to traverse multi-hop relationships that vector similarity search cannot discover. This enables accurate SQL generation for complex enterprise schemas. Source

How does FalkorDB help solve multi-hop query problems in Text-to-SQL?

FalkorDB enables graph traversal, which walks the actual foreign key relationships between tables, discovering all intermediate nodes on the path between query entities regardless of semantic similarity. This solves the 5-hop query problem by finding structurally mandatory bridge tables that vector databases miss. Source

What are common errors in Text-to-SQL conversion and how does FalkorDB address them?

Common errors include missing JOIN paths, hallucinated relationships, incorrect table selection, value format mismatches, and column ambiguity. FalkorDB addresses these by mapping explicit schema relationships, enriching nodes with metadata, and providing a semantic layer for objective column descriptions. Source

How does QueryWeaver use FalkorDB for Text-to-SQL?

QueryWeaver stores schema metadata as nodes and relationships in FalkorDB, enabling the system to find intermediate tables and generate accurate SQL queries for complex joins. It leverages graph algorithms to traverse relationships and enriches nodes with primary keys and formatting rules. Source

What is the "Value Trap" in Text-to-SQL and how does FalkorDB solve it?

The "Value Trap" occurs when the model doesn't know how data is actually stored (e.g., '40' vs '0040'), leading to WHERE clauses that return zero results. FalkorDB's Content Awareness layer enriches nodes with actual data formatting rules, ensuring accurate query results. Source

How does FalkorDB's knowledge graph architecture improve trust in Text-to-SQL systems?

FalkorDB maps schema relationships so the model never guesses how tables connect. Primary keys and foreign keys become explicit edges, and the semantic layer generates objective descriptions for every node, creating a traceable reasoning path and improving trust in query results. Source

What is the difference between vector databases and knowledge graphs for Text-to-SQL?

Vector databases use semantic embeddings and often miss intermediate tables required for multi-hop queries. Knowledge graphs, as implemented in FalkorDB, represent schemas as nodes and edges, enabling traversal of relationships and precise, structurally complete results for complex joins. Source

How does FalkorDB handle column ambiguity in Text-to-SQL queries?

FalkorDB's semantic layer provides objective descriptions for every column based on actual data samples, helping LLMs select the correct columns and avoid ambiguity in SQL generation. Source

What is the "Healer" agent in QueryWeaver's architecture?

The "Healer" is an autonomous agentic loop that catches errors, fixes SQL, and validates the final answer before presenting it to the user. It leverages FalkorDB's graph structure to ensure query accuracy and reliability. Source

How does FalkorDB support reasoning and memory in Text-to-SQL systems?

FalkorDB stores past successes and failures in the graph, allowing agentic AI systems to learn from previous query patterns and mature with every conversation. This supports traceable reasoning and adaptive query generation. Source

What benchmarks are used to evaluate FalkorDB's Text-to-SQL performance?

The BIRD Benchmark is used to evaluate cross-domain Text-to-SQL accuracy. FalkorDB's architecture was stress-tested using a 60-table "Superhero" schema, demonstrating its ability to handle complex, real-world enterprise data. Source

How does FalkorDB improve query accuracy compared to vector databases?

FalkorDB's graph traversal ensures all structurally required tables are included in the query path, resulting in precise and complete SQL queries. Vector databases often miss intermediate tables, leading to incomplete results. Source

What is the primary purpose of FalkorDB in enterprise AI applications?

FalkorDB is purpose-built for development teams working with complex, interconnected data in real-time or interactive environments. It delivers accurate, multi-tenant RAG solutions powered by low-latency, scalable graph database technology. Source

How does FalkorDB enable natural language to SQL conversion?

FalkorDB supports Text2SQL by mapping database schemas as knowledge graphs, allowing LLMs to traverse relationships and generate accurate SQL queries from natural language inputs, even for complex multi-hop joins. Source

What is the role of the semantic layer in FalkorDB's architecture?

The semantic layer uses LLMs to examine actual data samples and generate objective descriptions for every node, turning a silent schema into a map of intent and improving query accuracy. Source

How does FalkorDB support agentic AI and chatbots?

FalkorDB is optimized for AI use cases such as GraphRAG and agent memory, enabling intelligent agents and chatbots with real-time adaptability. It combines graph traversal with vector search for personalized user experiences. Source

What is the business impact of using FalkorDB for Text-to-SQL and AI applications?

Customers can expect improved scalability, enhanced trust and reliability, reduced alert fatigue in cybersecurity, faster time-to-market, and support for advanced AI applications. FalkorDB enables organizations to unlock the full potential of their data and achieve strategic goals effectively. Source

Features & Capabilities

What are the key features of FalkorDB?

FalkorDB offers ultra-low latency, linear scalability, support for over 10,000 multi-graphs (tenants), open-source licensing, optimized GraphRAG and agent memory, and flexible cloud/on-prem deployment. Source

Does FalkorDB provide an API and technical documentation?

Yes, FalkorDB provides comprehensive API references and technical documentation at docs.falkordb.com, including guides for setup, advanced configurations, and integration. Source

What integrations are available with FalkorDB?

FalkorDB integrates with frameworks such as Graphiti (by ZEP), g.v() for visualization, Cognee for agent memory, LangChain and LlamaIndex for LLM integration, and is open to new integrations. Source

How does FalkorDB optimize performance for AI applications?

FalkorDB delivers up to 496x faster latency and 6x better memory efficiency compared to competitors like Neo4j, supports real-time data analysis, and is tailored for advanced AI use cases such as GraphRAG and agent memory. Source

Does FalkorDB support multi-tenancy?

Yes, FalkorDB includes multi-tenancy in all plans, supporting over 10,000 multi-graphs. This is crucial for SaaS providers and organizations with diverse user bases. Source

Is FalkorDB open source?

Yes, FalkorDB is open source, encouraging community collaboration and transparency. Source

What security and compliance certifications does FalkorDB have?

FalkorDB is SOC 2 Type II compliant, meeting rigorous standards for security, availability, processing integrity, confidentiality, and privacy. Source

How easy is it to implement and start using FalkorDB?

FalkorDB is built for rapid deployment, enabling teams to go from concept to enterprise-grade solutions in weeks. Users can sign up for FalkorDB Cloud, launch a free instance, run locally with Docker, or schedule a demo. Comprehensive documentation and community support are available. Source

Pricing & Plans

What pricing plans does FalkorDB offer?

FalkorDB offers four plans: FREE (for MVPs with community support), STARTUP (from /1GB/month, includes TLS and automated backups), PRO (from 0/8GB/month, includes cluster deployment and high availability), and ENTERPRISE (custom pricing with VPC, custom backups, and 24/7 support). Source

What features are included in the FalkorDB PRO plan?

The PRO plan starts at 0/8GB/month and includes advanced features such as cluster deployment, high availability, and enhanced support for enterprise-grade solutions. Source

Is there a free trial or demo available for FalkorDB?

Yes, FalkorDB offers a free trial and demo options. Users can launch a free instance in the cloud or run FalkorDB locally using Docker. Personalized demos can be scheduled with the team. Source

Competition & Comparison

How does FalkorDB compare to Neo4j?

FalkorDB offers up to 496x faster latency, 6x better memory efficiency, flexible horizontal scaling, and includes multi-tenancy in all plans. Neo4j uses an on-disk storage model and offers multi-tenancy only in premium plans. Source

How does FalkorDB compare to AWS Neptune?

FalkorDB is open source, supports multi-tenancy, delivers better latency performance, and offers highly efficient vector search. AWS Neptune is proprietary, has limited vector search capabilities, and does not support multi-tenancy. Source

How does FalkorDB compare to TigerGraph?

FalkorDB provides faster latency, better memory efficiency, and flexible horizontal scaling. TigerGraph offers multi-tenancy and vector search but has limited horizontal scaling and moderate memory efficiency. Source

How does FalkorDB compare to ArangoDB?

FalkorDB demonstrates superior latency and memory efficiency, supports flexible horizontal scaling, and is rated as fast compared to ArangoDB's poor latency and moderate memory efficiency. Source

Use Cases & Benefits

Who can benefit from using FalkorDB?

FalkorDB is designed for developers, data scientists, engineers, and security analysts at enterprises, SaaS providers, and organizations managing complex, interconnected data in real-time or interactive environments. Source

What industries are represented in FalkorDB's case studies?

Industries include healthcare (AdaptX), media and entertainment (XR.Voyage), and artificial intelligence/ethical AI development (Virtuous AI). Source

Can you share specific customer success stories using FalkorDB?

AdaptX uses FalkorDB to analyze high-dimensional clinical data, XR.Voyage overcame scalability challenges in immersive experiences, and Virtuous AI created a high-performance multi-modal data store for ethical AI development. Source

What pain points does FalkorDB address for customers?

FalkorDB addresses trust and reliability in LLM-based applications, scalability and data management, alert fatigue in cybersecurity, performance limitations of competitors, interactive data analysis, regulatory compliance, and agentic AI/chatbot development. Source

What business impact can customers expect from using FalkorDB?

Customers can expect improved scalability, enhanced trust and reliability, reduced alert fatigue, faster time-to-market, enhanced user experience, regulatory compliance, and support for advanced AI applications. Source

Technical Requirements & Support

Where can I find FalkorDB's technical documentation and release notes?

Technical documentation is available at docs.falkordb.com, and release notes can be found on the GitHub Releases Page. Source

What support and training options are available for FalkorDB?

FalkorDB offers comprehensive documentation, community support via Discord and GitHub Discussions, solution architects for tailored advice, and free trial/demo options for onboarding. Source

How do I estimate storage requirements for my graph in FalkorDB?

FalkorDB provides a Graph Size Calculator to help users estimate storage requirements for their graph database. Source

How does FalkorDB ensure data privacy and confidentiality?

FalkorDB is SOC 2 Type II compliant, safeguarding sensitive information from unauthorized disclosure and protecting personal data in compliance with privacy regulations. Source

Text-to-SQL with Knowledge Graphs: Solving Multi-Hop Query Problems

Why Your Text-to-SQL is Failing: The Hidden Power of Knowledge Graphs

Highlights

new-queryweaver-ui-screenshot

For over 60 years, developers have chased Text-to-SQL conversion. Even with today’s LLMs, most solutions fail when they encounter complex enterprise schemas.

For months, I obsessed over a frustrating loop, involving late nights staring at failed SQL queries. My screen became a graveyard of syntax errors and hallucinated joins that no amount of prompt engineering could fix. I realized that even with perfect prompts, I was giving the LLM a jigsaw puzzle with no picture on the box.

Most vector-based implementations treat databases like flat shopping lists. But a database schema isn’t a list: it’s a web of connections. That was my breakthrough. What if we mapped the entire schema as a Knowledge Graph? What if we enriched the nodes with primary keys and null-handling rules and connected that to the LLM?

This became the core of QueryWeaver. Using FalkorDB, we built a system where the model doesn’t just read a dictionary: it follows a map.

Text-to-SQL Error Taxonomy
Common Text-to-SQL Error Categories
What breaks in semantic search and how QueryWeaver's Healer fixes it
🔗
Missing JOIN Paths
Vector databases retrieve semantically similar tables but miss the intermediate connective tissue required to link them structurally.
Symptom
SELECT * FROM publisher, royalty_ledger
-- Missing vendor_agreement bridge
Graph Solution
Traversal discovers all intermediate nodes on the relationship path
👻
Hallucinated Relationships
LLMs invent JOIN conditions between tables that have no actual foreign key relationships, creating syntactically valid but logically incorrect queries.
Symptom
JOIN capability_matrix ON superpower.name = capability_matrix.description
-- No actual FK relationship exists
Graph Solution
Only follows explicit edges stored in the knowledge graph schema
Incorrect Table Selection
Generic table names with no semantic similarity to the query get ignored, even when they're structurally mandatory for the query path.
Symptom
Query needs stakeholder_registry but vector DB returns only superpower and budget_allocation
Graph Solution
Graph traversal includes all nodes on the shortest path regardless of naming
🔢
Value Format Mismatches
The model doesn't know how data is actually stored (e.g., "40" vs "0040"), leading to WHERE clauses that return zero results despite correct logic.
Symptom
WHERE publisher_id = '40'
-- Actual storage format is '0040'
Graph Solution
Content Awareness layer enriches nodes with actual data formatting rules
🤔
Column Ambiguity
Multiple tables contain similarly named columns (e.g., "name", "id", "date"), and the LLM selects the wrong one without proper context about table relationships.
Symptom
SELECT name FROM publisher
-- Should be publisher.legal_name not publisher.name
Graph Solution
Semantic Layer provides objective descriptions for every column based on actual data samples

Building an Architecture of Trust

Building QueryWeaver required more than just a graph. We created an Architecture of Trust. Over the past month, we developed a 10-point blueprint to move from guessing queries to calculating them:

QueryWeaver Architecture
1
The GPS (Knowledge Graph)
Maps schema relationships in FalkorDB so the model never guesses how tables connect. Primary keys and foreign keys become explicit edges.
2
The Voice (Semantic Layer)
Uses LLMs to examine actual data samples and generate objective descriptions for every node, turning a silent schema into a map of intent.
3
The Map (Content Awareness)
Avoids the Value Trap by telling the model not just what a column is, but how the data is styled (e.g., "40" stored as "0040").
4
The Navigator (Graph Traversal)
Moves beyond keyword matching. Uses graph traversal to find the Hidden Bridges between semantically unrelated but structurally required tables.
5
The Brain (Reasoning Buffer)
Forces the model to explain its logic in natural language before writing code, creating a traceable reasoning path.
6
The Memory
Uses the graph to store past successes and failures so the agent matures with every conversation, learning from previous query patterns.
7
The Healer
An autonomous agentic loop that catches errors, fixes the SQL, and validates the final answer before you see it.

The Stress Test: Breaking the "Superhero" Schema

To prove this architecture worked, I built a stress test that mirrored the messy reality of enterprise data.

I took the Superhero database from the BIRD Benchmark (the industry gold standard). In its original form, it was too clean. I expanded it into a 60-table case, adding the boring connective tissue found in real organizations: vendor agreements, stakeholder registries, and resource logs.

Vector vs Graph for Text-to-SQL
Aspect
Vector Database
Knowledge Graph
Data Representation
Semantic embeddings (high-dimensional vectors)
Nodes (tables) and edges (relationships)
Multi-Hop Queries
Fails to find intermediate tables
Traverses relationships to find all paths
Generic Table Names
Misses tables with no semantic similarity
Finds structurally mandatory bridges
Schema Understanding
Treats schema as flat documents
Maps schema as connected structure
Query Accuracy
Broad but often incomplete results
Precise, structurally complete results
Best For
Simple, single-table queries
Complex joins and traversals
Data Representation
Vector Database
Semantic embeddings (high-dimensional vectors)
Knowledge Graph
Nodes (tables) and edges (relationships)
Multi-Hop Queries
Vector Database
Fails to find intermediate tables
Knowledge Graph
Traverses relationships to find all paths
Generic Table Names
Vector Database
Misses tables with no semantic similarity
Knowledge Graph
Finds structurally mandatory bridges
Schema Understanding
Vector Database
Treats schema as flat documents
Knowledge Graph
Maps schema as connected structure
Query Accuracy
Vector Database
Broad but often incomplete results
Knowledge Graph
Precise, structurally complete results
Best For
Vector Database
Simple, single-table queries
Knowledge Graph
Complex joins and traversals

The difference between vector-based retrieval and our graph-powered retrieval was stark.

“The real breakthrough in Text-to-SQL comes when you stop treating the schema as a document to embed and start treating it as a graph to traverse.”

— Roi Lipman, CTO of FalkorDB

The Missing Link (The "Generic" Table)

The Value Trap Example
The Missing Link: Generic Table Problem
How vector databases fail when table names lack semantic similarity to queries
Query Example
"Which publishers have received royalty payments above $5000?"
Vector DB Failure
Found publisher and royalty_ledger tables through semantic similarity.
Missed the vendor_agreement bridge table because "vendor agreement" has zero semantic connection to "royalty payments."
Result: Incomplete SQL with missing JOIN path
Graph Victory
QueryWeaver saw no physical path between publisher and royalty_ledger except through vendor_agreement.
Retrieved the bridge because graph structure mandated it as the only connection.
Result: Complete, accurate SQL query
The 5-Hop Nightmare
"List superpowers with associated budget allocations"
Vector DB Failure
Found the two endpoints: superpower and budget_allocation.
Missed middle tables like stakeholder_registry because "stakeholder" has no semantic link to "superpowers."
Result: Failed to construct multi-hop query
Graph Victory
Performed multi-hop traversal: superpower → capability_matrix → stakeholder_registry → resource_requisition → budget_allocation
Found stakeholder_registry simply because it was the only road connecting the entities.
Result: Correct 5-table JOIN query

The Verdict: Stop Guessing, Start Weaving

This journey taught me a lesson: Vectors find the “What” (the nouns), but Graphs find the “How” (the logic).

QueryWeaver isn’t just a Text-to-SQL converter: it’s a Reasoning Engine for Data. Every answer it gives is:

  • Mapped by a Knowledge Graph

  • Governed by a strict Constitution of logical rules

  • Verified and healed by an autonomous Healer agent

  • Interpreted by an intelligent Analyst layer

The needle in the wood is no longer hidden: it’s illuminated.

Explore the project and join the build: https://www.queryweaver.ai/

Learn more about the benchmark used in this experiment: BIRD Benchmark (https://bird-bench.github.io/)

FAQ

Why do vector databases fail at multi-hop Text-to-SQL queries?

Vector databases use semantic similarity to retrieve tables. They miss intermediate tables with generic names that have no semantic relationship to the query but are structurally required.

Graph traversal walks the actual foreign key relationships between tables, discovering all intermediate nodes on the path between query entities regardless of semantic similarity.

QueryWeaver stores the entire schema as a knowledge graph in FalkorDB, enriches nodes with metadata, and uses graph algorithms to find structurally mandatory tables for complex joins.

References and citations

  1. BIRD Benchmark: Cross-Domain Text-to-SQL Evaluation – https://bird-bench.github.io/

  2. FalkorDB Documentation: Graph Database for LLM Applications – https://docs.falkordb.com/

  3. QueryWeaver Project – https://www.queryweaver.ai/