✦ Free Tool

Does AI Reflection actually improve SQL?

Run a scientific experiment on your own data. Single-pass SQL vs 2-pass Reflection. An LLM Judge scores both. Measure the difference.

Try it free → See OverFlow ↗

No signup required·Bring your own database·Powered by OverFlow

What it does

Three things, done well

🅐

Control — single pass

One LLM call generates SQL from your schema and question. Fast, simple, the baseline every team uses today.

🅑

Reflection — two pass

Generate SQL, execute it, let an LLM critique the result, rewrite if needed, execute again. The Reflection agentic pattern applied to SQL.

⚖️

LLM Judge scores both

A third LLM scores accuracy and completeness 1–10 for each approach. See which wins and why — in structured JSON you can analyse.

How it works

Up and running in 60 seconds

1

Connect your database

Link your Snowflake or BigQuery database. The tool reads your real schema so the generated SQL targets your actual tables and columns.

2

Enter a natural language question

Type a business question: "show me the top 10 customers by revenue in the last 30 days". The experiment runs both tracks automatically.

3

Read the verdict

Control SQL, Reflection SQL, execution results, and a scored verdict appear on canvas. Export to BigQuery for further analysis.

Stop guessing if Reflection helps

Run 10 questions through both tracks, store results in BigQuery, and have data to back up your architecture decisions.

Try free — no signup →

Free forever for personal use·Powered by OverFlow canvas