Over ons 🤖

Laten we elkaar leren kennen

Vertel me de missie en visie

Leg het verhaal achter Mach8 uit

Hallo daar 👋

Hoe kunnen we je helpen?

Mijn gegevens mogen worden gebruikt om me op de hoogte te houden van relevant nieuws van Mach8

Data & Analytics with AI·6 min·4 May 2025

Natural language querying: asking questions about data in plain language

Natural language querying lets you query data without knowing SQL. You ask a question in plain language and the system retrieves the answer. It sounds simple, but the reality is more nuanced.

"What was our best-selling product category last month?" Asked in plain language, answered in seconds. Natural language querying makes that possible, but reliability depends heavily on how the system is set up.

How natural language querying works

Natural language querying (NLQ) is an interface that translates plain language into database queries. In most modern implementations this happens via a large language model:

  1. User asks a question in plain language
  2. The system makes the database structure available to the model (schema, table names, column names, relationships)
  3. The model generates a query (SQL, API call, etc.)
  4. The query is executed against the database
  5. The result is returned, optionally translated into an understandable summary

The quality of step 2 (making the schema available) largely determines the quality of the final result.

Strengths of NLQ

NLQ has clear advantages for non-technical users:

  • Speed: direct answers to simple questions without having to ask an analyst
  • Accessibility: no SQL knowledge required
  • Iterative: you can ask follow-up questions if the first answer is incomplete
  • Democratisation: more employees can retrieve data insights without IT or analytics involvement

For use cases like management dashboards, customer service with data needs, or field sales with real-time product information, this is a significant improvement.

Limitations and pitfalls

NLQ also has clear limitations that should not be underestimated:

Ambiguity in plain language: Words like "recent", "large", or "top" are ambiguous. What is "recent" in your context: the last week, the last month? The system makes an assumption. If that assumption is wrong, the system gives a plausible but incorrect answer.

Complex business logic: NLQ works well for straightforward data questions. Complex calculations that depend on multiple steps, exceptions, or internal definitions are harder to process automatically.

Trust in the system: Users who trust NLQ without seeing the underlying query can be misled by incorrect answers. Transparency about the generated query matters.

Schema management: Poorly documented databases produce worse NLQ results. Column names like "column1" or "flag_x" give the model no context.

Which tools are available?

There are several options:

  • Built-in BI functionality: Power BI Q&A, Tableau Ask Data, Looker Explore — all have NLQ built into existing platforms
  • Dedicated NLQ tools: ThoughtSpot, Sisense, Atlan — focused on NLQ as a core function
  • Custom implementations: language model connected to a documented database schema, as Mach8 builds for clients with specific data sources

The choice depends on your existing infrastructure and the complexity of your database.

Good documentation as a prerequisite

An NLQ system is only as good as the description of the data it has access to. Invest in documenting your tables and columns in understandable language.

That means: every table has a description ("This is the table with all customer orders, including cancelled orders"), every column has a definition ("status: O = open, C = closed, A = cancelled"), and relationships are described.

That documentation not only improves NLQ, but also the functioning of every other AI system that works with your data.

NLQ as part of a broader data strategy

NLQ solves a specific problem: making data accessible to non-technical users. It does not replace data infrastructure, analytical expertise, or good data governance.

Mach8 implements NLQ as part of broader data accessibility projects, where we also take care of the documentation, security, and error handling needed for reliable operation.

Conclusion

Natural language querying is a useful interface for data democracy, provided it is set up well. The technology works; the challenge lies in database documentation, user expectations, and maintaining the reliability of the answers.

Want to implement NLQ for your team? Get in touch with Mach8 for an initial analysis.

Ready to apply AI?

We help you go from strategy to implementation. Schedule a no-obligation call.

Schedule a call