Skip to main navigation Skip to search Skip to main content

Structure-Guided Large Language Models for Text-to-SQL Generation

  • Qinggang Zhang
  • , Hao Chen
  • , Junnan Dong
  • , Shengyuan Chen
  • , Feiran Huang
  • , Xiao Huang

Research output: Journal article publicationConference articleAcademic researchpeer-review

Abstract

Recent advancements in large language models (LLMs) have shown promise in bridging the gap between natural language queries and database management systems, enabling users to interact with databases without the background of SQL. However, LLMs often struggle to comprehend complex database structures and accurately interpret user intentions. Decomposition-based methods have been proposed to enhance the performance of LLMs on complex tasks, but decomposing SQL generation into subtasks is non-trivial due to the declarative structure of SQL syntax and the intricate connections between query concepts and database elements. In this paper, we propose a novel Structure GUided text-to-SQL framework (SGU-SQL) that incorporates syntax-based prompting to enhance the SQL generation capabilities of LLMs. Specifically, SGU-SQL establishes structure-aware links between user queries and database schema and decomposes the complex generation task using syntax-based prompting to enable more accurate LLM-based SQL generation. Extensive experiments on two benchmark datasets demonstrate that SGU-SQL consistently outperforms state-of-the-art text-to-SQL models.

Original languageEnglish
Pages (from-to)74671-74691
Number of pages21
JournalProceedings of Machine Learning Research
Volume267
Publication statusPublished - Jul 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver, Canada
Duration: 13 Jul 202519 Jul 2025

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Structure-Guided Large Language Models for Text-to-SQL Generation'. Together they form a unique fingerprint.

Cite this