Skip to main navigation Skip to search Skip to main content

Branching Out: Exploration of Chinese Dependency Parsing with Fine-tuned Large Language Models

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

In this paper, we investigate the effectiveness of large language models (LLMs) for Chinese dependency parsing through fine-tuning. We explore how different dependency representations impact parsing performance when fine-tuning the Chinese Llama-3 model. Our results demonstrate that while the Stanford typed dependency tuple representation yields the highest number of valid dependency trees, converting dependency structure into a lexical centered tree produces parses of significantly higher quality despite generating fewer valid structures. The results further show that fine-tuning enhances LLMs’ capability to handle longer dependencies to some extent, though challenges remain. Additionally, we evaluate the effectiveness of DeepSeek in correcting LLM-generated dependency structures, finding that it is effective for fixing index errors and cyclicity issues but still suffers from tokenization mismatches. Our analysis across dependency distances and relations reveals that fine-tuned LLMs outperform traditional parsers in specific syntactic structures while struggling with others. These findings contribute to the research on leveraging LLMs for syntactic analysis tasks.

Original languageEnglish
Title of host publicationProceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, RANLP 2025
EditorsGalia Angelova, Maria Kunilovskaya, Marie Escribe, Ruslan Mitkov
PublisherIncoma Ltd
Pages1437-1445
Number of pages9
ISBN (Electronic)9789544520984
DOIs
Publication statusPublished - Sept 2025
Event15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, RANLP 2025 - Varna, Bulgaria
Duration: 8 Sept 202510 Sept 2025

Publication series

NameInternational Conference Recent Advances in Natural Language Processing, RANLP
ISSN (Print)1313-8502

Conference

Conference15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, RANLP 2025
Country/TerritoryBulgaria
CityVarna
Period8/09/2510/09/25

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Branching Out: Exploration of Chinese Dependency Parsing with Fine-tuned Large Language Models'. Together they form a unique fingerprint.

Cite this