A brain-to-text framework for decoding natural tonal sentences

  • Daohan Zhang
  • , Zhenjie Wang
  • , Youkun Qian
  • , Zehao Zhao
  • , Yan Liu
  • , Xiaotao Hao
  • , Wanxin Li
  • , Shuo Lu
  • , Honglin Zhu
  • , Luyao Chen
  • , Kunyu Xu
  • , Yuanning Li (Corresponding Author)
  • , Junfeng Lu (Corresponding Author)

Research output: Journal article publicationJournal articleAcademic researchpeer-review

5 Citations (Scopus)

Abstract

Speech brain-computer interfaces (BCIs) directly translate brain activity into speech sound and text. Despite successful applications in non-tonal languages, the distinct syllabic structures and pivotal lexical information conveyed through tonal nuances present challenges in BCI decoding for tonal languages like Mandarin Chinese. Here, we designed a brain-to-text framework to decode Mandarin sentences from invasive neural recordings. Our framework dissects speech onset, base syllables, and lexical tones, integrating them with contextual information through Bayesian likelihood and a Viterbi decoder. The results demonstrate accurate tone and syllable decoding during naturalistic speech production. The overall word error rate (WER) for 10 offline-decoded tonal sentences with a vocabulary of 40 high-frequency Chinese characters is 21% (chance: 95.3%) averaged across five participants, and tone decoding accuracy reaches 93% (chance: 25%), surpassing previous intracranial Mandarin tonal syllable decoders. This study provides a robust and generalizable approach for brain-to-text decoding of continuous tonal speech sentences.
Original languageEnglish
Article number114624
JournalCell Reports
Volume43
Issue number11
DOIs
Publication statusPublished - 26 Nov 2024
Externally publishedYes

Keywords

  • electrocorticography
  • ECoG
  • brain-computer interface
  • BCI
  • tonal language
  • natural speech
  • deep neural networks
  • neural decoding
  • artificial intelligence
  • Mandarin

Fingerprint

Dive into the research topics of 'A brain-to-text framework for decoding natural tonal sentences'. Together they form a unique fingerprint.

Cite this