A Type-theoretical approach to register classification

Renkui Hou, Chu-Ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

We propose to differentiate different registers based on the distribution of different Parts of Speeches. Based on a type-theoretical approach, grammatical categories are defined by their combinatory and mapping functions. With noun as the basic category representing entities, verbs are functions taking them as arguments; and adverbs are functions taking verbs as arguments. Based on this different functional mapping relations, we hypothesis that their ratio, like unit-constituency ratios, can differentiate different types of texts, and especially registers. We calculated the ratios between grammatical categories based on their function mapping relations. For example the ratio between verbs and nouns, and adverbs and verbs. The boxplots was used to show the distribution of the ratios between these parts of speeches in each register. The linear regression was used to verify the differences of these ratios in different registers. The text clustering result showed that these ratios can differ conversational and written registers.

Original languageEnglish
Title of host publicationProceedings of the 33rd Pacific Asia Conference on Language, Information and Computation
EditorsRyo Otoguro, Mamoru Komachi, Tomoko Ohkuma
Pages57-67
Number of pages11
Publication statusPublished - 2019
Event33rd Pacific Asia Conference on Language, Information and Computation, PACLIC 2019 - Hakodate, Japan
Duration: 13 Sept 201915 Sept 2019

Conference

Conference33rd Pacific Asia Conference on Language, Information and Computation, PACLIC 2019
Country/TerritoryJapan
CityHakodate
Period13/09/1915/09/19

Keywords

  • Chinese register
  • Linear regression
  • Parts of speeches
  • Text clustering

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science (miscellaneous)

Fingerprint

Dive into the research topics of 'A Type-theoretical approach to register classification'. Together they form a unique fingerprint.

Cite this