TY - GEN
T1 - Building a Chinese shallow parsed treebank for collocation extraction
AU - Li, Baoli
AU - Qin, Lu
AU - Yin, Li
PY - 2003/1/1
Y1 - 2003/1/1
N2 - To automatically extract Chinese collocations and build a large-scale collocation bank, we are developing a one-million-word Chinese shallow parsed treebank. The treebank can be used not only as a training set for our shallow parser, but also as processed data from which collocations are extracted. This paper presents several issues related to this on-going project, such as our definition of shallow parsing used in Chinese collocation extraction, guideline preparation, and quality control.
AB - To automatically extract Chinese collocations and build a large-scale collocation bank, we are developing a one-million-word Chinese shallow parsed treebank. The treebank can be used not only as a training set for our shallow parser, but also as processed data from which collocations are extracted. This paper presents several issues related to this on-going project, such as our definition of shallow parsing used in Chinese collocation extraction, guideline preparation, and quality control.
UR - http://www.scopus.com/inward/record.url?scp=78651536562&partnerID=8YFLogxK
U2 - 10.1007/3-540-36456-0_41
DO - 10.1007/3-540-36456-0_41
M3 - Conference article published in proceeding or book
SN - 3540005323
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 402
EP - 405
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PB - Springer Verlag
T2 - 4th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2003
Y2 - 16 February 2003 through 22 February 2003
ER -