Chinese Computing Lab
 Site Map 
About CCL
Site News
Projects
PolyU TreeBank
Chunk Bank
Collocation Extraction
ASAB
CERG
Hong Kong Character Glyphs
Jyutping
Dash Line
Publications
Download Area
Contact Information
Useful Links


Warning: A non-numeric value encountered in /webhome/cclab/public_html/menu.php on line 69

PolyU Treebank

 

中文

 

 

 



IX. Acknowledgement

This project is partially supported by the Hong Kong Polytechnic University (Project Code A-P203) and CERG Grant (Project code 5087/01E)

 

X. References

Baoli Li, Qin Lu and Yin Li. 2003. Building a Chinese Shallow Parsed Treebank for Collocation Extraction, Proceedings of CICLing 2003: 402-405

Fei Xia, et al. 2000. Developing Guidelines and Ensuring Consistency for Chinese Text Annotation Proceedings of LREC-2000, Greece

Feng-yi Chen, et al. 1999. Sinica Treebank, Computational Linguistics and Chinese Language Processing, 4(2):183-204

G. N. Leech, R.Garside. 1996. Running a grammar factory: the production of syntactically analyzed corpora or “treebanks”, Johansson and Stenstron.

Honglin Sun, 2001. A Content Chunk Parser for Unrestricted Chinese Text, Ph.D Thesis, Peking University, 2001

Keh-jiann Chen, et al. Sincica Treebank – Design Criteria, Representational Issues and Implementation, in Building and Using Parsed Corpora (Anne Abeillé ed. s) KLUWER, Dordrecht, 2003, pp.231-248

Kenneth Church, and Patrick Hanks. 1990. Word association norms, mutual information, and lexicography, Computational Linguistics, 16(1): 22-29

Marcus, M. et al. 1993. Building a Large Annotated Corpus of English: The Penn Treebank, Computational Linguistics, 19(1): 313-330.

Nianwen Xue, et al. 2002. Building a Large-Scale Annotated Chinese Corpus, Proceedings of COLING 2002, Taipei, Taiwan

Qin Lu, Jing Zhou and Ruifeng Xu, Machine Learning Approaches for Chinese Shallow Parsing, In Proceedings of IEEE ICMLC2003, pp.2309-2314

Ruifeng Xu, Qin Lu, and Yin Li, An Automatic Chinese Collocation Extraction Algorithm based on Lexical Statistics, In Proceedings of IEEE NLPKE 2003, pp.321-326

Sean Wallis, Completing Parsed Corpora: from Correction to Evolution, in Building and Using Parsed Corpora. (Anne Abeillé eds) KLUWER, Dordrecht, 2003, pp.61-71

Shiwen Yu, et al. 1998. The Grammatical Knowledge- base of contemporary Chinese: a complete specification. Tsinghua University Press, Beijing, China

Shiwen Yu, et al. 2001. Guideline of People’s Daily Corpus Annotation, Technical report, Beijing University

Shoukang Zhang and Xingguang Lin, 1992. Collocation Dictionary of Modern Chinese Lexical Words, Business Publisher, China

Walter Dalemans, et al. Memory-based Shallow Parsing, In Proceedings of ECAL99, CoNLL, 1999

 

Appendix 1: POS tag set

Appendix 2: Syntactic Phrase Label Set

Appendix 3: Semantic Phrase Label Set

Appendix 4: Example of an Annotated Article

Sharing PolyU Treebank

 

<< Publications Arising From This Project          

 

Last modified on Thu, 11 May 2006 11:54:26 +0800
THE HONG KONG POLYTECHNIC UNIVERSITY