TY - GEN
T1 - A high-performance semi-supervised learning method for text chunking
AU - Ando, Rie Kubota
AU - Zhang, Tong
PY - 2005
Y1 - 2005
N2 - In machine learning, whether one can build a more accurate classifier by using unlabeled data (semi-supervised learning) is an important issue. Although a number of semi-supervised methods have been proposed, their effectiveness on NLP tasks is not always clear. This paper presents a novel semi-supervised method that employs a learning paradigm which we call structural learning. The idea is to find "what good classifiers are like" by learning from thousands of automatically generated auxiliary classification problems on unlabeled data. By doing so, the common predictive structure shared by the multiple classification problems can be discovered, which can then be used to improve performance on the target problem. The method produces performance higher than the previous best results on CoNLL'00 syntactic chunking and CoNLL'03 named entity chunking (English and German).
AB - In machine learning, whether one can build a more accurate classifier by using unlabeled data (semi-supervised learning) is an important issue. Although a number of semi-supervised methods have been proposed, their effectiveness on NLP tasks is not always clear. This paper presents a novel semi-supervised method that employs a learning paradigm which we call structural learning. The idea is to find "what good classifiers are like" by learning from thousands of automatically generated auxiliary classification problems on unlabeled data. By doing so, the common predictive structure shared by the multiple classification problems can be discovered, which can then be used to improve performance on the target problem. The method produces performance higher than the previous best results on CoNLL'00 syntactic chunking and CoNLL'03 named entity chunking (English and German).
UR - http://www.scopus.com/inward/record.url?scp=84859921107&partnerID=8YFLogxK
M3 - Conference Paper published in a book
AN - SCOPUS:84859921107
SN - 1932432515
SN - 9781932432510
T3 - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
SP - 1
EP - 9
BT - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
Y2 - 25 June 2005 through 30 June 2005
ER -