TY - GEN
T1 - Learning word meta-embeddings
AU - Yin, Wenpeng
AU - Schütze, Hinrich
N1 - Funding Information:
We gratefully acknowledge the support of Deutsche Forschungsgemeinschaft (DFG): grant SCHU 2246/8-2.
PY - 2016
Y1 - 2016
N2 - Word embeddings - distributed representations of words - in deep learning are beneficial for many tasks in NLP. However, different embedding sets vary greatly in quality and characteristics of the captured information. Instead of relying on a more advanced algorithm for embedding learning, this paper proposes an ensemble approach of combining different public embedding sets with the aim of learning metaembeddings. Experiments on word similarity and analogy tasks and on part-of-speech tagging show better performance of metaembeddings compared to individual embedding sets. One advantage of metaembeddings is the increased vocabulary coverage. We release our metaembeddings publicly at http://cistern.eis.lmu.de/meta-emb.
AB - Word embeddings - distributed representations of words - in deep learning are beneficial for many tasks in NLP. However, different embedding sets vary greatly in quality and characteristics of the captured information. Instead of relying on a more advanced algorithm for embedding learning, this paper proposes an ensemble approach of combining different public embedding sets with the aim of learning metaembeddings. Experiments on word similarity and analogy tasks and on part-of-speech tagging show better performance of metaembeddings compared to individual embedding sets. One advantage of metaembeddings is the increased vocabulary coverage. We release our metaembeddings publicly at http://cistern.eis.lmu.de/meta-emb.
UR - http://www.scopus.com/inward/record.url?scp=85011842404&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85011842404&partnerID=8YFLogxK
U2 - 10.18653/v1/p16-1128
DO - 10.18653/v1/p16-1128
M3 - Conference contribution
AN - SCOPUS:85011842404
T3 - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers
SP - 1351
EP - 1360
BT - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers
PB - Association for Computational Linguistics (ACL)
T2 - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
Y2 - 7 August 2016 through 12 August 2016
ER -