Indonesian Language Term Extraction using Multi-Task Neural Network

Joan Santoso; Esther Irawati Setiawan; Fransiskus Xaverius Ferdinandus; Gunawan Gunawan; Leonel Hernandez

doi:10.17977/um018v5i22022p160-167

Indonesian Language Term Extraction using Multi-Task Neural Network

Joan Santoso, Esther Irawati Setiawan, Fransiskus Xaverius Ferdinandus, Gunawan Gunawan, Leonel Hernandez

Abstract

The rapidly expanding size of data makes it difficult to extricate information and store it as computerized knowledge. Relation extraction and term extraction play a crucial role in resolving this issue. Automatically finding a concealed relationship between terms that appear in the text can help people build computer-based knowledge more quickly. Term extraction is required as one of the components because identifying terms that play a significant role in the text is the essential step before determining their relationship. We propose an end-to-end system capable of extracting terms from text to address this Indonesian language issue. Our method combines two multilayer perceptron neural networks to perform Part-of-Speech (PoS) labeling and Noun Phrase Chunking. Our models were trained as a joint model to solve this problem. Our proposed method, with an f-score of 86.80%, can be considered a state-of-the-art algorithm for performing term extraction in the Indonesian Language using noun phrase chunking.

Full Text:

PDF

References

D. S. Wang, “A domain-specific question answering system based on ontology and question templates,” in Software Engineering Artificial Intelligence Networking and Parallel/Distributed Computing (SNPD), 2010 11th ACIS International Conference on, 2010, pp. 151–156.

H. Al-Zubaide and A. A. Issa, “Ontbot: Ontology based chatbot,” in Innovation in Information & Communication Technology (ISIICT), 2011 Fourth International Symposium on, 2011, pp. 7–12.

A. D. S. Jayatilaka and G. Wimalarathne, “Knowledge extraction for Semantic Web using web mining,” in Advances in ICT for Emerging Regions (ICTer), 2011 International Conference on, 2011, pp. 89–94.

B. Abdelbasset, K. Okba, and M. Sofiane, “Agent-based approach for building ontology from text,” in Computer Medical Applications (ICCMA), 2013 International Conference on, 2013, pp. 1–6.

H. Yang and J. Callan, “Metric-based ontology learning,” in Proceedings of the 2nd international workshop on Ontologies and information systems for the semantic web, 2008, pp. 1–8.

R. Snow, D. Jurafsky, and A. Y. Ng, “Semantic taxonomy induction from heterogenous evidence,” in Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 801–808.

M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni, “Open Information Extraction from the Web.,” in IJCAI, 2007, pp. 2670–2676.

C. Giuliano, A. Lavelli, and L. Romano, “Exploiting shallow linguistic information for relation extraction from biomedical literature.,” in EACL, 2006, pp. 401–408.

L. A. Ramshaw and M. P. Marcus, “Text chunking using transformation-based learning,” in Natural language processing using very large corpora, Springer, 1999, pp. 157–176.

W. Skut and T. Brants, “A maximum-entropy partial parser for unrestricted text,” arXiv preprint cmp-lg/9807006, 1998.

T. Kudoh and Y. Matsumoto, “Use of support vector learning for chunk identification,” in Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning-Volume 7, 2000, pp. 142–144.

E. F. Sang, “Memory-based shallow parsing,” Journal of machine learning research, vol. 2, no. Mar, pp. 559–594, 2002.

J. Santoso, H. V. Gani, E. M. Yuniarno, M. Hariadi, M. H. Purnomo, and others, “Noun phrases extraction using shallow parsing with C4. 5 decision tree algorithm for Indonesian Language ontology building,” in Communications and Information Technologies (ISCIT), 2015 15th International Symposium on, 2015, pp. 149–152.

H. Li, J. J. Webster, C. Kit, and T. Yao, “Transductive HMM based chinese text chunking,” in Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on, 2003, pp. 257–262.

G.-H. Fu, R.-F. Xu, K.-K. Luke, and Q. Lu, “Chinese text chunking using lexicalized HMMs,” in Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on, 2005, pp. 7–12.

M. Miwa and M. Bansal, “End-to-end relation extraction using lstms on sequences and tree structures,” arXiv preprint arXiv:1601.00770, 2016.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, “Distributed representations of words and phrases and their compositionality,” in Advances in neural information processing systems, 2013, pp. 3111–3119.

J. Pennington, R. Socher, and C. D. Manning, “Glove: Global Vectors for Word Representation.,” in EMNLP, 2014, pp. 1532–1543.

S. K. Sienčnik, “Adapting word2vec to named entity recognition,” in Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania, 2015, pp. 239–243.

B. Xue, C. Fu, and Z. Shaobin, “A study on sentiment computing and classification of sina weibo with word2vec,” in Big Data (BigData Congress), 2014 IEEE International Congress on, 2014, pp. 358–363.

P. Pantel and M. Pennacchiotti, “Espresso: Leveraging generic patterns for automatically harvesting semantic relations,” in Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 113–120.

K. Chen and H.-H. Chen, “Extracting noun phrases from large-scale texts: A hybrid approach and its automatic evaluation,” in Proceedings of the 32nd annual meeting on Association for Computational Linguistics, 1994, pp. 234–241.

A. A. Arman, A. Purwarianti, and others, “Syntactic phrase chunking for indonesian language,” Procedia Technology, vol. 11, pp. 635–640, 2013.

E. F. Sang and J. Veenstra, “Representing text chunks,” in Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics, 1999, pp. 173–179.

Y. Goldberg and O. Levy, “word2vec explained: Deriving mikolov et al.’s negative-sampling word-embedding method,” arXiv preprint arXiv:1402.3722, 2014.

A. F. Wicaksono and A. Purwarianti, “HMM based part-of-speech tagger for Bahasa Indonesia,” in Fourth International MALINDO Workshop, Jakarta, 2010.

N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting.,” Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929–1958, 2014.

DOI: http://dx.doi.org/10.17977/um018v5i22022p160-167