TOWARD AN ARABIC QUESTION ANSWERING SYSTEM OVER LINKED DATA


(Received: 2017-12-31, Revised: 21-Mar.-2018 and 09-Apr.-2018, , Accepted: 2018-04-15)
The increasing interest in Arabic natural language processing and semantic Web research involves an emerging need to the development of new Question Answering Systems (QAS). These systems allow users to ask a question in Arabic natural language and get the relevant answer. However, most existing QA systems focused on English and Latin-based languages. Less effort has been concentrated on the Arabic language, which belongs to "Semitic Languages". This work is an early version of a new domain-independent Arabic question answering system over linked data, which aims to particularly help Arab users to explore the Arabic Semantic Web based on Arabic ontology. We describe with sufficient details the different modules of our proposed system, which uses language parser, finite state automaton and semantic Web techniques to linguistically process and answer Arabic natural language question. Experiments have been carried out to evaluate and show efficiency of the proposed system.

[1] M. M. Goup, Internet World Users By Language: Top 10 Languages,[Online], Available: http://www.internetworldstats.com/stats7.htm,[Accessed October 2017].

[2] B. v. d. Beld, State of Digital, The Arabic Web: Numbers and Facts, General Statistics,[Online], Available: http://www.stateofdigital.com/the-arabic-web/,[Accessed October 2017].

[3] S. Albagli, R. Ben-Eliyahu-Zohary and S. E. Shimony, "Markov Network-based Ontology Matching," Journal of Computer and System Sciences, vol. 78, pp. 105-118, 2012.

[4] P. Hitzler, M. Krotzsch and S. Rudolph, Foundations of Semantic Web Technologies: CRC Press, 2009.

[5] W. Zaghouani, "Critical Survey of the Freely Available Arabic Corpora," arXiv Preprint arXiv:1702.07835, 2017.

[6] A. Bouziane, D. Bouchiha, N. Doumi and M. Malki, "Question Answering Systems: The Story Till the Arabic Linked Data," International Journal of Artificial Intelligence and Soft Computing (IJAISC), vol. 6, pp. 24-42, 2017.

[7] A. M. Ezzeldin and M. Shaheen, "A Survey of Arabic Question Answering: Challenges, Tasks, Approaches, Tools and Future Trends," Proceedings of The 13th International Arab Conference on Information Technology (ACIT 2012), pp. 1-8, 2012.

[8] A. Pasha, M. Al-Badrashiny, M. Diab, A. E. Kholy, R. Eskander, N. Habash, M. Pooleery, O. Rambow and R. M. Roth, "MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic," Ed. LREC2014, 2014.

[9] N. Habash, Introduction to Arabic Natural Language Processing: Morgan & Claypool, 2010.

[10] R. Guo and F. Ren, "Towards the Relationship Between Semantic Web and NLP," International Conference on Natural Language Processing and Knowledge Engineering, Dalian, 2009.

[11] M. M. Boudabous, L. H. Belguith and F. Sadat, "Exploiting the Arabic Wikipedia for Semi-automatic Construction of a Lexical Ontology," International Journal of Metadata, Semantics and Ontologies, vol. 8, pp. 245-253, 2013.

[12] H. Al-Feel, "The Roadmap for the Arabic Chapter of DBpedia," Mathematical and Computational Methods in Electrical Engineering, Proceedings of the 14th International Conference on Telecom. and Informatics (TELE-INFO '15), Sliema, Malta, pp. 115-125, 2015.

[13] M. Beseiso, A. R. Ahmad and R. Ismail, "A Survey of Arabic Language Support in Semantic Web," International Journal of Computer Applications, vol. 9, pp. 35-40, 2010.

[14] R. F. Simmons, "Answering English Questions by Computer: A Survey," Communications of the ACM, vol. 8, pp. 53-70, 1965.

[15] B. F. Green, A. K. Wolf, C. Chomsky and K. Laughery, "BASEBALL: An Automatic Question Answering," Proceedings of Western Joint Computer Conference, pp. 207-216, 1961.

[16] K. M. Colby, "Artificial Paranoia," Artificial Intelligence, vol. Vol. 2, 1971.

[17] W. Woods, R. Kaplan and B. Webber, "The Lunar Sciences Natural Language Information System," Cambridge, Massachusetts Final Report, 1972.

[18] A. Rukshan, R. Prashanthi and M. Sinnathamby, "Natural Language Web Interface for Database (NLWIDB)," Proceedings of the 3rd International Symposium, SEUSL, Oluvil, Sri Lanka, 2013.

[19] E. M. Voorhees and D. M. Tice, "The TREC-8 Question Answering Track Evaluation," NIST Special Publication 500-246: The 8th Text REtrieval Conference (TREC 8), 1999.

[20] W. Youzheng, H. Chiori, K. Hideki and K. Hisashi, "Leveraging Social Q&A Collections for Improving Complex Question Answering," Elsevier, Computer Speech and Language, vol. 29, pp. 1-19, 2015.

[21] C. Kwok, O. Etzioni and D. Weld, "Scaling Question Answering to the Web," Proceedings of the 10th International Conference on World Wide Web, Hong Kong, China, pp. 150-161, 2001.

[22] S. Harabagiu Dan, A. Harabagiu, D. Moldovan, C. Clark, M. Bowden, A. Hickl and P. Wang, "Employing Two Question Answering Systems in TREC 2005," Proceedings of the 14th Text REtrieval Conference, 2005.

[23] O. Ferret, B. Grau, G. Illouz, C. Jacquemin and N. Masson, "QALC - The Question-Answering Program of the Language and Cognition Group at LIMSI-CNRS," TREC-8, Columbia, 1999.

[24] J. M. G. Soriano, M. M. Y. Gómez, E. S. Arnal and P. Rosso, "A Passage Retrieval System for Multilingual Question Answering," International Conference on Text, Speech and Dialogue, pp. 443-450, 2005.

[25] P.-M. Ryu, M.-G. Jang and H.-K. Kim, "Open Domain Question Answering Using Wikipedia-based Knowledge Model," Information Processing & Management, vol. 50, pp. 683-692, 2014.

[26] R. Sutcliffe, A. Peñas, E. Hovy, P. Forner, Á. Rodrigo, C. Forascu, Y. Benajiba and P. Osenova, "Overview of QA4MRE Main Task at CLEF 2013," Working Notes CLEF, 2013.

[27] S. K. Ray and K. Shaalan, "A Review and Future Perspectives of Arabic Question Answering Systems," IEEE Transactions on Knowledge and Data Engineering, vol. 28, pp. 3169-3190, 2016.

[28] V. Lopez, V. Uren, M. Sabou and E. Motta, "Is Question Answering Fit for the Semantic Web? A Survey, " Semantic Web, vol. 2, pp. 125–155, 2011.

[29] C. Pradel, O. Haemmerlé and N. Hernandez, "Swip: A Natural Language to SPARQL Interface Implemented with SPARQL," 21st International Conference on Conceptual Structures (ICCS 2014), Iaşi, Romania, 2014.

[30] K. Xu, S. Zhang, Y. Feng and D. Zhao, "Answering Natural Language Questions via Phrasal Semantic Parsing," The Natural Language Processing and Chinese Computing, Third CCF Conference (NLPCC 2014), Shenzhen, China, 2014.

[31] A. Kalyanpur, B. K. Boguraev, S. Patwardhan, J. W. Murdock, A. Lally, C. Welty, J. M. Prager, B. Coppola, A. Fokoue-Nkoutche, L. Zhang, Y. Pan and Z. M. Qiu, "Structured Data and Inference in DeepQA," IBM Journal of Research and Development, vol. 56, pp. 10:1 - 10:14, May-June 2012.

[32] S. Linckels and C. Meinel, "A Simple Solution for an Intelligent Librarian System," Proceedings of the IADIS International Conference of Applied Computing (IADIS AC2005), Lisbon, Portugal, pp. 495-503, 2005.

[33] W. Ahmed and A. P. Babu, "Question Analysis for Arabic Question Answering Systems," International Journal on Natural Language Computing (IJNLC), vol. 5, December 2016.

[34] H. Abdelnasser, R. Mohamed, M. Ragab, A. Mohamed, B. Farouk, N. El-Makky and M. Torki, "Al-Bayan: An Arabic Question Answering System for the Holy Quran," Proceedings of the EMNLP 2014 Workshop on Arabic Natural Langauge Processing (ANLP), Doha, Qatar, pp. 57–64, 2014.

[35] S. Bekhti and M. Al-Harbi, "AQuASys: A Question-Answering System for Arabic," Proceedings of the 13th International Conference on Applied Computer Science (ACS '13), Proceedings of the 2nd International Conference on Digital Services, Internet and Applications (DSIA'13), Morioka City, Iwate, Japan, pp. 130-139, 2013.

[36] W. Ahmed, A. Pv and A. P. Babu, "Web-based Arabic Question Answering System using Machine Learning Approach," International Journal of Advanced Research in Computer Science, vol. 8, pp. 40-45, Jan./Feb. 2017.

[37] I. Al-Agha and A. Abu-Taha, "AR2SPARQL: An Arabic Natural Language Interface for the Semantic Web," International Journal of Computer Applications, vol. 125, 2015.

[38] F. A. Mohammed, K. Nasser and H. M. Harb, "A Knowledge-based Arabic Question Answering System (AQAS)," ACM SIGART Bulletin, vol. 4, pp. 21-30, Oct. 1993.

[39] B. Hammo, H. Abu-Salem and S. Lytinen, "QARAB: A Question Answering System to Support the Arabic Language," Proceedings of the ACL-02 Workshop on Computational Approaches to Semitic Languages (SEMITIC'02), Philadelphia, Pennsylvania, pp. 1-11, 2002.

[40] E. Al-Shawakfa, "A Rule-based Approach to Understand Questions in Arabic Question Answering," Jordanian Journal of Computers and Information Technology, vol. 2, pp. 210-231, 2016.

[41] Y. Benajiba, P. Rosso and A. Lyhyaoui, "Implementation of the ArabiQA Question Answering System's Components," Proc. Of Workshop on Arabic Natural Language Processing, 2nd Information Communication Technologies Int. Symposium (ICTIS-2007), Fez, Morroco, pp. 3-5, April, 2007.

[42] B. A. Shawar, "A Chatbot As a Natural Web Interface to Arabic Web QA," International Journal of Emerging Technologies in Learning (iJET), vol. 6, pp. 37-43, 2011.

[43] O. Trigui, L. H. Belguith and P. Rosso, "Arabic Cooperative Answer Generation via Wikipedia Article Infoboxes," Research in Computing Science, vol. 132, pp. 129-153, 2017.

[44] N. Y. Habash, A. Soudi and T. Buckwalter, "On Arabic Transliteration," Arabic Computational Morphology: Knowledge-based and Empirical Methods, vol. 38, A. Soudi, A. v. d. Bosch and G. Neumann, Eds., Springer, pp. 15-22, 2007.

[45] Y. Benajiba, M. Diab and P. Rosso, "Using Language -Independent and Language- Specific Features to Enhance Arabic Named Entity Recognition," The International Arab Journal of Information Technology, vol. 6, 2009.

[46] M. C. De Marneffe and C. D. Manning, Stanford Typed Dependencies Manual, Stanford University, Ed. 2008, pp. 338-345, Sep. 2008.

[47] R. Al-Shalabi, G. Kanaan, J. M. Jaam, A. Hasnah and E. Hilat, "Stop-word Removal Algorithm for Arabic Language," Proceedings of International Conference on Information and Communication Technologies: From Theory to Applications, Damascus, Syria, 2004.

[48] H. Rodríguez, D. Farwell, J. Farreres, M. Bertran, M. Alkhalifa, M. A. Martí, W. Black, S. Elkateb, J. Kirk, P. Vossen and C. Fellbaum, "Arabic WordNet: Current State and Future Extensions," Proceedings of the 4th Global WordNet Conference (GWC 2008), Szeged, Hungary, 2008.

[49] Y. Regragui, L. Abouenour, F. Krieche, K. Bouzoubaa and P. Rosso, "Arabic WordNet: New Content and New Applications," Proceedings of the 8th Global WordNet Conference, pp. 330-338, 2016.

[50] Y. Benajiba, "Test-Bed for Passage Retrieval (PR) and Question Answering (QUA) Tasks," Y. Benajiba, Ed.,[Accessed October 2017].

[51] B. Hammo, S. Abuleil, S. Lytinen and M. Evens: "Experimenting with a Question Answering System for the Arabic Language," Computers and the Humanities, vol. 38, no. 4, pp. 397-415, 2004.