A HYBRID APPROACH FOR INDEXING AND SEARCHING THE HOLY QURAN

(Received: 2015-10-27, Revised: 2015-11-19 , Accepted: 2015-12-21)
Searching and retrieving information from the Holy Quran have recently received special attention by Muslim and Arab researchers. Many applications and websites offered to perform search through the Quran using keywords, semantic search or even both. Arabic; as the language of the Quran, is having a complex structure and thus the searching process of Arabic scripts is rather more difficult than English. In this paper, we propose a hybrid approach; using a combination of syntactic (keyword) and word semantics, to search through the Quran text. The Quran text was pre-processed and represented in a XML format. A system was implemented using Python, where the concept of regular expressions is used to test the approach with a Graphical User Interface used to manipulate user requests. Good and promising results from the approach were obtained.
  1. O. Alonso, J. Strötgen, R. A. Baeza-Yates and M. Gertz, "Temporal Information Retrieval: Challenges and Opportunities," TWAW 11, pp. 1-8, 2011.
  2. N. Mitton, S. Papavassiliou, A. Puliafito and K. S. Trivedi, "Combining Cloud and Sensors in a Smart City Environment," EURASIP Journal on Wireless Communications and Networking, pp. 1-10, 2012.
  3. W. Hu, N. Xie, L. Li, X. Zeng and S. Maybank, "A Survey on Visual Content-Based Video Indexing and Retrieval Systems," IEEE Transactions on Man and Cybernetics, Part C: Applications and Reviews, vol. 41, no. 6, pp. 797-819, 2011.
  4. Smirnova and K. Balog, "A User-Oriented Model for Expert Finding," Advances in Information Retrieval, Springer, pp. 580-592, 2011.
  5. T. Sari and A. Kefali, "A Search Engine for Arabic Documents," Colloque International Francophone sur l'Ecrit et le Document, Groupe de Recherche en Communication Ecrite, France, Oct. 2008.
  6. A. El Salam Al Hajjar, M. Hajjar and K. Zreik, "A System for Evaluation of Arabic Root Extraction Methods," Proc. on IEEE 5th International Conference on Internet and Web Applications and Services (ICIW), pp. 506 - 512, Barcelona, Spain, 9-15 May 2010.
  7. S. A. Raza, M. Rehan, A. Farooq, S. M. Ahsan and M. S. Khan, "An Essential Framework for Concept-Based Evolutionary Quranic Search Engine (CEQSE)," Science International, vol. 26, no. 1, 2014.
  8. Search Truth, "Online Quran and Hadith Search Web Portal," last accessed September 1st, 2015 from: www.searchtruth.com.
  9. Guided Ways Technologies, "Online Quran and Hadith Search Web Portal," last accessed July 30th, 2015 from: http://www.guidedways.com/search.php.
  10. B. Abu Shawar and E. Atwell, "An Arabic Chatbot Giving Answers from the Qur'an," Proc. TALN04: XI Conference sur le Traitement Automatique des Langues Naturelles, vol. 2, pp. 197-202, 2004.
  11. IslamicSearch.com, "Islamic Search Powered by Google," last accessed July 12th, 2015 from: http://IslamicSearch.com.
  12. Allah.pk, "The Multilingual Quran and Hadith Search Engine," last accessed August 25th, 2015 from: http://Allah.pk.
  13. IslamiCity.com, "Islam and the Global Muslim eCommunity," last accessed on August 15th, 2015 from: www.quran4theworld.com.
  14. IntoIslam.com, "Islamic Search Engine," last accessed June 10th, 2015 from: http://IntoIslam.com.
  15. K. Darwish and D. W. Oard, "CLIR Experiments at Maryland for TREC-2002: Evidence Combination for Arabic-English Retrieval," DTIC Document, issue LAMP-TR-101, Maryland Univ. College Park Inst. for Advanced Computer Studies, Feb. 2003.
  16. L. S. Larkey and M. E. Connell, "Arabic Information Retrieval at UMass in TREC-10," DTIC Document, Massachusetts Univ. Amherst Center for Intelligent Information Retrieval, 2006.
  17. C. Rocha, D. Schwabe and M. P. Aragao, "A Hybrid Approach for Searching in the Semantic Web," Proc. of the 13th International Conference on World Wide Web, pp. 374-383, New York, NY, USA, 17 – 22 May 2004.
  18. R. Bhagdev, S. Chapman, F. Ciravegna, V. Lanfranchi and D. Petrelli, "Hybrid Search: Effectively Combining Keywords and Semantic Searches," Proc. of the ESWC, Springer, Berlin, Heidelberg, pp. 554-568, 2008.
  19. A. Kiryakov, B. Popov, I. Terziev, D. Manov and D. Ognyanoff, "Semantic Annotation, Indexing, and Retrieval," Web Semantics: Science, Services and Agents on the World Wide Web, vol. 2, no. 1, 2011.
  20. F. Alkhateeb, A. Alzubi, I. A. Doush, S. Aljawarneh and Al E. Maghayreh, "Extracting Authoring Information Based on Keywords and Semantic Search," Proc. of the 1st International Conference on Intelligent Semantic Web-Services and Applications, pp. 1-6, 2010.
  21. Y. Kotb, K. Gondow and T. Katayama, "The SLXS Specification Language for Describing Consistency of XML Documents," Proc. of the 4th International Workshop on Information Computer Science (WICS2002), IEEE Computer Society, 17-18 March 2002.
  22. Y. Kotb, K. Gondow and T. Katayama, "The XML Semantics Checker Model," in: Proc. of the 3rd International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'02), 2002.
  23. elebda3.net, "Regular Expression e-book," downloaded from: http://files.books.elebda3.net/elebda3.net-5647.pdf, on September 2nd, 2015.
  24. A. Arara, A. Smeda and I. Ellabib, "Searching and Analyzing Arabic Text Using Regular Expressions: e–Quran Case Study," International Journal of Computer Science and Electronics Engineering, vol. 1, no. 5, pp. 627-631, 2013.
  25. QuranComplex, "Al-Madinah Mushaf Website," last accessed September 25th, 2015 from: http://publications-img.qurancomplex.gov.sa/?p=44.
  26. Altafsir.com, "Tafsir Website," last accessed on September 28th, 2015 from: http://www.altafsir.com.
  27. N. Abbas, L. Aldhubai, H. Al-Khalifa, Z. Alqassem, E. Atwell, K. Dukes, M. Sawalha and A. Sharaf, "Unifying Linguistic Annotations and Ontologies for the Arabic Quran," in: Proc. of WACL’2 2nd Workshop on Arabic Corpus Linguistics, Lancaster University, UK, 22 July 2013.
  28. N. Abbas and E. Atwell, "Annotating the Arabic Quran with a Classical Semantic Ontology," in: Proc. of WACL’2 2nd Workshop on Arabic Corpus Linguistics, Lancaster University, UK, 22 July 2013.
  29. O. Ahmad, I. Hyder, R. Iqbal, M. Murad, A. Mustapha, N. Sharef and M. Mansoor, "A Survey of Searching and Information Extraction on a Classical Text Using Ontology-Based Semantics Modelling: A Case of Quran," Life Science Journal, vol. 10, no. 4, pp 1370-1377, 2013.
  30. Z. Adhoni and H. Al Hamad, "A Cloud Qur’an Application Using Drupal Technology," International Journal of Web Applications, vol. 6, no. 1, pp 23-38, 2014.
  31. S. M. Alrehaili and E. Atwell, "Computational Ontologies for Semantic Tagging of the Quran: A Survey of Past Approaches," in: Proc. of the 9th International Conference on Language Resources and Evaluations (LREC’14), Reykjavik, Iceland, 26-31 May 2014.