Wednesday, July 8, 2020

Open Access Quranic Arabic Corpus

 [First posted in AWOL 11 August 2010, updated 8 July 2020]

Quranic Arabic Corpus
http://corpus.quran.com/images/logo.png
Welcome to the Quranic Arabic Corpus, an annotated linguistic resource which shows the Arabic grammar, syntax and morphology for each word in the Holy Quran. The corpus provides three levels of analysis: morphological annotation, a syntactic treebank and a semantic ontology.
The Quran is a significant religious text written in Quranic Arabic, and is followed by believers of the Islamic faith. The Quran contains 6,236 numbered verses (ayāt) and is divided into 114 chapters.

An example verse from the Quran:
(21:30) Have those who disbelieved not considered that the heavens and the earth were a joined entity, and We separated them and made from water every living thing? Then will they not believe?
  • Version 0.4 Released - new and updated linguistic features in this version of the corpus
  • Word by Word Quran - maps out the syntax of the entire Quran, with analysis and translation
  • Quranic Grammar - traditional Arabic grammar (إعراب) illustrated using dependency graphs

How you can get involved

This project contributes to the research of the Quran by applying natural language computing technology to analyze the Arabic text of each verse. The word by word grammar is very accurate, but ensuring complete accuracy is not possible without your help. If you come across a word and you feel that a better analysis could be provided, you can suggest a correction online by clicking on an Arabic word.

No comments:

Post a Comment