Quran

Quranic Arabic Corpus

Annotated linguistic dataset from the University of Leeds providing word-by-word Arabic grammar, morphology, and syntax for every word in the Quran. Includes part-of-speech tags, roots, lemmas, and syntactic treebank data. The gold standard for Quranic NLP.

Free Scholar-reviewed
Open resource ↗ Documentation