Loading...
Please wait while we load the page
MACULA includes rich datasets (accessible via GitHub and via an API). These datasets provide many layers of linguistic description for every sentence of the Hebrew Bible and Greek New Testament, including syntactic structure, morphology, semantic domains, relationships, referents, glosses, and quotations, and other aspects.
MACULA can be used by resource creators or researchers with extensive knowledge of biblical languages and linguistics, or by programmers and NLP practitioners who do not know Hebrew and Greek. MACULA’s datasets are freely licensed and available on GitHub.
https://github.com/Clear-Bible/macula-hebrew
https://github.com/Clear-Bible/macula-greek
Using these datasets as a basis, Clear is in the process of creating user environments for both scholars of biblical languages and serious Bible students who do not know Greek or Hebrew.






"As a Data Scientist wanting to process the biblical texts, I have found the MACULA dataset to be extremely useful. The data is very high quality, and it's very straightforward for me to import and use it in developing applications!"
Mark Woodward, Data Scientist, SIL International
"The MACULA data sets help translation consultants quickly recognize features, including participant reference and word sense, of the biblical texts. We appreciate the way it already integrates UBS and other data sets and look forward to even richer data in the future."
Milt Jones, VP Bible Translation at Seed Company
"MACULA data has been a game changer for our team as we try to estimate the qualities of Bible translation drafts. The semantic domain information and glosses help us find missing, added, or modified information during review."
Daniel Whitenack, Data Scientist, SIL Internationa