Skip to content

ARBML is a group of researchers working on democratizing Arabic NLP research and deveopment:

  • 🙋‍♀️ All about Arabic NLP and ML, open source for the win!
  • 🏵️ Contribution guidelines - open an issue and given the go-ahead submit a PR.
  • 👩‍💻 Some repos have specific contribution guidlines.
  • 📝 Remember to cite if you use one of our resources.

Pinned

  1. ARBML Public

    Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.

    JavaScript 280 36

  2. klaam Public

    Arabic speech recognition, classification and text-to-speech.

    Jupyter Notebook 168 38

  3. masader Public

    The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.

    JavaScript 91 20

  4. Calliar Public

    A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.

    Jupyter Notebook 111 11

  5. tkseem Public

    Arabic Tokenization Library. It provides many tokenization algorithms.

    Jupyter Notebook 39 6

  6. nmatheg Public

    A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.

    Jupyter Notebook 18 5

Repositories

  • Python 0 0 0 0 Updated Nov 4, 2022
  • adawat Public
    Jupyter Notebook 0 0 0 0 Updated Oct 26, 2022
  • nmatheg Public

    A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.

    Jupyter Notebook 18 5 0 0 Updated Oct 20, 2022
  • tnqeeb Public

    Arabic diverse corpus datasets

    Python 9 MIT 0 18 2 Updated Oct 10, 2022
  • .github Public
    1 0 0 0 Updated Sep 17, 2022
  • tkseem Public

    Arabic Tokenization Library. It provides many tokenization algorithms.

    Jupyter Notebook 39 MIT 6 1 1 Updated Sep 17, 2022
  • masader Public

    The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.

    JavaScript 91 GPL-3.0 20 6 (1 issue needs help) 0 Updated Sep 14, 2022
  • qawafi Public

    Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.

    Jupyter Notebook 11 MIT 2 3 0 Updated Sep 14, 2022
  • website Public
    HTML 0 1 2 0 Updated Sep 3, 2022
  • Calliar Public

    A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.

    Jupyter Notebook 111 11 1 0 Updated Aug 9, 2022

Top languages

Loading…

Most used topics

Loading…