Stars
An AI Digital Assistant Extension for Helping Customer Support Agents
Example code for "Real-World Natural Language Processing"
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.
Arabic speech recognition, classification and text-to-speech.
Extract Keywords from sentence or Replace keywords in sentences.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Access Google Colab compute from your local VSCode
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Benchmark Arabic text diacritization dataset
Open-source search and retrieval database for AI applications.
Llama from scratch, or How to implement a paper without crying
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch implementation of other Hierarchical Methods (Mean Pooling a…
📋 A list of open LLMs available for commercial use.
The Quranic Arabic Corpus, an invaluable linguistic resource, is due for a revamp. We're calling on Linguistics, AI, and Tech volunteers to join us in this exciting journey. 🚀
مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (ANLP).
🔥Highlighting the top ML papers every week.
iFeature is a comprehensive Python-based toolkit for generating various numerical feature representation schemes from protein or peptide sequences. iFeature is capable of calculating and extracting…
An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.