MrBabarAli

Follow

Babar Ali MrBabarAli

Follow

"Aspiring AI researcher and developer, passionate about Computer Vision and NLP. I enjoy building ML models, exploring multimodal data, and solving real-world p

2 followers · 14 following

Stars

xcjthu / TopTextClassification

text classifier

Python 4 1 Updated Jul 10, 2019

mttk / rnn-classifier

Minimal RNN classifier with self-attention in Pytorch

Python 152 30 Updated Dec 21, 2021

castorini / anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,084 541 Updated Nov 11, 2025

castorini / hedwig

PyTorch deep learning models for document classification

Python 596 126 Updated Jul 21, 2023

NTMC-Community / MatchZoo-py

Facilitating the design, comparison and sharing of deep text matching models.

Python 502 107 Updated May 3, 2024

thunlp / OpenCLaP

Open Chinese Language Pre-trained Model Zoo

988 147 Updated Mar 18, 2020

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 7,750 848 Updated Jun 1, 2025

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 77,136 15,062 Updated May 10, 2024

THUwangcy / ReChorus

“Chorus” of recommendation models: a light and flexible PyTorch framework for Top-K recommendation.

Python 619 94 Updated Mar 8, 2025

THUwangcy / DirectAU

KDD'2022: Towards Representation Alignment and Uniformity in Collaborative Filtering

Python 70 5 Updated Oct 27, 2022

catboost / catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…

C++ 8,655 1,247 Updated Nov 11, 2025

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,097 526 Updated May 9, 2024

castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,978 463 Updated Nov 8, 2025

jingtaozhan / disentangled-retriever

An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.

Python 60 5 Updated May 17, 2023

THUIR / THUIR-website

THUIR website

HTML 10 19 Updated Nov 2, 2025

ChuXiaokai / baidu_ultr_dataset

an unbias-learning-to-rank dataset of Baidu

Python 65 7 Updated Aug 3, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,975 2,217 Updated Jul 29, 2024

xuanyuan14 / THUIR_WSDM_Cup

Our code for WSDM Cup 2023 Task 1 and 2

Python 10 Updated Jan 16, 2023

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,235 4,538 Updated Nov 11, 2025

zai-org / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,160 5,214 Updated Jun 27, 2024

THUIR / LeCaRDv2

A Large-Scale Chinese Legal Case Retrieval Dataset

79 6 Updated Dec 29, 2024

CSHaitao / SAILER

The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval

Python 93 10 Updated May 9, 2023

CSHaitao / JTR

The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval

Python 28 2 Updated Jun 7, 2023

THUIR / T2Ranking

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 162 10 Updated Jul 3, 2023

CSHaitao / THUIR-COLIEE2023

Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2

Python 27 2 Updated May 12, 2023

CSHaitao / ChatGLM_mutli_gpu_tuning

deepspeed+trainer简单高效实现多卡微调大模型

Python 129 10 Updated May 27, 2023

zeno-ml / zeno-build

Build, evaluate, understand, and fix LLM-based apps

Jupyter Notebook 491 32 Updated Jan 16, 2024

HFO4 / gameboy.live

🕹️ A basic gameboy emulator with terminal "Cloud Gaming" support

Go 4,844 239 Updated Apr 13, 2025

CSHaitao / LexiLaw

LexiLaw - 中文法律大模型

Python 942 140 Updated Mar 7, 2025

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,122 294 Updated Nov 8, 2024