Stars
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
A 13B large language model developed by Baichuan Intelligent Technology
ModelScope: bring the notion of Model-as-a-Service to life.
An open source implementation of CLIP.
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
温馨提示:仅限用于合法的、积极向上的敏感词过滤使用,严禁用于从事违反法律法规、危害国家、危害人民、不道德的活动!!!
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
all kinds of text classification models and more with deep learning
Convolutional Neural Network for Text Classification in Tensorflow
Interactive tools and developer experiences for Big Data on Google Cloud Platform.
基于分词原理修改写的一个过滤敏感词库,可以改成动态,支持返回敏感词,高亮敏感词,替换敏感词等操作,本敏感词收集了5W多个违法词、敏感词、违禁词,已去重,最新追加了将近1W个最新词,几十个矫正词、变异词。
⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
RUCAIBox / LightSANs
Forked from BELIEVEfxy/LightSANsCode for our SIGIR 2021 short paper "Lighter and Better: Low-Rank Decomposed Self-Attention Networks for Next-Item Recommendation."
SASRec: Self-Attentive Sequential Recommendation
Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protobuf...
综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。