- Japan
-
13:13
(UTC +09:00) - https://secon.dev/
- @hotchpotch
- https://kaggle.com/hotchpotch
- https://huggingface.co/hotchpotch
-
open_provence Public
✂️ OpenProvence: Open-Source, Efficient, and Robust Context Pruning for Retrieval-Augmented Generation
-
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is the leading framework for building LLM-powered agents over your data.
Python MIT License UpdatedOct 31, 2025 -
fast-bunkai Public
⚡Japanese sentence splitting(日本語文境界判定器), 40–250× faster via a Rust-accelerated Python library with near-perfect API compatibility with megagonlabs/bunkai.
-
JMTEB Public
Forked from sbintuitions/JMTEBThe evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
Python Creative Commons Attribution Share Alike 4.0 International UpdatedSep 9, 2025 -
JaCWIR Public
JaCWIR: Japanese Casual Web IR - 日本語情報検索評価のための小規模でカジュアルなWebタイトルと概要のデータセット
-
JQaRA Public
JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット
-
sentence-transformers Public
Forked from huggingface/sentence-transformersState-of-the-Art Text Embeddings
Python Apache License 2.0 UpdatedAug 2, 2025 -
yast Public
YAST - Yet Another SPLADE or Sparse Trainer
-
yasem Public
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
-
記事タイトルがないものを、自動タイトル
Python MIT License UpdatedApr 21, 2025 -
Educational content scoring and evaluation code using fineweb-2 (Japanese). Includes training and assessment implementations for content rating tasks.
-
sd-16 Public
Forked from mahm/sd-16LangGraph sample code for Software Design article vol.16
Python UpdatedNov 21, 2024 -
JapaneseEmbeddingEval Public
Forked from oshizo/JapaneseEmbeddingEvalJupyter Notebook UpdatedOct 7, 2024 -
FlagEmbedding Public
Forked from FlagOpen/FlagEmbeddingRetrieval and Retrieval-augmented LLMs
Python MIT License UpdatedAug 29, 2024 -
text-embeddings-inference Public
Forked from huggingface/text-embeddings-inferenceA blazing fast inference solution for text embeddings models
Rust Apache License 2.0 UpdatedJun 12, 2024 -
vespa-kuromoji-linguistics Public
Forked from yahoojapan/vespa-kuromoji-linguisticsJava Apache License 2.0 UpdatedApr 3, 2024 -
wikipedia 日本語の文を、各種日本語の embeddings や faiss index へと変換するスクリプト等。
-
youri-7b を SFT で Q&A + RAG形式に特化したフォーマットで学習
-
ranx Public
Forked from AmenRa/ranx⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
Python MIT License UpdatedFeb 21, 2024 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 29, 2023 -
ncd_classifier Public
NCD Classifier is a Python library that implements the method proposed in the paper "Low-Resource" Text Classification: A Parameter-Free Classification Method with Compressors".
-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedMay 29, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedMay 2, 2023 -
-
-
-
improved-aesthetic-predictor Public
Forked from christophschuhmann/improved-aesthetic-predictorCLIP+MLP Aesthetic Score Predictor
Python Apache License 2.0 UpdatedSep 20, 2022 -
lab_sample_pipelines Public
Forked from AseiSugiyama/lab_sample_pipelinesPython MIT License UpdatedOct 30, 2021 -
Hatena::Group Static Site Generator