Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      4113811Updated Oct 14, 2025Oct 14, 2025
    • A lightweight framework for evaluating visual-language models.
      Python
      438191Updated Oct 14, 2025Oct 14, 2025
    • 4005Updated Oct 14, 2025Oct 14, 2025
    • 日本語LLMまとめ - Overview of Japanese LLMs
      TypeScript
      381.2k20Updated Oct 13, 2025Oct 13, 2025
    • Python
      1401Updated Oct 13, 2025Oct 13, 2025
    • ccaudio

      Public
      Tools for downloading and preprocessing audio data from Common Crawl
      Python
      0000Updated Oct 3, 2025Oct 3, 2025
    • 生成自動評価を行うためのPythonツール
      Python
      13121Updated Oct 1, 2025Oct 1, 2025
    • scripts

      Public
      Shell
      59519Updated Oct 1, 2025Oct 1, 2025
    • Python
      1600Updated Sep 23, 2025Sep 23, 2025
    • Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequences of interleaved semantic and acoustic tokens.
      Python
      12100Updated Sep 20, 2025Sep 20, 2025
    • Python
      2700Updated Sep 17, 2025Sep 17, 2025
    • Roff
      54210Updated Sep 6, 2025Sep 6, 2025
    • FastChat2

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      4.8k000Updated Sep 5, 2025Sep 5, 2025
    • jgpqa

      Public
      Japanese translation of the GPQA dataset
      0200Updated Sep 1, 2025Sep 1, 2025
    • Python
      1900Updated Aug 26, 2025Aug 26, 2025
    • Python
      0000Updated Jul 28, 2025Jul 28, 2025
    • Internal modification version of llm-jp-eval
      Python
      41000Updated Jul 27, 2025Jul 27, 2025
    • Python
      1000Updated Jun 29, 2025Jun 29, 2025
    • This repository contains the training and evaluation code for llm-jp-modernbert-base.
      Python
      11100Updated Jun 17, 2025Jun 17, 2025
    • clip-eval

      Public
      clip-eval is a tool for evaluating CLIP models on various image classification and image-text retrieval tasks in Japanese.
      Python
      0000Updated Apr 30, 2025Apr 30, 2025
    • instruct3

      Public
      Python
      0600Updated Apr 14, 2025Apr 14, 2025
    • Shell
      0300Updated Mar 28, 2025Mar 28, 2025
    • 0100Updated Mar 11, 2025Mar 11, 2025
    • NLP2025ワークショップ「大規模言語モデルのファインチューニング技術と評価」
      Python
      0000Updated Feb 7, 2025Feb 7, 2025
    • Easily turn large English text datasets into Japanese text datasets using open LLMs.
      Python
      12260Updated Jan 20, 2025Jan 20, 2025
    • Ongoing research training transformer models at scale
      Python
      3.2k000Updated Jan 19, 2025Jan 19, 2025
    • This is the repository for llm jp membership inference attack.
      Python
      1500Updated Jan 18, 2025Jan 18, 2025
    • A framework for few-shot evaluation of language models.
      Python
      2.8k000Updated Jan 4, 2025Jan 4, 2025
    • Python
      1100Updated Dec 16, 2024Dec 16, 2024
    • A fastText-based classifier for removing toxic texts from corpora, trained on automatically generated labeled data.
      Python
      0000Updated Nov 23, 2024Nov 23, 2024