Release ｜ LLM-jp

Models, Corpus, and Tools

We have released the models and tools developed by LLM-jp.
Please cite or reference the resources on this page if you use them in your research or software development.

Open platforms

Models: https://huggingface.co/llm-jp
Corpora: https://gitlab.llm-jp.nii.ac.jp/datasets
Tools: https://github.com/llm-jp

Major Models

Fine-tuned Models

Pre-trained Models

LLM-jp-3-8x13b
LLM-jp-3-172B (Access requires approval. Redistribution and certain uses are restricted.)
LLM-jp-3-13B

Multi-model Models

LLM-jp-3-VILA-14B

Corpora for Pre-training

Evaluation and fine-tuning datasets

Other data is based on publicly available data, and details can be found in the “Evaluation Tools” and “Tuning Scripts” below, respectively.

Tools

Pre-training Corpus Building Scripts v2.0
Pre-training Corpus Building Scripts v1.0
Tokenizer
Evaluation Tools
- llm-jp-eval
- llm-jp-judge
Fine-tuning Script
- trl-based
  - SFT
  - DPO
- Nemo-Aligner-based (Supports both SFT and DPO)

Leaderboards (Weights & Biases)

All Models

Fine-tuned Models

Release

Open platforms

Major Models

Fine-tuned Models

Pre-trained Models

Multi-model Models

Corpora for Pre-training

Evaluation and fine-tuning datasets

Tools

Leaderboards (Weights & Biases)

All Models

Fine-tuned Models

Pre-trained Models

Multi-model Models

Others