Resources - LLM-jp

LLM-jp

HOME - Resources

Resources

Documents / Slides

We also publish an “Overview of Japanese LLMs” on GitHub. Click below to learn more.

Working Group / Talks & Reports

Session

List of Public Resources

LLM-jp Status Report
Kurohashi

Research and Development of an Open Japanese Medical LLM
Kobayashi

Corpus Construction WG
Kawahara

Real-world Interaction WG
Ogata

Safety WG
Sekine

LLM-jp Status Report
Kurohashi

Crawling the Japanese Web
Tamura/ROIS

The Insight We Gained from the Development of Llama 3.1 Future Code Ja
Ryo Fujii/Future

Dialogue WG
Higashinaka

Evaluation and Tuning WG
Miyao

Corpus Construction WG
Kawahara

Safety WG
Sekine

Safety WG
Sekine

Safety WG
Sekine

Model Building WG
Suzuki

Real-world Interaction WG
Ogata

LLM-jp Status Report
Kurohashi

Evaluating the Faithfulness and Readability of RL-Tuned LLMs
Chaoran Liu

The First Workshop on Fine-Tuning and Evaluating LLMs: A Report
Namgi Han/UT|Katsumata/Retrieva|Miyao|Kiyomaru

Model Building WG
Suzuki

Evaluation and Tuning WG
Miyao

Dialogue WG
Higashinaka

Real-world Interaction WG
Ogata

Safety WG
Sekine

Corpus Construction WG
Kawahara

Real-world Interaction WG
Ogata

Evaluation and Tuning WG
Miyao

Mid-training for LLM
Kodama

Dialogue WG
Higashinaka

Model Building WG
Suzuki

Safety WG
Sekine

Corpus Construction WG
Kawahara

LLM-jp Status Report
Kurohashi

Harnessing AI Agents for Real-World Applications
Kwasi Ankomah / SambaNova

LLM-jp Status Report (Oral Report Only)
Kurohashi

Large-Scale Human Evaluation of LLM Safety
Takahashi

LLM-jp Status Report (Oral Report Only)
Kurohashi

Real-world Interaction WG
Ogata

Safety WG
Sekine

Safety WG
Sekine

Model Building WG
Suzuki

Corpus Construction WG
Kawahara

Evaluation and Tuning WG
Miyao

Dataflow Architecture Achieving 198 Tokens per Second with DeepSeek R1 671B
Kenichi Hayashi/SambaNova Systems

PLaMo 2 Tokenizer: Keys to Token Efficiency
Kentaro Imajo/Preferred Networks

Training Progress of LLM-jp-3 Models: Analysis on Downstream Performance
Oda|Nishida

LLM-jp Status Report
Kurohashi

Model Building WG
Suzuki

Evaluation and Tuning WG
Miyao

Real-world Interaction WG
Ogata

Corpus Construction WG
Kawahara

Development of Large-Scale Multimodal Models in Cardiovascular Medicine (ECG/X-ray)
Junichiro Takahashi/UT

Development of X-ray Reading Report Generation Model
Kaito Baba/UT

Efforts to Efficiently Create High-Quality LLM Datasets
Yujiro Terazawa/APTO

LLM Training Using Synthetic Data
Kiyomaru

LLM-jp Status Report (Oral Report Only)
Kurohashi

Development and Evaluation of Tanuki
Kan Hatakeyama/Institute of Science Tokyo

EMNLP2024 Report
Takagi

EMNLP2024 Report
Kodama

EMNLP2024 Report
Liu

Safety WG
Sekine

Corpus Construction WG
Kawahara

Evaluation and Tuning WG
Miyao

Real-world Interaction WG
Ogata

Model Building WG
Suzuki

LLM-jp Status Report
Kurohashi

BritLLM: Organising, producing, and publishing the first British Large Language Model
Pontus Stenetorp/NII

Pre-training and Post-training of PLaMo100B
Hiroaki Mikami|Kosuke Nakago/Preferred Elements