Skip to content
LLM-jp
NEWS
Release
Resources
Blog
Members
Meeting
CONTACT
NEWS
Release
Resources
Blog
Members
Meeting
CONTACT
JA
EN
LLM-jp
HOME
-
Resources
Resources
Documents / Slides
We also publish an “
Overview of Japanese LLMs
” on GitHub. Click below to learn more.
Open on GitHub
Search
Working Group / Talks & Reports
Academic Domain WG
Corpus Construction WG
Dialogue WG
Evaluation and Tuning WG
Evaluation and Turning WG
Mechanistic Understanding WG
Model Building WG
Multi-modal WG
Principal Elucidation WG
Real-world Interaction WG
Safety WG
Talks and Reports
すべて
Session
27th(2026-3-17)
26th (2026-2-24)
25th (2026-1-13)
24th (2025-10-28)
23rd (2025-9-30)
22nd (2025-8-26)
21st (2025-7-22)
20th (2025-6-24)
19th (2025-5-27)
18th (2025-4-22)
17th (2025-3-25)
16th (2025-2-25)
15th (2025-1-14)
14th (2024-11-26)
13th (2024-10-29)
12th (2024-8-27)
11th (2024-7-30)
10th (2024-6-25)
9th (2024-5-28)
8th (2024-3-26)
7th (2024-1-22)
6th (2023-11-29)
5th (2023-10-18)
4th (2023-9-4)
3rd (2023-7-20)
2nd (2023-6-19)
1st (2023-5-15)
すべて
List of Public Resources
LLM-jp Status Report
Kurohashi
Generation of Instruction and Preference Dataset for Improving Japanese Instruction Following in LLMs
Moriyama
Construction of JMultiWOZ-TC Evaluation Data for Tool Invocation by AI Agents
Shimizu
Which Feedback Works for Whom? Differential Effects of LLM-Generated Feedback Elements Across Learner Profiles
Furuhashi
Typology of Perceived “Oddness” in LLM-Generated Stories for Elementary School Kanji Learning
Takami
Analyzing Training Data Contributions in LLM Pretraining via Parameter-Space Distance
Nishida
Bottom-Up Interpretation of Language Model Training Dynamics via Loss Curve Clustering
Aoki
Exclusive Unlearning
Sasaki
Investigating Internal Operations for Long Distance Dependencies in Language Models
Kimura
JAMMEval: Improving the Reliability of Japanese VQA Evaluation Datasets through Re-annotation
Sugiura
Omni-JDocVQA: A Japanese Benchmark for Visual Document Understanding across Diverse Document Types
Kajikawa
Verification of Japanese Pre-training for LayoutLMv3
Yanagisawa
ABMamba: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning
Yashima
JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding
Maeda
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Nakamura
Fact-Checking of LLM-Generated Texts
Kiyomaru, Masano
Scaling Data-Constrained Language Models with Synthetic Data
Kiyomaru
Improving the Accuracy of Sensitive Personal Information Detection in Large-Scale Corpora
Minamoto
Demystifying Mixed Outcomes of Self-Training: Pre-training Analyses on Non-Toy LLMs
Nakamura
Enhancing Multi-turn Safety in Japanese and English LLMs using GRPO
Sata
Comparing Human and Automated Red Teaming for Multi-Turn Conversational Safety Evaluation
Semitsu
Tracing Multilingual Knowledge Acquisition Dynamics in Domain Adaptation: A Case Study of Biomedical Adaptation
Zhao
Effects of Dialogue Corpora Properties on Fine-Tuning a Moshi-Based Spoken Dialogue Model
Abe
Construction of a Large-Scale Audio Acoustic Dataset Using Common Crawl
Asai
LLM-jp Status Report
Kurohashi
Release of Japanese-Specialized Diffusion Language Model ‘ELYZA-LLM-Diffusion’
Trisitichoke Tasavat/ELYZA
Building Japanese-English Reasoning Large Language Models with Swallow
Mizuki/Institute of Science Tokyo
Evaluation and Tuning WG
Miyao
Corpus Construction WG
Kawahara
Safety WG
Suzuki
Safety WG
Sekine
Model Building WG
Suzuki
Academic Domain WG
Aizawa
Dialogue WG
Higashinaka
Real-world Interaction WG
Kurita
Multi-modal WG
Okazaki
Model Building WG
Suzuki
Multi-modal WG
Okazaki
Safety WG
Sekine
Corpus Construction WG
Kawahara
Real-world Interaction WG
Ogata
Evaluation and Tuning WG
Miyao
LLM-jp Status Report
Kurohashi
Harnessing AI Agents for Real-World Applications
Kwasi Ankomah / SambaNova
The Role of LLM in Juris-informatics and a Survey of Legal Control over LLM
Ken Sato
Mid-training for LLM
Kodama
Dialogue WG
Higashinaka
Real-world Interaction WG
Ogata
Safety WG
Sekine
Corpus Construction WG
Kawahara
Multi-modal WG
Okazaki
LLM-jp Status Report
Kurohashi
Evaluating the Faithfulness and Readability of RL-Tuned LLMs
Chaoran Liu
The First Workshop on Fine-Tuning and Evaluating LLMs: A Report
Namgi Han/UT|Katsumata/Retrieva|Miyao|Kiyomaru
SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches
Deguchi/NTT
Model Building WG
Suzuki
Evaluation and Tuning WG
Miyao
Dialogue WG
Higashinaka
Multi-modal WG
Okazaki
Evaluation and Tuning WG
Miyao
Corpus Construction WG
Kawahara
Safety WG
Sekine
Safety WG
Sekine
Safety WG
Sekine
Model Building WG
Suzuki
Real-world Interaction WG
Ogata
Dialogue WG
Higashinaka
LLM-jp Status Report
Kurohashi
Crawling the Japanese Web
Tamura/ROIS
Stockmark-2-VL-100B: A Japanese-Specialized Visual Language Model with Chain-of-Thought Reasoning for Document Reading Comprehension
Shi Chen/Stockmark
The Insight We Gained from the Development of Llama 3.1 Future Code Ja
Ryo Fujii/Future
LLM-jp Status Report
Kurohashi
Research and Development of an Open Japanese Medical LLM
Kobayashi
Japanese LLM-as-a-Judge Evaluation Tool ‘llm-jp-judge’ Update and Analysis
Nakayama
Corpus Construction WG
Kawahara
Real-world Interaction WG
Ogata
Safety WG
Sekine
Model Building WG
Suzuki
Multi-modal WG
Okazaki
Evaluation and Tuning WG
Miyao
Dialogue WG
Higashinaka
LLM-jp Status Report
Kurohashi
Historical Texts and LLMs: Corpus Construction and Utilization
Kitamoto/NII
JPharmatron & JPharmaBench: A Japanese Language Model and Evaluation Benchmarks for Pharmaceutical NLP
Ono/UT EQUES
Evaluation and Tuning WG
Miyao
Real-world Interaction WG
Ogata
Dialogue WG
Higashinaka
Corpus Construction WG
Kawahara
Safety WG
Sekine
Model Building WG
Suzuki
Academic Domain WG
Aizawa
Multi-modal WG
Okazaki
LLM-jp Status Report
Kurohashi
NTT’s Large Language Model: tsuzumi 2
Nishida/NTT
Papers Accepted at EMNLP2025
Inaba
Papers Accepted at EMNLP2025
Nishida
Papers Accepted at EMNLP2025
Harada
Papers Accepted at EMNLP2025
Furuhashi
Academic Domain WG
Aizawa
Corpus Construction WG
Kawahara
Page
1
Page
2
Page
3
NEWS
Release
Resources
Blog
Members
Meeting
CONTACT
NEWS
Release
Resources
Blog
Members
Meeting
CONTACT
X-twitter
Github
Youtube
JA
EN