InfoCapability

rinna Japanese Stable Diffusion: Japanese-language image generation from Stable Diffusion

AI Impact Summary

Japanese Stable Diffusion is a language-specific diffusion model fine-tuned from Stable Diffusion to interpret Japanese prompts and generate imagery reflecting Japanese culture. It replaces the base English text encoder with a Japanese-specific one and uses a tokenizer tailored for Japanese to improve token efficiency and fidelity. The model, developed by rinna Co., Ltd., is available on Hugging Face and GitHub and relies on Diffusers and CLIP as the text encoder, trained with approximately 100 million Japanese-captioned images from LAION-5B and a Japanese-cloob-vit-b-16 preprocessing step. This capability broadens AI-generated content to Japanese-speaking users but requires engineering work to integrate the tokenizer/model into existing inference pipelines and to review licensing and data provenance.

Affected Systems

Stable Diffusion

Date: Date not specified
Change type: capability
Severity: info

rinna Japanese Stable Diffusion: Japanese-language image generation from Stable Diffusion

More from Hugging Face

Get alerts for Hugging Face