microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition โข 6B โข Updated โข 313k โข 1.58k
Generate captions and chat responses from your images
Generate optimized prompts for Stable Diffusion
Set up and customize Stable Diffusion WebUI