visionMicrosoft
Phi-4 Multimodal
Microsoft's compact multimodal model handling text, image and audio understanding.
microsoft/phi-4-multimodal-instructTest it live
Live playground
Phi-4 Multimodalin-browser demo
Test Phi-4 Multimodal live
Send a message and watch the model respond. Multi-turn — it remembers the conversation.