Unum’s Post

View organization page for Unum, graphic

1,337 followers

UForm is going Generative! The UForm family of tiny multimodal AI models just got broader! In addition to the existing CLIP-like embedding models, we now have a generative model useful for image captioning, visual question answering, and multimodal chats. All that is #opensource and takes around a billion parameters, small enough to fit even on mobile devices 🎉 Repository: https://lnkd.in/dTrZ5Q2d Generative model: https://lnkd.in/gZ9y4KEW Chat model: https://lnkd.in/gpaRVvKm Discord: https://lnkd.in/gGj-rRGW Check our the quality of image captions in the comments ⬇️

  • No alternative text description for this image
Ash Vardanian

Founder at Unum | Exascale Search | On 100M+ Devices

7mo

Check out how the captions quality compares between our model and 5x larger InstructBLIP and LLaVA

  • No alternative text description for this image
Like
Reply
Ash Vardanian

Founder at Unum | Exascale Search | On 100M+ Devices

7mo

  • No alternative text description for this image
See more comments

To view or add a comment, sign in

Explore topics