On January 26, Baichuan Intelligence announced the official launch of the Baichuan-Omni-1.5 open source omnimodal model. This model not only supports omnimodal understanding of text, images, audio and video, but also has the ability to generate text and audio in dual modes. In terms of vision, speech and multimodal streaming processing, Baichuan-Omni-1.5 performs better than GPT-4o mini.
Baichuan Intelligent's open source full-modal model Omni-1.5 is launched, claiming that many capabilities surpass GPT-4o mini
2025-01-26 15:09:23
Email Subscription
Newsletters and emails are now available! Delivered on time, every weekday, to keep you up
to date with North American business news.
Weekly Highlights