Gemini 1.5 Pro Now Listens to Audio and Is Available to All

Apr. 10, 2024



I tried to access the Gemini 1.5 Pro model from a new Google account and the model was readily available without any wait. And all this is available for free.

Keep in mind that the Gemini 1.5 Pro model is amid-tier modelbuilt on the MoE architecture, however, it beats the largest Gemini 1.0 Ultra model easily. And in ourcomparison with the GPT-4 model, Gemini 1.5 Pro showed remarkable capabilities in several tests. When Gemini 1.5 Pro debuts on the Gemini portal, expect it to perform better than GPT-4 and Claude 3’s Opus model.

Gemini 1.5 Procould already process videosand images, and now audio files are supported too which makes it a powerful multimodal model with a context length of 1 million tokens. We tested the audio processing capability of the Gemini 1.5 Pro model. Here is how it went.

How to Process Audio Files on Gemini 1.5 Pro

How to Process Audio Files on Gemini 1.5 Pro

So this is how you can upload and process audio files on Gemini 1.5 Pro. It’s really a powerful model from the Google DeepMind team and I am excited that it’s now available to the public at large without any cost. Go ahead and try it and let us know your thoughts in the comment section below.

Passionate about Windows, ChromeOS, Android, security and privacy issues. Have a penchant to solve everyday computing problems.