During Google I/O this Tuesday, the company revealed Gemini 1.5 Flash, the latest member of the Gemini model series.
Google is leveraging its annual developer conference to showcase what the company dubs as its lightest and most efficient artificial intelligence models.
During Google I/O on Tuesday, the company introduced Gemini 1.5 Flash, the latest addition to the Gemini model series.
“We heard from developers that they wanted something faster and even more cost-effective,” said Demis Hassabis, CEO of Google DeepMind, in a press briefing.
The unveiling coincides with tech companies increasingly centering their product development and rollouts around generative AI, which holds particular significance for Google. These new tools offer consumers more advanced and creative ways to access online information compared to traditional web search.
OpenAI also launched a new AI model and desktop version of ChatGPT on Monday, along with a revamped user interface. The new model, GPT-4o, boasts twice the speed of GPT-4 Turbo and is half the cost, according to the company.
Additionally, Google announced an enhanced Gemini 1.5 Pro model, which can decipher multiple large documents—totaling 1,500 pages—or summarize 100 emails, according to a vice president working on Gemini.
Gemini 1.5 Pro will soon have the capacity to handle an hour of video content or codebases with over 30,000 lines, said Hsiao.
“You can quickly get answers and insights about dense documents, like figuring out the details of the pet policy in your rental agreement or comparing key arguments of multiple long research papers,” Hsiao explained.
OpenAI’s latest upgrade, unveiled this week, brings improved quality and speed of ChatGPT for 50 different languages. It will also be accessible via OpenAI’s application programming interface (API), allowing developers to start building applications using the new model immediately, executives said.
With support for 35 languages, Google states that Gemini 1.5 Pro features a 2 million token window, which measures context and indicates the model’s information processing capability at once. The new model boasts enhanced local reasoning, planning, and image understanding, according to company executives.
“It offers the longest context window of any foundational model yet,” noted Alphabet CEO Sundar Pichai in the press briefing. During the event, he provided an example of a parent asking Gemini to summarize all recent emails from their child’s school.
Gemini 1.5 Pro will initially undergo testing in Workspace Labs. Meanwhile, Gemini 1.5 Flash will be available for testing and in Vertex AI, Google’s machine learning platform that enables developers to train and deploy AI applications.