Recently, Google announced a series of new features and products at the Google I/O conference, which are highly anticipated. Although most of them have not been released yet, we can get a glimpse of some ongoing development work. Google plans to release five Gemini products on July 15th and July 18th. Let's dive in and find out!

image.png

According to reverse engineering of the front-end code and related leak information, we can initially understand that the upcoming Gemini products may include: a new version of Imagen3, custom Gemini GPT, personalized response features, scheduled prompts, voice recording, and integration with Google Photos. Additionally, there is information about the development of real-time enhancement features and the Gemini Chrome extension program.

Imagen3

Imagen3 is expected to be opened to alpha testers at AI Labs, and may eventually also be available to Gemini Advanced users. While there is little chance of its release next week, considering the quality of Imagen2 and the lack of image generation capabilities in the EU, this is a highly anticipated version. There are rumors that the first invitations will be sent to members of AI Labs Discord and premium subscribers.

GEMs

The custom GPT for Gemini, called GEM (previously known as "Bot"), has already started development before the I/O announcement. Users will be able to view, edit, and copy GEMs, which can be accessed through the GEMs Manager tab. Given its long development time, GEMs may be an important version, but it may also be delayed.

image.png

Memory/Personalized Response Feature

This feature is displayed as a separate section in the side menu, located behind the Gemini response icon. The tooltip indicates that this button will allow users to schedule prompts. In a dedicated tab, users might see a list of scheduled tasks. This unique feature will allow users to request Gemini to send them daily news every morning, which works well with GEMs.

image.png

Preset Prompts

This feature has been in the code for some time and is expected to function similarly to the memory feature on ChatGPT. Users will have a dedicated section in settings to access the personalized part. However, there may be some adjustments due to the name of this section being "Personalized Response".

image.png

Voice Recording and Google Photos Integration

The attachment options indicate two new additions:

  • Voice Recording: Allows users to record messages and send them as .wav files. Although the voice recording feature seems feasible, it seems to be a long way from release.
  • Google Photos Integration: Appears to be almost complete, allowing users to directly select photos from the Photos app on the web. However, it still cannot solve the problem of uploading multiple images at once.

image.png

Instant Prompt Enhancement Feature

New hidden buttons may serve as prompt enhancement based on their appearance and name.

image.png

Aside from this, more features for Android Gemini were discovered earlier, and Google is also recruiting Beta testers for the iOS version of the Google app, indicating that an iOS update for Gemini may be coming soon. Future updates may also include the option to disable real-time response features.

image.png

Gemini, as Google's latest and most advanced AI model, represents a major leap in AI capabilities, with its functions and application scenarios constantly expanding.

Key Points:

🔍 Google Gemini is set to launch new features, including Imagen3, custom Gemini GPT, and more

🔍 It is expected that Gemini will also launch features such as personalized responses, scheduled prompts, voice recording, and integration with Google Photos

🔍 Google is actively recruiting Beta testers for the iOS version of Gemini, indicating that an iOS update may be released soon