Setting up Google AI Studio with Gemini 2.0 for screen interaction | Alpha | PandaiTech

Setting up Google AI Studio with Gemini 2.0 for screen interaction

A guide on configuring Google AI Studio using the Gemini 2.0 Flash Experimental model, enabling the AI to see your computer screen and chat in real-time to assist with your tasks.

Learning Timeline
Key Insights

Advantages of Gemini 2.0 Flash Experimental

The 2.0 Flash model is significantly faster and smarter at understanding real-time visual screen context compared to previous versions.

Multimodal Interaction Tips

You aren't limited to just voice; you can combine visual input (screen), voice (microphone), and text (typing prompts) simultaneously for more accurate results.

Free Usage Status

Currently, access to Gemini 2.0 Flash Experimental within Google AI Studio is free, making it a fantastic alternative to other paid chatbots.
Prompts

Real-Time Visual Assistance

Target: Gemini 2.0 Flash Experimental
Look at my screen and give me tips on how to improve my workflow in this video editing software.

Setting Up Email Filters

Target: Gemini 2.0 Flash Experimental
I'm looking at my Gmail right now. Can you guide me on how to set up filters to manage these spam emails effectively?
Step by Step

How to Set Up Gemini 2.0 Flash Real-Time Screen Interaction

  1. Visit the Google AI Studio website and Sign in using your Google account.
  2. Look at the sidebar on the left side of the screen and click on the 'Stream real-time' menu.
  3. Click the model selection dropdown and make sure you select 'Gemini 2.0 Flash Experimental'.
  4. Find the 'Output format' section on the right and change the selection from 'Text' to 'Audio' to enable the AI to speak.
  5. Choose your preferred Voice from the dropdown menu (e.g., 'Puck').
  6. Click the microphone icon at the bottom of the screen to activate voice input (you can 'Mute' it if you prefer to use text first).
  7. Click the screen-sharing icon next to the microphone icon.
  8. Choose whether you want to 'Share your screen', use 'FaceTime camera', or both.
  9. Click the 'Share' button to start a real-time interaction session where the AI can see your screen activity.
  10. Start a verbal conversation or type your prompt in the 'Type something' box to get visual assistance.

More from Boost Productivity & Research with AI

View All