Unlocking bonus worlds with Gemini for the Google I/O puzzle
The Google I/O 2025 puzzle used the Gemini API to generate dynamic riddles for bonus worlds, enhancing player engagement and scalability. Here's what our developers learned on using the Gemini API effectively, including creativity, design, and implemen...
Experiment with Gemini 2.0 Flash native image generation
The experimental native image generation feature of Gemini 2.0 Flash – allowing for the combination of text and images, conversational image editing, and leveraging real-world knowledge for contextual visuals – is now available for developers to te...
Safer and Multimodal: Responsible AI with Gemma
ShieldGemma 2, built on Gemma 3, is a 4 billion parameter model that can be used as an input filter for vision language models or an output filter for image generation systems, and is designed to respond to a wide range of diverse and nuanced imagery.
Introducing Gemma 3: The Developer Guide
Gemma 3 is a new, advanced version of the Gemma open-model family featuring multimodality, longer context windows, and improved language capabilities, with various sizes and deployment options for developers to experiment.
Gemma 3 on mobile and web with Google AI Edge
Gemma 3 1B, a new small language model for mobile and web applications via Google AI Edge, is now available, with increased efficiency, improved performance, and offline availability.
State-of-the-art text embedding via the Gemini API
A new experimental Gemini Embedding text model, now available in the Gemini API, achieves top rankings on the Massive Text Embedding Benchmark (MTEB) leaderboard and offers expanded language support and high-dimensional embeddings.
Gemini 2.0 Deep Dive: Code Execution
This blog post introduces Gemini's code execution feature, which allows the AI model to generate and run Python code for tasks like solving equations, data analysis, and creating visualizations.
CalCam: Transforming Food Tracking with the Gemini API
CalCam, a calorie-tracking app, uses the Gemini API to analyze meal photos, providing users with fast and accurate nutritional information. Polyverse, CalCam's creator, highlights Gemini API's speed, accuracy, and structured JSON output are crucial for...
Start building with Gemini 2.0 Flash and Flash-Lite
Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for enterprise customers on Vertex AI. 2.0 Flash-Lite offers improved performance over 1.5 Flash across reasoning, multimodal, math and factua...
Build Scalable AI Agents: Langbase and the Gemini API
Langbase empowers developers to build and deploy powerful, scalable AI agents by leveraging the Google Gemini API, particularly Gemini 1.5 Flash, unlocking a new era of intelligent applications and streamlined workflows.
Beyond the Chatbot: Agentic AI with Gemma
A practical guide to constructing a Gemma 2-based Agentic AI system – a type of AI that can make its own decisions and use external tools to achieve goals – that can generate dynamic content for a fictional game world.
Get ready for Google I/O May 20-21
Google I/O returns May 20-21. Watch the livestreams for updates on Android, AI, web, and cloud. Registration is open on the Google I/O website.
Imagen 3 arrives in the Gemini API
Imagen 3 – now available in Google AI Studio and the Gemini API – offers developers state-of-the-art image generation with brighter, better-composed images in diverse styles, and simplified image generation through text prompts.
Gemini 2.0: Flash, Flash-Lite and Pro
The Gemini 2.0 model family is seeing significant updates, including the release of Gemini 2.0 Flash, which is now production-ready and boasts higher rate limits, enhanced performance, and simplified pricing. Developers can also start testing an update...
Unlocking the Potential of Quantum Computing
A free Coursera course on quantum error correction, developed by Google Quantum AI, explains the importance of error correction in quantum computing and provides an overview of quantum errors.