Gemini

Streaming Gemini 3.1's expressive new TTS model in Java

📅 April 16, 2026 — by Guillaume Laforge

Google just released Gemini 3.1 Flash Text-to-Speech (TTS), a new expressive TTS model that you can steer with audio tags and scene descriptions.

I wanted to see how it worked with the Gemini Interactions SDK for Java.

Expressive control

The model sounds natural out of the box, but the real benefit is the control you have over expressiveness. By defining “Audio Profiles”, “Scene Details”, and “Director’s Notes” in your prompt, you can control the character’s pacing, tone, and environment.

A Simple Coding Agent in a Loop with LangChain4j, Jbang, and Gemini

📅 April 11, 2026 — by Guillaume Laforge

java ai-agents generative-ai langchain4j gemini gemini-cli

A few days ago, Max Rydahl Andersen published a fascinating article about nanocode: a minimalist Claude Code alternative implemented in just 260 lines of Java (inspired from a 250-line Python equivalent). It was a masterclass in “leanness,” using raw HTTP calls and Jackson JSON parsing, an OpenRouter or Anthropic LLM endpoint, to create an autonomous coding loop.

I loved the concept, but I had a very practical motivation to take it in a different direction: I don’t have a Claude subscription. 😃

Creating a Wikipedia MCP Server in Java in a Few Prompts with Skills

📅 April 2, 2026 — by Guillaume Laforge

java jbang model-context-protocol ai-agents langchain4j gemini-cli generative-ai gemini

Since I started using Model Context Protocol (MCP) to equip my AI agents with useful tools, I’ve been looking for ways to quickly build and iterate on local servers. A few weeks ago, I shared how to easily build a local MCP server in Java with a custom skill in Gemini CLI. Today, I wanted to put that skill to the test by creating a Wikipedia MCP server.

What’s impressive is that I didn’t even have to leave my terminal or read documentation. The entire process was a conversation with Gemini CLI, leveraging its ability to search the web, find libraries, and even check migration guides!

Decoded: How Google AI Studio Securely Proxies Gemini API Requests

📅 February 9, 2026 — by Guillaume Laforge

gemini google-ai-studio architecture

If you’ve recently vibe-coded and exported a Gemini-powered app from Google AI Studio to host it online on Google Cloud Run, you might have noticed a server/ directory containing a Node.js application. This isn’t just a simple file server; it’s a clever “transparent proxy” designed to solve a classic problem in frontend AI development:

How do I use my API key without leaking it to the browser?

In this post (although vibe-coding is supposed to be all about not looking at the code at all) we’ll dissect exactly how this architecture works, why it’s safer than a client-side key, and where its security limits lie.

Latest Gemini and Nano Banana Enhancements in LangChain4j

📅 February 6, 2026 — by Guillaume Laforge

gemini langchain4j nano-banana generative-ai large-language-models

A few days ago, LangChain4j 1.11.0 was released, and with this version, a few notable enhancements to the support of the Gemini model family have landed. Let’s dive in!

New Image Generation Models (Gemini 2.5 & 3.0 Preview, aka 🍌 Nano Banana)

Note

Before showing some snippets of code, let me give you the link to the full documentation on the new image model: docs.langchain4j.dev/integrations/image-models/gemini

Researching Topics in the Age of AI — Rock-Solid Webhooks Case Study

📅 February 4, 2026 — by Guillaume Laforge

generative-ai large-language-models gemini

Back in 2019, I spent significant time researching Webhooks. In particular, I was interested in best practices, pitfalls, design patterns, and approaches for implementing Webhooks in a reliable, resilient, and effective way.

Everything is distilled in that article: Implementing Webhooks, not as trivial as it may seem

It likely took me a full week to dive deep into this subject, finding sources and experimenting with design patterns myself. But nowadays, AI makes it easier to dive deeper into topics, explore unfamiliar aspects, and share findings with your team.

How to Integrate Gemini CLI with Intellij Idea Using ACP

📅 February 1, 2026 — by Guillaume Laforge

gemini gemini-cli intellij-idea agent-client-protocol

The Agent Client Protocol (ACP) allows you to connect external AI agents directly into IDEs and text editors that support that protocol (like JetBrains’ IntelliJ IDEA, PyCharm, or WebStorm, as well as Zed). This means you can bring the power of the Gemini CLI directly into your editor, allowing it to interact with your code, run terminal commands, and use Model Context Protocol (MCP) servers right from the AI Assistant chat window.

Building a Research Assistant with the Interactions API in Java

📅 January 3, 2026 — by Guillaume Laforge

java generative-ai large-language-models gemini gemini-interactions-api

First of all, dear readers, let me wish you a happy new year! This is my first post on this blog for 2026. I’m looking forward to continuing sharing interesting content with you.

During my holiday break, I wanted to put my recent Java implementation of the Gemini Interactions API to the test. I implemented and released it with the help of Antigravity. My colleague Shubham Saboo and Gargi Gupta wrote a tutorial on how to build an AI research agent with Google Interactions API & Gemini 3. I thought this was a great opportunity to replicate this example in Java using my Interactions API Java SDK.

Implementing the Interactions API with Antigravity

📅 December 15, 2025 — by Guillaume Laforge

ai-agents generative-ai large-language-models java gemini gemini-interactions-api

Google and DeepMind have announced the Interactions API, a new way to interact with Gemini models and agents.

Here are some useful links to learn more about this new API:

An announcement is available on Google’s Keywords blog:
Interactions API: A unified foundation for models and agents
A more detailed article is available on Google’s developers blog:
Building agents with the ADK and the new Interactions API
The newly released Gemini Deep Research agent is now available via the Interactions API as well:
Build with Gemini Deep Research
The official documentation of the Interactions API.

About the Interactions API

The Rationale and Motivation

The Interactions API was introduced to address a shift in AI development, moving from simple, stateless text generation to more complex, multi-turn agentic workflows. It serves as a dedicated interface for systems that require memory, reasoning, and tool use. It provides a unified interface for both simple LLM calls and more complex agent calls.

Gemini Is Cooking Bananas Under Antigravity

📅 November 21, 2025 — by Guillaume Laforge

generative-ai large-language-models ai-agents gemini nano-banana antigravity

What a wild title, isn’t it? It’s a catchy one, not generated by AI, to illustrate this crazy week of announcements by Google. Of course, there are big highlights like Gemini 3 Pro, Antigravity, or Nano Banana Pro, but not only, and this is the purpose of the article to share with you everything, including links to all the interesting materials about those news.

Gemini 3 Pro

The community was eagerly anticipating the release of Gemini 3. Gemini 3 Pro is a state-of-the-art model, with excellent multimodal capabilities, advanced reasoning, excellent at coding, and other agentic activities.

1 of 3 >> >|