Large-Language-Models

Advanced RAG — Hypothetical Question Embedding

📅 July 6, 2025 — by Guillaume Laforge

In the first article of this Advanced RAG series, I talked about an approach I called sentence window retrieval, where we calculate vector embeddings per sentence, but the chunk of text returned (and added in the context of the LLM) actually contains also surrounding sentences to add more context to that embedded sentence. This tends to give a better vector similarity than the whole surrounding context. It is one of the techniques I’m covering in my talk on advanced RAG techniques.

Expanding ADK AI agent capabilities with tools

📅 June 15, 2025 — by Guillaume Laforge

java agent-development-kit ai-agents model-context-protocol large-language-models

In a nutshell, the AI agent equation is the following:

AI Agent = LLM + Memory + Planning + Tool Use

AI agents are nothing without tools! And they are actually more than mere Large Language Model calls. They require some memory management to handle the context of the interactions (short term, long term, or contextual information like in the Retrieval Augmented Generation approach. Planning is important (with variations around the Chain-of-Thought prompting approach, and LLM with reasoning or thinking capabilities) for an agent to realize its tasks.

Expanding ADK Java LLM coverage with LangChain4j

📅 June 5, 2025 — by Guillaume Laforge

java agent-development-kit large-language-models ai-agents langchain4j

Recently on these pages, I’ve covered ADK (Agent Development Kit) for Java, launched at Google I/O 2025. I showed how to get started writing your first Java agent, and I shared a Github template that you can use to kick start your development.

But you also know that I’m a big fan of, and a contributor to the LangChain4j project, where I’ve worked on the Gemini support, embedding models, GCS document loaders, Imagen generation, etc.

An ADK Java GitHub template for your first Java AI agent

📅 May 27, 2025 — by Guillaume Laforge

java agent-development-kit large-language-models ai-agents

With the unveiling of the Java version of Agent Development Kit (ADK) which lets you build AI agents in Java, I recently covered how to get started developing your first agent.

The installation and quickstart documentation also helps for the first steps, but I realized that it would be handy to provide a template project, to further accelarate your time-to-first-conversation with your Java agents! This led me to play with GitHub’s template project feature, which allows you to create a copy of the template project on your own account or organization. It comes with a ready-made project structure, a configured pom.xml file, and a first Java agent you can customize at will, and run from both the command-line or the ADK Dev UI.

Things you never dared to ask about LLMs — Take 2

📅 May 26, 2025 — by Guillaume Laforge

generative-ai large-language-models

Recently, I had the chance to deliver this talk on the mysteries of LLMs, at Devoxx France, with my good friend Didier Girard, It was fun to uncover the oddities of LLMs, and better understand where they thrive or fail, and why.

In this post, I’d like to share an update of the presentation deck, with a few additional slides here and there, to cover for example

the difficulty of LLMs to work with acronyms, scientific molecule names, plant names, special uncommon vocabulary, which require more tokens and weakens attention,
the difference between deterministic and probabilistic problems, and why predictive models are still important,
some limits of LLMs with regards to understanding dates, data ownership, or the fact they can’t easily forget what they learned.

This was fun delivering the talk with Didier, as a friendly dialogue makes things more entertaining! We were lucky that this talk was recorded (however, in French 🇫🇷) and you can watch the video below:

Beyond the chatbot or AI sparkle: a seamless AI integration

📅 May 23, 2025 — by Guillaume Laforge

generative-ai machine-learning large-language-models

When I talk about Generative AI, whether it’s with developers at conferences or with customers, I often find myself saying the same thing: chatbots are just one way to use Large Language Models (LLMs).

Unfortunately, I see many articles or presentations that just focus on demonstrating LLMs at work within the context of chatbots. I feel guilty of showing the traditional chat interfaces too. But there’s so much more to it!

Vibe coding an MCP server with Micronaut, LangChain4j, and Gemini

📅 May 2, 2025 — by Guillaume Laforge

java micronaut langchain4j large-language-models model-context-protocol

Unlike Quarkus and Spring Boot, Micronaut doesn’t (yet?) provide a module to facilitate the implementation of MCP servers (Model Context Protocol). But being my favorite framework, I decided to see what it takes to build a quick implementation, by vibe coding it, with the help of Gemini!

In a recent article, I explored how to use the MCP reference implementation for Java to implement an MCP server, served as a servlet via Jetty, and to call that server from LangChain4j’s great MCP support. One approach with Micronaut may have been to somehow integrate the servlet I had built via Micronaut’s servlet support, but that didn’t really feel like a genuine and native way to implement a server, so I decided to do it from scratch.

LLMs.txt to help LLMs grok your content

📅 March 3, 2025 — by Guillaume Laforge

large-language-models generative-ai

Since I started my career, I’ve been sharing what I’ve learned along the way in this blog. It makes me happy when developers find solutions to their problems, or discover new things, thanks to articles I’ve written here. So it’s important for me that readers are able to find those posts. Of course, my blog is indexed by search engines, and people usually find about it from Google or other engines, or they discover it via the links I share on social media. But with LLM powered tools (like Gemini, ChatGPT, Claude, etc.) you can make your content more easily grokkable by such tools.

Advanced RAG — Sentence Window Retrieval

📅 February 25, 2025 — by Guillaume Laforge

generative-ai large-language-models machine-learning langchain4j java retrieval-augmented-generation

Retrieval Augmented Generation (RAG) is a great way to expand the knowledge of Large Language Models to let them know about your own data and documents. With RAG, LLMs can ground their answers on the information your provide, which reduces the chances of hallucinations.

Implementing RAG is fairly trivial with a framework like LangChain4j. However, the results may not be on-par with your quality expectations. Often, you’ll need to further tweak different aspects of the RAG pipeline, like the document preparation phase (in particular docs chunking), or the retrieval phase to find the best information in your vector database.

The power of large context windows for your documentation efforts

📅 February 15, 2025 — by Guillaume Laforge

generative-ai large-language-models machine-learning langchain4j

My colleague Jaana Dogan was pointing at the Anthropic’s MCP (Model Context Protocol) documentation pages which were describing how to build MCP servers and clients. The interesting twist was about preparing the documentation in order to have Claude assist you in building those MCP servers & clients, rather than clearly documenting how to do so.

MCP tutorials are great. There are no tutorials really.

"Copy these resources to Claude, and start asking some questions like..." pic.twitter.com/GG50DMWNLW
Read more...

1 of 4 >> >|