Analyzing video, audio and PDF files with Gemini and LangChain4j
Certain models like Gemini are multimodal. This means that they accept more than just text as input. Some models support text and images, but Gemini goes further and also supports audio, video, and PDF files. So you can mix and match text prompts and different multimedia files or PDF documents.
Until LangChain4j 0.32, the models could only support text and images, but since my PR got merged into the newly released 0.
Read more...