Talk | 
MLCon Munich 2025
 | 25.06.2025

Beyond LLMs – Using Embedding Models for Input Guarding, Semantic Routing, and Tool Decisions

Retrieval Augmented Generation (RAG) leverages retrievers like vector databases to fetch relevant data for answering queries. In advanced RAG setups involving multiple data sources, selecting the best retriever is critical. Traditionally, in LangChain this is handled by a MultiRoute Chain, where a Large Language Model (LLM) dynamically chooses the optimal data source based on semantic fit. However, this approach can be slow, costly, and unpredictable.
Enter the Open Source library Semantic Router—a faster, cheaper, and deterministic alternative that uses an embedding model for retriever selection without compromising quality.
In this talk, I’ll showcase the Semantic Router’s broader capabilities, including input guarding for AI applications and efficient tool selection for function calling.
Through live coding, we’ll first build a traditional MultiRoute Chain and then optimize it with Semantic Router, illustrating how this transformation can dramatically improve efficiency in RAG workflows.

Principal Consultant for Generative AI at Thinktecture with a focus on effective integration of AI into business applications and processes.

Event

MLCon Munich 2025
23.06.25  
- 26.06.25 
@ Munich
 (DE)
MLCon by devmio

Slidedeck

More articles about AI, Embeddings, Generative AI, LangChain, LLM

AI
sg
One of the more pragmatic ways to get going on the current AI hype, and to get some value out of it, is by leveraging semantic search. This is, in itself, a relatively simple concept: You have a bunch of documents and want to find the correct one based on a given query. The semantic part now allows you to find the correct document based on the meaning of its contents, in contrast to simply finding words or parts of words in it like we usually do with lexical search. In our last projects, we gathered some experience with search bots, and with this article, I'd love to share our insights with you.
17.05.2024
AI
favicon
With the rise of powerful AI models and services, questions come up on how to integrate those into our applications and make reasonable use of them. While other languages like Python already have popular and feature-rich libraries like LangChain, we are missing these in .NET and C#. But there is a new kid on the block that might change this situation. Welcome Semantic Kernel by Microsoft!
03.05.2023

Our webinars

Our articles

More about us