Beyond LLMs – Using Embedding Models for Input Guarding, Semantic Routing, and Tool Decisions

Retrieval Augmented Generation (RAG) leverages retrievers like vector databases to fetch relevant data for answering queries. In advanced RAG setups involving multiple data sources, selecting the best retriever is critical. Traditionally, in LangChain this is handled by a MultiRoute Chain, where a Large Language Model (LLM) dynamically chooses the optimal data source based on semantic fit. However, this approach can be slow, costly, and unpredictable.
Enter the Open Source library Semantic Router—a faster, cheaper, and deterministic alternative that uses an embedding model for retriever selection without compromising quality.
In this talk, I’ll showcase the Semantic Router’s broader capabilities, including input guarding for AI applications and efficient tool selection for function calling.
Through live coding, we’ll first build a traditional MultiRoute Chain and then optimize it with Semantic Router, illustrating how this transformation can dramatically improve efficiency in RAG workflows.

Marco Frodl

Principal Consultant for Generative AI at Thinktecture with a focus on effective integration of AI into business applications and processes.

Event

MLCon Munich 2025

23.06.25

- 26.06.25

@ Munich

(DE)

Links & additional Content

Slidedeck

Notebook Semantic Router Part 1

Notebook Semantic Router Part 2

Notebook Semantic Router Part 3

More articles about AI, Embeddings, Generative AI, LangChain, LLM

Slidedeck

More articles about AI, Embeddings, Generative AI, LangChain, LLM

Improved RAG: More effective Semantic Search with content transformations

One of the more pragmatic ways to get going on the current AI hype, and to get some value out of it, is by leveraging semantic search. This is, in itself, a relatively simple concept: You have a bunch of documents and want to find the correct one based on a given query. The semantic part now allows you to find the correct document based on the meaning of its contents, in contrast to simply finding words or parts of words in it like we usually do with lexical search. In our last projects, we gathered some experience with search bots, and with this article, I'd love to share our insights with you.

read article >

17.05.2024

| Sebastian Gingter

Integrating AI Power into Your .NET Applications with the Semantic Kernel Toolkit – an Early View

With the rise of powerful AI models and services, questions come up on how to integrate those into our applications and make reasonable use of them. While other languages like Python already have popular and feature-rich libraries like LangChain, we are missing these in .NET and C#. But there is a new kid on the block that might change this situation. Welcome Semantic Kernel by Microsoft!

read article >

03.05.2023

| Thinktecture Team

Beyond LLMs – Using Embedding Models for Input Guarding, Semantic Routing, and Tool Decisions

Marco Frodl

Event

Links & additional Content

Slidedeck

More articles about AI, Embeddings, Generative AI, LangChain, LLM

Improved RAG: More effective Semantic Search with content transformations

Integrating AI Power into Your .NET Applications with the Semantic Kernel Toolkit – an Early View

Our webinars

Our articles

More about us