I am Zahiruddin Tavargere (Zahere). A social-learner, here to learn, share and grow with the tech community.

Mastering Chunking for RAG: Semantic vs Recursive vs Fixed Size

Advanced RAG Series: Part 2

PublishedSeptember 16, 2024

Mastering Chunking for RAG: Semantic vs Recursive vs Fixed Size

I am a Journalist-turned-Software Engineer. I love coding and the associated grind of learning every day. A firm believer in social learning, I owe my dev career to all the tech content creators I have learned from. This is my contribution back to the community.

Note: The read-time of this article was going beyond 4 minutes, so I am sharing the video instead.

This is part of the Advanced RAG Series: Part 1

When working with Retrieval Augmented Generation (RAG) models, selecting the right chunking method can make a huge difference in performance.

In my latest YouTube video, I dive deep into the three main chunking approaches—Semantic, Recursive, and Fixed Size—and evaluate their performance based on four critical metrics: context precision, faithfulness, answer relevancy, and context recall.

The chunking method you choose can impact how accurate and relevant the AI-generated answers are. So, which method strikes the perfect balance between retaining enough context and providing highly relevant, faithful responses?

In the video, I break down:

How Semantic Chunking performed in capturing context but struggled with relevancy.
Why Recursive Chunking emerged as a strong contender with high accuracy and relevancy.
The surprising strengths of Fixed Size Chunking, especially in context retention.

If you're interested in fine-tuning your RAG models or curious about which chunking method works best, this video is packed with insights that will help you make the right choice. Check out the full breakdown in the embedded video below!

Watch the full analysis and find out which chunking method is best for your use case:

https://www.youtube.com/watch?v=jEzh4IuTWtc

#rag #generative-ai #advanced-rag #ai

Comments

Join the discussion

No comments yet. Be the first to comment.

More from this blog

📡 FastAPI MCP SSE Server with JWT Auth & Custom Client

📖 Introduction In modern AI applications, communication between clients and tools isn’t always as simple as calling an API. The Model Context Protocol (MCP) provides a standardized way for clients to exchange information, invoke tools, and maintain ...

May 18, 20255 min read

📡 FastAPI MCP SSE Server with JWT Auth & Custom Client

Build an MCP Client and Server from Scratch Using Python

If you’re curious about how to build an intelligent agent using Model Context Protocol (MCP), you’re in the right place. In this post, I’ll walk you through how to: Create an MCP Server using FastMCP Expose a tool that calculates BMI Build a Clien...

Apr 7, 20255 min read

Build an MCP Client and Server from Scratch Using Python

My Favorite OpenAI Agents SDK Feature (And The Most Understated!)

In our previous tutorial, we built a restaurant customer support chatbot using OpenAI's Agents SDK. In this follow-up, we’ll explore guardrails—a critical feature that enhances AI chatbot safety and reliability. What Are Guardrails in AI Agents? Guar...

Mar 24, 20253 min read

My Favorite OpenAI Agents SDK Feature (And The Most Understated!)

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

Video https://www.youtube.com/watch?v=UPBMkFSJdBI The Problem at Hand Uber's data platform processes approximately 1.2 million interactive queries monthly, with 36% of these coming from the operations organization. This group—comprising engineers...

Jan 14, 20253 min read

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

A Deep Dive into Google's "Agents" White Paper: Hype or Revolution?

Video https://www.youtube.com/watch?v=FgRGwnpd2HY Google's recent white paper on "Agents" has created quite a buzz. The paper explores the concept of AI agents and delves into their architecture and potential. Let's break down what this white paper...

Jan 10, 20254 min read

A Deep Dive into Google's "Agents" White Paper: Hype or Revolution?

I am Zahiruddin Tavargere (Zahere). A social-learner, here to learn, share and grow with the tech community.

74 posts

I am Zahiruddin Tavargere (Zahere). A firm believer in social learning, I owe my dev career to all the tech content creators I have learned from - this is my contribution back to the community.

Command Palette

Comments

More from this blog