🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
- 
            Updated
            Oct 29, 2025 
- Python
🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
FastCDC implementation in Python https://pypi.org/project/fastcdc/
SmartChunk is a lightweight, structure-aware semantic chunking toolkit designed to supercharge RAG (Retrieval-Augmented Generation) and LLM pipelines. Unlike naive splitters that break text arbitrarily, SmartChunk respects document structure (headings, lists, tables, code blocks) and semantic flow, ensuring cleaner, more coherent chunks.
Go implementation of the AE chunking algorithm.
Android Resumable Uploads SDK from Fastpix
A nodejs chunking system
Explore and benchmark the world of data chunking algorithms in 'ChunkingChampions' - a competitive arena to determine the most efficient and effective chunking strategies for varied data sizes.
[2024-2] Mermaid 모델을 활용한 회의 지원 플랫폼 서비스 "Clerker"
Implementation of an interactive chatbot for summarizing legal and policy documents. Includes data preprocessing (cleaning, tokenization, chunking), extractive summarization baselines, and fine-tuned abstractive models (PEGASUS and LED). Integrates a retrieval layer for document relevance and uses ROUGE, BLEU, and cosine similarity for evaluation.
A smol Go package for splitting text into chunks while preserving semantic meaning.
MS PowerPoint extension created using Yoman generator and React JS
a simple utility to split given array into chunks of input size with array reverse option
🧩 Enhance RAG processes with SmartChunk, a Python package that creates quality text chunks while preserving structure and meaning for better retrieval.
RAG, AI, ML, Chunking, Groq, LangChain, ChromaDB, Embeddings, Retrieval-Augmented Generation
Add a description, image, and links to the chunking-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the chunking-algorithm topic, visit your repo's landing page and select "manage topics."