All categories
Top GitHub Category

RAG & Vector

Retrieval, embeddings and vector databases.

100Repos
2.8mStars
Ranked by stars
Showing 48 of 100
01
langgenius/difyTypeScript

Production-ready platform for agentic workflow development.

146.2k
02

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

142.7k
03

The agent engineering platform.

139.9k
04

100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

115.4k
05

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with Claude Code, OpenClaw, Codex, Gemini, Hermes, Copilot, OpenCode + More

83.8k
06

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

83.4k
07

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

83.3k
08

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

75.9k
09

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

70.8k
10

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

61.9k
11

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

61.0k
12
pathwaycom/llm-appJupyter Notebook

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

59.3k
13

Universal memory layer for AI Agents

59.2k
14

Build AI Agents, Visually

53.9k
15

LlamaIndex is the leading document agent and OCR platform

50.3k
16

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

47.2k
17

AI 低代码平台「低代码 + 零代码」双驱动!低代码可一键生成前后端代码;零代码可 5 分钟搭建系统;AI Skills 一句话画流程、设计表单、生成整套系统。内置 AI聊天、知识库、流程编排、MCP插件等,兼容主流大模型。引领「AI 生成 → 在线配置 → 代码生成 → 手工合并->AI修改」开发模式,消除 Java 项目 80% 的重复工作,提效而不失灵活。

46.8k
18

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

44.9k
19

General-purpose AI designed for knowledge workers — creators, strategists, and operators — and individuals seeking AI systems they can truly control to help them get work done, with full flexibility to extend and deploy anywhere (VPC, on-prem, or cloud).

39.3k
20

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

39.2k
21

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

38.2k
22

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

36.9k
23

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

35.9k
24

Build resilient agents.

35.5k
25

Vane is an AI-powered answering engine.

35.4k
26

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

35.3k
27

A modular graph-based Retrieval-Augmented Generation (RAG) system

33.9k
28

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

33.3k
29
datawhalechina/happy-llmJupyter Notebook

📚 从零开始构建大模型

31.5k
30

Open Source AI Platform - AI Chat with advanced features that works with every LLM

30.5k
31
simstudioai/simTypeScript

Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.

28.8k
32
labring/FastGPTTypeScript

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

28.6k
33

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.

28.1k
34

Build Real-Time Knowledge Graphs for AI Agents

27.7k
35

Python scraper based on AI

27.4k
36

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

25.9k
37

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

25.8k
38

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

25.6k
39

An open-source RAG-based tool for chatting with your documents.

25.5k
40

DeepTutor: Agent-native Personalized Tutoring. https://deeptutor.info/.

24.9k
41

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

23.7k
42

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

23.5k
43
NirDiamant/GenAI_AgentsJupyter Notebook

50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.

22.8k
44

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

22.5k
45
AccumulateMore/CVJupyter Notebook

✅(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

22.1k
46

🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。

21.4k
47

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

21.0k
48

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

20.8k
Other categories