Build software better, together

yuniko-software / bge-m3-onnx

ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer with native C#, Java, and Python implementations. Generates all three embedding types: dense, sparse, and ColBERT vectors.

python java machine-learning csharp dotnet tokenizer inference pytorch embedding-models onnx huggingface vector-database bge-m3

Updated May 18, 2026
Jupyter Notebook

labazhou2024 / memexa

Star

Self-hosted Chinese personal memory graph. Six sources, two LLMs, one graph.

cli postgresql self-hosted knowledge-graph chinese-nlp pgvector retrieval-augmented-generation llm-pipeline bge-m3 personal-memory

Updated May 17, 2026
Python

yuniko-software / bge-m3-qdrant-sample

Star

A demonstration of hybrid search with reranking using Qdrant and BGE-M3 model. A showcase of dense and sparse retrieval combined with ColBERT reranking for optimal search results

sparse-vectors semantic-search rag dense-vectors vector-search vector-database colbert hybrid-search qdrant retrieval-augmented-generation bge-m3

Updated Apr 4, 2025
Jupyter Notebook

Peakstone-Labs / sembr

Star

Self-hosted intent radar — Reverse RAG for any input stream.

rss self-hosted embeddings semantic-search news-monitoring fastapi qdrant llm-tools bge-m3 reverse-rag

Updated May 18, 2026
Python

Vensus137 / Coreness-Flow

Star

Event-driven desktop AI agent: YAML scenarios, plugin system with UI contributions, local RAG (BGE-M3 + Qdrant), LLM routing. Electron + React + Python.

electron react desktop-app python yaml automation ai sqlite event-driven plugin-system rag openai-api qdrant llm bge-m3

Updated Feb 28, 2026
Python

Local-first semantic cache for AI agents. A small C daemon + CLI that remembers what your agent learned across sessions. Plugs into Claude Code, Codex, Gemini CLI, and Claude Desktop / ChatGPT via MCP. No LLM calls, no SaaS, no API key.

c cli caching daemon sqlite mcp embeddings knowledge-graph command-line-tool semantic-search ai-agents local-first llama-cpp semantic-cache llm-agents bge-m3 sqlite-vec agent-memory claude-code

Updated May 20, 2026
C

yucl80 / local-openai-api-service

Star

LLM API Server , OpenAI 同时支持 ChatGLM3 ，Llama, Llama-3, Firefunction, Openfunctions ，BAAI/bge-m3 ,bge-large-zh-v1.5

function firefunctions openai-api llamacpp functionary chatglm3 chatglm3-6b llama3 bge-m3

Updated Sep 15, 2024
Python

MauroCE / m3serve

Star

Optimised BAAI/bge-m3 serving with dense + sparse + ColBERT embeddings, async dynamic batching and pipeline GPU inference

inference embeddings text-embedding rag colbert hybrid-search dense-retrieval sparse-retrieval multilingual-embeddings bge-m3 baai-bge-m3

Updated May 4, 2026
Python

rogers0602 / llamaindex_rag

Star

🚀 企业级私有化 RAG 知识库系统 | 💯 纯离线部署、无需联网 | 🔐 支持 LDAP/AD 统一认证与 RBAC 部门级权限隔离 | ⚡️ 基于 LlamaIndex + Ollama + BGE-M3/Rerank | 📝 支持多格式文档解析、上下文对话记忆及精准的 PDF 原文溯源高亮 | 📊 内置数据可视化仪表盘（🚀 Enterprise RAG Knowledge Base. 💯 Fully offline/air-gapped deployment. 🔐 Features LDAP/AD integration, RBAC & Department Isolation. ⚡️ Powered by LlamaIndex, Ollama, and Local R

docker ldap rbac knowledge-base rag fastapi pdf-highlight llamaindex private-gpt ollama offline-llm rerank bge-m3 enterpise-kb

Updated Dec 8, 2025
Python

NikitaHerndlhofer / superwhisper-rag

Star

Local SQL archive for your Super Whisper dictation history. Thin sqlite3 wrapper + multilingual semantic search via bge-m3 / Ollama. macOS, distributed via Homebrew.

macos homebrew sqlite ollama bge-m3 sqlite-vec super-whisper

Updated May 20, 2026
TypeScript

yastman / rag

Star

AI real-estate automation platform: Telegram bot, RAG, apartment search, CRM workflows, voice agent, Langfuse observability, and Dockerized AI runtime.

Updated May 23, 2026
Python

YomnaWaleed / egyptian-rag-translator

Star

Hybrid RAG system for translating Ancient Egyptian transliterations using the TLA dataset.

nlp translation embeddings semantic-search bm25 tla hieroglyphics rag huggingface ancient-egyptian vector-database marianmt dense-retrieval qdrant llm retrieval-augmented-generation bge-m3 hybrid-rag ollama-cloud

Updated Feb 8, 2026
Jupyter Notebook

luoboask / evo-agents

Star

Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.

agent markdown sqlite chinese-nlp semantic-search fts5 rag workspace-template local-ai ollama ai-memory bge-m3 memory-system self-evolution openclaw

Updated Apr 19, 2026
Python

EliasK93 / BGE-M3-and-Gemma-2-for-retrieval-augmented-generation

Star

Example application for using the BGE-M3 embedding model and Google's Gemma-2-9B-Instruct generation model in a LangChain-based RAG pipeline to answer Lord of the Rings trivia questions

nlp rag langchain langchain-python bge-m3 gemma-2

Updated Sep 12, 2024
Python

bbulb / trawl

Star

Selective web content extraction for AI agents — URL + query returns only the chunks that matter (Python library + MCP server)

python mcp embeddings web-scraping content-extraction ai-agents rag playwright llm trafilatura bge-m3 mcp-server

Updated May 21, 2026
Python

pgmnemo / pgmnemo

Star

Multi-agent memory substrate for PostgreSQL — provenance-gated, vector-hybrid recall

memory postgresql provenance magma postgresql-extension ai-agents zep locomo vector-search llm pgvector memgpt bge-m3 mem0 agent-memory letta longmemeval dragon-encoder

Updated May 23, 2026
PLpgSQL

kasssandr / archilles

Star

RAG for researchers: page-level citations from your personal library, LLM access via MCP. Ask your entire archive, get answers with the intelligence of leading AI.

python privacy sqlite mcp embeddings calibre digital-humanities semantic-search historical-research rag academic-research research-tools retrieval-augmented-generation lancedb local-ai bge-m3 model-context-protocol

Updated May 22, 2026
Python

CarterPerez-dev / vuemantics

Sponsor

Star

Semantic media search with Qwen2.5-VL + bge-m3 embeddings and pgvector

react python docker nginx typescript scss pubsub semantic-search pnpm uv justfile fastapi vector-search vector-database pgvector bge-m3 qwen2-vl

Updated May 14, 2026
Python

shaneliuyx / agent-prep

Star

12-week lab-driven curriculum: cloud/infra engineer → AI Agent/LLM engineer. Local-first MLX stack, measured engineering, every claim grounded in a runnable RESULTS.md.

agent neo4j knowledge-graph interview-prep mlx rag apple-silicon qdrant llm react-agent langgraph ragas bge-m3 phoenix-observability

Updated May 20, 2026
Python

Fulton-Engineering-Services / bge-m3-embedding-server

Star

Axum HTTP server serving BGE-M3 dense and sparse embeddings via ONNX Runtime

nlp docker rust embeddings onnxruntime axum bge-m3

Updated May 22, 2026
Rust

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bge-m3

Here are 67 public repositories matching this topic...

yuniko-software / bge-m3-onnx

labazhou2024 / memexa

yuniko-software / bge-m3-qdrant-sample

Peakstone-Labs / sembr

Vensus137 / Coreness-Flow

AEndrix03 / Graft

yucl80 / local-openai-api-service

MauroCE / m3serve

rogers0602 / llamaindex_rag

NikitaHerndlhofer / superwhisper-rag

yastman / rag

YomnaWaleed / egyptian-rag-translator

luoboask / evo-agents

EliasK93 / BGE-M3-and-Gemma-2-for-retrieval-augmented-generation

bbulb / trawl

pgmnemo / pgmnemo

kasssandr / archilles

CarterPerez-dev / vuemantics

shaneliuyx / agent-prep

Fulton-Engineering-Services / bge-m3-embedding-server

Improve this page

Add this topic to your repo