You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer with native C#, Java, and Python implementations. Generates all three embedding types: dense, sparse, and ColBERT vectors.
A demonstration of hybrid search with reranking using Qdrant and BGE-M3 model. A showcase of dense and sparse retrieval combined with ColBERT reranking for optimal search results
Event-driven desktop AI agent: YAML scenarios, plugin system with UI contributions, local RAG (BGE-M3 + Qdrant), LLM routing. Electron + React + Python.
Local-first semantic cache for AI agents. A small C daemon + CLI that remembers what your agent learned across sessions. Plugs into Claude Code, Codex, Gemini CLI, and Claude Desktop / ChatGPT via MCP. No LLM calls, no SaaS, no API key.
Local SQL archive for your Super Whisper dictation history. Thin sqlite3 wrapper + multilingual semantic search via bge-m3 / Ollama. macOS, distributed via Homebrew.
Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.
Example application for using the BGE-M3 embedding model and Google's Gemma-2-9B-Instruct generation model in a LangChain-based RAG pipeline to answer Lord of the Rings trivia questions
RAG for researchers: page-level citations from your personal library, LLM access via MCP. Ask your entire archive, get answers with the intelligence of leading AI.