Real-time stream editing pipeline powered by the FLUX.2-klein-4B model, optimized for consumer GPUs
-
Updated
May 16, 2026 - Python
Real-time stream editing pipeline powered by the FLUX.2-klein-4B model, optimized for consumer GPUs
graph based stream processing framework
Real-time Face & emotion recognition system using a lightweight CNN, with OpenCV webcam inference, GroqCloud for chatbot & FastAPI + frontend web deployment.
An open source risk-management tool built for stock and security risk analysis
Real-time behavioral intelligence for call centers. Transcribes support calls, redacts PII, extracts emotional tone, classifies issues, and delivers insight-rich dashboards — powered by GPT-3.5 (cheap tokens), Whisper, DuckDB, and a polished React+TypeScript frontend. No Azure. No Power BI. No vendor lock-in. Just full-stack AI that runs local.
A production-ready voice agent template that feels like a real phone call
Production Android AI with ExecuTorch 1.0 - Deploy PyTorch models to mobile with NPU acceleration and 50KB footprint
An AI Email Intelligence Platform Real-time email intelligence with multi-provider AI fallback, semantic search, OAuth integration. Handles incremental sync and streaming with 70% cold start reduction.
Simulation-first safety middleware for BCI-inspired intent-to-action systems, with confidence gating, prohibited-action blocking, replayable sessions, assurance receipts, offline EEG-style analysis, and sequential fallback policy.
A unified benchmarking framework for evaluating Voice AI agents across conversational quality, audio realism, latency metrics, and safety guardrails with scalable multi-language stress testing.
Real time AI hand pose detection and styling it with the infinity stone gauntlet style build by using ReactJS and react webcam
The WYRD Protocol Architecture. The goal is to move the "World Model" out of the LLM's fleeting memory and into a structured Entity-Component-System (ECS). World-class foundation for deterministic AI world-modeling. Light-weight agnostic world-modeling system designed for the next generation of AGI, AI gaming, AI RPGs, and AI charactor systems.
A functionally operational, mathematically unhinged system for achieving 10× effective memory amplification on Apple Silicon using quantized fractal compression, complex-plane KV decomposition, and Euler-aligned swap geometry.
Visiona AI is a real-time assistive system for visually impaired users that detects objects, estimates distance, tracks motion, and predicts collisions using multiple cameras. It provides voice alerts, answers queries, remembers objects (15 min), and supports goal-based assistance.
A real-time facial analysis platform built with Flask, OpenCV, TensorFlow, PyTorch, and Next.js, featuring live face detection, age & gender estimation, and emotion recognition. Designed for robotics club events, tech fairs, and interactive AI demos, with a futuristic cyberpunk UI powered by Arwes.
Local-first real-time voice gateway for Ollama, OpenAI-compatible local LLMs, MCP, and local AI agents: browser mic, STT, TTS, barge-in.
Real time AI face landmark detection and positioning app build by using ReactJS and react webcam
Real-time facial emotion, age, and gender detection using CNNs, OpenCV, and Keras. Powered by CK+ and UTKFace datasets.
Real-time AI assistant powered by Gemini 2.5 Flash Native Audio — sees your camera, speaks back. Built with FastAPI + React.
LiveVision AI: Real-Time Visual Conversations with Gemini
Add a description, image, and links to the real-time-ai topic page so that developers can more easily learn about it.
To associate your repository with the real-time-ai topic, visit your repo's landing page and select "manage topics."