Voice Agents
Platforms for building AI agents that converse in real-time voice for customer service and telephony
40 tools
litellm
freePython SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
unsloth
open-sourceUnsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
text-generation-webui
freeThe original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
composio
open-sourceComposio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
whisperX
freeWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
buzz
open-sourceBuzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
pipecat
freeOpen Source framework for voice and multimodal conversational AI
autoresearch
freeAI agents running research on single-GPU nanochat training automatically
langchain4j
open-sourceLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme
instructor
open-sourcestructured outputs for llms
langroid
open-sourceHarness LLMs with Multi-Agent Programming
SWE-agent
open-sourceSWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
happy
open-sourceMobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured
index-tts
freeAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
gorilla
open-sourceGorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
TTS-WebUI
open-sourceA single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, Mus
ChatTTS
freeA generative speech model for daily dialogue.
NadirClaw
open-sourceOpen-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium β automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenCl
jarvis-ai-assistant
open-sourceJarvis AI Assistant - Voice-powered AI assistant for Mac
seamless_communication
freeFoundational Models for State-of-the-Art Speech and Text Translation
insanely-fast-whisper
open-sourcego-openai
open-sourceOpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
AudioGPT
freeAudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
arcade-mcp
open-sourceThe best way to create, deploy, and share MCP Servers
agents
open-sourceA framework for building realtime voice AI agents π€ποΈπΉ
EmotiVoice
open-sourceEmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
LLocalSearch
open-sourceLLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress o
AppAgent
open-sourceAppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
RealChar
open-sourceποΈπ€Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI G
ultravox
open-sourceA fast multimodal LLM for real-time voice
TaskingAI
open-sourceThe open source platform for AI-native application development.
llama-cpp-agent
freeThe llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured ou
turbopilot
open-sourceTurbopilot is an open source large-language-model based code completion engine that runs locally on CPU
core
open-sourceAI agent microservice
WhisperS2T
open-sourceAn Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
AgentPilot
freeA versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.
openlm
open-sourceOpenAI-compatible Python client that can call any LLM
open-assistant-api
open-sourceThe Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It
npi
open-sourceAction library for AI Agent
llm.ts
open-sourceCall any LLM with a single API. Zero dependencies.