Voice Agents
Platforms for building AI agents that converse in real-time voice for customer service and telephony
40 tools
litellm
freePython SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
unsloth
open-sourceUnsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
pipecat
freeOpen Source framework for voice and multimodal conversational AI
composio
open-sourceComposio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
whisperX
freeWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
langchain4j
open-sourceLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme
buzz
open-sourceBuzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
autoresearch
freeAI agents running research on single-GPU nanochat training automatically
happy
open-sourceMobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured
text-generation-webui
freeThe original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
instructor
open-sourcestructured outputs for llms
NadirClaw
open-sourceOpen-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenCl
TTS-WebUI
open-sourceA single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, Mus
index-tts
freeAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
langroid
open-sourceHarness LLMs with Multi-Agent Programming
SWE-agent
open-sourceSWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
arcade-mcp
open-sourceThe best way to create, deploy, and share MCP Servers
insanely-fast-whisper
open-sourcegorilla
open-sourceGorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
jarvis-ai-assistant
open-sourceJarvis AI Assistant - Voice-powered AI assistant for Mac
llama-cpp-agent
freeThe llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured ou
ChatTTS
freeA generative speech model for daily dialogue.
AppAgent
open-sourceAppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
agents
open-sourceA framework for building realtime voice AI agents 🤖🎙️📹
ultravox
open-sourceA fast multimodal LLM for real-time voice
core
open-sourceAI agent microservice
RealChar
open-source🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI G
AgentPilot
freeA versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.
LLocalSearch
open-sourceLLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress o
npi
open-sourceAction library for AI Agent
open-assistant-api
open-sourceThe Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It
TaskingAI
open-sourceThe open source platform for AI-native application development.
WhisperS2T
open-sourceAn Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
EmotiVoice
open-sourceEmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
turbopilot
open-sourceTurbopilot is an open source large-language-model based code completion engine that runs locally on CPU
go-openai
open-sourceOpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
seamless_communication
freeFoundational Models for State-of-the-Art Speech and Text Translation
llm.ts
open-sourceCall any LLM with a single API. Zero dependencies.
openlm
open-sourceOpenAI-compatible Python client that can call any LLM
AudioGPT
freeAudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head