🎙️

Voice Agents

Platforms for building AI agents that converse in real-time voice for customer service and telephony

40 tools

litellm

free

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

41.6k 3435/movoice-agents

unsloth

open-source

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

58.7k 2325/movoice-agents

pipecat

free

Open Source framework for voice and multimodal conversational AI

10.9k 368/movoice-agents

composio

open-source

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

27.6k 353/movoice-agents

whisperX

free

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

21.0k 413/movoice-agents

langchain4j

open-source

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme

11.4k 420/movoice-agents

buzz

open-source

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

18.5k 300/movoice-agents

autoresearch

free

AI agents running research on single-GPU nanochat training automatically

62.2k 29588/movoice-agents

happy

open-source

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

16.8k 2595/movoice-agents

text-generation-webui

free

The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.

46.4k 98/movoice-agents

instructor

open-source

structured outputs for llms

12.6k 135/movoice-agents

NadirClaw

open-source

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenCl

375 53/mocoding-agents

TTS-WebUI

open-source

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, Mus

3.0k 90/movoice-agents

index-tts

free

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

19.7k 840/movoice-agents

langroid

open-source

Harness LLMs with Multi-Agent Programming

3.9k 15/movoice-agents

SWE-agent

open-source

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

18.9k 240/mocoding-agents

arcade-mcp

open-source

The best way to create, deploy, and share MCP Servers

841 53/movoice-agents

insanely-fast-whisper

open-source

12.2k 3413/movoice-agents

gorilla

open-source

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

12.8k 60/movoice-agents

jarvis-ai-assistant

open-source

Jarvis AI Assistant - Voice-powered AI assistant for Mac

464 8/movoice-agents

llama-cpp-agent

free

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured ou

624 8/movoice-agents

ChatTTS

free

A generative speech model for daily dialogue.

39.0k 53/movoice-agents

AppAgent

open-source

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

6.6k 45/movoice-agents

agents

open-source

A framework for building realtime voice AI agents 🤖🎙️📹

5.9k 38/movoice-agents

ultravox

open-source

A fast multimodal LLM for real-time voice

4.4k 15/movoice-agents

core

open-source

AI agent microservice

3.0k 15/movoice-agents

RealChar

open-source

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI G

6.2k 15/movoice-agents

AgentPilot

free

A versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.

539 8/movoice-agents

LLocalSearch

open-source

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress o

6.0k 0/movoice-agents

npi

open-source

Action library for AI Agent

228 0/movoice-agents

open-assistant-api

open-source

The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It

359 0/movoice-agents

TaskingAI

open-source

The open source platform for AI-native application development.

5.4k 0/movoice-agents

WhisperS2T

open-source

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

558 0/movoice-agents

EmotiVoice

open-source

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

8.5k 0/movoice-agents

turbopilot

open-source

Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

3.8k 0/mocoding-agents

go-openai

open-source

OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go

10.6k 8/movoice-agents

seamless_communication

free

Foundational Models for State-of-the-Art Speech and Text Translation

11.8k 8/movoice-agents

llm.ts

open-source

Call any LLM with a single API. Zero dependencies.

213 8/movoice-agents

openlm

open-source

OpenAI-compatible Python client that can call any LLM

369 15/movoice-agents

AudioGPT

free

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

10.2k 30/movoice-agents