TEN Agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

TEN-framework

5725

666

Python

TEN banner

Official Site
•
Documentation
•
Blog

Table of Contents

👋 Welcome to TEN
🎨 TMAN Designer
✨ Features
👩‍💻 Get TEN Agent up and running
- 🅰️ Run TEN Agent in localhost
- 🅱️ Run TEN Agent in Codespace(no docker)
🛳️ TEN Agent Self Hosting
- 🅰️ Deploying with Docker
- 🅱️ Deploying with other cloud services
🌍 TEN Ecosystem
❓ Ask Questions
🥰 Contributing

👋 Welcome to TEN

TEN is a comprehensive open-source ecosystem for creating, customizing, and deploying real-time conversational AI agents with multimodal capabilities including voice, vision, and avatar interactions.

TEN includes TEN Framework, TEN Turn Detection, TEN VAD, TEN Agent, TMAN Designer, and TEN Portal. Check out 🌍 TEN Ecosystem for more details.

Community Channel	Purpose
	Follow TEN Framework on X for updates and announcements
	Follow TEN Framework on LinkedIn for updates and announcements
	Join our Discord community to connect with developers
	Join our Hugging Face community to explore our spaces and models
	Join our WeChat group for Chinese community discussions

[!IMPORTANT]

Star TEN Repositories ⭐️

Get instant notifications for new releases and updates. Your support helps us grow and improve TEN!

TEN star us gif

Star History

🎨 TMAN Designer

https://github.com/user-attachments/assets/44c6a087-ec7a-45b0-a084-dab5dac5e36b

TMAN Designer

TMAN Designer is a low/no-code option to create voice agents with an easy-to-use workflow UI. It can load apps and graphs, and includes an online editor, log viewer, and much more.

Check out this blog for more details.

✨ Features

TEN Agent with Trulience

1️⃣ Real-time Avatar

Build engaging AI avatars with TEN Agent using Trulience’s diverse collection of free avatar options. To get it up and running, you only need 2 steps:

Follow the README to finish setting up and running the Playground
Enter the avatar ID and token you get from Trulience

TEN with MCP servers

2️⃣ Real-time voice with MCP servers

TEN Agent now integrates seamlessly with MCP servers, expanding its LLM capabilities. To get started:

Open the Module Picker in Playground
Add the MCP server tool for LLM integration
Paste a URL from your MCP server in the extension
Start a realtime conversation with TEN Agent

This integration allows you to leverage MCP’s diverse servers offerings while maintaining TEN Agent’s powerful conversational abilities.

https://github.com/user-attachments/assets/78647eef-2d66-44e6-99a8-1918a940fb9f

3️⃣ Real-time communication with hardware

TEN Agent is now running on the Espressif ESP32-S3 Korvo V3 development board, an excellent way to integrate realtime communication with LLM on hardware.

Check out the integration guide for more details.

Real-time Vision

4️⃣ Real-time vision and real-time screenshare detection

Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.

TEN with Dify

5️⃣ TEN with other LLM platforms

TEN Agent + Dify

TEN offers a great support to make the realtime interactive experience even better on other LLM platform as well, check out docs for more.

TEN StoryTeller

6️⃣ StoryTeller - TEN image generation

Experience the real-time image generation with StoryTeller, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN.

👩‍💻 Get TEN Agent up and running

🅰️ Run TEN Agent in localhost

Step ⓵ - Prerequisites

Category	Requirements
Keys	• Agora App ID and App Certificate (free minutes every month) • OpenAI API key (any LLM that is compatible with OpenAI) • Deepgram ASR (free credits available with signup) • Elevenlabs TTS (free credits available with signup)
Installation	• Docker / Docker Compose • Node.js(LTS) v18
Minimum System Requirements	• CPU >= 2 Core • RAM >= 4 GB

[!NOTE]

macOS: Docker setting on Apple Silicon

Uncheck “Use Rosetta for x86/amd64 emulation” in Docker settings, it may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.

Step ⓶ - Build agent in VM

1. Clone down the repo,`cd` to `ai-agents` and create `.env` file from `.env.example`

cd ai_agents
cp ./.env.example ./.env

2. Setup Agora App ID and App Certificate in `.env`

AGORA_APP_ID=
AGORA_APP_CERTIFICATE=

3. Start agent development containers

docker compose up -d

4. Enter container

docker exec -it ten_agent_dev bash

5. Build agent with the default `graph` ( ~5min - ~8min)

check the /examples folder for more examples

# use the default agent
task use

# or use the demo agent
task use AGENT=agents/examples/demo

6. Start the web server

task run

Step ⓷ - Customize your agent with TMAN Designer

Open localhost:49483.
Right click to load the corresponding graph (e.g., Voice Assistant).
Enter API keys and set preferences for each extension.
Right click and select Run to start the agent.

🅱️ Run TEN Agent in Codespace(no docker)

GitHub offers free Codespace for each repository, you can run the playground in Codespace without using Docker.Also, the speed of Codespace is much faster than localhost.

Check out this guide for more details.

🛳️ TEN Agent Self Hosting

🅰️ Deploying with Docker

Once you have customized your agent (either by using the TMAN Manager, Playground, or editing property.json directly), you can deploy it by creating a release Docker image for your service.

Read the Deployment Guide for detailed information about deployment.

🅱️ Deploying with other cloud services

coming soon

🌍 TEN Ecosystem

Project	Preview
🏚️ TEN Framework TEN is an open-source framework for real-time, multimodal conversational AI.
️🔂 TEN Turn Detection TEN is for full-duplex dialogue communication.
🔉 TEN VAD TEN VAD is a low-latency, lightweight and high-performance streaming voice activity detector (VAD).
🎙️ TEN Agent TEN Agent is a showcase of TEN Framewrok.
🎨 TMAN Designer TMAN Designer is low/no code option to make a voice agent with easy to use workflow UI.
📒 TEN Portal The official site of TEN framework, it has documentation and blog.

❓ Ask Questions

TEN Framework is available on these AI-powered Q&A platforms. They can help you find answers quickly and accurately in multiple languages, covering everything from basic setup to advanced implementation details.

Service	Link
DeepWiki
ReadmeX

🥰 Contributing

We welcome all forms of open-source collaboration! Whether you’re fixing bugs, adding features, improving documentation, or sharing ideas - your contributions help advance personalized AI tools. Check out our GitHub Issues and Projects to find ways to contribute and show your skills. Together, we can build something amazing!

[!TIP]

Welcome all kinds of contributions 🙏

Join us in building TEN better! Every contribution makes a difference, from code to documentation. Share your TEN Agent projects on social media with to inspire others!

Connect with one of the TEN maintainers @elliotchen100 on 𝕏 or @cyfyifanchen on GitHub for project updates, discussions and collaboration opportunities.

Code Contributors

Contribution Guidelines

Contributions are welcome! Please read the contribution guidelines first.

License

The entire TEN framework (except for the folders explicitly listed below) is released under the Apache License, Version 2.0, with additional restrictions. For details, please refer to the LICENSE file located in the root directory of the TEN framework.
The components within the packages directory are released under the Apache License, Version 2.0. For details, please refer to the LICENSE file located in each package’s root directory.
The third-party libraries used by the TEN framework are listed and described in detail. For more information, please refer to the third_party folder.