27:47 Setting Up & Testing DramaBox Expressive TTS | Voice Cloning & Design | #1 Tech Giant 142 views - 2 weeks ago
27:40 OmniVoice Document Reader: Zero-shot MultiLingual Voice Cloning, 600 languages Tech Giant 500 views - 1 month ago
16:26 I Tested VoxCPM2's Voice Design, Cloning & Multilingual Support Tech Giant 618 views - 2 months ago
17:14 Testing Microsoft's VibeVoice Realtime 0.5B — Local Setup, Text & Voice Input, Multilingual Tech Giant 796 views - 2 months ago
16:05 Voxtral TTS - Open Source Text-to-speech Model by Mistral AI Local Setup & Testing w/ Voice Presets Tech Giant 1.3K views - 3 months ago
28:39 Testing Qwen3-TTS — Voice Cloning, Custom Voices & Voice Design Locally Tech Giant 2.1K views - 4 months ago
16:13 Local Setup of Supertonic 2 - Lightning Fast Multilingual Text-To-Speech Model Tech Giant 903 views - 5 months ago
24:18 I Tested Liquid AI's New Voice Model So You Don't Have To (LFM2.5-Audio) Tech Giant 2.9K views - 5 months ago
27:00 Chatterbox Turbo: Expressive Voice Cloning Model by Resemble AI Tech Giant 2.9K views - 6 months ago
25:52 Realtime End-To-End Multimodal Audio Model | LFM2 Audio 1.5B Tech Giant 3.4K views - 7 months ago
17:35 NeuTTS Air: Open source Text-To-Speech & Voice Cloning Model Tech Giant 4.2K views - 8 months ago
16:18 Open Source TTS & Voice Cloning Model with a Web UI | IndexTTS 2 Setup Guide Tech Giant 1.2K views - 9 months ago
23:49 Vibevoice 1.5B Text To Speech & Voice Cloning Model by Microsoft Tech Giant 1.9K views - 9 months ago
18:58 Kyutai's Open-source Text-to-Speech Model | 100% Local Setup Tech Giant 3.2K views - 10 months ago
22:43 Voxtral Mini 3B Audio + Text Multimodal Model by Mistral AI Tech Giant 1.3K views - 11 months ago
22:55 Gemma 3n E4B Multimodal Model Test w/ PyQt6 GUI and Gemini CLI Tech Giant 745 views - 11 months ago
17:47 Openaudio S1 & S1 mini Free Voice Cloning Model by Fish Audio | Tests & Local Setup Tech Giant 4.5K views - 1 year ago
22:01 Chatterbox: Open-source Text-to-speech & Voice Cloning Model | 100% FREE TTS Tech Giant 1.6K views - 1 year ago
27:26 Open-source Voice Cloning & Text to Speech with the new OuteTTS v1.0 Model Tech Giant 840 views - 1 year ago
20:10 CSM 1B (Conversational Speech Model) by Sesame AI Labs | Voice Cloning Tech Giant 1.2K views - 1 year ago
21:23 Dia 1.6B Highly Realistic TTS Model for audio dialogues generation Tech Giant 1.9K views - 1 year ago
10:49 Multilingual SOTA Text To Speech Model | Kokoro TTS version 1.0 Tech Giant 2.1K views - 1 year ago
26:27 Deepseek R1 Open-Source SOTA Reasoning Model | Local Setup | Reasoning tool Tech Giant 289 views - 1 year ago
20:41 Kokoro 82M Text to speech model (FREE SOTA TTS Model) | ONNX & Pytorch Models Setup Tech Giant 4.3K views - 1 year ago
17:27 Real-Time Speech-to-Text & Speaker Identification using Whisper, Vosk & Pyannote (Open-Source) Tech Giant 12K views - 1 year ago
19:21 Open-source Voice Cloning & Text to Speech with the new OuteTTS v0.2 500M Model | Local Setup Tech Giant 1.3K views - 1 year ago
4:53 Improving AI Agents with Background Tasks | A Different Approach to Handling Tools Tech Giant 474 views - 1 year ago
19:39 Setting up Janus 1.3B Multimodal LLM Locally | Image Generation & Understanding Tech Giant 1.4K views - 1 year ago
12:32 Open-source Voice Cloning with the new F5 TTS Model | Local Setup, CLI Inference & Gradio Web UI Tech Giant 6.8K views - 1 year ago
6:32 CosyVoice Text to Speech WebUI (Open-source) - English Version Tech Giant 1.3K views - 1 year ago
11:21 CosyVoice TTS #3 | Open-source Instruct Model Text-to-Speech Tech Giant 1.3K views - 1 year ago
12:22 CosyVoice TTS #2 | Open-source Base Model Voice Cloning & Cross-Lingual Tech Giant 1.8K views - 1 year ago
12:50 Setting up CosyVoice TTS #1 | Open-source SFT Model Text to Speech Tech Giant 1.5K views - 1 year ago
16:44 Setting up Fish Speech TTS v1.4 by @FishAudio locally- High Quality Open-source Voice Cloning Model Tech Giant 6.1K views - 1 year ago
22:21 High Quality Voice Cloning TTS Model - Fish Speech 1.2 by Fish Audio Tech Giant 2.6K views - 1 year ago
12:17 Setting up a Realistic Text-to-speech; Bark (by Suno AI) locally Tech Giant 6.9K views - 1 year ago
36:59 Simple AI Agent/Chatbot | MegaMind | I/O with Whisper.cpp & Piper TTS Tech Giant 773 views - 2 years ago
11:30 Speech to text with Whisper CPP in a Python Project (with CoreML/Apple Silicon Support) Tech Giant 3.9K views - 2 years ago
8:13 Gemini 1.5 Pro (latest) with Langchain's ChatVertexAI Package Tech Giant 521 views - 2 years ago
13:27 Vision Tool & Screenshot Tool for Langchain Structured Chat Agent (Powered by Gemini 1.5 Pro) Tech Giant 459 views - 2 years ago
20:39 Installing Piper Text To Speech Engine (on a Macbook w/ Apple Silicon) Tech Giant 3.7K views - 2 years ago
36:55 Setting up Openvoice version 2 and MeloTTS for AI voice cloning Tech Giant 14K views - 2 years ago
3:26 WizardLM2 function call - Using Llama 3 Tokenizer & Langchain's Pydantic OpenAI function converter Tech Giant 153 views - 2 years ago
5:34 LLAMA 3: function calling review using llama index framework and Ollama locally. Tech Giant 621 views - 2 years ago