ComfyUI Qwen3 ASR

ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports high-accuracy ASR and language identification for 52 languages/dialects, including 22 Chinese dialects and various English accents. Features word-level timestamps, long audio transcription, and VRAM-optimized inference.

A software interface displaying an audio processing workflow. It includes modules for loading audio, aligning, transcribing, and previewing text. Inputs and settings such as language, device, and precision are visible.


Comfy UI Audio Waveform Visualiser

A suite of custom nodes for ComfyUI designed to generate high-quality audio waveform visualizations. Whether you need a real-time preview on your node or a high-fidelity image for video synthesis, this package provides multiple ways to see your sound.

A digital audio processing interface displaying a workflow with audio waveforms. The screen shows three waveform panels with blue and red frequency visualizations and various node connections. RAM and CPU usage indicators are visible at the top.


YouTube Thumbnail Preview

Code

Preview


Simple Password Generator

Code

Preview