WakkaQt - Karaoke App

Help requested: for MacOS tests.

Windows x86_64 zip bundle available at the end of this readme, please follow instructions (mostly unpack, then run WakkaQt.exe) On Windows 11 you must disable Audio Enhancements on your microphone, because it messes the Qt6 recording process.

WakkaQt - Karaoke App

WakkaQt is a karaoke application built with C++ and Qt6, designed to record vocals over a video/audio track and mix them into a rendered file. This app features webcam recording, YouTube video downloading, real-time sound visualization, and post-recording video rendering with FFmpeg. It automatically does some masterization on the vocal tracks. It also has a custom automatic auto-tuner class called VocalEnhancer that provides slight pitch shift/correction and formant preservation.

Features

Record karaoke sessions with synchronized video and audio playback.
Mix webcam video and vocals with the karaoke video and export the result.
Real-time sound visualization (green waveform microphone meter).
Download YouTube videos and use them as karaoke tracks.
Video and Audio device selection for recording.
Rendering with FFmpeg for high-quality output, automatic masterization and vocal enhancement;
Vocal enhancement with some effects and a custom auto tuning class programmed with the intention to improve the vocals a little bit without distorting or corrupting vocal formants. All of this is automatic, but you can adjust volumes before each rendering operation.
Intended Cross-platform compatibility (Windows, macOS, Linux).

Requirements

To build and run this application, ensure you have the following:

C++17 or later
Qt6 (Qt Multimedia module)
FFmpeg (for video/audio mixing and rendering)
yt-dlp (for downloading YouTube videos)
fftw3 (for the custom VocalEnhancer class)

Installation

Clone the repository:

git clone https://github.com/guprobr/WakkaQt.git
cd WakkaQt

Install dependencies:
- Qt6: Install via your system package manager or the official Qt website.
- FFmpeg: Install from FFmpeg website or via your system package manager.
- yt-dlp: Install from yt-dlp GitHub page. for Ubuntu 24.04 and below you must get a latest version from Github
- libfftw3: for our own custom VocalEnhancer class

Ubuntu/Debian

sudo apt update
sudo apt install qt6-base-dev qt6-multimedia-dev ffmpeg yt-dlp libfftw3-dev

Fedora

sudo dnf install qt6-qtbase-devel qt6-qtmultimedia-devel ffmpeg yt-dlp fftw-devel

Arch Linux

sudo pacman -S qt6-base qt6-multimedia ffmpeg yt-dlp fftw

openSUSE

sudo zypper install qt6-qtbase-devel qt6-qtmultimedia-devel ffmpeg yt-dlp fftw3-devel

Build the project:
```
mkdir build
cd build
cmake ..
make
```
Run the application:
```
./WakkaQt
```

Usage

Load Karaoke Track: Use the "Load playback" option on the "File" menu to load a video or audio file for the karaoke session. It will start a preview of the playback, and enables the SING button so you can start recording.
Select Input Device: Choose the microphone or audio input device for recording.
Sing & Record: Click the "♪ SING ♪" button to start recording. The webcam will be used to record a video, while the audio input will record your voice.
Stop Recording: Once finished, click the "Finish!" button to stop the recording.
Adjust vocals volume Once finished recording, a dialog appears with a knob for you to amplify or reduce volume of the vocals. It is a very low quality amplification, just to adjust volumes. After rendering it will sound much better.
Render the Video: You can render and preview the mix of vocals and the karaoke track before the final video or audio file.
Download YouTube Video: You can enter a YouTube URL to download and use as a karaoke track. Other streaming services URL might work as well.
Render again: This button appears after rendering, so you can save a new filename and adjust options again, then render, again, with different options :D

Project Structure

mainwindow.cpp / mainwindow.h: Core application logic, including UI setup and media control, audio and video recording orchestration, playback downloads from streaming services and rendering.
sndwidget.cpp / sndwidget.h: Custom widget for displaying sound levels from the current audio input source, the green audio visualizer at the top of the UI.
previewdialog.cpp / previewdialog.h: Preview dialog for reviewing and adjusting vocal levels before rendering. Here the masterization and vocal enhancement takes place, allowing the user to amplify or reduce volume while listening to the backing track at the same time. Note that the final rendering with FFmpeg will sound much better, this is a low quality preview.
audioamplifier.cpp / audioamplifier.h: Class to manipulate samples for volume adjustment, and to act as a media player to mix backing track with the recorded vocals.
audiorecorder.cpp / audiorecorder.h: Class to record audio. It enables the configuration of different sample formats, channels and rates while recording sound, since QAudioInput with MediaCaptureSession refuses to record in different formats.
audiovizmediaplayer.cpp / audiovizmediaplayer.h: Class to mimic QMediaPlayer but extracting audio from playbacks and serve the AudioVisualizer widget with visualization data.
audiovisualizerwidget.cpp / audiovisualizerwidget.h: Class that implements the Yelloopy© audio visualizer widget.
vocalenhancer.cpp / vocalenhancer.h: Class that implements a custom pitch shifter VocalEnhancer automatically applied right after each recording. The VocalEnhancer class provides a multi-step process for enhancing vocals in audio data by combining pitch detection, harmonic scaling, gain normalization, and windowed overlap-add techniques. Its three-pass scaling approach for pitch correction achieves enhancement without distorting vocal characteristics.
resources.qrc: Resource file for including images like the app logo.

FFmpeg Integration

The application uses FFmpeg to mix the recorded webcam video and vocals with the karaoke video. It applies various audio filters like normalization, echo, and compression to enhance vocal quality. We benefit from working with several different media formats for input/output that way.

About Windows bundle ZIP

A proper FFMPEG binary is already on the root of the application directory.
yt-dlp is already there too, for your convenience.
NOTE: antivirus software degrade this software a lot, and VPNs might make streaming services to block the fetching of the video file when running yt-dlp.
You can download the windows x64 ZIP Here on my website

Contributing

Feel free to contribute by submitting pull requests or reporting issues in the GitHub Issues page.

License

This project is licensed under the MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WakkaQt - Karaoke App

Features

Requirements

Installation

Ubuntu/Debian

Fedora

Arch Linux

openSUSE

Usage

Project Structure

FFmpeg Integration

About Windows bundle ZIP

Contributing

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 480 Commits
images		images
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
WakkaQt.desktop		WakkaQt.desktop
audioamplifier.cpp		audioamplifier.cpp
audioamplifier.h		audioamplifier.h
audiorecorder.cpp		audiorecorder.cpp
audiorecorder.h		audiorecorder.h
audiovisualizerwidget.cpp		audiovisualizerwidget.cpp
audiovisualizerwidget.h		audiovisualizerwidget.h
audiovizmediaplayer.cpp		audiovizmediaplayer.cpp
audiovizmediaplayer.h		audiovizmediaplayer.h
complexes.cpp		complexes.cpp
complexes.h		complexes.h
main.cpp		main.cpp
mainwindow.cpp		mainwindow.cpp
mainwindow.h		mainwindow.h
mainwindowDevicesMgr.cpp		mainwindowDevicesMgr.cpp
mainwindowPlaybackMgr.cpp		mainwindowPlaybackMgr.cpp
mainwindowRecorderMgr.cpp		mainwindowRecorderMgr.cpp
mainwindowRenderMgr.cpp		mainwindowRenderMgr.cpp
previewdialog.cpp		previewdialog.cpp
previewdialog.h		previewdialog.h
readme.md		readme.md
resources.qrc		resources.qrc
sndwidget.cpp		sndwidget.cpp
sndwidget.h		sndwidget.h
vocalenhancer.cpp		vocalenhancer.cpp
vocalenhancer.h		vocalenhancer.h

guprobr/WakkaQt

Folders and files

Latest commit

History

Repository files navigation

WakkaQt - Karaoke App

Features

Requirements

Installation

Ubuntu/Debian

Fedora

Arch Linux

openSUSE

Usage

Project Structure

FFmpeg Integration

About Windows bundle ZIP

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages