← KARAOKE.VOCALA.NET  /  Help

How to Create a Karaoke Video Online

A complete step-by-step guide — no special skills needed.

Go to the app →

Contents

1

Upload your song

When you open the site, you will land on the «1 Tracks» tab.

Click Choose file and select an audio file from your computer.

ℹ️ Supported formats: MP3, WAV, FLAC. The higher the audio quality, the more accurate the result.

Alternatively, drag and drop the file directly onto the grey upload area.

Once the file loads, its name will appear and the next step will become available.

2

Separate vocals and instrumental

Click Separate tracks.

The site will automatically split the song into two tracks:

  • Vocal - the singer's voice only
  • Instrumental - the music without the voice
This takes 1-3 minutes depending on the track length. Simply wait - do not close the browser tab.

When separation is complete, you will see two waveform tracks and a playback button. Press to preview the result.

If both the vocal and instrumental tracks sound clean, you are ready to move on.

Volume controls (left panel):

  • Vocal - the speaker icon mutes or unmutes the vocal track. The slider and number field set the volume from 0 to 100.
  • Instrumental - the same controls for the music-only track.
  • Video / Photo - the monitor icon shows or hides the background image in the preview window. The slider controls background opacity (0 = fully transparent, 100 = fully opaque).
3

Edit the vocal track (optional)

Vocal editing only affects how the site synchronizes lyrics - it does not change the final exported video.

Use this step if the song contains unwanted audio that might confuse the synchronizer: background noise at the start or end, spoken words before or after the singing, or stray vocal runs and ad-libs in the middle. Muting those sections helps the site pinpoint where each lyric line actually starts and ends.

ℹ️ If your song is clean with no extraneous noise, skip this step and go to Step 4.

How to open the vocal editor:

  1. Make sure you are on the 1 Tracks tab
  2. Click Vocal editing - the vocal waveform will appear

Three editing tools:

  • Cursor - the default mode; used to navigate along the track
  • Cut - click anywhere on the waveform to split it at that point, dividing the vocal into separate segments
  • Mute - click a segment to silence it; muted segments are excluded from lyric synchronization

Typical workflow:

  1. Select the Cut tool
  2. Click on the waveform at the point where noise ends and singing begins - the track splits there
  3. Select the Mute tool
  4. Click the noise segment at the beginning - it turns grey and is silenced
  5. Repeat the same process at the end of the track if needed
ℹ️ The and + buttons zoom in and out on the waveform, making it easier to cut precisely.
⚠️ Do not mute sections where the singer is actually singing - only silence noise and silence before or after the vocals. Muting real singing will prevent those lyrics from syncing correctly.

When finished, click Vocal editing again to close the editor, then continue to Step 4.

4

Enter the song lyrics

Click the 2 Text tab at the top of the screen.

Paste or type the lyrics into the large text field.

⚠️ Important: each line of the song must be on its own line. The text must exactly match what the singer sings - any extra words or mismatches will reduce synchronization accuracy.

Correct:

Walking down the empty street
Rain falling on my face
I don't know what I'm looking for
In this forgotten place

Incorrect:

Walking down the empty street Rain
falling on my face I don't know
what I'm looking for In this
← all on one line or
random line breaks
1. Walking down the empty street
2. Rain falling on my face
[Chorus]
I don't know what I'm looking for
← line numbers, section labels,
or extra annotations
ℹ️ You can find lyrics online by searching for: "Song name lyrics". Copy and paste the result, then remove any line numbers, section headings, or annotations.

Below the text field you will find a language selector. We recommend leaving it on «Auto-detect» - the site identifies the language correctly in most cases.

5

Synchronize the lyrics

Click Synchronize text.

The site analyzes the vocal track and determines exactly when each word is sung, then assigns timing to every line automatically.

This also takes 1-2 minutes. Do not close the tab while it is running.

When «Done» appears on the button, synchronization is complete.

Press - or tap Space on your keyboard - to preview how the lyrics scroll along with the music.

ℹ️ If some lines are slightly off-beat, you can fine-tune them manually on the «3 Editing» tab - see Step 6.
6

Fine-tune timing (optional)

If some words or lines are slightly out of sync after automatic synchronization, the 3 Editing tab lets you adjust them by hand.

ℹ️ If the synchronization looks accurate, skip this step and go directly to Step 7.

Two editing modes:

  • Edit line-by-line - each block on the timeline represents one full line. This is the faster approach for overall adjustments.
  • Edit word-by-word - each block represents a single word, giving you precise control over every syllable. More time-consuming but highly accurate.

Moving and resizing clips:

  • Move a clip - click and drag it left or right along the timeline. This shifts when that line or word begins.
  • Resize a clip - drag the left or right edge of the block. This controls how long the fill animation runs for that line or word.
  • Edit text - double-click a block to open an edit window. Changes are instantly reflected in the lyrics text field.

Selecting multiple blocks and moving them together:

  • Select a block - click on it. It will be highlighted with a white border.
  • Add to selection - hold Shift or Ctrl and click additional blocks.
  • Select an area - click on an empty part of the track and drag — all blocks within the rectangle will be selected.
  • Move all selected - drag any selected block and all others will shift by the same amount.
  • Deselect - click on an empty part of the track.
ℹ️ When multiple blocks are selected, resizing by edge is disabled — dragging an edge also moves the whole group.

Countdown block before singing starts (1·2·3·4 or 1·2·3):

If you want to give the vocalist time to get ready before the first line — add a countdown block. In the top panel of the Editing tab, drag the 1·2·3·4 button onto the lyrics track at the desired position. Use the arrows next to the button to choose the mode: ▲ = 4 beats, ▼ = 3 beats.

  • Place a block - hold and drag the 1·2·3·4 button onto the timeline. The block will appear exactly where you drop it.
  • Move - drag the block left or right along the timeline.
  • Stretch / shrink - drag the left or right edge of the block. The wider the block, the slower the countdown.
  • Delete - click the block to select it, then press Delete.
  • Multiple blocks are supported — for example, a countdown before each verse.

In the preview window and in the final video, the countdown number appears on the left inside a circle filled with the lyrics fill color.

Controls at the bottom of the screen:

  • + Zoom - shrink or expand the timeline view. Zoom in for precise editing.
  • Offset - shifts all lyrics earlier or later by 50 ms per click. Hold the button down for continuous adjustment. Useful when the entire text is consistently early or late.
ℹ️ To move the playback cursor, click on the bar ruler, the vocal waveform track, or the Video / Photo track. Then press or Space to instantly hear the selected moment.
7

Customize the appearance (optional)

This step is entirely optional - if you are happy with the default look, jump straight to Step 8.

Options available on the «2 Text» tab:

  • Font - choose a typeface from the dropdown list
  • Size - make the text larger or smaller
  • Base color - the color of the text before it is filled in during singing
  • Fill color - the color that sweeps across each word as it is sung (the classic karaoke highlight color)
  • Outline color - the border drawn around letters to improve readability on any background
  • Outline size - the thickness of the outline in pixels
  • Background color - the solid color shown behind the text when no video or photo is used
  • Stripe color - the color of the horizontal bar displayed behind the text
  • Stripe height - how tall the background stripe is
  • Stripe opacity - how transparent the stripe is (0 = fully transparent, 100 = fully opaque)
  • Fade in - how many seconds the line takes to fade onto the screen
  • Fade out - how many seconds the line takes to fade off the screen
  • Lead time - how many seconds before the singing starts the line appears on screen
  • Trail time - how many seconds after the line ends before it disappears from the screen

How to add a video or photo background:

  1. On the «1 Tracks» tab, find the «Video / Photo» track
  2. Drag a file onto it (MP4, MOV, JPG, or PNG)
  3. The clip will appear on the timeline and the background will show in the preview window

Working with background clips on the timeline:

  • Move a clip - drag it left or right to choose which part of the song the background is shown during
  • Precise keyboard movement - click the clip to select it, then use ← → arrow keys to nudge it by one frame (≈ 33 ms). Hold Shift to move by one second at a time.
  • Resize a clip - drag the left or right edge to set how long the background is displayed
  • Delete a clip - select it and press Delete or Backspace

Red triangles - fade handles:

  • Each clip has small red triangles in its top-left and top-right corners
  • Left triangle - fade in: drag it to the right to make the background appear gradually rather than cutting in abruptly
  • Right triangle - fade out: drag it to the left to make the background dissolve smoothly at the end
  • The farther you drag the triangle, the longer and smoother the transition
ℹ️ All changes appear instantly in the preview window. Press ▶ at any time to watch how the finished video will look.
8

Export and download your video

Click the 4 Export tab at the top of the screen.

Choose what to include in the video:

  • Include vocals - the singer's voice will be in the final video
  • Include instrumental - the backing track without the voice
  • Include Video / Photo - the background image or video clip (if you added one)
ℹ️ You can keep both tracks checked to produce a standard video with the full song. Or uncheck vocals to create an instrumental-only (backing track) version.

Click the large Render project button.

Do not close the tab while rendering is in progress - the video is being recorded in real time. To cancel, press Esc on your keyboard.

When «Done» appears, click Download.

Your finished MP4 file at full HD 1920×1080 will be saved to your computer.

🎉 That's it! Your karaoke video is ready. You can open it in any media player or upload it to YouTube.

Keyboard shortcuts

These shortcuts let you work faster without reaching for the mouse.

Space

Play / Pause

Start or pause playback. Works on any tab, as long as the cursor is not inside a text input field.

Esc

Cancel active operation

Cancels stem separation if it is currently running. Also cancels video rendering during export.

Move a selected clip frame by frame

Click a video or photo clip to select it, then use the arrow keys to nudge it precisely - each press moves it by 1 frame (≈ 33 ms). Hold Shift to move by 1 second per press.

Delete
Backspace

Delete selected clip

Removes the selected video or photo clip from the timeline, as well as a selected 1·2·3·4 countdown block. You must click the clip first to select it.

Shift
Ctrl

Select multiple lyrics blocks

Hold Shift or Ctrl and click lyrics blocks on the timeline — each clicked block is added to the selection. Then drag any selected block to move them all together. You can also select an area by dragging over an empty part of the track.

ℹ️ Arrow keys and Delete only work when a clip is selected (click it to highlight it). Clicking an empty area of the timeline deselects everything.
?

Frequently asked questions

The lyrics are out of sync - what should I do?

Make sure the text in the lyrics field matches exactly what the singer sings. Extra words, missing lines, section headers (like [Chorus] or [Verse]), or line number prefixes will throw off synchronization. Clean up the text and click «Synchronize text» again.

Stem separation is taking a long time or failed with an error?

Refresh the page and try again. Make sure the audio file is not corrupted and is in a supported format (MP3, WAV, or FLAC).

Can I create an instrumental-only (backing track) version?

Yes. On the Export tab, uncheck «Include vocals» and leave «Include instrumental» checked. The exported video will contain only the music without the voice.

The site says my screen resolution is not supported?

The site requires a screen resolution of 1920 × 1080 or higher. It is not designed for phones or tablets - please use a desktop or laptop computer.

How do I start a new project?

Click the New project button at the top of the page. This will clear all current project data so you can start fresh.

What happens to my files - is my data safe?

Only two tasks are sent to our server:

  • Vocal and instrumental separation - this requires significant processing power and is handled server-side
  • Lyrics synchronization - also performed on the server

Everything else - timing adjustments, appearance settings, vocal editing, video rendering, and export - runs entirely in your browser, using your computer's own resources. Nothing else is sent to our servers.

🔒 We do not store your files. Your uploaded song, lyrics, and exported video are never saved on our servers - everything stays on your device.

Rendering and export speed depends on your computer's hardware. A modern CPU and GPU will produce the finished video significantly faster.

You're all set!

Follow these 8 steps and you'll have a finished HD karaoke video.
The whole process typically takes 5-10 minutes.

Create a karaoke video →