is a Japanese phase-cancellation tool used to extract vocals (acapellas) from a song. It works by "subtracting" a high-quality instrumental track from the full song, leaving only the differences—the vocals—behind. Prerequisites Audio Files : You must have the Official Instrumental
: Use high-quality, lossless files (WAV is best). Both files must have identical sample rates and bit depths. Alignment Tool : You often need an editor like to align the tracks perfectly before using Utagoe. Step 1: Align Your Tracks (Audacity)
Because Utagoe relies on subtraction, the tracks must be sample-perfect in alignment. Import both the full song and the instrumental into Convert to Mono
: If Utagoe is having trouble with stereo, split both tracks to mono and keep only one channel of each. Perfect Alignment Time Shift Tool
(F5) to align the waveforms. Zoom in until you see individual sample dots and ensure the instrumental's peaks line up exactly with the full song's peaks. Sync Starts
: Trim any silence at the beginning so both files start at the exact same millisecond. : Export both as 16-bit PCM WAV files. Name them clearly (e.g., song_full.wav song_inst.wav Step 2: Using Utagoe
Since the program is in Japanese, the interface may show question marks ( ) on non-Japanese systems. Load Files : Select the Middle Bar : Select the Instrumental (the track to be subtracted). Bottom Bar : Specify the name and location for the Output Vocal Adjust Sensitivity (Wrench Icon) Click the wrench tool to adjust settings. The slider or numerical value (often set between 1.2 and 4.0 ) controls how much "noise" is removed. Start around
and experiment if the result is too "washy" or still has music bleed. : Click the Large Green Button
(the musical note icon) to begin extraction. A progress bar will appear at the bottom. Alternative: Modern AI Extraction
Utagoe is an older method that requires a perfect instrumental. If you don't have an instrumental, modern AI tools often produce better results: Ultimate Vocal Remover (UVR)
: Widely considered the best free AI tool for high-quality stem separation. BandLab Splitter : A free mobile/web tool for quick extraction. Canva Vocal Remover : A simple, free web-based option. Do you have the official instrumental file for the song you are trying to rip?
Can you remove vocals from a song without losing quality on computer?
Utagoe is a lightweight, classic Japanese freeware utility designed for phase cancellation to extract vocals, requiring a matching official instrumental track for best results. Unlike modern AI tools that estimate frequencies, Utagoe enables manual alignment and produces a bit-perfect vocal extraction when perfect audio alignment is met.
Utagoe is a specialized tool designed for vocal extraction using a technique called "phase cancellation" or audio subtraction. Unlike modern AI-based separators, Utagoe requires you to have both the original full track and the official instrumental (karaoke) version to isolate the vocals. How to Use Utagoe for Vocal Ripping
To get a clean acapella, you need to match the two tracks perfectly so the software can "subtract" the instrumental from the full song.
Prepare Your Files: Ensure both your full song and instrumental are in high-quality WAV format. Align in Audacity: Drag both files into Audacity. Zoom in until you see individual sample dots.
Use the Time Shift Tool to align the waveforms of the instrumental and the original song exactly.
Export both tracks as separate WAV files starting at the exact same millisecond. Process in Utagoe:
Open Utagoe and load the original song into the first field and the instrumental into the second.
Fine-tuning: Click the wrench/settings icon. Many users find success by setting the right-hand slider (noise reduction/sensitivity) to around 1.2 to 3.6 depending on the song's complexity.
Click Start (the big green button) to generate your "VO" (Vocal Only) track.
These tutorials provide step-by-step visual guidance on aligning tracks in Audacity and using Utagoe's settings for the cleanest results:
Utagoe is a classic, lightweight Japanese freeware tool specifically designed to extract vocals from a song or, conversely, remove vocals to create an instrumental track. It operates on the principle of phase cancellation
, requiring both the original full track and a perfectly matched instrumental version to work effectively. How Utagoe Works
Unlike modern AI-driven "vocal rippers" (like LALAL.AI or StemRoller) that use neural networks to identify and separate sound sources, Utagoe is a "subtraction" tool. The Process
: You load a vocal track and its corresponding instrumental. Utagoe inverts the phase of the instrumental and layers it over the vocal track. The Result
: Because the instrumental frequencies in both files are identical but out of phase, they cancel each other out, leaving behind only the unique frequencies—the vocals. Key Features of the "Full" Version
The term "full" usually refers to the complete freeware package (often version 1.80 or similar), which includes the necessary components to run on Windows systems without installation. Sample-Level Alignment
: Its most critical feature is the ability to shift tracks by tiny increments (milliseconds) to ensure the two files are perfectly aligned for the best cancellation. Pitch and Tempo Matching
: It allows for minor adjustments to the pitch or tempo of the files if they don't perfectly match the source material. No-Frills Interface
: The UI is extremely minimalist, focusing entirely on the technical parameters of the subtraction process. Pros and Cons Clean Extraction
: When you have a high-quality instrumental, the vocal isolation can be cleaner than AI methods, with fewer "underwater" artifacts.
: It remains a free legacy tool for the enthusiast community.
: It runs entirely offline on your PC; no need to upload files to a cloud server. Requires an Instrumental
: It cannot "rip" vocals from a single file. If you don't already have the backing track, Utagoe is useless. Sensitivity
: If the two tracks are encoded differently (e.g., one is a 128kbps MP3 and the other is a FLAC), the cancellation will be poor and "ghost" sounds will remain. Legacy Software
: It may require "Compatibility Mode" or specific Japanese locale settings (AppLocale) to display text correctly on modern versions of Windows.
While modern AI separators have largely replaced Utagoe for general users because they don't require a matching instrumental,
Utagoe remains a gold standard for "studio quality" DIY acapellas
when a high-quality instrumental is available. It is a surgical tool rather than a magic wand. step-by-step guide
on how to align tracks within the software for the best results?
To use Utagoe (also known as the "vocal ripper" or "Utigo") for vocal extraction, you typically use a "phase cancellation" method which requires both the original full track and the official instrumental version of the song. How to Use Utagoe for Vocal Extraction Prepare Your Files: You must have two files in .WAV format: The original full song. The official instrumental/karaoke version.
Align Tracks (Optional but Recommended): Use a program like Audacity to ensure the two tracks start at the exact same millisecond. Load into Utagoe: Open Utagoe. Drag the Original Track into the first field. Drag the Instrumental Track into the second field.
The program will automatically name the output file (usually ending in "VO" or "vocal").
Fine-Tune and Start: Click the wrench/settings icon to adjust the subtraction strength (often around 3.6 is recommended) and then press the start/green button to begin processing. Better Modern Alternatives
While Utagoe was popular for older methods, modern AI-based "stem splitters" are now much faster and don't require an instrumental version.
Ultimate Vocal Remover 5 (UVR5): Widely considered the best free, open-source software for high-quality isolation.
VocalRemover.org: A quick, free online tool that uses AI to separate vocals and instruments in seconds.
BandLab Splitter: A mobile-friendly free tool for isolating vocals and specific instruments like drums or bass.
LALAL.AI: A popular browser-based splitter with high-fidelity results (though full features may require payment).
Utagoe is a specialized Japanese audio utility designed to extract vocals or instrumentals from songs by using a technique called "phase cancellation" (or audio subtraction). Unlike modern AI separation tools, it relies on having two nearly identical files: the full song and the official instrumental. Core Mechanism: Phase Cancellation
Utagoe works on the mathematical principle that if you subtract the instrumental audio data from the full track, only the difference (the vocals) remains.
Requirements: You must have a high-quality "Original" track (vocals + music) and a high-quality "Instrumental" track.
Precision: The two tracks must be perfectly aligned down to the millisecond. Even a tiny offset will result in "ghosting" or poor separation. Workflow Overview
According to tutorials from the IsolatedVocals community and Lazytown forums, the typical process involves:
Preparation: Import both tracks into an editor like Audacity.
Alignment: Manually align the waveforms until they match exactly. Exporting: Export the aligned tracks as separate WAV files. Utagoe Processing: Load the original and instrumental files into Utagoe.
Adjust the sensitivity (often set around 3.6 for best results).
The software then "subtracts" the instrumental frequencies from the original track to produce a .VO file containing the isolated vocals. Comparison with Modern AI Tools
While Utagoe was a standard for years, newer AI-based solutions like Ultimate Vocal Remover (UVR) or LALAL.AI are now often preferred because they do not require an official instrumental track to work. Modern AI (e.g., UVR) Input Needed Full Song + Official Instrumental Full Song Only Mechanism Waveform Subtraction Deep Learning Models Ease of Use Difficult (Manual Alignment) Easy (One-click) Result Quality Perfect (if files match) High (but may have artifacts) Legal Note
It is important to remember that extracting vocals for commercial use without permission is generally illegal, as it involves copyrighted material.
is a specialized Japanese software used to isolate vocals or instrumentals from a track using phase inversion
(also known as the "subtraction" method). Unlike modern AI tools that guess what a voice sounds like, Utagoe works by taking a full song and an official instrumental, then mathematically "subtracting" the instrumental from the song to leave only the vocals behind. The "Subtraction" Requirements
To use Utagoe effectively, you must have two specific files: The Full Song : The original track with vocals. The Official Instrumental : This must be the
same version (same mix, length, and tempo) as the full song. WAV Format
: Both tracks should be in lossless WAV format for the best results; lossy formats like MP3 can cause "noise" or artifacts. Step-by-Step Usage Guide
Because the software was developed in Japan and never received an official English port, many menu items may appear as question marks ("?") on English operating systems. Preparation (Alignment) Open both tracks in a program like
Zoom in closely to match the waveforms exactly. Even a millisecond of misalignment will result in poor vocal extraction.
Export both tracks starting at the exact same timestamp as WAV files. File Input First Field : Select your (often marked as "O" for Original). Second Field : Select the Instrumental (often marked as "I"). Third Field : Name your output file (e.g., "Vocal_Track.wav"). Settings & Execution Wrench/Tool icon to adjust the "strength" of the pass. Strength Range : Recommended settings are usually between 1.2 and 3.6
. Higher values "scrape" harder to remove background noise but can reduce vocal quality. Large Start Button (often featuring a musical symbol) to begin the process. Modern Alternatives (AI-Based)
While Utagoe is a "classic" tool for high-quality extraction when you have an instrumental, modern AI-driven tools can now extract vocals needing an instrumental track: Ultimate Vocal Remover (UVR)
: Widely considered the best free, open-source offline tool. It uses advanced AI models like MDX-Net and Demucs
to separate stems (vocals, drums, bass) directly from a single file. VocalRemover.org : A popular free online AI tool for quick separations without installing software. Ultimate Vocal Remover Do you have the official instrumental
for the song you're working on, or would you like to explore options that don't require one?
The “Full” label in user forums and download archives (e.g., SoundFont.ru, old audio blogs) usually refers to:
Because the original developer (a Japanese creator known as “UTagoe”) distributed it as freeware, the “Full” version is essentially the standard release, but some later repacks added presets or bundled encoders.
Because the official website is long gone, you will likely need to find the software on archives or audio forums.
Utagoe.zip or the specific version (often Ver 5.0 is the most sought after). It is a very small file (usually under 1MB)..exe). Just extract the ZIP file to a folder on your desktop and run Utagoe.exe.Utagoe Vocal Ripper is a pioneering, yet technically obsolete, Japanese software utility designed for audio processing. Its primary function is to isolate or "rip" the vocal track from a mixed audio file (such as an MP3 or WAV file of a full song).
Developed in the early 2000s, Utagoe became legendary in the early internet music remix community—particularly within the Doujin music scene and early karaoke enthusiasts—for being one of the first accessible tools capable of vocal extraction.
First, let's break down the name. "Utagoe" (歌声) is a Japanese term meaning "singing voice." The software was initially developed to cater to the Vocaloid and UTAU community—hobbyists who needed to extract raw vocals from commercial songs to create cover versions or "harmony" tracks.
Unlike basic "center channel extraction" (which simply removes everything in the stereo center, usually destroying the bass in the process), Utagoe Vocal Ripper utilizes Deep Neural Networks (DNN) . It analyzes the audio spectrally, identifying the signature frequencies of the human voice versus the backing instruments.