Lalal.ai Audio Splitter Review: Can You Split Mono Into Stereo?

Podcasters can’t always get what they want. Maybe you landed an interview with your dream guest, but only if you do it right here and now, and you don’t have your preferred recording equipment available. Maybe you have to record in a noisy environment, but you want to make sure your audiences can understand the voices. What’s a podcaster to do? Fortunately, there are software tools that can help. With Lalal.ai Audio Splitter software, we’ll separate the wheat from the chaff as best we can. Let’s see what artificial intelligence can do to salvage cluttered audio and clarify those voices.

How Does Lalal.ai Work?

Lalal.ai uses artificial intelligence to detect multiple sound sources (vocals and music) and split them into two separate sound files. The company was kind enough to send me a promotion code, so I could test the software’s features at all levels.

Signup is a little different, but still easy. You submit your email address to their website. Lalal.ai sends a link to your email address. Click on the link, and voilá, you’re signed up.

At their website, you drag the sound file you want to split and drop it onto the page to upload it.

Once it’s processed, you can listen to the split audio files and select how much processing you want: Free, Lite, or Professional.

Lalal.ai Audio Splitter processes the file, splits the vocal from any background sounds, and displays two files for download: one marked vocal, the other marked instrumental. It takes only a few minutes, depending on how long your audio file is. Then you just click the down arrow to download the file.

Pricing

Lalal.ai audio splitter has three price tiers: Free, Lite, and Professional. Higher tiers let you have more flexibility, and use more data.

Free: Users can upload and process up to ten minutes of audio (only) files up to 50 MB of data, in mp3, OGG or wav files.
Lite: For $10, This tier lets users upload and split up to 90 minutes of 2 GB audio or video files. The acceptable file formats are MP3, OGG, WAV, FLAC, AVI, MP4, MKV, AIFF, and AAC. So, one can split audio or video. Lite has a faster processing queue, as well.
Professional: For $20, users can upload and split up to 300 minutes of 2 GB audio or video files. This tier accepts the same file formats as Lite, and uses the same queue for faster processing.

Essentially, it’s reasonably priced and convenient. Let’s hear how it performs.

Samples of Audio Processed with Lalal.ai Audio Splitter software

Bear in mind that I know nothing about AI or machine learning, so I can’t begin to dissect this software. All I can describe is what I hear. What I tried is to;

split a track into vocals and music
split a track into two vocal tracks. This isn’t recommended on the web site, but I think podcasters would need this use.

Splitting audio and music

I started with a sample audio file I had lying around (as you do), from when I made screenshots for our GarageBand Podcast Production article.

I care about copyright, so here are the credits. Music is The Maple Leaf Rag by Scott Joplin, performed by Kevin McLeod. For more music, visit his website at incompetech.com. The joke is probably floating around in the zeitgeist, but I first heard it performed by Fozzie Bear on The Muppets. I know, when Matthew tests mics, he reads Beatrix Potter. I’ll try to step up my game for the next one.

Here’s what the vocal track sounds like, after processing it with Lalal.ai.

The background music is gone, mostly; I can still hear a tiny snippet of piano around the words. How’d the music turn out?

You probably remember from the Intro to Garageband article that I lowered the volume while I was speaking. That’s why you hear the volume get lower here, too, and then increase after I stopped talking. I notice that between seven and ten seconds, there’s almost an underwater sound, as if the software is trying to fill in notes that my voice covered up. So, yes, it’s music. But, I wouldn’t pick this out for karaoke night.

Splitting Vocals from Background Vocals

What about background voices? Let’s say you’re waiting in line for a ride at the carnival, and you notice The Dalai Lama is in line right behind you. You get him to agree to a quick interview for your podcast, in exchange for letting him budge ahead of you in line. But, all you have is your phone. Plus, a carnival isn’t exactly a silent home studio. What do you do?

interviewing a quiet monk on a noisy carousel

For this example, I recorded myself telling the same horse joke, while holding my phone up playing a clip of the final speech from Charlie Chaplin’s seminal cinematic masterpiece, The Great Dictator.

You get the idea. if Selma and Patty were ahead of you in line yapping about funnel cake and their boyfriends, their voices would bleed into your recording. You don’t want that. Now, let’s see what happens when we try to split the vocal tracks, in this case, mine and Charlie’s.

Lalal.ai recognizes my voice as a voice. But, it recognizes anything other than the voice closest to the mic as an “instrumental.”

Lalal.ai did cut off the first 4 seconds of Chaplin’s speech (where Charlie talks about not wanting to be an emperor) before I started speaking. But, you can still hear his voice in and around my vocals. So, if you were trying to split primary vocals away from background sound, it can do it, mostly. But, you’ll still have artifacts.

Splitting Two Podcast Speakers into Two Tracks

I really thought the best use of this software would be to take a recording of two people, both on mics but recorded as one track, and split it into two tracks. So, Matthew and I had a quick Zoom call.

I could really benefit from Cleanvoice.ai. In any case, this is an example of the closest thing to a zero-effort podcast clip. Again, what I hoped to get was two vocal tracks, even if they were labelled Vocal and Instrumental.

Look at the waveforms on these tracks. One takes up a lot of space, it’s loud and clear. The other one takes up hardly any space, it’s barely audible. I know Matthew’s quiet, but I’m not that much louder than him, am I?

Lalal.ai’s artificial intelligence solely looks for spoken language, as opposed to sounds other than verbal language. At least, that’s how it seems. We were both speaking, therefore our voices both went on the same track. As an example, Descript’s AI can detect the difference between spoken voices and transcribe what each speaker says. As transcription software, Descript does a good job of telling the difference between male and female voices, and differently-accented voices. So, this could have been a slam dunk. In this case, Lalal.ai is looking for two things: human voice, and not-human-voice.

To show you what the AI removed, here’s the “instrumental” track.

It seems like it extracted room noise (I had a fan running in the room), some bass frequencies, and a couple of mouth tics. This isn’t bad, necessarily, if you like experimental audio. I’m sure David Lynch would be very excited to hear the negative space of a conversation.

But I Really Want My Podcast In Stereo!

Again, I’m a writer, not a sound designer. Please bear in mind, we have resources that can help you get good audio for your podcast. Here are a few:

There are better ways to achieve this result. I’m showing you this anyway, so you can see the idea. Here’s what I did.

I opened up Garageband as a vocal project, and made two tracks. Then, I took the .mp3 file of both Matthew and I talking, that Lalal.ai split from the room noise. I dragged & dropped it into each track, once. So, what you see here is the same file, copied and pasted into two different tracks.

I renamed the tracks “Matthew” and “Lindsay,” respectively. Then, I used that little wheel to the right of the volume slider to pan Matthew’s vocal track hard to one side, and mine to the other.

Finally, I cut all of my voice out of Matthew’s track, and all of Matthew’s voice out of my track.

Not gonna lie, you’ll hear a couple of spots where we talk over each other a wee bit. More importantly, now that our voices are on separate tracks, I can process each differently. If you listen to this through headphones, it should sound more like stereo.

This isn’t “splitting,” in the sense that you break the track into two things. It’s copying, pasting, and cutting. But, this lets me dig in and give each track the settings that make Matthew and I both sound our best. At least, I would, were I an experienced sound designer.

Pros and Cons of the Lalal.ai Audio Splitter Software

If you need to remove extraneous background sound from a recording, Lalal.ai is quick and won’t cost much money, if any. Plus, it’s web-based, so you can use it pretty much anywhere that you have Internet access.

But, if you want to split one cluttered vocal track into two clean, separate tracks, and preserve the integrity of the sound, Lalal.ai isn’t the right choice.

There are lots of ways to kill background noise. You can find workarounds to split sides of the call. Call recorder software is improving. There are all kinds of workarounds for odd podcasting situations. Lalal.ai Audio Splitter might not be your best best, but it’s accessible, and won’t break the bank.

Part of the fun of podcasting is finding new strategies and using new tech to make it easier to get your story out into the world. That’s one of the reasons we created Podcraft Academy. We don’t want you to be stuck in a trial-and-error cycle. We test new gear, methods, and software, so you can more easily launch and grow your show. Not only that, but also, our all-in-one podcasting solution, Alitu, can help you record, edit, polish and publish your podcast, so you can focus on making great content and building relationships. Give it a try!

Cookie	Duration	Description
_hjAbsoluteSessionInProgress	1 hour	Hotjar sets this cookie to detect a user's first pageview session, which is a True/False flag set by the cookie.
tph_hp_filter	365 days	Stores which filters you have enabled in our Hosting Picker Chooser tool for user convenience.
tph_news_sign_up	365 days	Determines if the "Get weekly podcast industry insights like this straight to your inbox" banner is shown.
tph-article-feedback-submitted	365 days	Checks whether you submitted feedback to an article. If you did, we will no longer show you that section to avoid spam & user confusion.
wp-wpml_current_language	session	WordPress multilingual plugin sets this cookie to store the current language/language settings.

Cookie	Duration	Description
_ce.gtld	session	Crazyegg sets this cookie to identify the top-level domain.
_clck	1 year	Microsoft Clarity sets this cookie to retain the browser's Clarity User ID and settings exclusive to that website. This guarantees that actions taken during subsequent visits to the same website will be linked to the same user ID.
_clsk	1 day	Microsoft Clarity sets this cookie to store and consolidate a user's pageviews into a single session recording.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_gtag_UA_*	1 minute	Google Analytics sets this cookie to store a unique user ID.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.n
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_hjRecordingEnabled	session	Hotjar sets this cookie when a Recording starts and is read when the recording module is initialized, to see if the user is already in a recording in a particular session.
_hjSession_*	1 hour	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
_hjSessionUser_*	1 year	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
browser_id	5 years	This cookie is used for identifying the visitor browser on re-visit to the website.
cebs	session	Crazyegg sets this cookie to trace the current user session internally.
CLID	1 year	Microsoft Clarity set this cookie to store information about how visitors interact with the website. The cookie helps to provide an analysis report. The data collection includes the number of visitors, where they visit the website, and the pages visited.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
last_pys_landing_page	7 days	PixelYourSite plugin sets this cookie to manages the analytical services.
last_pysTrafficSource	7 days	PixelYourSite plugin sets this cookie to manage the analytical services.
MR	7 days	This cookie, set by Bing, is used to collect user information for analytics purposes.
prism_*	1 month	Active Campaign sets this cookie to track and store interactions.
pys_first_visit	7 days	PixelYourSite plugin sets this cookie to manage the analytical services.
pys_landing_page	7 days	PixelYourSite plugin sets this cookie to manages the analytical services.
pys_session_limit	1 hour	PixelYourSite plugin sets this cookie to manage the analytical services.
pys_start_session	session	PixelYourSite plugin sets this cookie to manage the analytical services.
pysTrafficSource	7 days	PixelYourSite plugin sets this cookie to manage the analytical services.
SM	session	Microsoft Clarity cookie set this cookie for synchronizing the MUID across Microsoft domains.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.

Cookie	Duration	Description
ANONCHK	10 minutes	The ANONCHK cookie, set by Bing, is used to store a user's session ID and verify ads' clicks on the Bing search engine. The cookie helps in reporting and personalization as well.
ckid	never	Adara yield sets this cookie to deliver advertisements tailored to user interests on other websites and track transactions
MUID	1 year 24 days	Bing sets this cookie to recognise unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
scribd_ubtc	10 years	Scribd sets this cookie to gather data on user behaviour across several websites and maximise the relevancy of the advertisements on the website.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_ce.clock_data	1 day	Description is currently not available.
_ce.clock_event	1 day	Description is currently not available.
_ce.irv	session	Description is currently not available.
_ce.s	1 year	Description is currently not available.
_CEFT	1 year	No description available.
_hjIncludedInSessionSample_271830	1 hour	Description is currently not available.
cebsp_	session	Description is currently not available.
memberful_tracking_params	never	No description available.
pbid	6 months	Description is currently not available.
VISITOR_PRIVACY_METADATA	6 months	Description is currently not available.